SentenceTransformer based on thebajajra/RexBERT-base-embed-pf-v0.1
This is a sentence-transformers model finetuned from thebajajra/RexBERT-base-embed-pf-v0.1 on the nomic-embed-supervised-data dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
Model Details
Model Description
- Model Type: Sentence Transformer
- Base model: thebajajra/RexBERT-base-embed-pf-v0.1
- Maximum Sequence Length: 1024 tokens
- Output Dimensionality: 768 dimensions
- Similarity Function: Cosine Similarity
- Training Dataset:
Model Sources
- Documentation: Sentence Transformers Documentation
- Repository: Sentence Transformers on GitHub
- Hugging Face: Sentence Transformers on Hugging Face
Full Model Architecture
SentenceTransformer(
(0): Transformer({'max_seq_length': 1024, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
Usage
Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
pip install -U sentence-transformers
Then you can load this model and run inference.
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
queries = [
"None of this proves that Loral wasn\u0027t disloyal or criminally negligent in its dealings with China.",
]
documents = [
'There is nothing that proves that Loral was negligent in dealing with China. ',
'Trimipramine is a tricyclic antidepressant. Trimipramine affects chemicals in the brain that may become unbalanced. Trimipramine is used to treat symptoms of depression.Trimipramine may also be used for purposes not listed in this medication guide. You should not take trimipramine if you have recently had a heart attack. Do not use trimipramine if you have used an MAO inhibitor in the past 14 days.A dangerous drug interaction could occur.ou should not use trimipramine if you are allergic to it, or if you have: 1 if you have recently had a heart attack; or. 2 if you are allergic to antidepressants such as amitriptyline, amoxapine, clomipramine, desipramine, doxepin, imipramine, nortriptyline, or protriptyline.',
'Lorraine Dunn Lorraine Dunn (12 September 1942 – 16 October 2003) was a Panamanian sprinter. She competed in the 4 × 100 metres relay at the 1960 Summer Olympics and the 1964 Summer Olympics. Dunn finished third in the 200 metres and finished fourth in the 80 metres hurdles at the 1963 Pan American Games.',
]
query_embeddings = model.encode_query(queries)
document_embeddings = model.encode_document(documents)
print(query_embeddings.shape, document_embeddings.shape)
# [1, 768] [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(query_embeddings, document_embeddings)
print(similarities)
# tensor([[ 0.7875, -0.0236, -0.0537]])
Training Details
Training Dataset
nomic-embed-supervised-data
- Dataset: nomic-embed-supervised-data at 13eef8a
- Size: 1,611,024 training samples
- Columns:
query,positive, andnegative - Approximate statistics based on the first 1000 samples:
query positive negative type string string list details - min: 4 tokens
- mean: 37.16 tokens
- max: 1024 tokens
- min: 5 tokens
- mean: 99.03 tokens
- max: 1024 tokens
- min: 20 elements
- mean: 155.14 elements
- max: 209 elements
- Samples:
query positive negative Japan's biggest daily newspaper, Asahi Shimbun, published an interview Sunday with Woody Allen, who, asked to sum up the 20 th century in a single word, replied, Disappointing.Woody Allen described the 20th century as "disappointing'' in an interview with the Japaense newspaper, Asahi Shimbun.['Billy Joel described the 20th century as "outlandish" in a recent interview with Japan's leading newspaper. ', "aside for the kids' education and--BOP--I have so much trouble with that the", 'An old man is jogging.', 'It was my favorite spot.', 'Two women are playing volleyball.', ...]Sedimentation is the term for when clumped solids sink to the bottom of the water.Deposition occurs where the water motion slows.['The law of superposition is best described by: in undisturbed layers of sedimentary rock, the lowest layers contain the older rocks.', "Review: I felt this film - throughout. I waas impressed with Russell Crowe's talent in developing his relationship with Lillie, such a typical Aussie blend of softly softly approach, a bit self depreciating and very persistent. Really loved the cinematography and direction. Pace was just right and the portrayals of nearly all characters was impressive.Gosh, didn't Russell's talent even in 1993 shine! .. and I have yet to see Gladiator. Question: is it a negative review?", 'Question: How many times did Chopin and Liszy perform together in public? (Answer: seven).', 'Fact 1: Birth may be followed by a period of parental care of the offspring. \nFact 2: Human birth is the rarest of all births.', 'acts. Id. at 963. The Lawson court stated that a "critical distinction” of Pylant from the facts before it in Lawson was that in Pylant, there was "not a factu...Volleyball involves techniques like jumping.Volleyball Volleyball is a team sport in which two teams of six players are separated by a net . Each team tries to score points by grounding a ball on the other team 's court under organized rules . It has been a part of the official program of the Summer Olympic Games since 1964 . The complete rules are extensive . But simply , play proceeds as follows : a player on one of the teams begins a ` rally ' by serving the ball ( tossing or releasing it and then hitting it with a hand or arm ) , from behind the back boundary line of the court , over the net , and into the receiving team 's court . The receiving team must not let the ball be grounded within their court . The team may touch the ball up to 3 times but individual players may not touch the ball twice consecutively . Typically , the first two touches are used to set up for an attack , an attempt to direct the ball back over the net in such a way that the serving team is unable to prevent it from being grounded in their court . ...['Volleyball jump serve The Volleyball jump Serve is a type of volleyball serve where the player increases the power and height of their serve by jumping into the hit . The Jump Serve itself was popularized by the brazilian national volleyball team in 1984 on the Olympics .', "Volleyball variations As volleyball is one of the world 's most popular team sports , second only to football ( soccer ) in the number of players , there are numerous variations of the basic rules . The rules have changed around the world since its creation in 1895 , as skills have developed , to make the game more suited for spectators , for learning or other special needs . Some variations have been included as a change in the international rules by Fédération Internationale de Volleyball ( FIVB ) , others have resulted in variations with specific names . The more notable variations include :", 'Volleyball (ball) A volleyball is a ball used to play indoor volleyball , beach volleyball , or other less common v... - Loss:
MultipleNegativesRankingLosswith these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim", "gather_across_devices": false }
Evaluation Dataset
nomic-embed-supervised-data
- Dataset: nomic-embed-supervised-data at 13eef8a
- Size: 84,795 evaluation samples
- Columns:
query,positive, andnegative - Approximate statistics based on the first 1000 samples:
query positive negative type string string list details - min: 4 tokens
- mean: 34.78 tokens
- max: 1024 tokens
- min: 5 tokens
- mean: 91.63 tokens
- max: 1024 tokens
- min: 15 elements
- mean: 151.78 elements
- max: 209 elements
- Samples:
query positive negative I think people are upset that they're "copying" a mechanic. Which is ridiculous. Every platformer copied from Mario. That doesn't mean they weren't good games. And this says nothing of the end result of the flow of the game.The people who say spiderman is copying another game. Are the same retarded little children who think any game that has drivable vehicles in it, is a gta rip off.
You can't take them serious.['i unfollowed all of them earlier this year theres too many and all their tweets and instaposts made up damn near half my feed i said id just wait for the music and then iridesence came out and was pretty mid so ', 'There are five, FIVE fucking boxes of doughnuts by the printer in the office this morning and I am overtired and hungry and in the worst mood, and I’m sorry r/1200isplenty, tea is not a goddamn dessert substitute; I keep drinking it and not only am I not satiated, I have to pee every ten minutes, so I’m here guzzling tea and growling ”nothing tastes as good as skinny feels” under my breath and if I sound crazy, it’s because I am. \n\nEdit: down to one box. Just checked the calories for one doughnut—380. Three hundred. And eighty. Calories. For one doughnut. ', 'What a strange thing to call your cock.', "It doesn't. That's a very personal decision. You're not right or wrong on that. I was just replying because you were clear you don't want to raise the child as your own....duties of a medical assistant for resumeInclude clinical skills in a health care resume. Those that apply to a medical assistant job may be: 1 Preparing patients, including taking of medical histories. 2 The ability to take vital signs. 3 Preparing medication and treatments. 4 Basic first aid, CPR and infection control knowledge. Assisting physicians with exams.['Include clinical skills in a health care resume. Those that apply to a medical assistant job may be: 1 Preparing patients, including taking of medical histories. 2 The ability to take vital signs. 3 Preparing medication and treatments. Basic first aid, CPR and infection control knowledge.', 'Clinical Medical Assistant Resume. The position of a medical assistant is a key position in any hospital, clinic or a health care center. A medical assistant may perform a variety of duties, pertaining to administrative, front office or clinical duties. Here we will discuss the job profile of a clinical medical assistant. Also, there is a clinical medical assistant resume example provided below, for your further reference.', "Community Q&A. A medical assistant is a member of a health care team that undertakes administrative and clinical job duties. A health care resume must emphasize skills as well as education and job experience. Learn how to write a resume for a medical assistant job and yo...when did british rule end in south africaHistory of South Africa British colonies: Cape Colony, Natal Colony, Transvaal Colony, and Orange River Colony. The country became a self-governing nation state within the British Empire, in 1934 following enactment of the Status of the Union Act. The dominion came to an end on 31 May 1961 as the consequence of a 1960 referendum, which legitimised the country becoming a sovereign state named Republic of South Africa. A republican constitution was adopted. From 1948–1994, South African politics were dominated by Afrikaner nationalism. Racial segregation and white minority rule known officially as apartheid, an Afrikaans word meaning "separateness”, came into existence in 1948 (under["South Africa 1910. The Union was a dominion that included the former territories of the Cape, Transvaal and Natal colonies, as well as the Orange Free State republic. The Natives' Land Act of 1913 severely restricted the ownership of land by blacks; at that stage natives controlled only 7% of the country. The amount of land reserved for indigenous peoples was later marginally increased. In 1931, the union was fully sovereign from the United Kingdom with the passage of the Statute of Westminster, which abolished the last powers of the British Government on the country. In 1934, the South African Party and", 'History of South Africa conducted raids in Free State territories. Both sides adopted scorched-earth tactics, with large swathes of pasturage and cropland being destroyed. Faced with starvation, Moshoeshoe signed a peace treaty on 15 October 1858, though crucial boundary issues remained unresolved. War broke out again in 1865. After an unsuccessful appeal for aid from the British ... - Loss:
MultipleNegativesRankingLosswith these parameters:{ "scale": 20.0, "similarity_fct": "cos_sim", "gather_across_devices": false }
Training Hyperparameters
Non-Default Hyperparameters
eval_strategy: stepsper_device_train_batch_size: 256per_device_eval_batch_size: 128learning_rate: 0.0001num_train_epochs: 20warmup_ratio: 0.1bf16: Truedataloader_num_workers: 10dataloader_prefetch_factor: 10
All Hyperparameters
Click to expand
overwrite_output_dir: Falsedo_predict: Falseeval_strategy: stepsprediction_loss_only: Trueper_device_train_batch_size: 256per_device_eval_batch_size: 128per_gpu_train_batch_size: Noneper_gpu_eval_batch_size: Nonegradient_accumulation_steps: 1eval_accumulation_steps: Nonetorch_empty_cache_steps: Nonelearning_rate: 0.0001weight_decay: 0.0adam_beta1: 0.9adam_beta2: 0.999adam_epsilon: 1e-08max_grad_norm: 1.0num_train_epochs: 20max_steps: -1lr_scheduler_type: linearlr_scheduler_kwargs: {}warmup_ratio: 0.1warmup_steps: 0log_level: passivelog_level_replica: warninglog_on_each_node: Truelogging_nan_inf_filter: Truesave_safetensors: Truesave_on_each_node: Falsesave_only_model: Falserestore_callback_states_from_checkpoint: Falseno_cuda: Falseuse_cpu: Falseuse_mps_device: Falseseed: 42data_seed: Nonejit_mode_eval: Falsebf16: Truefp16: Falsefp16_opt_level: O1half_precision_backend: autobf16_full_eval: Falsefp16_full_eval: Falsetf32: Nonelocal_rank: 0ddp_backend: Nonetpu_num_cores: Nonetpu_metrics_debug: Falsedebug: []dataloader_drop_last: Truedataloader_num_workers: 10dataloader_prefetch_factor: 10past_index: -1disable_tqdm: Falseremove_unused_columns: Truelabel_names: Noneload_best_model_at_end: Falseignore_data_skip: Falsefsdp: []fsdp_min_num_params: 0fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}fsdp_transformer_layer_cls_to_wrap: Noneaccelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}parallelism_config: Nonedeepspeed: Nonelabel_smoothing_factor: 0.0optim: adamw_torch_fusedoptim_args: Noneadafactor: Falsegroup_by_length: Falselength_column_name: lengthproject: huggingfacetrackio_space_id: trackioddp_find_unused_parameters: Noneddp_bucket_cap_mb: Noneddp_broadcast_buffers: Falsedataloader_pin_memory: Truedataloader_persistent_workers: Falseskip_memory_metrics: Trueuse_legacy_prediction_loop: Falsepush_to_hub: Falseresume_from_checkpoint: Nonehub_model_id: Nonehub_strategy: every_savehub_private_repo: Nonehub_always_push: Falsehub_revision: Nonegradient_checkpointing: Falsegradient_checkpointing_kwargs: Noneinclude_inputs_for_metrics: Falseinclude_for_metrics: []eval_do_concat_batches: Truefp16_backend: autopush_to_hub_model_id: Nonepush_to_hub_organization: Nonemp_parameters:auto_find_batch_size: Falsefull_determinism: Falsetorchdynamo: Noneray_scope: lastddp_timeout: 1800torch_compile: Falsetorch_compile_backend: Nonetorch_compile_mode: Noneinclude_tokens_per_second: Falseinclude_num_input_tokens_seen: noneftune_noise_alpha: Noneoptim_target_modules: Nonebatch_eval_metrics: Falseeval_on_start: Falseuse_liger_kernel: Falseliger_kernel_config: Noneeval_use_gather_object: Falseaverage_tokens_across_devices: Trueprompts: Nonebatch_sampler: batch_samplermulti_dataset_batch_sampler: proportionalrouter_mapping: {}learning_rate_mapping: {}
Training Logs
Click to expand
| Epoch | Step | Training Loss | Validation Loss |
|---|---|---|---|
| 0.0127 | 10 | 1.278 | - |
| 0.0254 | 20 | 1.2803 | - |
| 0.0382 | 30 | 1.2583 | - |
| 0.0509 | 40 | 1.2256 | - |
| 0.0636 | 50 | 1.2424 | - |
| 0.0763 | 60 | 1.148 | - |
| 0.0891 | 70 | 1.1033 | - |
| 0.1018 | 80 | 1.0141 | - |
| 0.1145 | 90 | 0.8885 | - |
| 0.1272 | 100 | 0.7322 | - |
| 0.1399 | 110 | 0.5795 | - |
| 0.1527 | 120 | 0.4847 | - |
| 0.1654 | 130 | 0.4266 | - |
| 0.1781 | 140 | 0.3894 | - |
| 0.1908 | 150 | 0.3716 | - |
| 0.2036 | 160 | 0.3361 | - |
| 0.2163 | 170 | 0.33 | - |
| 0.2290 | 180 | 0.3192 | - |
| 0.2417 | 190 | 0.3084 | - |
| 0.2545 | 200 | 0.3092 | - |
| 0.2672 | 210 | 0.298 | - |
| 0.2799 | 220 | 0.2783 | - |
| 0.2926 | 230 | 0.2855 | - |
| 0.3053 | 240 | 0.2767 | - |
| 0.3181 | 250 | 0.2743 | - |
| 0.3308 | 260 | 0.2677 | - |
| 0.3435 | 270 | 0.2644 | - |
| 0.3562 | 280 | 0.2662 | - |
| 0.3690 | 290 | 0.2595 | - |
| 0.3817 | 300 | 0.2548 | - |
| 0.3944 | 310 | 0.2587 | - |
| 0.4071 | 320 | 0.2538 | - |
| 0.4198 | 330 | 0.2525 | - |
| 0.4326 | 340 | 0.2522 | - |
| 0.4453 | 350 | 0.2545 | - |
| 0.4580 | 360 | 0.2408 | - |
| 0.4707 | 370 | 0.2548 | - |
| 0.4835 | 380 | 0.2419 | - |
| 0.4962 | 390 | 0.2276 | - |
| 0.5 | 393 | - | 0.1599 |
| 0.5089 | 400 | 0.2377 | - |
| 0.5216 | 410 | 0.2309 | - |
| 0.5344 | 420 | 0.2322 | - |
| 0.5471 | 430 | 0.2277 | - |
| 0.5598 | 440 | 0.2286 | - |
| 0.5725 | 450 | 0.2338 | - |
| 0.5852 | 460 | 0.2247 | - |
| 0.5980 | 470 | 0.2239 | - |
| 0.6107 | 480 | 0.2317 | - |
| 0.6234 | 490 | 0.2262 | - |
| 0.6361 | 500 | 0.2282 | - |
| 0.6489 | 510 | 0.2165 | - |
| 0.6616 | 520 | 0.2186 | - |
| 0.6743 | 530 | 0.2169 | - |
| 0.6870 | 540 | 0.2197 | - |
| 0.6997 | 550 | 0.2203 | - |
| 0.7125 | 560 | 0.2183 | - |
| 0.7252 | 570 | 0.2188 | - |
| 0.7379 | 580 | 0.2167 | - |
| 0.7506 | 590 | 0.2026 | - |
| 0.7634 | 600 | 0.2174 | - |
| 0.7761 | 610 | 0.2129 | - |
| 0.7888 | 620 | 0.207 | - |
| 0.8015 | 630 | 0.2123 | - |
| 0.8142 | 640 | 0.2079 | - |
| 0.8270 | 650 | 0.2176 | - |
| 0.8397 | 660 | 0.2123 | - |
| 0.8524 | 670 | 0.2061 | - |
| 0.8651 | 680 | 0.2058 | - |
| 0.8779 | 690 | 0.2027 | - |
| 0.8906 | 700 | 0.2138 | - |
| 0.9033 | 710 | 0.2025 | - |
| 0.9160 | 720 | 0.2015 | - |
| 0.9288 | 730 | 0.2112 | - |
| 0.9415 | 740 | 0.2142 | - |
| 0.9542 | 750 | 0.2 | - |
| 0.9669 | 760 | 0.2046 | - |
| 0.9796 | 770 | 0.1986 | - |
| 0.9924 | 780 | 0.2044 | - |
| 1.0 | 786 | - | 0.1360 |
| 1.0051 | 790 | 0.1905 | - |
| 1.0178 | 800 | 0.1724 | - |
| 1.0305 | 810 | 0.1899 | - |
| 1.0433 | 820 | 0.1739 | - |
| 1.0560 | 830 | 0.1682 | - |
| 1.0687 | 840 | 0.1804 | - |
| 1.0814 | 850 | 0.1794 | - |
| 1.0941 | 860 | 0.1735 | - |
| 1.1069 | 870 | 0.1792 | - |
| 1.1196 | 880 | 0.1849 | - |
| 1.1323 | 890 | 0.178 | - |
| 1.1450 | 900 | 0.1824 | - |
| 1.1578 | 910 | 0.1841 | - |
| 1.1705 | 920 | 0.1874 | - |
| 1.1832 | 930 | 0.1759 | - |
| 1.1959 | 940 | 0.1737 | - |
| 1.2087 | 950 | 0.1779 | - |
| 1.2214 | 960 | 0.1866 | - |
| 1.2341 | 970 | 0.1806 | - |
| 1.2468 | 980 | 0.174 | - |
| 1.2595 | 990 | 0.1803 | - |
| 1.2723 | 1000 | 0.1707 | - |
| 1.2850 | 1010 | 0.1796 | - |
| 1.2977 | 1020 | 0.1769 | - |
| 1.3104 | 1030 | 0.1885 | - |
| 1.3232 | 1040 | 0.1752 | - |
| 1.3359 | 1050 | 0.1856 | - |
| 1.3486 | 1060 | 0.1782 | - |
| 1.3613 | 1070 | 0.1772 | - |
| 1.3740 | 1080 | 0.1841 | - |
| 1.3868 | 1090 | 0.1774 | - |
| 1.3995 | 1100 | 0.1822 | - |
| 1.4122 | 1110 | 0.1837 | - |
| 1.4249 | 1120 | 0.176 | - |
| 1.4377 | 1130 | 0.1865 | - |
| 1.4504 | 1140 | 0.1841 | - |
| 1.4631 | 1150 | 0.1736 | - |
| 1.4758 | 1160 | 0.1788 | - |
| 1.4885 | 1170 | 0.1821 | - |
| 1.5 | 1179 | - | 0.1318 |
| 1.5013 | 1180 | 0.1753 | - |
| 1.5140 | 1190 | 0.1739 | - |
| 1.5267 | 1200 | 0.1841 | - |
| 1.5394 | 1210 | 0.1749 | - |
| 1.5522 | 1220 | 0.1847 | - |
| 1.5649 | 1230 | 0.1731 | - |
| 1.5776 | 1240 | 0.1901 | - |
| 1.5903 | 1250 | 0.1777 | - |
| 1.6031 | 1260 | 0.1812 | - |
| 1.6158 | 1270 | 0.1861 | - |
| 1.6285 | 1280 | 0.1808 | - |
| 1.6412 | 1290 | 0.1834 | - |
| 1.6539 | 1300 | 0.1797 | - |
| 1.6667 | 1310 | 0.1822 | - |
| 1.6794 | 1320 | 0.1846 | - |
| 1.6921 | 1330 | 0.1829 | - |
| 1.7048 | 1340 | 0.1838 | - |
| 1.7176 | 1350 | 0.1832 | - |
| 1.7303 | 1360 | 0.1766 | - |
| 1.7430 | 1370 | 0.186 | - |
| 1.7557 | 1380 | 0.1886 | - |
| 1.7684 | 1390 | 0.1794 | - |
| 1.7812 | 1400 | 0.1851 | - |
| 1.7939 | 1410 | 0.1785 | - |
| 1.8066 | 1420 | 0.1887 | - |
| 1.8193 | 1430 | 0.1766 | - |
| 1.8321 | 1440 | 0.1746 | - |
| 1.8448 | 1450 | 0.1826 | - |
| 1.8575 | 1460 | 0.1861 | - |
| 1.8702 | 1470 | 0.1758 | - |
| 1.8830 | 1480 | 0.1801 | - |
| 1.8957 | 1490 | 0.178 | - |
| 1.9084 | 1500 | 0.1771 | - |
| 1.9211 | 1510 | 0.17 | - |
| 1.9338 | 1520 | 0.1827 | - |
| 1.9466 | 1530 | 0.185 | - |
| 1.9593 | 1540 | 0.1814 | - |
| 1.9720 | 1550 | 0.1824 | - |
| 1.9847 | 1560 | 0.1738 | - |
| 1.9975 | 1570 | 0.1805 | - |
| 2.0 | 1572 | - | 0.1268 |
| 2.0102 | 1580 | 0.1264 | - |
| 2.0229 | 1590 | 0.1183 | - |
| 2.0356 | 1600 | 0.1155 | - |
| 2.0483 | 1610 | 0.1232 | - |
| 2.0611 | 1620 | 0.1133 | - |
| 2.0738 | 1630 | 0.1208 | - |
| 2.0865 | 1640 | 0.1218 | - |
| 2.0992 | 1650 | 0.111 | - |
| 2.1120 | 1660 | 0.1159 | - |
| 2.1247 | 1670 | 0.1194 | - |
| 2.1374 | 1680 | 0.1235 | - |
| 2.1501 | 1690 | 0.1207 | - |
| 2.1628 | 1700 | 0.1193 | - |
| 2.1756 | 1710 | 0.1279 | - |
| 2.1883 | 1720 | 0.1229 | - |
| 2.2010 | 1730 | 0.1252 | - |
| 2.2137 | 1740 | 0.1189 | - |
| 2.2265 | 1750 | 0.1242 | - |
| 2.2392 | 1760 | 0.1321 | - |
| 2.2519 | 1770 | 0.1243 | - |
| 2.2646 | 1780 | 0.125 | - |
| 2.2774 | 1790 | 0.1218 | - |
| 2.2901 | 1800 | 0.1324 | - |
| 2.3028 | 1810 | 0.1342 | - |
| 2.3155 | 1820 | 0.1272 | - |
| 2.3282 | 1830 | 0.1252 | - |
| 2.3410 | 1840 | 0.1261 | - |
| 2.3537 | 1850 | 0.1297 | - |
| 2.3664 | 1860 | 0.1224 | - |
| 2.3791 | 1870 | 0.1294 | - |
| 2.3919 | 1880 | 0.119 | - |
| 2.4046 | 1890 | 0.1269 | - |
| 2.4173 | 1900 | 0.1286 | - |
| 2.4300 | 1910 | 0.1262 | - |
| 2.4427 | 1920 | 0.1255 | - |
| 2.4555 | 1930 | 0.1229 | - |
| 2.4682 | 1940 | 0.129 | - |
| 2.4809 | 1950 | 0.127 | - |
| 2.4936 | 1960 | 0.1226 | - |
| 2.5 | 1965 | - | 0.1274 |
| 2.5064 | 1970 | 0.1292 | - |
| 2.5191 | 1980 | 0.1306 | - |
| 2.5318 | 1990 | 0.1241 | - |
| 2.5445 | 2000 | 0.1229 | - |
| 2.5573 | 2010 | 0.1293 | - |
| 2.5700 | 2020 | 0.1271 | - |
| 2.5827 | 2030 | 0.123 | - |
| 2.5954 | 2040 | 0.1344 | - |
| 2.6081 | 2050 | 0.1252 | - |
| 2.6209 | 2060 | 0.126 | - |
| 2.6336 | 2070 | 0.1307 | - |
| 2.6463 | 2080 | 0.1272 | - |
| 2.6590 | 2090 | 0.1264 | - |
| 2.6718 | 2100 | 0.1322 | - |
| 2.6845 | 2110 | 0.1245 | - |
| 2.6972 | 2120 | 0.1278 | - |
| 2.7099 | 2130 | 0.133 | - |
| 2.7226 | 2140 | 0.1309 | - |
| 2.7354 | 2150 | 0.1366 | - |
| 2.7481 | 2160 | 0.1279 | - |
| 2.7608 | 2170 | 0.1292 | - |
| 2.7735 | 2180 | 0.1236 | - |
| 2.7863 | 2190 | 0.1273 | - |
| 2.7990 | 2200 | 0.1308 | - |
| 2.8117 | 2210 | 0.135 | - |
| 2.8244 | 2220 | 0.1302 | - |
| 2.8372 | 2230 | 0.1248 | - |
| 2.8499 | 2240 | 0.1206 | - |
| 2.8626 | 2250 | 0.1254 | - |
| 2.8753 | 2260 | 0.1229 | - |
| 2.8880 | 2270 | 0.1305 | - |
| 2.9008 | 2280 | 0.1288 | - |
| 2.9135 | 2290 | 0.1284 | - |
| 2.9262 | 2300 | 0.1293 | - |
| 2.9389 | 2310 | 0.1354 | - |
| 2.9517 | 2320 | 0.127 | - |
| 2.9644 | 2330 | 0.124 | - |
| 2.9771 | 2340 | 0.1307 | - |
| 2.9898 | 2350 | 0.1326 | - |
| 3.0 | 2358 | - | 0.1239 |
Framework Versions
- Python: 3.11.13
- Sentence Transformers: 5.1.2
- Transformers: 4.57.1
- PyTorch: 2.8.0+cu129
- Accelerate: 1.11.0
- Datasets: 4.3.0
- Tokenizers: 0.22.1
Citation
BibTeX
Sentence Transformers
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
MultipleNegativesRankingLoss
@misc{henderson2017efficient,
title={Efficient Natural Language Response Suggestion for Smart Reply},
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
year={2017},
eprint={1705.00652},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
- Downloads last month
- 55
Model tree for thebajajra/RexBERT-base-embed-pf-v0.4a
Base model
thebajajra/RexBERT-base
Finetuned
thebajajra/RexBERT-base-embed-pf-v0.1