gmguarino commited on
Commit
f669219
·
verified ·
1 Parent(s): 40c9d31

Add new CrossEncoder model

Browse files
Files changed (7) hide show
  1. README.md +475 -0
  2. config.json +34 -0
  3. model.safetensors +3 -0
  4. special_tokens_map.json +37 -0
  5. tokenizer.json +0 -0
  6. tokenizer_config.json +58 -0
  7. vocab.txt +0 -0
README.md ADDED
@@ -0,0 +1,475 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: apache-2.0
5
+ tags:
6
+ - sentence-transformers
7
+ - cross-encoder
8
+ - generated_from_trainer
9
+ - dataset_size:2369
10
+ - loss:BinaryCrossEntropyLoss
11
+ base_model: answerdotai/answerai-colbert-small-v1
12
+ pipeline_tag: text-ranking
13
+ library_name: sentence-transformers
14
+ ---
15
+
16
+ # colbert-small-v1 trained on climatecheck
17
+
18
+ This is a [Cross Encoder](https://www.sbert.net/docs/cross_encoder/usage/usage.html) model finetuned from [answerdotai/answerai-colbert-small-v1](https://huggingface.co/answerdotai/answerai-colbert-small-v1) using the [sentence-transformers](https://www.SBERT.net) library. It computes scores for pairs of texts, which can be used for text reranking and semantic search.
19
+
20
+ ## Model Details
21
+
22
+ ### Model Description
23
+ - **Model Type:** Cross Encoder
24
+ - **Base model:** [answerdotai/answerai-colbert-small-v1](https://huggingface.co/answerdotai/answerai-colbert-small-v1) <!-- at revision be1703c55532145a844da800eea4c9a692d7e267 -->
25
+ - **Maximum Sequence Length:** 512 tokens
26
+ - **Number of Output Labels:** 1 label
27
+ <!-- - **Training Dataset:** Unknown -->
28
+ - **Language:** en
29
+ - **License:** apache-2.0
30
+
31
+ ### Model Sources
32
+
33
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
34
+ - **Documentation:** [Cross Encoder Documentation](https://www.sbert.net/docs/cross_encoder/usage/usage.html)
35
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
36
+ - **Hugging Face:** [Cross Encoders on Hugging Face](https://huggingface.co/models?library=sentence-transformers&other=cross-encoder)
37
+
38
+ ## Usage
39
+
40
+ ### Direct Usage (Sentence Transformers)
41
+
42
+ First install the Sentence Transformers library:
43
+
44
+ ```bash
45
+ pip install -U sentence-transformers
46
+ ```
47
+
48
+ Then you can load this model and run inference.
49
+ ```python
50
+ from sentence_transformers import CrossEncoder
51
+
52
+ # Download from the 🤗 Hub
53
+ model = CrossEncoder("gmguarino/answerai-colbert-small-v1-climatecheck")
54
+ # Get scores for pairs of texts
55
+ pairs = [
56
+ ['Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?', 'Welcome to the city Human populations are shifting en masse to cities, which is leading to rapid increases in the number and extent of urban areas. Such changes are well known to cause declines in many species, but they can also act as alternative selection pressures to which some species are able to adapt. Johnson and Munshi-South review the suite of pressures that urban environments exert, the ways in which species may (or may not) adapt, and the larger impact of these evolutionary events on natural processes and human populations. Understanding such urban evolution patterns will improve our ability to foster species persistence in the face of urbanization and to mitigate some of the challenges, such as disease, that adaptation can bring. Science, this issue p. eaam8327 BACKGROUND The extent of urban areas is increasing around the world, and most humans now live in cities. Urbanization results in dramatic environmental change, including increased temperatures, more impervious surface cover, altered hydrology, and elevated pollution. Urban areas also host more non-native species and reduced abundance and diversity of many native species. These environmental changes brought by global urbanization are creating novel ecosystems with unknown consequences for the evolution of life.'],
57
+ ['Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?', 'These environmental changes brought by global urbanization are creating novel ecosystems with unknown consequences for the evolution of life. Here, we consider how early human settlements led to the evolution of human commensals, including some of the most notorious pests and disease vectors. We also comprehensively review how contemporary urbanization affects the evolution of species that coinhabit cities. ADVANCES A recent surge of research shows that urbanization affects both nonadaptive and adaptive evolution. Some of the clearest results of urban evolution show that cities elevate the strength of random genetic drift (stochastic changes in allele frequencies) and restrict gene flow (the movement of alleles between populations due to dispersal and mating). Populations of native species in cities often represent either relicts that predate urbanization or populations that established after a city formed. Both scenarios frequently result in a loss of genetic diversity within populations and increased differentiation between populations. Fragmentation and urban infrastructure also create barriers to dispersal, and consequently, gene flow is often reduced among city populations, which further contributes to genetic differentiation between populations. The influence of urbanization on mutation and adaptive evolution are less clear.'],
58
+ ['Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?', 'The influence of urbanization on mutation and adaptive evolution are less clear. A small number of studies suggest that industrial pollution can elevate mutation rates, but the pervasiveness of this effect is unknown. A better studied phenomenon are the effects of urbanization on evolution by natural selection. A growing number of studies show that plant and animal populations experience divergent selection between urban and nonurban environments. This divergent selection has led to adaptive evolution in life history, morphology, physiology, behavior, and reproductive traits. These adaptations typically evolve in response to pesticide use, pollution, local climate, or the physical structure of cities. Despite these important results, the genetic basis of adaptive evolution is known from only a few cases. Most studies also examine only a few populations in one city, and experimental validation is rare. OUTLOOK The study of evolution in urban areas provides insights into both fundamental and applied problems in biology. The thousands of cities throughout the world share some features while differing in other aspects related to their age, historical context, governmental policies, and local climate. Thus, the phenomenon of global urbanization represents an unintended but highly replicated global study of experimental evolution.'],
59
+ ['Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?', 'Thus, the phenomenon of global urbanization represents an unintended but highly replicated global study of experimental evolution. We can harness this global urban experiment to understand the repeatability and pace of evolution in response to human activity. Among the most important unresolved questions is, how often do native and exotic species adapt to the particular environmental challenges found in cities? Such adaptations could be the difference as to whether a species persists or vanishes from urban areas. In this way, the study of urban evolution can help us understand how evolution in populations may contribute to conservation of rare species, and how populations can be managed to facilitate the establishment of resilient and sustainable urban ecosystems. In a similar way, understanding evolution in urban areas can lead to improved human health. For example, human pests frequently adapt to pesticides and evade control efforts because of our limited understanding of the size of populations and movement of individuals. Applied evolutionary studies could lead to more effective mitigation of pests and disease agents. The study of urban evolution has rapidly become an important frontier in biology, with implications for healthy and sustainable human populations in urban ecosystems.'],
60
+ ['Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?', 'The study of urban evolution has rapidly become an important frontier in biology, with implications for healthy and sustainable human populations in urban ecosystems. A gradient in urbanization showing the skyline of Canada’s sixth largest city (Mississauga, Canada) on the horizon, and the Credit Valley and the University of Toronto Mississauga campus in the foreground. PHOTO CREDIT: ARJUN YADAV Our planet is an increasingly urbanized landscape, with over half of the human population residing in cities. Despite advances in urban ecology, we do not adequately understand how urbanization affects the evolution of organisms, nor how this evolution may affect ecosystems and human health. Here, we review evidence for the effects of urbanization on the evolution of microbes, plants, and animals that inhabit cities. Urbanization affects adaptive and nonadaptive evolutionary processes that shape the genetic diversity within and between populations. Rapid adaptation has facilitated the success of some native species in urban areas, but it has also allowed human pests and disease to spread more rapidly. The nascent field of urban evolution brings together efforts to understand evolution in response to environmental change while developing new hypotheses concerning adaptation to urban infrastructure and human socioeconomic activity.'],
61
+ ]
62
+ scores = model.predict(pairs)
63
+ print(scores.shape)
64
+ # (5,)
65
+
66
+ # Or rank different texts based on similarity to a single text
67
+ ranks = model.rank(
68
+ 'Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?',
69
+ [
70
+ 'Welcome to the city Human populations are shifting en masse to cities, which is leading to rapid increases in the number and extent of urban areas. Such changes are well known to cause declines in many species, but they can also act as alternative selection pressures to which some species are able to adapt. Johnson and Munshi-South review the suite of pressures that urban environments exert, the ways in which species may (or may not) adapt, and the larger impact of these evolutionary events on natural processes and human populations. Understanding such urban evolution patterns will improve our ability to foster species persistence in the face of urbanization and to mitigate some of the challenges, such as disease, that adaptation can bring. Science, this issue p. eaam8327 BACKGROUND The extent of urban areas is increasing around the world, and most humans now live in cities. Urbanization results in dramatic environmental change, including increased temperatures, more impervious surface cover, altered hydrology, and elevated pollution. Urban areas also host more non-native species and reduced abundance and diversity of many native species. These environmental changes brought by global urbanization are creating novel ecosystems with unknown consequences for the evolution of life.',
71
+ 'These environmental changes brought by global urbanization are creating novel ecosystems with unknown consequences for the evolution of life. Here, we consider how early human settlements led to the evolution of human commensals, including some of the most notorious pests and disease vectors. We also comprehensively review how contemporary urbanization affects the evolution of species that coinhabit cities. ADVANCES A recent surge of research shows that urbanization affects both nonadaptive and adaptive evolution. Some of the clearest results of urban evolution show that cities elevate the strength of random genetic drift (stochastic changes in allele frequencies) and restrict gene flow (the movement of alleles between populations due to dispersal and mating). Populations of native species in cities often represent either relicts that predate urbanization or populations that established after a city formed. Both scenarios frequently result in a loss of genetic diversity within populations and increased differentiation between populations. Fragmentation and urban infrastructure also create barriers to dispersal, and consequently, gene flow is often reduced among city populations, which further contributes to genetic differentiation between populations. The influence of urbanization on mutation and adaptive evolution are less clear.',
72
+ 'The influence of urbanization on mutation and adaptive evolution are less clear. A small number of studies suggest that industrial pollution can elevate mutation rates, but the pervasiveness of this effect is unknown. A better studied phenomenon are the effects of urbanization on evolution by natural selection. A growing number of studies show that plant and animal populations experience divergent selection between urban and nonurban environments. This divergent selection has led to adaptive evolution in life history, morphology, physiology, behavior, and reproductive traits. These adaptations typically evolve in response to pesticide use, pollution, local climate, or the physical structure of cities. Despite these important results, the genetic basis of adaptive evolution is known from only a few cases. Most studies also examine only a few populations in one city, and experimental validation is rare. OUTLOOK The study of evolution in urban areas provides insights into both fundamental and applied problems in biology. The thousands of cities throughout the world share some features while differing in other aspects related to their age, historical context, governmental policies, and local climate. Thus, the phenomenon of global urbanization represents an unintended but highly replicated global study of experimental evolution.',
73
+ 'Thus, the phenomenon of global urbanization represents an unintended but highly replicated global study of experimental evolution. We can harness this global urban experiment to understand the repeatability and pace of evolution in response to human activity. Among the most important unresolved questions is, how often do native and exotic species adapt to the particular environmental challenges found in cities? Such adaptations could be the difference as to whether a species persists or vanishes from urban areas. In this way, the study of urban evolution can help us understand how evolution in populations may contribute to conservation of rare species, and how populations can be managed to facilitate the establishment of resilient and sustainable urban ecosystems. In a similar way, understanding evolution in urban areas can lead to improved human health. For example, human pests frequently adapt to pesticides and evade control efforts because of our limited understanding of the size of populations and movement of individuals. Applied evolutionary studies could lead to more effective mitigation of pests and disease agents. The study of urban evolution has rapidly become an important frontier in biology, with implications for healthy and sustainable human populations in urban ecosystems.',
74
+ 'The study of urban evolution has rapidly become an important frontier in biology, with implications for healthy and sustainable human populations in urban ecosystems. A gradient in urbanization showing the skyline of Canada’s sixth largest city (Mississauga, Canada) on the horizon, and the Credit Valley and the University of Toronto Mississauga campus in the foreground. PHOTO CREDIT: ARJUN YADAV Our planet is an increasingly urbanized landscape, with over half of the human population residing in cities. Despite advances in urban ecology, we do not adequately understand how urbanization affects the evolution of organisms, nor how this evolution may affect ecosystems and human health. Here, we review evidence for the effects of urbanization on the evolution of microbes, plants, and animals that inhabit cities. Urbanization affects adaptive and nonadaptive evolutionary processes that shape the genetic diversity within and between populations. Rapid adaptation has facilitated the success of some native species in urban areas, but it has also allowed human pests and disease to spread more rapidly. The nascent field of urban evolution brings together efforts to understand evolution in response to environmental change while developing new hypotheses concerning adaptation to urban infrastructure and human socioeconomic activity.',
75
+ ]
76
+ )
77
+ # [{'corpus_id': ..., 'score': ...}, {'corpus_id': ..., 'score': ...}, ...]
78
+ ```
79
+
80
+ <!--
81
+ ### Direct Usage (Transformers)
82
+
83
+ <details><summary>Click to see the direct usage in Transformers</summary>
84
+
85
+ </details>
86
+ -->
87
+
88
+ <!--
89
+ ### Downstream Usage (Sentence Transformers)
90
+
91
+ You can finetune this model on your own dataset.
92
+
93
+ <details><summary>Click to expand</summary>
94
+
95
+ </details>
96
+ -->
97
+
98
+ <!--
99
+ ### Out-of-Scope Use
100
+
101
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
102
+ -->
103
+
104
+ <!--
105
+ ## Bias, Risks and Limitations
106
+
107
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
108
+ -->
109
+
110
+ <!--
111
+ ### Recommendations
112
+
113
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
114
+ -->
115
+
116
+ ## Training Details
117
+
118
+ ### Training Dataset
119
+
120
+ #### Unnamed Dataset
121
+
122
+ * Size: 2,369 training samples
123
+ * Columns: <code>anchor</code>, <code>passage</code>, and <code>label</code>
124
+ * Approximate statistics based on the first 1000 samples:
125
+ | | anchor | passage | label |
126
+ |:--------|:-------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------|:---------------------------------------------------------------|
127
+ | type | string | string | float |
128
+ | details | <ul><li>min: 30 characters</li><li>mean: 114.16 characters</li><li>max: 209 characters</li></ul> | <ul><li>min: 77 characters</li><li>mean: 972.69 characters</li><li>max: 1537 characters</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.58</li><li>max: 1.0</li></ul> |
129
+ * Samples:
130
+ | anchor | passage | label |
131
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
132
+ | <code>Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?</code> | <code>Welcome to the city Human populations are shifting en masse to cities, which is leading to rapid increases in the number and extent of urban areas. Such changes are well known to cause declines in many species, but they can also act as alternative selection pressures to which some species are able to adapt. Johnson and Munshi-South review the suite of pressures that urban environments exert, the ways in which species may (or may not) adapt, and the larger impact of these evolutionary events on natural processes and human populations. Understanding such urban evolution patterns will improve our ability to foster species persistence in the face of urbanization and to mitigate some of the challenges, such as disease, that adaptation can bring. Science, this issue p. eaam8327 BACKGROUND The extent of urban areas is increasing around the world, and most humans now live in cities. Urbanization results in dramatic environmental change, including increased temperatures, more impervious surface...</code> | <code>0.0</code> |
133
+ | <code>Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?</code> | <code>These environmental changes brought by global urbanization are creating novel ecosystems with unknown consequences for the evolution of life. Here, we consider how early human settlements led to the evolution of human commensals, including some of the most notorious pests and disease vectors. We also comprehensively review how contemporary urbanization affects the evolution of species that coinhabit cities. ADVANCES A recent surge of research shows that urbanization affects both nonadaptive and adaptive evolution. Some of the clearest results of urban evolution show that cities elevate the strength of random genetic drift (stochastic changes in allele frequencies) and restrict gene flow (the movement of alleles between populations due to dispersal and mating). Populations of native species in cities often represent either relicts that predate urbanization or populations that established after a city formed. Both scenarios frequently result in a loss of genetic diversity within populati...</code> | <code>0.0</code> |
134
+ | <code>Turns out, species that can adapt easily to different environments are often the ones that can survive in a wide variety of places. Interesting, right?</code> | <code>The influence of urbanization on mutation and adaptive evolution are less clear. A small number of studies suggest that industrial pollution can elevate mutation rates, but the pervasiveness of this effect is unknown. A better studied phenomenon are the effects of urbanization on evolution by natural selection. A growing number of studies show that plant and animal populations experience divergent selection between urban and nonurban environments. This divergent selection has led to adaptive evolution in life history, morphology, physiology, behavior, and reproductive traits. These adaptations typically evolve in response to pesticide use, pollution, local climate, or the physical structure of cities. Despite these important results, the genetic basis of adaptive evolution is known from only a few cases. Most studies also examine only a few populations in one city, and experimental validation is rare. OUTLOOK The study of evolution in urban areas provides insights into both fundamental...</code> | <code>0.0</code> |
135
+ * Loss: [<code>BinaryCrossEntropyLoss</code>](https://sbert.net/docs/package_reference/cross_encoder/losses.html#binarycrossentropyloss) with these parameters:
136
+ ```json
137
+ {
138
+ "activation_fn": "torch.nn.modules.linear.Identity",
139
+ "pos_weight": null
140
+ }
141
+ ```
142
+
143
+ ### Training Hyperparameters
144
+ #### Non-Default Hyperparameters
145
+
146
+ - `per_device_train_batch_size`: 16
147
+ - `learning_rate`: 2e-05
148
+ - `num_train_epochs`: 10
149
+ - `warmup_ratio`: 0.1
150
+ - `fp16`: True
151
+ - `dataloader_num_workers`: 2
152
+
153
+ #### All Hyperparameters
154
+ <details><summary>Click to expand</summary>
155
+
156
+ - `overwrite_output_dir`: False
157
+ - `do_predict`: False
158
+ - `eval_strategy`: no
159
+ - `prediction_loss_only`: True
160
+ - `per_device_train_batch_size`: 16
161
+ - `per_device_eval_batch_size`: 8
162
+ - `per_gpu_train_batch_size`: None
163
+ - `per_gpu_eval_batch_size`: None
164
+ - `gradient_accumulation_steps`: 1
165
+ - `eval_accumulation_steps`: None
166
+ - `torch_empty_cache_steps`: None
167
+ - `learning_rate`: 2e-05
168
+ - `weight_decay`: 0.0
169
+ - `adam_beta1`: 0.9
170
+ - `adam_beta2`: 0.999
171
+ - `adam_epsilon`: 1e-08
172
+ - `max_grad_norm`: 1.0
173
+ - `num_train_epochs`: 10
174
+ - `max_steps`: -1
175
+ - `lr_scheduler_type`: linear
176
+ - `lr_scheduler_kwargs`: {}
177
+ - `warmup_ratio`: 0.1
178
+ - `warmup_steps`: 0
179
+ - `log_level`: passive
180
+ - `log_level_replica`: warning
181
+ - `log_on_each_node`: True
182
+ - `logging_nan_inf_filter`: True
183
+ - `save_safetensors`: True
184
+ - `save_on_each_node`: False
185
+ - `save_only_model`: False
186
+ - `restore_callback_states_from_checkpoint`: False
187
+ - `no_cuda`: False
188
+ - `use_cpu`: False
189
+ - `use_mps_device`: False
190
+ - `seed`: 42
191
+ - `data_seed`: None
192
+ - `jit_mode_eval`: False
193
+ - `use_ipex`: False
194
+ - `bf16`: False
195
+ - `fp16`: True
196
+ - `fp16_opt_level`: O1
197
+ - `half_precision_backend`: auto
198
+ - `bf16_full_eval`: False
199
+ - `fp16_full_eval`: False
200
+ - `tf32`: None
201
+ - `local_rank`: 0
202
+ - `ddp_backend`: None
203
+ - `tpu_num_cores`: None
204
+ - `tpu_metrics_debug`: False
205
+ - `debug`: []
206
+ - `dataloader_drop_last`: False
207
+ - `dataloader_num_workers`: 2
208
+ - `dataloader_prefetch_factor`: None
209
+ - `past_index`: -1
210
+ - `disable_tqdm`: False
211
+ - `remove_unused_columns`: True
212
+ - `label_names`: None
213
+ - `load_best_model_at_end`: False
214
+ - `ignore_data_skip`: False
215
+ - `fsdp`: []
216
+ - `fsdp_min_num_params`: 0
217
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
218
+ - `tp_size`: 0
219
+ - `fsdp_transformer_layer_cls_to_wrap`: None
220
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
221
+ - `deepspeed`: None
222
+ - `label_smoothing_factor`: 0.0
223
+ - `optim`: adamw_torch
224
+ - `optim_args`: None
225
+ - `adafactor`: False
226
+ - `group_by_length`: False
227
+ - `length_column_name`: length
228
+ - `ddp_find_unused_parameters`: None
229
+ - `ddp_bucket_cap_mb`: None
230
+ - `ddp_broadcast_buffers`: False
231
+ - `dataloader_pin_memory`: True
232
+ - `dataloader_persistent_workers`: False
233
+ - `skip_memory_metrics`: True
234
+ - `use_legacy_prediction_loop`: False
235
+ - `push_to_hub`: False
236
+ - `resume_from_checkpoint`: None
237
+ - `hub_model_id`: None
238
+ - `hub_strategy`: every_save
239
+ - `hub_private_repo`: None
240
+ - `hub_always_push`: False
241
+ - `gradient_checkpointing`: False
242
+ - `gradient_checkpointing_kwargs`: None
243
+ - `include_inputs_for_metrics`: False
244
+ - `include_for_metrics`: []
245
+ - `eval_do_concat_batches`: True
246
+ - `fp16_backend`: auto
247
+ - `push_to_hub_model_id`: None
248
+ - `push_to_hub_organization`: None
249
+ - `mp_parameters`:
250
+ - `auto_find_batch_size`: False
251
+ - `full_determinism`: False
252
+ - `torchdynamo`: None
253
+ - `ray_scope`: last
254
+ - `ddp_timeout`: 1800
255
+ - `torch_compile`: False
256
+ - `torch_compile_backend`: None
257
+ - `torch_compile_mode`: None
258
+ - `include_tokens_per_second`: False
259
+ - `include_num_input_tokens_seen`: False
260
+ - `neftune_noise_alpha`: None
261
+ - `optim_target_modules`: None
262
+ - `batch_eval_metrics`: False
263
+ - `eval_on_start`: False
264
+ - `use_liger_kernel`: False
265
+ - `eval_use_gather_object`: False
266
+ - `average_tokens_across_devices`: False
267
+ - `prompts`: None
268
+ - `batch_sampler`: batch_sampler
269
+ - `multi_dataset_batch_sampler`: proportional
270
+
271
+ </details>
272
+
273
+ ### Training Logs
274
+ <details><summary>Click to expand</summary>
275
+
276
+ | Epoch | Step | Training Loss |
277
+ |:------:|:----:|:-------------:|
278
+ | 0.0067 | 1 | 0.6935 |
279
+ | 0.6711 | 100 | 0.6812 |
280
+ | 0.0067 | 1 | 0.6977 |
281
+ | 0.0671 | 10 | 0.676 |
282
+ | 0.1342 | 20 | 0.6758 |
283
+ | 0.2013 | 30 | 0.6902 |
284
+ | 0.2685 | 40 | 0.6628 |
285
+ | 0.3356 | 50 | 0.6718 |
286
+ | 0.4027 | 60 | 0.6607 |
287
+ | 0.4698 | 70 | 0.6761 |
288
+ | 0.5369 | 80 | 0.6759 |
289
+ | 0.6040 | 90 | 0.6695 |
290
+ | 0.6711 | 100 | 0.6791 |
291
+ | 0.7383 | 110 | 0.6687 |
292
+ | 0.8054 | 120 | 0.6823 |
293
+ | 0.8725 | 130 | 0.682 |
294
+ | 0.9396 | 140 | 0.6784 |
295
+ | 1.0067 | 150 | 0.6745 |
296
+ | 1.0738 | 160 | 0.6505 |
297
+ | 1.1409 | 170 | 0.6132 |
298
+ | 1.2081 | 180 | 0.5973 |
299
+ | 1.2752 | 190 | 0.5939 |
300
+ | 1.3423 | 200 | 0.5827 |
301
+ | 1.4094 | 210 | 0.6278 |
302
+ | 1.4765 | 220 | 0.5761 |
303
+ | 1.5436 | 230 | 0.6245 |
304
+ | 1.6107 | 240 | 0.5832 |
305
+ | 1.6779 | 250 | 0.5128 |
306
+ | 1.7450 | 260 | 0.5529 |
307
+ | 1.8121 | 270 | 0.4886 |
308
+ | 1.8792 | 280 | 0.4943 |
309
+ | 1.9463 | 290 | 0.5186 |
310
+ | 2.0134 | 300 | 0.436 |
311
+ | 2.0805 | 310 | 0.4821 |
312
+ | 2.1477 | 320 | 0.4655 |
313
+ | 2.2148 | 330 | 0.369 |
314
+ | 2.2819 | 340 | 0.4655 |
315
+ | 2.3490 | 350 | 0.3635 |
316
+ | 2.4161 | 360 | 0.5141 |
317
+ | 2.4832 | 370 | 0.4408 |
318
+ | 2.5503 | 380 | 0.4071 |
319
+ | 2.6174 | 390 | 0.5308 |
320
+ | 2.6846 | 400 | 0.4403 |
321
+ | 2.7517 | 410 | 0.3813 |
322
+ | 2.8188 | 420 | 0.5094 |
323
+ | 2.8859 | 430 | 0.3671 |
324
+ | 2.9530 | 440 | 0.4118 |
325
+ | 3.0201 | 450 | 0.3945 |
326
+ | 3.0872 | 460 | 0.3318 |
327
+ | 3.1544 | 470 | 0.3555 |
328
+ | 3.2215 | 480 | 0.4101 |
329
+ | 3.2886 | 490 | 0.3764 |
330
+ | 3.3557 | 500 | 0.4482 |
331
+ | 3.4228 | 510 | 0.3738 |
332
+ | 3.4899 | 520 | 0.3805 |
333
+ | 3.5570 | 530 | 0.3611 |
334
+ | 3.6242 | 540 | 0.4942 |
335
+ | 3.6913 | 550 | 0.3983 |
336
+ | 3.7584 | 560 | 0.353 |
337
+ | 3.8255 | 570 | 0.3006 |
338
+ | 3.8926 | 580 | 0.343 |
339
+ | 3.9597 | 590 | 0.3998 |
340
+ | 4.0268 | 600 | 0.3937 |
341
+ | 4.0940 | 610 | 0.3768 |
342
+ | 4.1611 | 620 | 0.2224 |
343
+ | 4.2282 | 630 | 0.3857 |
344
+ | 4.2953 | 640 | 0.3617 |
345
+ | 4.3624 | 650 | 0.3278 |
346
+ | 4.4295 | 660 | 0.3239 |
347
+ | 4.4966 | 670 | 0.4745 |
348
+ | 4.5638 | 680 | 0.291 |
349
+ | 4.6309 | 690 | 0.3666 |
350
+ | 4.6980 | 700 | 0.322 |
351
+ | 4.7651 | 710 | 0.3243 |
352
+ | 4.8322 | 720 | 0.3856 |
353
+ | 4.8993 | 730 | 0.2332 |
354
+ | 4.9664 | 740 | 0.3998 |
355
+ | 5.0336 | 750 | 0.4055 |
356
+ | 5.1007 | 760 | 0.2819 |
357
+ | 5.1678 | 770 | 0.2425 |
358
+ | 5.2349 | 780 | 0.3387 |
359
+ | 5.3020 | 790 | 0.3621 |
360
+ | 5.3691 | 800 | 0.3063 |
361
+ | 5.4362 | 810 | 0.3306 |
362
+ | 5.5034 | 820 | 0.3163 |
363
+ | 5.5705 | 830 | 0.2516 |
364
+ | 5.6376 | 840 | 0.2746 |
365
+ | 5.7047 | 850 | 0.2841 |
366
+ | 5.7718 | 860 | 0.2799 |
367
+ | 5.8389 | 870 | 0.3608 |
368
+ | 5.9060 | 880 | 0.2268 |
369
+ | 5.9732 | 890 | 0.2612 |
370
+ | 6.0403 | 900 | 0.1797 |
371
+ | 6.1074 | 910 | 0.2328 |
372
+ | 6.1745 | 920 | 0.2906 |
373
+ | 6.2416 | 930 | 0.2722 |
374
+ | 6.3087 | 940 | 0.2567 |
375
+ | 6.3758 | 950 | 0.3215 |
376
+ | 6.4430 | 960 | 0.2563 |
377
+ | 6.5101 | 970 | 0.2294 |
378
+ | 6.5772 | 980 | 0.233 |
379
+ | 6.6443 | 990 | 0.2697 |
380
+ | 6.7114 | 1000 | 0.2585 |
381
+ | 6.7785 | 1010 | 0.2863 |
382
+ | 6.8456 | 1020 | 0.2296 |
383
+ | 6.9128 | 1030 | 0.2399 |
384
+ | 6.9799 | 1040 | 0.2736 |
385
+ | 7.0470 | 1050 | 0.3086 |
386
+ | 7.1141 | 1060 | 0.2781 |
387
+ | 7.1812 | 1070 | 0.2201 |
388
+ | 7.2483 | 1080 | 0.1738 |
389
+ | 7.3154 | 1090 | 0.244 |
390
+ | 7.3826 | 1100 | 0.2045 |
391
+ | 7.4497 | 1110 | 0.2975 |
392
+ | 7.5168 | 1120 | 0.1997 |
393
+ | 7.5839 | 1130 | 0.3476 |
394
+ | 7.6510 | 1140 | 0.3303 |
395
+ | 7.7181 | 1150 | 0.1835 |
396
+ | 7.7852 | 1160 | 0.2045 |
397
+ | 7.8523 | 1170 | 0.2652 |
398
+ | 7.9195 | 1180 | 0.2576 |
399
+ | 7.9866 | 1190 | 0.2281 |
400
+ | 8.0537 | 1200 | 0.2479 |
401
+ | 8.1208 | 1210 | 0.2592 |
402
+ | 8.1879 | 1220 | 0.1455 |
403
+ | 8.2550 | 1230 | 0.3126 |
404
+ | 8.3221 | 1240 | 0.1814 |
405
+ | 8.3893 | 1250 | 0.175 |
406
+ | 8.4564 | 1260 | 0.1712 |
407
+ | 8.5235 | 1270 | 0.2163 |
408
+ | 8.5906 | 1280 | 0.2158 |
409
+ | 8.6577 | 1290 | 0.2192 |
410
+ | 8.7248 | 1300 | 0.22 |
411
+ | 8.7919 | 1310 | 0.3208 |
412
+ | 8.8591 | 1320 | 0.1588 |
413
+ | 8.9262 | 1330 | 0.2477 |
414
+ | 8.9933 | 1340 | 0.1668 |
415
+ | 9.0604 | 1350 | 0.2315 |
416
+ | 9.1275 | 1360 | 0.2444 |
417
+ | 9.1946 | 1370 | 0.363 |
418
+ | 9.2617 | 1380 | 0.1657 |
419
+ | 9.3289 | 1390 | 0.2557 |
420
+ | 9.3960 | 1400 | 0.2392 |
421
+ | 9.4631 | 1410 | 0.1683 |
422
+ | 9.5302 | 1420 | 0.1472 |
423
+ | 9.5973 | 1430 | 0.1653 |
424
+ | 9.6644 | 1440 | 0.2288 |
425
+ | 9.7315 | 1450 | 0.1856 |
426
+ | 9.7987 | 1460 | 0.1729 |
427
+ | 9.8658 | 1470 | 0.2009 |
428
+ | 9.9329 | 1480 | 0.1241 |
429
+ | 10.0 | 1490 | 0.1783 |
430
+
431
+ </details>
432
+
433
+ ### Framework Versions
434
+ - Python: 3.11.12
435
+ - Sentence Transformers: 4.1.0
436
+ - Transformers: 4.51.3
437
+ - PyTorch: 2.6.0+cu124
438
+ - Accelerate: 1.6.0
439
+ - Datasets: 3.6.0
440
+ - Tokenizers: 0.21.1
441
+
442
+ ## Citation
443
+
444
+ ### BibTeX
445
+
446
+ #### Sentence Transformers
447
+ ```bibtex
448
+ @inproceedings{reimers-2019-sentence-bert,
449
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
450
+ author = "Reimers, Nils and Gurevych, Iryna",
451
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
452
+ month = "11",
453
+ year = "2019",
454
+ publisher = "Association for Computational Linguistics",
455
+ url = "https://arxiv.org/abs/1908.10084",
456
+ }
457
+ ```
458
+
459
+ <!--
460
+ ## Glossary
461
+
462
+ *Clearly define terms in order to be accessible across audiences.*
463
+ -->
464
+
465
+ <!--
466
+ ## Model Card Authors
467
+
468
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
469
+ -->
470
+
471
+ <!--
472
+ ## Model Card Contact
473
+
474
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
475
+ -->
config.json ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertForSequenceClassification"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "hidden_act": "gelu",
8
+ "hidden_dropout_prob": 0.1,
9
+ "hidden_size": 384,
10
+ "id2label": {
11
+ "0": "LABEL_0"
12
+ },
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 1536,
15
+ "label2id": {
16
+ "LABEL_0": 0
17
+ },
18
+ "layer_norm_eps": 1e-12,
19
+ "max_position_embeddings": 512,
20
+ "model_type": "bert",
21
+ "num_attention_heads": 12,
22
+ "num_hidden_layers": 12,
23
+ "pad_token_id": 0,
24
+ "position_embedding_type": "absolute",
25
+ "sentence_transformers": {
26
+ "activation_fn": "torch.nn.modules.activation.Sigmoid",
27
+ "version": "4.1.0"
28
+ },
29
+ "torch_dtype": "float32",
30
+ "transformers_version": "4.51.3",
31
+ "type_vocab_size": 2,
32
+ "use_cache": true,
33
+ "vocab_size": 30522
34
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a0dcd0ef77f2d9db437fd75114e5711516cbcdc9feac71195deceb31835e4af
3
+ size 133464836
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,58 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": true,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "model_max_length": 512,
51
+ "never_split": null,
52
+ "pad_token": "[PAD]",
53
+ "sep_token": "[SEP]",
54
+ "strip_accents": null,
55
+ "tokenize_chinese_chars": true,
56
+ "tokenizer_class": "BertTokenizer",
57
+ "unk_token": "[UNK]"
58
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff