update model

Browse files

Files changed (12) hide show

README.md +148 -58
config.json +4 -2
model.safetensors +1 -1
onnx/model.onnx +3 -0
onnx/model_bnb4.onnx +3 -0
onnx/model_fp16.onnx +3 -0
onnx/model_int8.onnx +3 -0
onnx/model_q4.onnx +3 -0
onnx/model_q4f16.onnx +3 -0
onnx/model_quantized.onnx +3 -0
onnx/model_uint8.onnx +3 -0
quantize_config.json +18 -0

README.md CHANGED Viewed

@@ -4,37 +4,56 @@ tags:
 - sentence-similarity
 - feature-extraction
 - generated_from_trainer
-- dataset_size:49800
 - loss:CosineSimilarityLoss
 base_model: sentence-transformers/all-MiniLM-L6-v2
 widget:
-- source_sentence: 99designs - Logo, Web, and Graphic Design Contests for Freelancers
   sentences:
-  - Twitch - Live Streaming Platform for Gamers, Creators, and Communities
-  - Yandex Mail - Russian Email Service with Built-In Translation
-  - arXiv.org - Preprint Server for Physics, Mathematics, and Computer Science
-- source_sentence: BBC News - Comprehensive Global Reporting and Analysis
   sentences:
-  - Indeed - Aggregated Job Listings, Company Reviews, and Salaries
-  - ClickUp - All-in-One Productivity and Project Management Platform
-  - Motherly - Empowering Articles and Classes for Modern Mothers
-- source_sentence: Trivago - Compare Accommodation Prices Across Multiple Sites
   sentences:
-  - Mint - Budgeting App and Bill Tracking with Automatic Bank Sync
-  - Dailymotion - Global Video Hosting and Sharing Platform
-  - Zero to Three - Early Childhood Development and Care Information
-- source_sentence: ProtonMail - End-to-End Encrypted Email for Enhanced Privacy
   sentences:
-  - Kelley Blue Book - Vehicle Valuations, Consumer Reviews, and Insights
-  - GitHub - Code Hosting, Pull Requests, and Collaborative Development
-  - Delish - Fun, Creative Recipes and Easy Meal Inspiration
-- source_sentence: AngelList Talent - Startups Hiring Engineers, Designers, and Marketers
   sentences:
-  - Stitcher - Podcast App for Comedy, News, True Crime, and More
-  - TaskRabbit - Hire Freelancers for Home Improvement and Errands
-  - Dribbble - Discover UI Shots, Branding Projects, and Design Concepts
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
 ---
 # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
@@ -63,7 +82,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [s
 ```
 SentenceTransformer(
-  (0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: BertModel
   (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
   (2): Normalize()
 )
@@ -84,12 +103,12 @@ Then you can load this model and run inference.
 from sentence_transformers import SentenceTransformer
 # Download from the 🤗 Hub
-model = SentenceTransformer("vazish/all-MiniLM-L6-v2-fine-tuned")
 # Run inference
 sentences = [
-    'AngelList Talent - Startups Hiring Engineers, Designers, and Marketers',
-    'Stitcher - Podcast App for Comedy, News, True Crime, and More',
-    'Dribbble - Discover UI Shots, Branding Projects, and Design Concepts',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -125,6 +144,19 @@ You can finetune this model on your own dataset.
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
 <!--
 ## Bias, Risks and Limitations
@@ -166,7 +198,8 @@ You can finetune this model on your own dataset.
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
-- `num_train_epochs`: 2
 - `multi_dataset_batch_sampler`: round_robin
 #### All Hyperparameters
@@ -176,8 +209,8 @@ You can finetune this model on your own dataset.
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
-- `per_device_train_batch_size`: 8
-- `per_device_eval_batch_size`: 8
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
@@ -189,7 +222,7 @@ You can finetune this model on your own dataset.
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1
-- `num_train_epochs`: 2
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
@@ -264,7 +297,7 @@ You can finetune this model on your own dataset.
 - `fp16_backend`: auto
 - `push_to_hub_model_id`: None
 - `push_to_hub_organization`: None
-- `mp_parameters`:
 - `auto_find_batch_size`: False
 - `full_determinism`: False
 - `torchdynamo`: None
@@ -291,32 +324,89 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch  | Step  | Training Loss |
-|:------:|:-----:|:-------------:|
-| 0.0803 | 500   | 0.0197        |
-| 0.1606 | 1000  | 0.0153        |
-| 0.2410 | 1500  | 0.0069        |
-| 0.3213 | 2000  | 0.0028        |
-| 0.4016 | 2500  | 0.0012        |
-| 0.4819 | 3000  | 0.0008        |
-| 0.5622 | 3500  | 0.0007        |
-| 0.6426 | 4000  | 0.0006        |
-| 0.7229 | 4500  | 0.0006        |
-| 0.8032 | 5000  | 0.0005        |
-| 0.8835 | 5500  | 0.0004        |
-| 0.9639 | 6000  | 0.0004        |
-| 1.0442 | 6500  | 0.0004        |
-| 1.1245 | 7000  | 0.0003        |
-| 1.2048 | 7500  | 0.0002        |
-| 1.2851 | 8000  | 0.0002        |
-| 1.3655 | 8500  | 0.0005        |
-| 1.4458 | 9000  | 0.0001        |
-| 1.5261 | 9500  | 0.0001        |
-| 1.6064 | 10000 | 0.0001        |
-| 1.6867 | 10500 | 0.0001        |
-| 1.7671 | 11000 | 0.0001        |
-| 1.8474 | 11500 | 0.0001        |
-| 1.9277 | 12000 | 0.0001        |
 ### Framework Versions
@@ -361,4 +451,4 @@ You can finetune this model on your own dataset.
 ## Model Card Contact
 *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
--->

 - sentence-similarity
 - feature-extraction
 - generated_from_trainer
+- dataset_size:429643
 - loss:CosineSimilarityLoss
 base_model: sentence-transformers/all-MiniLM-L6-v2
 widget:
+- source_sentence: Oracle Cloud - Infrastructure and Platform Services for Enterprises
   sentences:
+  - PulseAudio - Ubuntu Wiki
+  - Documentation page not found - Read the Docs
+  - Dwarf Fortress beginner tips - Video Games on Sports Illustrated
+- source_sentence: Suggest opt in User Test - Google Slides
   sentences:
+  - ReleaseEngineering/TryServer - MozillaWiki
+  - Dwarf Fortress beginner tips - Video Games on Sports Illustrated
+  - Tutanota - Private Mailbox with End-to-End Encryption and Calendar
+- source_sentence: https://portal.naviabenefits.com/part/prioritytasks.aspx
   sentences:
+  - What to Expect - Pregnancy and Parenting Tips, Week-by-Week Guides
+  - Parents.com - Articles, Recipes, and Ideas for Family Activities
+  - Pinterest - Boards for Collecting and Sharing Inspiration on Any Topic
+- source_sentence: ‎Apple Music - Web Player
   sentences:
+  - BMW Connected Drive - Home Assistant
+  - Mary Stewart Phillips (1862-1928) - Find a Grave Memorial
+  - Sky Sports - Football, Formula 1, Cricket, and More
+- source_sentence: Tidal - High-Fidelity Music Streaming with Master Quality Audio
   sentences:
+  - Walmart - Everyday Low Prices on Groceries, Electronics, and More
+  - Notion - Integrated Workspace for Notes, Tasks, Databases, and Wikis
+  - Ambient Dreams Playlist on Amazon Music
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
+metrics:
+- pearson_cosine
+- spearman_cosine
+model-index:
+- name: SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
+  results:
+  - task:
+      type: semantic-similarity
+      name: Semantic Similarity
+    dataset:
+      name: Unknown
+      type: unknown
+    metrics:
+    - type: pearson_cosine
+      value: 0.9822505655251419
+      name: Pearson Cosine
+    - type: spearman_cosine
+      value: 0.2607864200673379
+      name: Spearman Cosine
 ---
 # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
 ```
 SentenceTransformer(
+  (0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: BertModel
   (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
   (2): Normalize()
 )
 from sentence_transformers import SentenceTransformer
 # Download from the 🤗 Hub
+model = SentenceTransformer("vazish/all-MiniLM-L6-v2-fine-tuned_0")
 # Run inference
 sentences = [
+    'Tidal - High-Fidelity Music Streaming with Master Quality Audio',
+    'Walmart - Everyday Low Prices on Groceries, Electronics, and More',
+    'Notion - Integrated Workspace for Notes, Tasks, Databases, and Wikis',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
+## Evaluation
+### Metrics
+#### Semantic Similarity
+* Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)
+| Metric              | Value      |
+|:--------------------|:-----------|
+| pearson_cosine      | 0.9823     |
+| **spearman_cosine** | **0.2608** |
 <!--
 ## Bias, Risks and Limitations
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `per_device_train_batch_size`: 32
+- `per_device_eval_batch_size`: 32
 - `multi_dataset_batch_sampler`: round_robin
 #### All Hyperparameters
 - `do_predict`: False
 - `eval_strategy`: no
 - `prediction_loss_only`: True
+- `per_device_train_batch_size`: 32
+- `per_device_eval_batch_size`: 32
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 1
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1
+- `num_train_epochs`: 3
 - `max_steps`: -1
 - `lr_scheduler_type`: linear
 - `lr_scheduler_kwargs`: {}
 - `fp16_backend`: auto
 - `push_to_hub_model_id`: None
 - `push_to_hub_organization`: None
+- `mp_parameters`:
 - `auto_find_batch_size`: False
 - `full_determinism`: False
 - `torchdynamo`: None
 </details>
 ### Training Logs
+| Epoch  | Step  | Training Loss | spearman_cosine |
+|:------:|:-----:|:-------------:|:---------------:|
+| 0.0372 | 500   | 0.0218        | -               |
+| 0.0745 | 1000  | 0.0151        | -               |
+| 0.1117 | 1500  | 0.0113        | -               |
+| 0.1490 | 2000  | 0.0076        | -               |
+| 0.1862 | 2500  | 0.0063        | -               |
+| 0.2234 | 3000  | 0.0054        | -               |
+| 0.2607 | 3500  | 0.0045        | -               |
+| 0.2979 | 4000  | 0.0041        | -               |
+| 0.3351 | 4500  | 0.0027        | -               |
+| 0.3724 | 5000  | 0.0028        | -               |
+| 0.4096 | 5500  | 0.0026        | -               |
+| 0.4469 | 6000  | 0.0021        | -               |
+| 0.4841 | 6500  | 0.0019        | -               |
+| 0.5213 | 7000  | 0.0022        | -               |
+| 0.5586 | 7500  | 0.0017        | -               |
+| 0.5958 | 8000  | 0.0018        | -               |
+| 0.6331 | 8500  | 0.0015        | -               |
+| 0.6703 | 9000  | 0.0015        | -               |
+| 0.7075 | 9500  | 0.0018        | -               |
+| 0.7448 | 10000 | 0.0014        | -               |
+| 0.7820 | 10500 | 0.0017        | -               |
+| 0.8192 | 11000 | 0.0012        | -               |
+| 0.8565 | 11500 | 0.0014        | -               |
+| 0.8937 | 12000 | 0.001         | -               |
+| 0.9310 | 12500 | 0.0011        | -               |
+| 0.9682 | 13000 | 0.001         | -               |
+| 1.0054 | 13500 | 0.0009        | -               |
+| 1.0427 | 14000 | 0.0011        | -               |
+| 1.0799 | 14500 | 0.001         | -               |
+| 1.1172 | 15000 | 0.0009        | -               |
+| 1.1544 | 15500 | 0.0008        | -               |
+| 1.1916 | 16000 | 0.001         | -               |
+| 1.2289 | 16500 | 0.0011        | -               |
+| 1.2661 | 17000 | 0.0011        | -               |
+| 1.3033 | 17500 | 0.0006        | -               |
+| 1.3406 | 18000 | 0.0011        | -               |
+| 1.3778 | 18500 | 0.0008        | -               |
+| 1.4151 | 19000 | 0.0011        | -               |
+| 1.4523 | 19500 | 0.0009        | -               |
+| 1.4895 | 20000 | 0.0011        | -               |
+| 1.5268 | 20500 | 0.0009        | -               |
+| 1.5640 | 21000 | 0.0009        | -               |
+| 1.6013 | 21500 | 0.0008        | -               |
+| 1.6385 | 22000 | 0.0005        | -               |
+| 1.6757 | 22500 | 0.001         | -               |
+| 1.7130 | 23000 | 0.0008        | -               |
+| 1.7502 | 23500 | 0.0007        | -               |
+| 1.7874 | 24000 | 0.0007        | -               |
+| 1.8247 | 24500 | 0.0008        | -               |
+| 1.8619 | 25000 | 0.001         | -               |
+| 1.8992 | 25500 | 0.0009        | -               |
+| 1.9364 | 26000 | 0.0008        | -               |
+| 1.9736 | 26500 | 0.0009        | -               |
+| 2.0109 | 27000 | 0.0007        | -               |
+| 2.0481 | 27500 | 0.0006        | -               |
+| 2.0854 | 28000 | 0.0007        | -               |
+| 2.1226 | 28500 | 0.0006        | -               |
+| 2.1598 | 29000 | 0.0007        | -               |
+| 2.1971 | 29500 | 0.001         | -               |
+| 2.2343 | 30000 | 0.0006        | -               |
+| 2.2715 | 30500 | 0.0006        | -               |
+| 2.3088 | 31000 | 0.001         | -               |
+| 2.3460 | 31500 | 0.0007        | -               |
+| 2.3833 | 32000 | 0.0008        | -               |
+| 2.4205 | 32500 | 0.0006        | -               |
+| 2.4577 | 33000 | 0.0007        | -               |
+| 2.4950 | 33500 | 0.0007        | -               |
+| 2.5322 | 34000 | 0.001         | -               |
+| 2.5694 | 34500 | 0.0007        | -               |
+| 2.6067 | 35000 | 0.0007        | -               |
+| 2.6439 | 35500 | 0.0008        | -               |
+| 2.6812 | 36000 | 0.0007        | -               |
+| 2.7184 | 36500 | 0.0006        | -               |
+| 2.7556 | 37000 | 0.0007        | -               |
+| 2.7929 | 37500 | 0.0007        | -               |
+| 2.8301 | 38000 | 0.0005        | -               |
+| 2.8674 | 38500 | 0.0009        | -               |
+| 2.9046 | 39000 | 0.0006        | -               |
+| 2.9418 | 39500 | 0.0007        | -               |
+| 2.9791 | 40000 | 0.0008        | -               |
+| -1     | -1    | -             | 0.2608          |
 ### Framework Versions
 ## Model Card Contact
 *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
+-->

config.json CHANGED Viewed

@@ -1,10 +1,12 @@
 {
-  "_name_or_path": "sentence-transformers/all-MiniLM-L6-v2",
   "architectures": [
     "BertModel"
   ],
   "attention_probs_dropout_prob": 0.1,
   "classifier_dropout": null,
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
   "hidden_dropout_prob": 0.1,
@@ -19,7 +21,7 @@
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.48.2",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

 {
+  "_attn_implementation_autoset": true,
+  "_name_or_path": "/content/model",
   "architectures": [
     "BertModel"
   ],
   "attention_probs_dropout_prob": 0.1,
   "classifier_dropout": null,
+  "export_model_type": "transformer",
   "gradient_checkpointing": false,
   "hidden_act": "gelu",
   "hidden_dropout_prob": 0.1,
   "pad_token_id": 0,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
+  "transformers_version": "4.46.3",
   "type_vocab_size": 2,
   "use_cache": true,
   "vocab_size": 30522

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2dd1d2164e062efaed33b1435c12840cf0469da12d8930615b4ab37b996ec8a1
 size 90864192

 version https://git-lfs.github.com/spec/v1
+oid sha256:cce795656a2f5901d70c4ea6568b7c98a136a9f653ec4e8e499ac26270beffcb
 size 90864192

onnx/model.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:506ac8fdb54c3d5401fc03fb7cc135553857779e2cc518dfb3341cf113ebe257
+size 90448255

onnx/model_bnb4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:77f96020b60846dbd7f86538280364c6f5e7d1fce2a1753debe0fa28a5b2a1dc
+size 53958494

onnx/model_fp16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:492935c401e7ae513a959c9433dd204f159b501593bf732507b94d73081ae14d
+size 45317631

onnx/model_int8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1d8575a3ba5d8a44884f56579d2046d5883e4bbef1923840f7b6b561bc6a6918
+size 22999753

onnx/model_q4.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:235764145072d6bc5e6ab9f61f17e893f47dd02cc6eef3a5a1967fbfbae31523
+size 54621818

onnx/model_q4f16.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5a462c3bbf68ad9d80c3367342d67fba38b871f683bd78a741479e9df6017dba
+size 30061282

onnx/model_quantized.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1d8575a3ba5d8a44884f56579d2046d5883e4bbef1923840f7b6b561bc6a6918
+size 22999753

onnx/model_uint8.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3d985ea995e76faa0d7f542db38a3e08ecfedfc8b9cfa1143e8c13398803019d
+size 22999753

quantize_config.json ADDED Viewed

	@@ -0,0 +1,18 @@

+{
+    "modes": [
+        "fp16",
+        "q8",
+        "int8",
+        "uint8",
+        "q4",
+        "q4f16",
+        "bnb4"
+    ],
+    "per_channel": true,
+    "reduce_range": true,
+    "block_size": null,
+    "is_symmetric": true,
+    "accuracy_level": null,
+    "quant_type": 1,
+    "op_block_list": null
+}