ubitech-edg
/

commandr-35b-sft

Text Generation

instruction-tuning

supervised-fine-tuning

text-generation-inference

Model card Files Files and versions

kosmylo1992 commited on 29 days ago

Commit

d881dad

·

verified ·

1 Parent(s): 0362134

Update README.md

Files changed (1) hide show

README.md +45 -0

README.md CHANGED Viewed

@@ -1,3 +1,48 @@
 # Command-R 35B — SFT (Supervised Fine-Tuning on Synthetic QA)
 **Model type:** Causal Language Model

+---
+{
+  "language": ["en"],
+  "license": "apache-2.0",
+  "tags": [
+    "text-generation",
+    "causal-lm",
+    "instruction-tuning",
+    "supervised-fine-tuning",
+    "synthetic-qa",
+    "lora",
+    "axolotl",
+    "deepspeed",
+    "transformers",
+    "commandr",
+    "cohere",
+    "eu-hpc"
+  ],
+  "datasets": [
+    "axolotl_deduplicated_synthetic_qa"
+  ],
+  "metrics": [
+    "loss"
+  ],
+  "library_name": "transformers",
+  "framework": "pytorch",
+  "base_model": "CohereLabs/c4ai-command-r-v01",
+  "model_name": "commandr-35b-sft",
+  "pipeline_tag": "text-generation",
+  "task_categories": ["text-generation", "instruction-following"],
+  "model_type": "AutoModelForCausalLM",
+  "inference": {
+    "parameters": {
+      "max_new_tokens": 512,
+      "temperature": 0.7,
+      "top_p": 0.9
+    }
+  },
+  "trained_on": [
+    "Leonardo EuroHPC"
+  ],
+  "description": "Supervised fine-tuning (SFT) of Cohere Command-R 35B on the synthetic QA dataset using LoRA and Axolotl. The model improves conversational reasoning and instruction-following capabilities."
+}
+---
 # Command-R 35B — SFT (Supervised Fine-Tuning on Synthetic QA)
 **Model type:** Causal Language Model