kavin1337
/

Mistral-Small-3.1-DRAFT-0.5B-FP8-Dynamic

Text Generation

mistral-small-3.1

text-generation-inference

compressed-tensors

Model card Files Files and versions

kavin1337 commited on Apr 25

Commit

8d0904b

·

verified ·

1 Parent(s): f8968bd

Create README.md

Files changed (1) hide show

README.md +30 -0

README.md ADDED Viewed

	@@ -0,0 +1,30 @@

+---
+license: apache-2.0
+language:
+- en
+- fr
+- de
+- es
+- it
+- pt
+base_model:
+- alamios/Qwenstral-Small-3.1-0.5B
+datasets:
+- alamios/Mistral-Small-24B-Instruct-2501-Conversations
+pipeline_tag: text-generation
+library_name: transformers
+tags:
+- qwen
+- qwen2.5
+- mistral
+- mistral-small
+- mistral-small-3.1
+---
+# Mistral-Small-3.1-DRAFT-0.5B
+This model is meant to be used as draft model for speculative decoding with [mistralai/Mistral-Small-3.1-24B-Instruct-2503](https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503) or [mistralai/Mistral-Small-24B-Instruct-2501](https://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501)
+# Data info
+The data are Mistral's outputs and includes all kind of tasks from various datasets in English, French, German, Spanish, Italian and Portuguese. It has been trained for 2 epochs on 20k unique examples, for a total of 12 million tokens per epoch.