Inference Providers
Active filters: gkd, trl
kashif/gkd_openassistant-guanaco
Text Generation
• 0.1B • Updated • 7
Ellio98/mistral-0.5B-Instruct-v0.1
Text Generation
• 0.5B • Updated • 7
distillslm/alpaca_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct
Text Generation
• 3B • Updated • 4
distillslm/alpaca_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-32B-Instruct
Text Generation
• 3B • Updated • 5
distillslm/alpaca_seq_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct
Text Generation
• 3B • Updated • 10
distillslm/alpaca_seq_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-32B-Instruct
Text Generation
• 3B • Updated • 2
distillslm/alpaca_supervised_kd_sft_gemma-2-2b-it_from_gemma-2-9b-it
Text Generation
• 3B • Updated • 10
• distillslm/alpaca_seq_kd_sft_gemma-2-2b-it_from_gemma-2-9b-it
Text Generation
• 3B • Updated • 5
• distillslm/alpaca_supervised_kd_sft_gemma-2-2b-it_from_gemma-2-27b-it
Text Generation
• 3B • Updated • 7
distillslm/alpaca_seq_kd_sft_gemma-2-2b-it_from_gemma-2-27b-it
Text Generation
• 3B • Updated • 4
Ellio98/Qwen2.5-0.5B-Instruct-distill-Qwen2.5-3B
Text Generation
• 0.5B • Updated • 7
• Text Generation
• 0.6B • Updated • 16
tiao55/task-11-Qwen-Qwen2.5-1.5B-Instruct
Text Generation
• 2B • Updated • 4
Text Generation
• 0.5B • Updated • 11
burtenshaw/Qwen3-4B-GKD-Tulu
Text Generation
• 4B • Updated • 5
• • 2
mradermacher/Qwen3-4B-GKD-Tulu-GGUF
4B • Updated • 22
lidaiqiang/Qwen2-0.5B-GKD-math
Updated
baikhsam/gemma3-1b-12b-distilled
Text Generation
• 1.0B • Updated • 33
harisarang/msmarco-gkd-Qwen3-0.6B-20251205_140702-sft-checkpoint
Updated
rishabhrj11/distillspec-qwen
Text Generation
• 0.6B • Updated • 9
rishabhrj11/distillspec-qwen600m
Text Generation
• 0.6B • Updated • 183
rishabhrj11/distillspec-qwen600m-xsum
Text Generation
• 0.6B • Updated • 183
• rishabhrj11/distillspec-qwen600m-cnn
Text Generation
• 0.6B • Updated • 24
rishabhrj11/distillspec-smollm-cnn
Text Generation
• 0.4B • Updated • 3
rishabhrj11/distillspec-smollm-cnn-b5
Text Generation
• 0.4B • Updated • 4
harisarang/msmarco-gkd-Qwen3-0.6B-20251208_090255-sft-checkpoint
Updated
rishabhrj11/distillspec-qwen-gsm-beta0.5
Text Generation
• 0.6B • Updated • 8
rishabhrj11/distillspec-qwen-gsk-fkl
Text Generation
• 0.6B • Updated • 6
rishabhrj11/distillspec-smollm-cnn-fkl
Text Generation
• 0.4B • Updated • 10
rishabhrj11/distillspec-smollm-cnn-rkl
Text Generation
• 0.4B • Updated • 13