Inference Providers
Active filters: kto, trl
ericlewis/SmolLM-1.7B-Instruct-KTO-V6
Text Generation
• 2B • Updated • 3
ericlewis/SmolLM-1.7B-Instruct-KTO-V7
Text Generation
• 2B • Updated • 1
qgallouedec/kto-aligned-model
Text Generation
• 2B • Updated • 3
mahak1204/Mistral-2-7b-Instruct-v0.2-finetune-kto
Text Generation
• 7B • Updated • 1
PaulD/llama3_false_positives_1207_KTO_top_model
PaulD/llama3_false_positives_0609_KTO_hp_screening
PaulD/llama3_false_positives_0609_KTO_hp_screening_seeds
Huertas97/smollm-gec-sftt-kto
Text Generation
• 0.1B • Updated • 1
Text Generation
• 0.1B • Updated • 2
CharlesLi/OpenELM-1_1B-KTO
Text Generation
• 1B • Updated • 2
Text Generation
• 2B • Updated • 3
PaulD/llama3_false_positives_1609_KTO_optimised_model
PaulD/llama3_false_positives_1010_KTO_hp_screening_seeds
johnpaulbin/llama3.2-3b-tokipona-v3-chat-v3
Updated
johnpaulbin/llama3.2-3b-tokipona-v3-chat-v3-Q8_0-GGUF
4B • Updated • 6
Text Generation
• 0.5B • Updated • 2
PaulD/llama3_false_positives_0411_KTO_hp_screening_seeds
PaulD/llama3_false_positives_0312_KTO_optimised_model
Text Generation
• Updated • 1
PaulD/llama3_false_positives_1101_KTO_optimised_model
Updated
chchen/Llama-3.1-8B-Instruct-KTO-100
chchen/Llama-3.1-8B-Instruct-KTO-200
chchen/Llama-3.1-8B-Instruct-KTO-300
chchen/Llama-3.1-8B-Instruct-KTO-400
chchen/Llama-3.1-8B-Instruct-KTO-500
chchen/Llama-3.1-8B-Instruct-KTO-600
chchen/Llama-3.1-8B-Instruct-KTO-700
chchen/Llama-3.1-8B-Instruct-KTO-800
chchen/Llama-3.1-8B-Instruct-KTO-900
chchen/Llama-3.1-8B-Instruct-KTO-1000