Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
HumanLLMs/Human-Like-DPO-Dataset
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
244
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
HumanLLMs/Human-Like-DPO-Dataset
Clear all
Yegor25/llm-course-hw2-dpo
Text Generation
•
0.1B
•
Updated
Mar 19
Yegor25/llm-course-hw2-reward-model
Text Classification
•
0.1B
•
Updated
Mar 18
Yegor25/llm-course-hw2-ppo
Text Generation
•
0.1B
•
Updated
Mar 19
georgebu/dpo_model
Text Generation
•
0.1B
•
Updated
Mar 28
antoniorusan/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 28
•
1
georgebu/reward_model
Text Classification
•
0.1B
•
Updated
Mar 28
artarif/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 23
mzabelin8/reward_model_output
Text Classification
•
0.1B
•
Updated
Mar 23
kuklinmike/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 24
Grivind/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 29
X1716/llm-course-hw2-dpo
Text Generation
•
0.1B
•
Updated
Mar 29
•
1
X1716/test
Text Classification
•
0.1B
•
Updated
Mar 24
X1716/llm-course-hw2-reward-model
Text Classification
•
0.1B
•
Updated
Mar 29
xinyuema/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 30
•
1
liuhailin0123/llm-course-hw2-dpo
Text Generation
•
0.1B
•
Updated
Mar 27
alicebeth/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 28
georgebu/ppo_model
Text Generation
•
0.1B
•
Updated
Mar 28
fitkovskaja/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 25
liuhailin0123/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 30
•
1
fridalex/model
Text Classification
•
0.1B
•
Updated
Mar 28
neuralsrg/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 29
thsluck/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 29
liuhailin0123/llm-course-hw2-ppo
Text Generation
•
0.1B
•
Updated
Mar 30
bychkovgk/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 27
MurDanya/llm-course-hw2-dpo
0.1B
•
Updated
Mar 30
dmitrylala/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 27
Aurelianous/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 27
AnnaBurikova/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 27
kdtln/trainer_output
Text Classification
•
0.1B
•
Updated
Mar 27
Aleks2002SH/llm-course-hw2-reward-model_old
Text Classification
•
0.1B
•
Updated
Mar 28
Previous
1
...
3
4
5
6
7
...
9
Next