Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
trl-lib
/
qwen1.5-1.8b-dpo-cli
like
0
Follow
TRL
207
Transformers
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
qwen1.5-1.8b-dpo-cli
/
adapter_config.json
Commit History
Upload model
be448c6
verified
ybelkada
commited on
Mar 15, 2024