Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jz666
/
simpo-train-small-correct
like
0
Text Generation
Transformers
Safetensors
jz666/gemma2-ultrafeedback-ppl-split
gemma2
alignment-handbook
trl
simpo
Generated from Trainer
conversational
text-generation-inference
License:
gemma
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
1b4d855
simpo-train-small-correct
Commit History
Model save
1b4d855
verified
jz666
commited on
Oct 14
Training in progress, step 137
bc4984e
verified
jz666
commited on
Oct 14
initial commit
a4fdf0e
verified
jz666
commited on
Oct 14