Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luckeciano
/
Qwen-2.5-Base-7B-GRPO-Base-v2_2771
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen-2.5-Base-7B-GRPO-Base-v2_2771
Commit History
End of training
6d5e0fd
verified
luckeciano
commited on
Sep 20
Model save
e559aec
verified
luckeciano
commited on
Sep 20
Training in progress, step 100
0952dff
verified
luckeciano
commited on
Sep 20
Training in progress, step 90
3a08985
verified
luckeciano
commited on
Sep 20
Training in progress, step 80
d0880fd
verified
luckeciano
commited on
Sep 20
Training in progress, step 70
cc00ddf
verified
luckeciano
commited on
Sep 20
Training in progress, step 60
15fc5c2
verified
luckeciano
commited on
Sep 20
Training in progress, step 50
4ac5339
verified
luckeciano
commited on
Sep 20
Training in progress, step 40
f03232f
verified
luckeciano
commited on
Sep 20
Training in progress, step 30
7ea6473
verified
luckeciano
commited on
Sep 20
Training in progress, step 20
b9a29da
verified
luckeciano
commited on
Sep 20
Training in progress, step 10
061a583
verified
luckeciano
commited on
Sep 20
initial commit
a15c0f7
verified
luckeciano
commited on
Sep 20