Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luckeciano
/
Qwen-2.5-Base-7B-GRPO-Base-v2_9382
like
0
Text Generation
Transformers
Safetensors
DigitalLearningGmbH/MATH-lighteval
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen-2.5-Base-7B-GRPO-Base-v2_9382
Commit History
End of training
c6a38dc
verified
luckeciano
commited on
Sep 20
Model save
e4e03a0
verified
luckeciano
commited on
Sep 20
Training in progress, step 100
f137bab
verified
luckeciano
commited on
Sep 20
Training in progress, step 90
349f3c0
verified
luckeciano
commited on
Sep 20
Training in progress, step 80
24b8063
verified
luckeciano
commited on
Sep 20
Training in progress, step 70
c29d8bd
verified
luckeciano
commited on
Sep 20
Training in progress, step 60
ad4aece
verified
luckeciano
commited on
Sep 20
Training in progress, step 50
9f1d1ee
verified
luckeciano
commited on
Sep 20
Training in progress, step 40
e0b6750
verified
luckeciano
commited on
Sep 20
Training in progress, step 30
89b3e73
verified
luckeciano
commited on
Sep 20
Training in progress, step 20
d7e064a
verified
luckeciano
commited on
Sep 20
Training in progress, step 10
5cade0f
verified
luckeciano
commited on
Sep 20
initial commit
2a37936
verified
luckeciano
commited on
Sep 20