Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AIcell
/
Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
qwen2
Generated from Trainer
open-r1
trl
grpo
conversational
text-generation-inference
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k
Commit History
End of training
5d8f23f
verified
AIcell
commited on
Sep 29
Model save
4787791
verified
AIcell
commited on
Sep 29
Training in progress, step 3600
1e442a1
verified
AIcell
commited on
Sep 29
Training in progress, step 3300
397482d
verified
AIcell
commited on
Sep 29
Training in progress, step 3000
2e0a9b7
verified
AIcell
commited on
Sep 28
Training in progress, step 2700
da60193
verified
AIcell
commited on
Sep 28
Training in progress, step 2400
07a9a59
verified
AIcell
commited on
Sep 28
Training in progress, step 2100
cfe6bff
verified
AIcell
commited on
Sep 28
Training in progress, step 1800
b51797c
verified
AIcell
commited on
Sep 28
Training in progress, step 1500
c9f3814
verified
AIcell
commited on
Sep 28
Training in progress, step 1200
7e7b495
verified
AIcell
commited on
Sep 28
Training in progress, step 900
b4f6ae9
verified
AIcell
commited on
Sep 28
Training in progress, step 600
61fe8cf
verified
AIcell
commited on
Sep 28
Training in progress, step 300
b195ea8
verified
AIcell
commited on
Sep 28
initial commit
7be1ea8
verified
AIcell
commited on
Sep 28