Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ermiaazarkhalili
/
qwen-2.5-3b-instruct_grpo-GSM8K
like
0
Text Generation
Transformers
Safetensors
openai/gsm8k
English
qwen2
text-generation-inference
unsloth
qwen2.5
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
qwen-2.5-3b-instruct_grpo-GSM8K
Commit History
Update README.md
41691b0
verified
ermiaazarkhalili
commited on
Jul 4
Update README.md
850aef3
verified
ermiaazarkhalili
commited on
Jul 4
Update README.md
005f8b1
verified
ermiaazarkhalili
commited on
Jun 30
(Trained with Unsloth)
b62a2be
verified
ermiaazarkhalili
commited on
Jun 30
(Trained with Unsloth)
801e73e
verified
ermiaazarkhalili
commited on
Jun 30
Unsloth Model Card
41d6809
verified
ermiaazarkhalili
commited on
Jun 30
Update README.md
a33aa65
verified
ermiaazarkhalili
commited on
Jun 29
Update README.md
2fba4d6
verified
ermiaazarkhalili
commited on
Jun 29
(Trained with Unsloth)
2d5c32c
verified
ermiaazarkhalili
commited on
Jun 28
(Trained with Unsloth)
1866dba
verified
ermiaazarkhalili
commited on
Jun 28
(Trained with Unsloth)
4574d47
verified
ermiaazarkhalili
commited on
Jun 28
(Trained with Unsloth)
972e791
verified
ermiaazarkhalili
commited on
Jun 28
Unsloth Model Card
7fd027b
verified
ermiaazarkhalili
commited on
Jun 28
initial commit
a746a16
verified
ermiaazarkhalili
commited on
Jun 28