Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Pierizvi
/
infused-reasoning-phi2
like
0
Text Generation
PEFT
Safetensors
gsm8k
English
reasoning
mathematics
grpo
reinforcement-learning
phi-2
step-by-step
mathematical-reasoning
rlhf
License:
mit
Model card
Files
Files and versions
xet
Community
Use this model
main
infused-reasoning-phi2
131 MB
1 contributor
History:
5 commits
Pierizvi
Update README.md
3b9a786
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
6 months ago
README.md
15 kB
Update README.md
6 months ago
adapter_config.json
798 Bytes
epoch-639
6 months ago
adapter_model.safetensors
42 MB
xet
epoch-639
6 months ago
added_tokens.json
Safe
1.08 kB
epoch-639
6 months ago
merges.txt
Safe
456 kB
epoch-639
6 months ago
optimizer.pt
pickle
Detected Pickle imports (3)
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
What is a pickle import?
84.1 MB
xet
epoch-639
6 months ago
special_tokens_map.json
Safe
473 Bytes
epoch-639
6 months ago
tokenizer.json
3.57 MB
epoch-639
6 months ago
tokenizer_config.json
Safe
7.44 kB
epoch-639
6 months ago
training_info.json
91 Bytes
epoch-639
6 months ago
vocab.json
Safe
798 kB
epoch-639
6 months ago