TheFinAI
/

Fin-o1-8B

Text Generation

Model card Files Files and versions

lfqian commited on May 15

Commit

e30768c

·

verified ·

1 Parent(s): 6dc9b74

Update README.md

Files changed (1) hide show

README.md +0 -17

README.md CHANGED Viewed

@@ -24,20 +24,3 @@ output = generator([{"role": "user", "content": question}], max_new_tokens=128,
 print(output["generated_text"])
 ```
-## Training procedure
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/wy2266336-yale-university/huggingface/runs/u00amzr7)
-This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
-### Framework versions
-- TRL: 0.17.0.dev0
-- Transformers: 4.51.2
-- Pytorch: 2.6.0
-- Datasets: 3.5.0
-- Tokenizers: 0.21.1
-## Citations


24	print(output["generated_text"])
25	```
26