lfqian commited on
Commit
e30768c
·
verified ·
1 Parent(s): 6dc9b74

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -17
README.md CHANGED
@@ -24,20 +24,3 @@ output = generator([{"role": "user", "content": question}], max_new_tokens=128,
24
  print(output["generated_text"])
25
  ```
26
 
27
- ## Training procedure
28
-
29
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/wy2266336-yale-university/huggingface/runs/u00amzr7)
30
-
31
-
32
- This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models](https://huggingface.co/papers/2402.03300).
33
-
34
- ### Framework versions
35
-
36
- - TRL: 0.17.0.dev0
37
- - Transformers: 4.51.2
38
- - Pytorch: 2.6.0
39
- - Datasets: 3.5.0
40
- - Tokenizers: 0.21.1
41
-
42
- ## Citations
43
-
 
24
  print(output["generated_text"])
25
  ```
26