stepfun-ai
/

Qwen2.5-32B-DialogueReason

Model card Files Files and versions

buyun commited on May 12

Commit

1fefda4

·

verified ·

1 Parent(s): e40c27b

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -1,3 +1,6 @@
 ## Introduction
 Qwen2.5-32B-DialogueReason is a dialogue-based reasoning model built on Qwen2.5-32B-Base.
 We train the model using [Open-Reasoner-Zero](https://github.com/Open-Reasoner-Zero/Open-Reasoner-Zero) data through rule-based reinforcement learning.

+---
+license: apache-2.0
+---
 ## Introduction
 Qwen2.5-32B-DialogueReason is a dialogue-based reasoning model built on Qwen2.5-32B-Base.
 We train the model using [Open-Reasoner-Zero](https://github.com/Open-Reasoner-Zero/Open-Reasoner-Zero) data through rule-based reinforcement learning.