Models

24

Full-text search

Active filters: X-R1

zhengComing/zhengComing_Qwen2.5_0dot5B_R1_zero

Text Generation • 0.5B • Updated Feb 22, 2025

smartrichard/X-R1-lora-7500

Text Generation • Updated Feb 27, 2025

watermelonhjg/Qwen2.5-3B-EN-Zero

Text Generation • 3B • Updated Mar 2, 2025

watermelonhjg/Qwen2.5-7B-EN-Zero

Text Generation • 8B • Updated Mar 1, 2025

watermelonhjg/Qwen2.5-3B-Instruct-CN-Math-Zero

Text Generation • 3B • Updated Mar 2, 2025

watermelonhjg/Qwen2.5-7B-Instruct-CN-Math-Zero

Text Generation • 8B • Updated Mar 2, 2025

watermelonhjg/Qwen2.5-7B-Instruct-EN-Zero

Text Generation • 8B • Updated Mar 2, 2025

watermelonhjg/Qwen2.5-3B-Instruct-EN-Zero

Text Generation • 3B • Updated Mar 2, 2025 • 1

watermelonhjg/Qwen2.5-7B-med

Text Generation • 8B • Updated Mar 30, 2025 • 1

watermelonhjg/Qwen2.5-7B-0.01KL

Text Generation • 8B • Updated Apr 5, 2025 • 1

watermelonhjg/Qwen2.5-7B-class5

Text Generation • 8B • Updated Apr 2, 2025

watermelonhjg/Qwen2.5-7B-cn-class2

Text Generation • 8B • Updated Apr 2, 2025 • 1

watermelonhjg/Qwen2.5-Math-7B-en-zero

Text Generation • 8B • Updated Apr 1, 2025

watermelonhjg/Qwen2.5-Math-7B-cn-zero-class2

Text Generation • 8B • Updated Mar 31, 2025

IDoNotHaveAName/origin_grpo_train_1_epoch

Text Generation • 2B • Updated Jul 10, 2025 • 3

IDoNotHaveAName/GRPO-qwen2.5-1.5B-reward-process

Text Generation • 2B • Updated Jul 15, 2025

IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-with-hint

Text Generation • 2B • Updated Jul 17, 2025

IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-without-hint

Text Generation • 2B • Updated Jul 18, 2025 • 2

IDoNotHaveAName/X-R1-3epoch

Text Generation • 2B • Updated Jul 18, 2025

IDoNotHaveAName/2epoch-experiment

Text Generation • 2B • Updated Jul 19, 2025

IDoNotHaveAName/model-trainby-mistake

Text Generation • 2B • Updated Jul 21, 2025

mradermacher/Hint-Informed-GRPO-1.5B-GGUF

2B • Updated Aug 8, 2025 • 17

GavinChan1105/X-R1-3B-cn-math

Text Generation • 3B • Updated Jan 20

mradermacher/X-R1-3B-cn-math-GGUF

3B • Updated Jan 21 • 31