Inference Providers
Active filters: X-R1
zhengComing/zhengComing_Qwen2.5_0dot5B_R1_zero
Text Generation
• 0.5B • Updated smartrichard/X-R1-lora-7500
Text Generation
• Updated watermelonhjg/Qwen2.5-3B-EN-Zero
Text Generation
• 3B • Updated watermelonhjg/Qwen2.5-7B-EN-Zero
Text Generation
• 8B • Updated watermelonhjg/Qwen2.5-3B-Instruct-CN-Math-Zero
Text Generation
• 3B • Updated watermelonhjg/Qwen2.5-7B-Instruct-CN-Math-Zero
Text Generation
• 8B • Updated watermelonhjg/Qwen2.5-7B-Instruct-EN-Zero
Text Generation
• 8B • Updated watermelonhjg/Qwen2.5-3B-Instruct-EN-Zero
Text Generation
• 3B • Updated • 1
watermelonhjg/Qwen2.5-7B-med
Text Generation
• 8B • Updated • 1
watermelonhjg/Qwen2.5-7B-0.01KL
Text Generation
• 8B • Updated • 1
watermelonhjg/Qwen2.5-7B-class5
Text Generation
• 8B • Updated watermelonhjg/Qwen2.5-7B-cn-class2
Text Generation
• 8B • Updated • 1
watermelonhjg/Qwen2.5-Math-7B-en-zero
Text Generation
• 8B • Updated watermelonhjg/Qwen2.5-Math-7B-cn-zero-class2
Text Generation
• 8B • Updated IDoNotHaveAName/origin_grpo_train_1_epoch
Text Generation
• 2B • Updated • 3
IDoNotHaveAName/GRPO-qwen2.5-1.5B-reward-process
Text Generation
• 2B • Updated IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-with-hint
Text Generation
• 2B • Updated IDoNotHaveAName/GRPO-1epoch-train-by-mistake-collections-without-hint
Text Generation
• 2B • Updated • 2
IDoNotHaveAName/X-R1-3epoch
Text Generation
• 2B • Updated IDoNotHaveAName/2epoch-experiment
Text Generation
• 2B • Updated IDoNotHaveAName/model-trainby-mistake
Text Generation
• 2B • Updated mradermacher/Hint-Informed-GRPO-1.5B-GGUF
2B • Updated • 17
GavinChan1105/X-R1-3B-cn-math
Text Generation
• 3B • Updated mradermacher/X-R1-3B-cn-math-GGUF