Edit Models filters

Apps

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

agentica-org/DeepScaleR-Preview-Dataset

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

125

Full-text search

Active filters: agentica-org/DeepScaleR-Preview-Dataset

mradermacher/E1-Math-7B-GGUF

8B • Updated May 25 • 99

hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier

Reinforcement Learning • 8B • Updated May 28 • 2

mradermacher/E1-Math-7B-i1-GGUF

8B • Updated May 25 • 502

TingchenFu/coldrl_3k_qwen-2.5-1.5b_04232202

Text Generation • 2B • Updated May 26

TingchenFu/coldrl_3k_qwen-2.5-7b_04240151

Text Generation • 8B • Updated May 26

TingchenFu/coldrl_3k_qwen-2.5-math-1.5b_04201604

Text Generation • 2B • Updated May 26 • 1

TingchenFu/coldrl_qwen-2.5-math-7b_04252230

Text Generation • 8B • Updated May 26

TingchenFu/sft_8k_qwen-2.5-1.5b_05022300

Text Generation • 2B • Updated May 26

TingchenFu/sft_8k_qwen-2.5-7b_05021953

Text Generation • 8B • Updated May 26

TingchenFu/sft_8k_qwen-2.5-math-1.5b_05021751

Text Generation • 2B • Updated May 26

TingchenFu/sft_8k_qwen-2.5-math-7b_05021445

Text Generation • 8B • Updated May 26

TingchenFu/sftrl_7k_qwen-2.5-1.5b_05070032

Text Generation • 2B • Updated May 26

TingchenFu/sftrl_7k_qwen-2.5-math-1.5b_05052256

Text Generation • 2B • Updated May 26 • 1

TingchenFu/sftrl_7k_qwen-2.5-math-7b_05040001

Text Generation • 8B • Updated May 26

TingchenFu/sftrl_7k_qwen-2.5-7b_05042309

Text Generation • 8B • Updated May 26

Salesforce/E1-AceReason-14B

Text Generation • 15B • Updated Jun 1 • 7 • 12

mradermacher/E1-AceReason-14B-GGUF

15B • Updated Jun 1 • 144 • 2

mradermacher/E1-AceReason-14B-i1-GGUF

15B • Updated Jun 1 • 268

sizzlebop/E1-AceReason-14B-Q8_0-GGUF

15B • Updated Jun 1 • 5 • 1

sizzlebop/AdaptThink-7B-delta0.05-Q8_0-GGUF

8B • Updated Jun 1 • 3

sizzlebop/AdaptThink-7B-delta0.05-IQ4_XS-GGUF

8B • Updated Jun 1 • 9

Khurram123/E1-Math-1.5B-Q4_K_M-GGUF

2B • Updated Jun 5 • 6

Xuerui2312/DeepSeek-R1-Distill-Qwen-7B-TRPA-DeepScaleR-verl0326

Text Generation • 8B • Updated Jun 20 • 3 • 1

hdong0/deepseek-Llama-8B-Open-R1-GRPO_deepscaler_1000steps_lr1e-6_kl1e-3_acc

Text Generation • 8B • Updated Jun 15 • 3

tensorblock/Vinnnf_Thinkless-1.5B-RL-DeepScaleR-GGUF

Text Generation • 2B • Updated Jul 9 • 109

hdong0/deepseek-Qwen2.5-1.5B-baseline-Open-R1-GRPO_deepscaler_mu_8

Text Generation • 2B • Updated Jul 4 • 3

hdong0/deepseek-Qwen2.5-1.5B-Open-R1-GRPO_deepscaler_mu_8

Text Generation • 2B • Updated Jul 4 • 3

hdong0/Qwen2.5-Math-1.5B-Open-R1-GRPO_deepscaler_mu_8_constant_lr

Text Generation • 2B • Updated Jul 7 • 3

hdong0/deepseek-Qwen-1.5B-Open-R1-GRPO_deepscaler_mu_8_constant_lr

Text Generation • 2B • Updated Jul 7 • 7

hdong0/Qwen2.5-Math-1.5B-baseline-Open-R1-GRPO_deepscaler_mu_8_constant_lr

Text Generation • 2B • Updated Jul 8 • 3