Models

73,129

Full-text search

Active filters: reinforcement-learning

nvidia/NitroGen

Reinforcement Learning • Updated Feb 5 • 532

Adilbai/stock-trading-rl-agent

Reinforcement Learning • Updated Jan 8 • 403 • 143

mradermacher/PRIMO-COT-SFT-7B-GGUF

Reinforcement Learning • 8B • Updated 18 days ago • 699 • 2

mradermacher/Tifa-Deepsex-14b-CoT-i1-GGUF

Reinforcement Learning • 15B • Updated Feb 13, 2025 • 411 • 14

Veri-Code/ReForm-14B-RL-entropy

Text Generation • 15B • Updated 3 days ago • 26 • 3

InfiX-ai/InfiGUI-G1-7B

Image-Text-to-Text • 8B • Updated Aug 12, 2025 • 113 • 12

Schrieffer/Llama-SARM-4B

Reinforcement Learning • 5B • Updated Dec 11, 2025 • 23 • 2

mradermacher/ATLAS-8B-Thinking-GGUF

Reinforcement Learning • 8B • Updated Sep 13, 2025 • 259 • 2

JonusNattapong/AI-XAUUSD-Trading

Reinforcement Learning • Updated Oct 10, 2025 • 34

Freakz3z/Qwen-JSON

Text Generation • 4B • Updated Dec 3, 2025 • 48 • 3

zai-org/GLM-TTS

Text-to-Speech • Updated Jan 12 • 1.04k • 336

exla-ai/openpie-0.6

Robotics • Updated Feb 4 • 130 • 21

PrimeIntellect/INTELLECT-3.1

Text Generation • 107B • Updated Feb 18 • 216 • 43

OpenDataArena/ODA-Fin-RL-8B

Reinforcement Learning • 8B • Updated Mar 10 • 195 • 2

mradermacher/PulseMind-72B-i1-GGUF

Reinforcement Learning • 73B • Updated Jan 30 • 187 • 2

Dat1710/nexus-1.5b

Text Generation • 2B • Updated 3 days ago • 97 • 1

diasAiMaster/unitree-go2-velocity-flat

Reinforcement Learning • Updated about 6 hours ago • 2

nvidia/GEAR-SONIC

Reinforcement Learning • Updated 30 days ago • 42

nvidia/EGM-8B

Image-Text-to-Text • 9B • Updated about 1 month ago • 622 • 8

Tzafon/Northstar-CUA-Fast

Image-Text-to-Text • 5B • Updated Apr 2 • 1.99k • 5

jasonmsilvas1984/stock-trading-rl-agent

Reinforcement Learning • Updated Mar 6 • 1

LeonOverload/PRIMO-R1-7B

Video-Text-to-Text • 8B • Updated 19 days ago • 25 • 1

LeonOverload/PRIMO-COT-SFT-7B

Video-Text-to-Text • 849k • Updated 19 days ago • 40 • 1

Camais03/camie-crafter

Reinforcement Learning • Updated Mar 29 • 27 • 5

Accio-Lab/Metis-8B-RL

Image-Text-to-Text • 9B • Updated about 1 month ago • 558 • 4

waltgrace/poker-gemma4-26b-a4b-lora

Image-Text-to-Text • Updated 23 days ago • 2

mradermacher/PRIMO-R1-7B-GGUF

Reinforcement Learning • 8B • Updated 18 days ago • 599 • 1

Falconss1/VideoThinker-R1-3B

Video-Text-to-Text • 4B • Updated 5 days ago • 45 • 1

Falconss1/VideoThinker-R1-Bias-3B

Video-Text-to-Text • 4B • Updated 18 days ago • 20 • 1

mradermacher/VideoThinker-R1-3B-GGUF

Question Answering • 3B • Updated 4 days ago • 1.36k • 1