Beyond Reward Engineering: A Data Recipe for Long-Context Reinforcement Learning Paper • 2606.18831 • Published 7 days ago • 2
view article Article Jawbreaker: Private Scam Defense for Someone You Love build-small-hackathon • 11 days ago • 4
huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated-FP8 Image-Text-to-Text • 8B • Updated about 5 hours ago • 3
HauhauCS/Gemma4-12B-QAT-Uncensored-HauhauCS-Balanced Image-Text-to-Text • 12B • Updated about 22 hours ago • 2.06k • 60