-
Your Group-Relative Advantage Is Biased
Paper • 2601.08521 • Published • 157 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 137 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 318 -
BitNet Distillation
Paper • 2510.13998 • Published • 59
Om Dehlan
immiscible-blade
AI & ML interests
LLMs and DDPMs
Recent Activity
updated
a collection
about 1 month ago
Weekly1 updated
a collection
about 1 month ago
Weekly1 updated
a collection
about 1 month ago
Weekly1 Organizations
None yet