AndroTMem: From Interaction Trajectories to Anchored Memory in Long-Horizon GUI Agents Paper • 2603.18429 • Published 3 days ago • 19
Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens Paper • 2603.19232 • Published 2 days ago • 26
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs Paper • 2603.19217 • Published 2 days ago • 27
MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning Paper • 2603.16929 • Published 8 days ago • 5
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents Paper • 2603.18815 • Published 2 days ago • 6
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 2 days ago • 37
From Prior to Pro: Efficient Skill Mastery via Distribution Contractive RL Finetuning Paper • 2603.10263 • Published 11 days ago • 2
Video-CoE: Reinforcing Video Event Prediction via Chain of Events Paper • 2603.14935 • Published 5 days ago • 88
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild Paper • 2603.17187 • Published 4 days ago • 113
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs Paper • 2603.18004 • Published 3 days ago • 10
Stereo World Model: Camera-Guided Stereo Video Generation Paper • 2603.17375 • Published 3 days ago • 10
AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents Paper • 2603.16496 • Published 4 days ago • 10
MosaicMem: Hybrid Spatial Memory for Controllable Video World Models Paper • 2603.17117 • Published 4 days ago • 82
SegviGen: Repurposing 3D Generative Model for Part Segmentation Paper • 2603.16869 • Published 4 days ago • 16
ViT-AdaLA: Adapting Vision Transformers with Linear Attention Paper • 2603.16063 • Published 5 days ago • 2