SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning
Paper
•
2512.13874
•
Published
•
16
Computer Vision, AI, Machine Learning
IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance
OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation