Forest Before Trees: Latent Superposition for Efficient Visual Reasoning
Paper
•
2601.06803
•
Published
•
5
Natural Language Processing, Machine Learning, and Computer Vision
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
Robust and Calibrated Detection of Authentic Multimedia Content