Rethinking Training Dynamics in Scale-wise Autoregressive Generation Paper • 2512.06421 • Published 10 days ago • 5
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts Paper • 2412.05552 • Published Dec 7, 2024 • 6
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation Paper • 2402.15852 • Published Feb 24, 2024
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models Paper • 2305.16986 • Published May 26, 2023
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models Paper • 2407.12366 • Published Jul 17, 2024 • 4