Gengze Zhou's picture

2 9 1

Gengze Zhou

ZGZzz

·

https://gengzezhou.github.io/

AI & ML interests

Embodied Ai, Vision-and-Language Navigation, Computer vision, Multimodality Learning, LLM

Recent Activity

authored a paper 7 days ago

Rethinking Training Dynamics in Scale-wise Autoregressive Generation

upvoted a paper 7 days ago

Relational Visual Similarity

upvoted a paper 7 days ago

Rethinking Training Dynamics in Scale-wise Autoregressive Generation

View all activity

Organizations

None yet

authored a paper 7 days ago

Rethinking Training Dynamics in Scale-wise Autoregressive Generation

Paper • 2512.06421 • Published 10 days ago • 5

authored a paper 12 months ago

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

Paper • 2412.05552 • Published Dec 7, 2024 • 6

authored 4 papers over 1 year ago

WebVLN: Vision-and-Language Navigation on Websites

Paper • 2312.15820 • Published Dec 25, 2023

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Paper • 2402.15852 • Published Feb 24, 2024

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Paper • 2305.16986 • Published May 26, 2023

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Paper • 2407.12366 • Published Jul 17, 2024 • 4