7 14 36

Zhaoyang Liu PRO

zyliu

liu-zhy

AI & ML interests

Video understanding, 3D Perception, Autonomous driving, Foundation models, AIGC

Recent Activity

upvoted a paper 8 days ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

liked a Space 9 days ago

Jiaqi-hkust/Robust-R1

liked a dataset 9 days ago

Jiaqi-hkust/Robust-R1

View all activity

Organizations

upvoted a paper 8 days ago

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Paper • 2512.20618 • Published 9 days ago • 52

upvoted a paper 9 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published 13 days ago • 64

upvoted 2 papers 3 months ago

Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning

Paper • 2510.11027 • Published Oct 13, 2025 • 21

NaViL: Rethinking Scaling Properties of Native Multimodal Large Language Models under Data Constraints

Paper • 2510.08565 • Published Oct 9, 2025 • 19

upvoted a collection 3 months ago

ScaleCUA

Collection

7 items • Updated Nov 12, 2025 • 17

upvoted 2 papers 3 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Paper • 2509.15221 • Published Sep 18, 2025 • 111

upvoted 3 papers 7 months ago

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30, 2025 • 35

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29, 2025 • 45

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26, 2025 • 104

upvoted a paper 8 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21, 2025 • 78

upvoted a paper 9 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14, 2025 • 306

upvoted a paper about 1 year ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

upvoted a paper about 2 years ago

ControlLLM: Augment Language Models with Tools by Searching on Graphs

Paper • 2310.17796 • Published Oct 26, 2023 • 18

Zhaoyang Liu PRO

AI & ML interests

Recent Activity

Organizations

zyliu's activity