JunZhan's picture

2 10 8

JunZhan

zhanjun

·

JunZhan2000

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

upvoted a paper 27 days ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

commented on a paper about 2 months ago

VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 6 days ago • 186

upvoted a paper 27 days ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published 28 days ago • 43

upvoted a paper about 2 months ago

VStyle: A Benchmark for Voice Style Adaptation with Spoken Instructions

Paper • 2509.09716 • Published Sep 9 • 10

upvoted 2 papers 3 months ago

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Paper • 2402.12226 • Published Feb 19, 2024 • 45

DeepPHY: Benchmarking Agentic VLMs on Physical Reasoning

Paper • 2508.05405 • Published Aug 7 • 64

upvoted 2 papers 5 months ago

Beyond Homogeneous Attention: Memory-Efficient LLMs via Fourier-Approximated KV Cache

Paper • 2506.11886 • Published Jun 13 • 20

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17 • 44

upvoted 2 articles over 1 year ago

Article

LoRA training scripts of the world, unite!

Jan 2, 2024

• 73

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 292

upvoted a paper over 1 year ago

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Paper • 2403.18814 • Published Mar 27, 2024 • 47