1 14 13

Jason Liu

liuzhan22

liuzhan22

AI & ML interests

None yet

Recent Activity

liked a dataset 41 minutes ago

MRSAudio/MRSAudio

upvoted a paper about 1 month ago

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

authored a paper about 1 month ago

Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

Paper • 2406.15704 • Published Jun 22, 2024 • 6

Audio-Conditioned Diffusion LLMs for ASR and Deliberation Processing

Paper • 2509.16622 • Published Sep 20 • 1

upvoted a paper 2 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 254

upvoted a collection 2 months ago

audio

Collection

108 items • Updated 10 days ago • 5

upvoted a changelog 3 months ago

Changelog

Trending Papers

Jul 28

• 104

upvoted 2 papers 3 months ago

Extract and Diffuse: Latent Integration for Improved Diffusion-based Speech and Vocal Enhancement

Paper • 2409.09642 • Published Sep 15, 2024 • 1

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 72

upvoted 2 papers 4 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 157

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166

upvoted a collection 4 months ago

Qwen Papers

Collection

8 items • Updated Feb 13 • 2

upvoted a paper 4 months ago

MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark

Paper • 2410.19168 • Published Oct 24, 2024 • 22

upvoted an article 4 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 385

upvoted a collection 4 months ago

Audio understanding benchmarks

Collection

1 item • Updated Oct 28, 2024 • 1

upvoted a paper 7 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 298

Jason Liu

AI & ML interests

Recent Activity

Organizations

liuzhan22's activity

Trending Papers

You could have designed state of the art positional encoding