arxiv:2509.14232
Zhaokai Wang
wzk1015
AI & ML interests
Computer Vision
Music Generation
Multimodal Large Language Models
Recent Activity
liked
a model
4 days ago
Zhenxin-Lei/MetaCaptioner
upvoted
a
paper
8 days ago
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding
LLM