Rui Sun PRO
ThreeSR
AI & ML interests
Vision and Language Multimodal Learning, CV, NLP, LLM
Recent Activity
upvoted
a
paper
19 days ago
Paper2Video: Automatic Video Generation from Scientific Papers
upvoted
a
paper
28 days ago
Video models are zero-shot learners and reasoners
upvoted
a
paper
about 1 month ago
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid
Vision Tokenizer