HangHua's picture

8 11 2

HangHua

hhua2

·

https://hanghuacs.notion.site

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

MIRA: Multimodal Iterative Reasoning Agent for Image Editing

commented on a paper 5 days ago

MIRA: Multimodal Iterative Reasoning Agent for Image Editing

authored a paper 5 days ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

View all activity

Organizations

upvoted a paper 5 days ago

MIRA: Multimodal Iterative Reasoning Agent for Image Editing

Paper • 2511.21087 • Published 7 days ago • 9

upvoted a paper 6 days ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

Paper • 2511.17490 • Published 11 days ago • 21

upvoted a paper 30 days ago

Latent Chain-of-Thought for Visual Reasoning

Paper • 2510.23925 • Published Oct 27 • 9

upvoted 2 papers about 2 months ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10 • 26

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published Oct 6 • 48

upvoted a paper 6 months ago

MMIG-Bench: Towards Comprehensive and Explainable Evaluation of Multi-Modal Image Generation Models

Paper • 2505.19415 • Published May 26 • 2

upvoted 2 papers 8 months ago

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published Apr 7 • 15

WikiVideo: Article Generation from Multiple Videos

Paper • 2504.00939 • Published Apr 1 • 37

upvoted a paper 11 months ago

Generative AI for Cel-Animation: A Survey

Paper • 2501.06250 • Published Jan 8 • 13

upvoted 2 papers about 1 year ago

FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity

Paper • 2411.15411 • Published Nov 23, 2024 • 8

MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models

Paper • 2410.09733 • Published Oct 13, 2024 • 9