Yang's picture

1 9 3

Yang

diddytpq

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

upvoted a paper 4 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

upvoted a paper 5 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 50

upvoted a paper 4 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 104

upvoted a paper 5 months ago

GenRecal: Generation after Recalibration from Large to Small Vision-Language Models

Paper • 2506.15681 • Published Jun 18 • 39

upvoted an article 7 months ago

Article

Fine-Tuning SigLIP2 for Image Classification

Mar 5

•

16

upvoted 3 papers 11 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 98

EXAONE 3.5: Series of Large Language Models for Real-world Use Cases

Paper • 2412.04862 • Published Dec 6, 2024 • 50

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Paper • 2412.04467 • Published Dec 5, 2024 • 118

upvoted 2 papers over 1 year ago

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 28

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 84