6 16 29

Yunlong Lin PRO

LYL1015

https://lyl1015.github.io/

AI & ML interests

AI agent ｜multi-modal learning

Recent Activity

upvoted a paper 11 days ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

upvoted a paper 14 days ago

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

authored a paper 17 days ago

IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering

View all activity

Organizations

upvoted a paper 11 days ago

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

Paper • 2512.16793 • Published 15 days ago • 72

upvoted a paper 14 days ago

Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation

Paper • 2512.16913 • Published 14 days ago • 33

upvoted a paper 17 days ago

JarvisEvo: Towards a Self-Evolving Photo Editing Agent with Synergistic Editor-Evaluator Optimization

Paper • 2511.23002 • Published Nov 28, 2025 • 26

upvoted a paper 28 days ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published about 1 month ago • 36

upvoted a paper 29 days ago

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published about 1 month ago • 32

upvoted a paper 3 months ago

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26, 2025 • 17

upvoted a collection 3 months ago

Spark-Collection

Collection

Spark-Collection • 2 items • Updated Sep 29, 2025 • 2

upvoted 5 papers 6 months ago

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9, 2025 • 105

MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second

Paper • 2507.10065 • Published Jul 14, 2025 • 24

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30, 2025 • 89

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Paper • 2506.17612 • Published Jun 21, 2025 • 64

PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5, 2025 • 82

upvoted 2 papers 7 months ago

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Paper • 2506.10741 • Published Jun 12, 2025 • 27

Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model

Paper • 2505.23606 • Published May 29, 2025 • 14

upvoted a paper 9 months ago

JarvisIR: Elevating Autonomous Driving Perception with Intelligent Image Restoration

Paper • 2504.04158 • Published Apr 5, 2025 • 2

upvoted a paper 10 months ago

MagicInfinite: Generating Infinite Talking Videos with Your Words and Voice

Paper • 2503.05978 • Published Mar 7, 2025 • 36

Yunlong Lin PRO

AI & ML interests

Recent Activity

Organizations

LYL1015's activity