2 68

Chi-Pin Huang

jasper0314-huang

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

upvoted a paper 5 days ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

upvoted a paper 13 days ago

TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published 3 days ago • 45

upvoted a paper 5 days ago

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published 6 days ago • 38

upvoted a paper 13 days ago

TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers

Paper • 2601.14133 • Published 19 days ago • 60

upvoted a paper 20 days ago

ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models

Paper • 2601.11404 • Published 23 days ago • 25

upvoted an article 22 days ago

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

Jan 5

•

upvoted a paper 23 days ago

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published 25 days ago • 32

authored a paper 24 days ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published 25 days ago • 51

upvoted 3 papers 24 days ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published Dec 23, 2025 • 27

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 25 days ago • 25

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published 25 days ago • 51

submitted a paper to Daily Papers 24 days ago

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Paper • 2601.09708 • Published 25 days ago • 51

upvoted a paper 25 days ago

3AM: Segment Anything with Geometric Consistency in Videos

Paper • 2601.08831 • Published 26 days ago • 34

upvoted 5 papers about 1 month ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published about 1 month ago • 222

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published Jan 4 • 44

Quantile Rendering: Efficiently Embedding High-dimensional Feature on 3D Gaussian Splatting

Paper • 2512.20927 • Published Dec 24, 2025 • 16

InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion

Paper • 2512.17504 • Published Dec 19, 2025 • 97

Spatia: Video Generation with Updatable Spatial Memory

Paper • 2512.15716 • Published Dec 17, 2025 • 33

upvoted 2 papers about 2 months ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published Dec 23, 2025 • 50

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

Paper • 2512.17012 • Published Dec 18, 2025 • 46

upvoted a paper 2 months ago

Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment

Paper • 2512.04356 • Published Dec 4, 2025 • 10

Chi-Pin Huang

AI & ML interests

Recent Activity

Organizations

jasper0314-huang's activity

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI