PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published 14 days ago • 61
TrajSelector: Harnessing Latent Representations for Efficient and Effective Best-of-N in Large Reasoning Model Paper • 2510.16449 • Published 16 days ago • 34
naver-hyperclovax/HyperCLOVAX-SEED-Vision-Instruct-3B Text Generation • 4B • Updated Sep 16 • 64.9k • 211
GTR: Improving Large 3D Reconstruction Models through Geometry and Texture Refinement Paper • 2406.05649 • Published Jun 9, 2024 • 12
ExtraNeRF: Visibility-Aware View Extrapolation of Neural Radiance Fields with Diffusion Models Paper • 2406.06133 • Published Jun 10, 2024 • 12
MLCM: Multistep Consistency Distillation of Latent Diffusion Model Paper • 2406.05768 • Published Jun 9, 2024 • 13
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization Paper • 2406.05981 • Published Jun 10, 2024 • 16
Margin-aware Preference Optimization for Aligning Diffusion Models without Reference Paper • 2406.06424 • Published Jun 10, 2024 • 16
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers Paper • 2406.05370 • Published Jun 8, 2024 • 19
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis Paper • 2406.06216 • Published Jun 10, 2024 • 23
Husky: A Unified, Open-Source Language Agent for Multi-Step Reasoning Paper • 2406.06469 • Published Jun 10, 2024 • 29
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper • 2406.06525 • Published Jun 10, 2024 • 71