GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer Paper • 2510.16136 • Published 15 days ago • 2
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published 23 days ago • 33
Fast and Simple Explainability for Point Cloud Networks Paper • 2403.07706 • Published Mar 12, 2024
Whitened CLIP as a Likelihood Surrogate of Images and Captions Paper • 2505.06934 • Published May 11
CompassVerifier: A Unified and Robust Verifier for LLMs Evaluation and Outcome Reward Paper • 2508.03686 • Published Aug 5 • 36
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis Paper • 2508.15754 • Published Aug 21 • 4
Intern-S1: A Scientific Multimodal Foundation Model Paper • 2508.15763 • Published Aug 21 • 254
Precise Action-to-Video Generation Through Visual Action Prompts Paper • 2508.13104 • Published Aug 18 • 11
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published Aug 5 • 50
Context versus Prior Knowledge in Language Models Paper • 2404.04633 • Published Apr 6, 2024 • 5
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published Jul 23 • 50
view post Post 541 Qwen 3 Coder is a personal attack to k2, and I love it.It achieves near SOTA on LCB while not having reasoning.Finally people are understanding that reasoning isnt necessary for high benches...Qwen ftw!DECENTRALIZE DECENTRALIZE DECENTRALIZE See translation 🚀 6 6 🔥 4 4 + Reply
CompassJudger-2: Towards Generalist Judge Model via Verifiable Rewards Paper • 2507.09104 • Published Jul 12 • 17
Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination Paper • 2507.10532 • Published Jul 14 • 88
PM4Bench: A Parallel Multilingual Multi-Modal Multi-task Benchmark for Large Vision Language Model Paper • 2503.18484 • Published Mar 24
Coding Triangle: How Does Large Language Model Understand Code? Paper • 2507.06138 • Published Jul 8 • 21
Rethinking Verification for LLM Code Generation: From Generation to Testing Paper • 2507.06920 • Published Jul 9 • 28
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Paper • 2506.03594 • Published Jun 4