Evolve the Method, Not the Prompts: Evolutionary Synthesis of Jailbreak Attacks on LLMs Paper • 2511.12710 • Published 2 days ago • 9
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper • 2511.09611 • Published 6 days ago • 44
Part-X-MLLM: Part-aware 3D Multimodal Large Language Model Paper • 2511.13647 • Published 1 day ago • 62
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance Paper • 2511.13254 • Published 1 day ago • 75
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling Paper • 2511.11793 • Published 4 days ago • 81
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data Paper • 2511.12609 • Published 2 days ago • 86
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published 5 days ago • 8
AlphaResearch: Accelerating New Algorithm Discovery with Language Models Paper • 2511.08522 • Published 7 days ago • 13
Superpositional Gradient Descent: Harnessing Quantum Principles for Model Training Paper • 2511.01918 • Published 17 days ago • 11
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 5 days ago • 58
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper • 2511.08521 • Published 7 days ago • 36
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation Paper • 2511.09057 • Published 7 days ago • 66
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper • 2511.10629 • Published 5 days ago • 103
Agentic Refactoring: An Empirical Study of AI Coding Agents Paper • 2511.04824 • Published 12 days ago • 4
WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation Paper • 2511.06251 • Published 10 days ago • 12
MathSE: Improving Multimodal Mathematical Reasoning via Self-Evolving Iterative Reflection and Reward-Guided Fine-Tuning Paper • 2511.06805 • Published 9 days ago • 11
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models Paper • 2511.09515 • Published 6 days ago • 15