UniGame: Turning a Unified Multimodal Model Into Its Own Adversary Paper • 2511.19413 • Published 9 days ago • 19
How Far Are Surgeons from Surgical World Models? A Pilot Study on Zero-shot Surgical Video Generation with Expert Assessment Paper • 2511.01775 • Published 30 days ago • 6
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6 • 121
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24 • 98
Glyph: Scaling Context Windows via Visual-Text Compression Paper • 2510.17800 • Published Oct 20 • 67
FlashWorld: High-quality 3D Scene Generation within Seconds Paper • 2510.13678 • Published Oct 15 • 70
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published Feb 20 • 154
Stable Diffusion Segmentation for Biomedical Images with Single-step Reverse Process Paper • 2406.18361 • Published Jun 26, 2024 • 1
Are Pixel-Wise Metrics Reliable for Sparse-View Computed Tomography Reconstruction? Paper • 2506.02093 • Published Jun 2 • 1