SAM2S: Segment Anything in Surgical Videos via Semantic Long-term Tracking Paper • 2511.16618 • Published 3 days ago • 6 • 2
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations Paper • 2511.13703 • Published 6 days ago • 17 • 2
Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs Paper • 2511.16664 • Published 3 days ago • 19 • 2
NaTex: Seamless Texture Generation as Latent Color Diffusion Paper • 2511.16317 • Published 3 days ago • 11 • 2
MiMo-Embodied: X-Embodied Foundation Model Technical Report Paper • 2511.16518 • Published 3 days ago • 20 • 2
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Paper • 2511.16528 • Published 3 days ago • 12 • 2
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO Paper • 2511.16669 • Published 3 days ago • 28 • 3
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models Paper • 2511.15605 • Published 4 days ago • 16 • 2
FinTRec: Transformer Based Unified Contextual Ads Targeting and Personalization for Financial Applications Paper • 2511.14865 • Published 5 days ago • 3 • 2
EntroPIC: Towards Stable Long-Term Training of LLMs via Entropy Stabilization with Proportional-Integral Control Paper • 2511.15248 • Published 4 days ago • 4 • 2
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning Paper • 2511.16043 • Published 3 days ago • 64 • 3
First Frame Is the Place to Go for Video Content Customization Paper • 2511.15700 • Published 4 days ago • 46 • 3
Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation Paper • 2511.16671 • Published 3 days ago • 13 • 2
V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models Paper • 2511.16668 • Published 3 days ago • 48 • 2