EgMM-Corpus: A Multimodal Vision-Language Dataset for Egyptian Culture Paper • 2510.16198 • Published Oct 17
Robust and Calibrated Detection of Authentic Multimedia Content Paper • 2512.15182 • Published 12 days ago • 15
Robust and Calibrated Detection of Authentic Multimedia Content Paper • 2512.15182 • Published 12 days ago • 15
Robust and Calibrated Detection of Authentic Multimedia Content Paper • 2512.15182 • Published 12 days ago • 15
How Good are Foundation Models in Step-by-Step Embodied Reasoning? Paper • 2509.15293 • Published Sep 18
Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees Paper • 2506.14606 • Published Jun 17 • 11
Guaranteed Guess: A Language Modeling Approach for CISC-to-RISC Transpilation with Testing Guarantees Paper • 2506.14606 • Published Jun 17 • 11
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark Paper • 2505.16968 • Published May 22 • 40
Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper • 2505.24867 • Published May 30 • 80
SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem Paper • 2505.21887 • Published May 28 • 14
CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark Paper • 2505.16968 • Published May 22 • 40
SALT: Singular Value Adaptation with Low-Rank Transformation Paper • 2503.16055 • Published Mar 20 • 8
Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark Paper • 2503.20786 • Published Mar 26 • 2
SALT: Singular Value Adaptation with Low-Rank Transformation Paper • 2503.16055 • Published Mar 20 • 8
Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models Paper • 2501.05478 • Published Jan 7 • 1
SALT: Singular Value Adaptation with Low-Rank Transformation Paper • 2503.16055 • Published Mar 20 • 8
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding Paper • 2502.14949 • Published Feb 20 • 9