TTRV: Test-Time Reinforcement Learning for Vision Language Models Paper • 2510.06783 • Published 20 days ago • 11
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published 22 days ago • 449
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published 22 days ago • 109
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper • 2509.18174 • Published Sep 17 • 124
EmbeddingGemma: Powerful and Lightweight Text Representations Paper • 2509.20354 • Published Sep 24 • 39
Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion Paper • 2501.17887 • Published Jan 27 • 1
FastVLM: Efficient Vision Encoding for Vision Language Models Paper • 2412.13303 • Published Dec 17, 2024 • 70
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Paper • 2508.20453 • Published Aug 28 • 63
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality Mar 4 • 77