5 207 2

Bhimraj Yadav

bhimrazy

https://bhimraj.com.np

AI & ML interests

Computer Vision, Healthcare, Generative AI and NLP

Recent Activity

upvoted a paper 18 days ago

Agent Learning via Early Experience

upvoted a paper 18 days ago

TTRV: Test-Time Reinforcement Learning for Vision Language Models

upvoted a paper 18 days ago

Less is More: Recursive Reasoning with Tiny Networks

View all activity

Organizations

upvoted 9 papers 18 days ago

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published 22 days ago • 109

Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR

Paper • 2509.18174 • Published Sep 17 • 124

EmbeddingGemma: Powerful and Lightweight Text Representations

Paper • 2509.20354 • Published Sep 24 • 39

Docling Technical Report

Paper • 2408.09869 • Published Aug 19, 2024 • 2

Docling: An Efficient Open-Source Toolkit for AI-driven Document Conversion

Paper • 2501.17887 • Published Jan 27 • 1

Qwen3-Omni Technical Report

Paper • 2509.17765 • Published Sep 22 • 132

upvoted 4 papers about 2 months ago

FastVLM: Efficient Vision Encoding for Vision Language Models

Paper • 2412.13303 • Published Dec 17, 2024 • 70

MobileCLIP2: Improving Multi-Modal Reinforced Training

Paper • 2508.20691 • Published Aug 28 • 5

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26 • 123

MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers

Paper • 2508.20453 • Published Aug 28 • 63

upvoted an article 6 months ago

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 553

upvoted 2 papers 7 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 200

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166

upvoted a paper 8 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 169

upvoted 2 articles 8 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 77

Article

The Beginners Guide to Cleaning a Dataset

•

Nov 18, 2024

• 24

upvoted a paper 8 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 207

Bhimraj Yadav

AI & ML interests

Recent Activity

Organizations

bhimrazy's activity

Vision Language Models (Better, Faster, Stronger)

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

The Beginners Guide to Cleaning a Dataset