Behrooz Azarkhalili's picture

57 499

Behrooz Azarkhalili

ermiaazarkhalili

·

AI & ML interests

LLMs, VLMs, PEFT, RL for LLMs and VLMs.

Recent Activity

upvoted an article 6 days ago

Supercharge your OCR Pipelines with Open Models

upvoted a collection 24 days ago

updated a collection 27 days ago

Multimodal Datasets

View all activity

Organizations

upvoted an article 6 days ago

Article

Supercharge your OCR Pipelines with Open Models

7 days ago

• 196

upvoted a collection 24 days ago

ExGRPO

Model collections trained using ExGRPO. • 7 items • Updated 25 days ago • 1

upvoted a paper 2 months ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published Aug 20 • 22

upvoted 3 articles 2 months ago

Article

Upskill your LLMs with Gradio MCP Servers

Jul 9

• 20

Article

Generate Images with Claude and Hugging Face

Aug 19

• 35

Article

Multimodal RAG with Colpali, Milvus and VLMs

By

•

Dec 10, 2024

• 10

upvoted 2 articles 3 months ago

Article

How I Built 7 Custom Gradio Components in Just 12 Days!

By

•

Aug 12

• 7

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7

• 97

upvoted a collection 3 months ago

Qwen3-MegaScience

Qwen3-MegaScience • 5 items • Updated Jul 23 • 4

upvoted a paper 3 months ago

MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning

Paper • 2507.16812 • Published Jul 22 • 63

upvoted an article 3 months ago

Article

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Jul 29

• 190

upvoted a collection 4 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 3 items • Updated 1 day ago • 128

upvoted an article 4 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 79

upvoted a collection 4 months ago

Qwen3

84 items • Updated Aug 6 • 1.37k

upvoted a paper 4 months ago

Understanding R1-Zero-Like Training: A Critical Perspective

Paper • 2503.20783 • Published Mar 26 • 56

upvoted an article 4 months ago

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Dec 9, 2022

• 361

upvoted 2 articles 7 months ago

Article

Multi-Label Classification Model From Scratch: Step-by-Step Tutorial

By

•

Jan 8, 2024

• 47

Article

Introduction to Quantization cooked in 🤗 with 💗🧑‍🍳

By

•

Aug 25, 2023

• 37

upvoted an article 9 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 185

upvoted an article 12 months ago

Article

Introducing GGUF-my-LoRA

By

•

Nov 1, 2024

• 22