Le Huy Hoang's picture

Le Huy Hoang

splendor1811

·

huyhoang18112k2

AI & ML interests

Computer Vision

Recent Activity

updated a model 27 days ago

splendor1811/BGE-qa-internal

published a model 27 days ago

splendor1811/fdd_1

published a model 27 days ago

splendor1811/BGE-qa-internal

View all activity

Organizations

None yet

upvoted a paper 2 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 254

upvoted a collection 3 months ago

Qwen3

84 items • Updated Aug 6 • 1.38k

upvoted 3 articles 3 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

By

•

Feb 7

• 243

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

By

•

Mar 17

• 338

Article

I trained a Language Model to schedule events with GRPO!

By

•

Apr 29

• 90

upvoted a paper 4 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 268

upvoted a collection 5 months ago

Qwen3-Embedding

6 items • Updated Jul 21 • 132

upvoted an article 6 months ago

Article

Vision Language Models (Better, Faster, Stronger)

May 12

• 557

upvoted a paper 9 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 58

upvoted 2 articles 9 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 378

upvoted a paper 10 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 89

upvoted a collection 10 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated May 1 • 574

upvoted a collection about 1 year ago

MIT Talk 31/10 Papers

14 items • Updated Oct 28, 2024 • 32

upvoted a paper over 1 year ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 89

upvoted 2 articles over 1 year ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 947

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By

•

Jun 23, 2024

• 36

upvoted 3 papers over 1 year ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 28

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 56

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 121