Lewis Tunstall's picture

In a Training Loop 🔄

Lewis Tunstall PRO

lewtun

huggingface

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Recent Activity

new activity about 21 hours ago

lm-provers/QED-Nano:Add MathArena evaluation result for aime/aime_2026

new activity about 21 hours ago

lm-provers/QED-Nano:Add MathArena evaluation result for hmmt/hmmt_feb_2026

liked a dataset about 21 hours ago

HuggingFaceFW/finephrase

View all activity

Organizations

upvoted 2 articles 14 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

14 days ago

•

183

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

14 days ago

•

73

upvoted a paper 14 days ago

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Paper • 2602.11149 • Published Feb 11 • 15

upvoted an article 14 days ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

15 days ago

•

23

upvoted an article 20 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

Feb 13

•

139

upvoted a changelog 22 days ago

Hugging Face Changelog

Public Storage Add-ons

26 days ago

• 169

upvoted an article 26 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

26 days ago

•

142

upvoted an article 30 days ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

Feb 20

•

490

upvoted 2 articles about 1 month ago

Article

Did GPT 5.2 make a breakthrough discovery in theoretical physics?

Feb 19

•

62

Article

I Let a Lobster Run My Jetson: What OpenClaw Taught Me About the Future of Computing

Feb 19

•

15

upvoted a paper about 1 month ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published Feb 3 • 12

upvoted a collection about 1 month ago

QED Nano

Artifacts for the QED Nano release • 9 items • Updated 22 days ago • 9

upvoted 2 papers about 1 month ago

Towards Robust Mathematical Reasoning

Paper • 2511.01846 • Published Nov 3, 2025 • 10

Single-minus gluon tree amplitudes are nonzero

Paper • 2602.12176 • Published Feb 12 • 8

upvoted an article about 1 month ago

Article

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

+3

Feb 12

•

31

upvoted a paper about 2 months ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published Jan 29 • 27

upvoted an article about 2 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

+2

Jan 28

•

150

upvoted 3 articles 2 months ago

Article

Introducing Waypoint-1: Real-time interactive video diffusion from Overworld

+3

Jan 20

•

40

Article

One Year Since the “DeepSeek Moment”

Jan 20

•

62

Article

M2.1: Multilingual and Multi-Task Coding with Strong Generalization

Jan 5

•

41