Bowen Pan's picture

5

Bowen Pan

bpan

·

https://people.csail.mit.edu/bpan/

AI & ML interests

Efficient LLM, Mixture-of-Experts, Embodied AI, Dynamic Neural Network

Organizations

upvoted a paper 2 months ago

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19 • 54

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

•

122

upvoted a collection over 1 year ago

Granite 2.0 Code Models

A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated 11 days ago • 201

upvoted 2 articles over 1 year ago

Article

Saving Memory Using Padding-Free Transformer Layers during Finetuning

Jun 11, 2024

•

20

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

Apr 9, 2024

•

30