Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
18
31
27
Shizhe Diao
shizhediao2
Follow
21world's profile picture
renjiepi's profile picture
bunyaminergen's profile picture
7 followers
·
12 following
https://shizhediao.github.io/
shizhediao
shizhediao
shizhediao
AI & ML interests
LLM pre-training and reasoning
Recent Activity
upvoted
a
paper
4 days ago
Unified Reinforcement and Imitation Learning for Vision-Language Models
updated
a dataset
6 days ago
nvidia/Nemotron-ClimbMix
upvoted
a
paper
6 days ago
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
View all activity
Organizations
models
3
Sort: Recently updated
shizhediao2/ToolOrchestrator-8B
Updated
12 days ago
shizhediao2/Llama-Nemotron-8B-v1-Prorl
Updated
Aug 25
shizhediao2/Nemotron-Research-Reasoning-Qwen-1.5B
Updated
May 14
datasets
0
None public yet