AI & ML interests
None defined yet.
Recent Activity
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
-
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Text Generation • 2B • Updated • 178 • 1 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k
Text Generation • 2B • Updated • 120 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k
Text Generation • 2B • Updated • 82 -
Shiyu-Lab/QwQ-32B-thinkprune-4k
Text Generation • 33B • Updated • 9
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
-
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-iter2k
Text Generation • 2B • Updated • 178 • 1 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-2k
Text Generation • 2B • Updated • 120 -
Shiyu-Lab/DeepSeek-R1-Distill-Qwen-1.5B-thinkprune-4k
Text Generation • 2B • Updated • 82 -
Shiyu-Lab/QwQ-32B-thinkprune-4k
Text Generation • 33B • Updated • 9
Trained models for the Prereq-Tune paper