arxiv:2502.06155
							
						Hangliang Ding
foreverpiano
		AI & ML interests
None yet
		Recent Activity
						upvoted 
								a
								paper
							
						11 days ago
						
					
						
						
						AdaSPEC: Selective Knowledge Distillation for Efficient Speculative
  Decoders
						
						upvoted 
								an
								article
							
						about 1 month ago
						
					
						
						
						Proximal Policy Optimization (PPO)
						
						upvoted 
								a
								paper
							
						about 2 months ago
						
					
						
						
						Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
						Organizations
None yet