- 
	
	
	Less is More: Recursive Reasoning with Tiny NetworksPaper • 2510.04871 • Published • 459
- 
	
	
	Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-PlayPaper • 2509.25541 • Published • 137
- 
	
	
	Agent Learning via Early ExperiencePaper • 2510.08558 • Published • 254
- 
	
	
	DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree SearchPaper • 2509.25454 • Published • 136
shen
sean29
		AI & ML interests
None yet
		Recent Activity
						updated 
								a collection
							
						16 days ago
						
					todo
						
						updated 
								a collection
							
						17 days ago
						
					todo
						
						updated 
								a collection
							
						17 days ago
						
					todo
						Organizations
None yet
 
								
