arxiv:2510.05592
							
						ZhuofengLi PRO
ZhuofengLi
		AI & ML interests
Agents, Reasoning LLMs/VLLMs, RL
		
		Organizations
			models
			13
		
			
	
	
	
	
	ZhuofengLi/torl-qwen2.5-7b-instruct
		
				8B
			• 
	
				Updated
					
				
				• 
					
					4
				
	
				
				
ZhuofengLi/octo-science-qwen2.5-7b-grpo-step-40-v2
		
				2B
			• 
	
				Updated
					
				
				• 
					
					6
				
	
				
				
ZhuofengLi/octo-search-qwen2.5-7b-grpo-155-step-v1
		
				8B
			• 
	
				Updated
					
				
				• 
					
					7
				
	
				
				
ZhuofengLi/octo-search-qwen2.5-7b-grpo-step-60-v1.5
		
				2B
			• 
	
				Updated
					
				
				• 
					
					8
				
	
				
				
ZhuofengLi/tool-n1-multi-turn-reason-lora-sft-1180-step
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					3
				
	
				
				
ZhuofengLi/xlam-reason-lora-sft-1340-step
			Text Generation
			• 
		
				3B
			• 
	
				Updated
					
				
				• 
					
					4
				
	
				
				
ZhuofengLi/tool-n1-reason-lora-sft-800-step
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					7
				
	
				
				
ZhuofengLi/pot-r1-grpo-qwen2.5-7b-Instruct
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					2
				
	
				
				
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct
			Text Generation
			• 
		
				2B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				
				
ZhuofengLi/pot-r1-grpo-qwen2.5-1.5b-Instruct-wo-warmup
			Text Generation
			• 
		
				2B
			• 
	
				Updated
					
				
				
				
	
				
				
			datasets
			9
		
			
	
	
	
	
	ZhuofengLi/deepreview-sft
			Viewer
			• 
	
				Updated
					
				• 
			
			41.4k
	
				• 
					
					42
				
				
				
ZhuofengLi/sft_data
			Viewer
			• 
	
				Updated
					
				• 
			
			8.4k
	
				• 
					
					2
				
				
				
ZhuofengLi/gpqa_mcq
			Viewer
			• 
	
				Updated
					
				• 
			
			198
	
				• 
					
					32
				
				
				
ZhuofengLi/Big-Math-RL-Verified
			Viewer
			• 
	
				Updated
					
				• 
			
			251k
	
				• 
					
					8
				
				
				
ZhuofengLi/rerank_public_dataset
	
				Updated
					
				
	
				• 
					
					2
				
				
				
ZhuofengLi/TEG-Datasets
			Preview
			• 
	
				Updated
					
				
	
				• 
					
					316
				
				• 
					
					4
				
ZhuofengLi/citation-network
			Preview
			• 
	
				Updated
					
				
	
				• 
					
					16
				
				
				
ZhuofengLi/MDS
			Viewer
			• 
	
				Updated
					
				• 
			
			97.5k
	
				• 
					
					9
				
				
				
ZhuofengLi/survey-sections-2k
			Viewer
			• 
	
				Updated
					
				• 
			
			2k
	
				• 
					
					9