Rin
hu5enpai
		ยท
				AI & ML interests
None yet
		Recent Activity
						upvoted 
								a
								paper
							
						about 2 months ago
						
					
						
						
						Parallel-R1: Towards Parallel Thinking via Reinforcement Learning
						
						commented on 
								a paper
							
						about 2 months ago
						
					
						
						
						On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised
  Fine-Tuning and Reinforcement Learning via Dynamic Weighting