TianXiaoyu
Emperorizzis
		AI & ML interests
Natural Language Processing, Large Language Model, Reinforcement Learning
		Recent Activity
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						MAPO: Mixed Advantage Policy Optimization
						
						upvoted 
								a
								paper
							
						about 2 months ago
						
					
						
						
						Why Language Models Hallucinate
						
						liked
								a dataset
							
						3 months ago
						
					
						
						
						
						stanfordnlp/SHP-2