Xuanfei Ren
xuanfeiren
AI & ML interests
RL and LLM
Recent Activity
upvoted a paper about 6 hours ago
Provably Learning from Language Feedback upvoted a paper about 6 hours ago
Understanding the Challenges in Iterative Generative Optimization with LLMs upvoted a paper 4 days ago
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov StatesOrganizations
None yet