imontlaji
imontlaji
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with
Verifiable Rewards via Monte Carlo Tree Search
upvoted
a
paper
about 2 months ago
Multiplayer Nash Preference Optimization
Organizations
None yet