Jack
SixPlusSeven13
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning
new activity
3 months ago
AgentGym/AgentGym-RL-Data-ID:Upload webarena_train.json
Organizations
None yet