-
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning
Paper • 2505.24726 • Published • 274 -
Reinforcement Pre-Training
Paper • 2506.08007 • Published • 262 -
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Paper • 2507.01006 • Published • 236 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 257
Erik Thorelli
esthor
AI & ML interests
Quantifying Agent Experience
Recent Activity
liked
a dataset
27 days ago
google/simpleqa-verified
liked
a model
27 days ago
Qwen/Qwen3-VL-235B-A22B-Thinking
updated
a collection
about 1 month ago
papers-to-read
Organizations
None yet