Game-TARS: Pretrained Foundation Models for Scalable Generalist Multimodal Game Agents Paper • 2510.23691 • Published 25 days ago • 51
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective Paper • 2507.01925 • Published Jul 2 • 38
DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping Paper • 2502.20900 • Published Feb 28 • 9
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper • 2410.17856 • Published Oct 23, 2024 • 51
In-Context Editing: Learning Knowledge from Self-Induced Distributions Paper • 2406.11194 • Published Jun 17, 2024 • 20
Panacea: Pareto Alignment via Preference Adaptation for LLMs Paper • 2402.02030 • Published Feb 3, 2024 • 10