Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics Paper • 2605.12178 • Published 1 day ago • 30
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published Mar 25 • 98
view article Article A New Framework for Evaluating Voice Agents (EVA) ServiceNow-AI • Mar 24 • 92
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published Mar 13 • 149
view article Article AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems ServiceNow-AI • Dec 23, 2025 • 48
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance ServiceNow-AI • Dec 9, 2025 • 84
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
Surfer 2: The Next Generation of Cross-Platform Computer Use Agents Paper • 2510.19949 • Published Oct 22, 2025 • 38
GRAFT: GRaPH and Table Reasoning for Textual Alignment -- A Benchmark for Structured Instruction Following and Visual Reasoning Paper • 2508.15690 • Published Aug 21, 2025 • 8
view article Article SyGra: The One-Stop Framework for Building Data for LLMs and SLMs ServiceNow-AI • Sep 22, 2025 • 14