AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games
Paper
• 2602.17594 • Published
• 9
None defined yet.
AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games
How Much Reasoning Do Retrieval-Augmented Models Add beyond LLMs? A Benchmarking Framework for Multi-Hop Inference over Hybrid Knowledge