TouchStone: Evaluating Vision-Language Models by Language Models Paper • 2308.16890 • Published Aug 31, 2023 • 1
NICO++: Towards Better Benchmarking for Domain Generalization Paper • 2204.08040 • Published Apr 17, 2022
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence Paper • 2509.03505 • Published Sep 3 • 6
Competing for Shareable Arms in Multi-Player Multi-Armed Bandits Paper • 2305.19158 • Published May 30, 2023