AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons Paper • 2503.05731 • Published Feb 19 • 3
Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation Paper • 2509.08825 • Published Sep 10 • 2