CLUE: Non-parametric Verification from Experience via Hidden-State Clustering Paper • 2510.01591 • Published Oct 2 • 26
Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling Paper • 2505.12225 • Published May 18 • 4
Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling Paper • 2505.12225 • Published May 18 • 4
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 84
Model-Based Differentially Private Knowledge Transfer for Large Language Models Paper • 2410.10481 • Published Oct 14, 2024 • 1
Model-Based Differentially Private Knowledge Transfer for Large Language Models Paper • 2410.10481 • Published Oct 14, 2024 • 1
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale Paper • 2409.17115 • Published Sep 25, 2024 • 63
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29, 2024 • 6
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29, 2024 • 6