Navigating the Alignment-Calibration Trade-off: A Pareto-Superior Frontier via Model Merging Paper • 2510.17426 • Published 8 days ago • 1
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors Paper • 2510.17516 • Published 8 days ago • 1