Kyle Montgomery's picture

3 3 2

Kyle Montgomery

kylemontgomery

·

AI & ML interests

None yet

Recent Activity

updated a Space about 9 hours ago

ScalerLab/JudgeBench

updated a dataset 1 day ago

kylemontgomery/imo-1030-rollout-partial

updated a dataset 1 day ago

kylemontgomery/imo-1030-rollout-partial

View all activity

Organizations

upvoted 2 papers 14 days ago

Budget-aware Test-time Scaling via Discriminative Verification

Paper • 2510.14913 • Published 15 days ago • 4

Predicting Task Performance with Context-aware Scaling Laws

Paper • 2510.14919 • Published 15 days ago • 3

upvoted a paper about 1 year ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 48