DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion Paper β’ 2510.20766 β’ Published Oct 23 β’ 34
Advancing Speech Understanding in Speech-Aware Language Models with GRPO Paper β’ 2509.16990 β’ Published Sep 21 β’ 18
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper β’ 2508.09983 β’ Published Aug 13 β’ 68
Auto-Regressive vs Flow-Matching: a Comparative Study of Modeling Paradigms for Text-to-Music Generation Paper β’ 2506.08570 β’ Published Jun 10 β’ 33
Debatable Intelligence: Benchmarking LLM Judges via Debate Speech Evaluation Paper β’ 2506.05062 β’ Published Jun 5 β’ 15
CHIMERA: A Knowledge Base of Idea Recombination in Scientific Literature Paper β’ 2505.20779 β’ Published May 27 β’ 15
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper β’ 2505.17813 β’ Published May 23 β’ 57
WHISTRESS: Enriching Transcriptions with Sentence Stress Detection Paper β’ 2505.19103 β’ Published May 25 β’ 13
RefVNLI: Towards Scalable Evaluation of Subject-driven Text-to-image Generation Paper β’ 2504.17502 β’ Published Apr 24 β’ 55