TAU: A Benchmark for Cultural Sound Understanding Beyond Semantics Paper • 2509.26329 • Published Sep 30 • 2 • 2
MI-Fuse: Label Fusion for Unsupervised Domain Adaptation with Closed-Source Large-Audio Language Model Paper • 2509.20706 • Published Sep 25 • 2 • 2
Do You Hear What I Mean? Quantifying the Instruction-Perception Gap in Instruction-Guided Expressive Text-To-Speech Systems Paper • 2509.13989 • Published Sep 17 • 3 • 2