d^2Cache: Accelerating Diffusion-Based LLMs via Dual Adaptive Caching Paper • 2509.23094 • Published Sep 27 • 3
Speculative Ensemble: Fast Large Language Model Ensemble via Speculation Paper • 2502.01662 • Published Feb 1 • 2