MDPO: Overcoming the Training-Inference Divide of Masked Diffusion Language Models Paper • 2508.13148 • Published Aug 18 • 3
HDT Collection Data and model weights for our COLM' 24 paper, HDT: Hierarchical Document Transformer. Project page https://cli212.github.io/HDT/ • 6 items • Updated Jul 14, 2024 • 1