Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models
Paper
•
2503.11073
•
Published
•
1
PURE is the first framework to employ a multimodal large language model (MLLM) as its backbone for addressing low-level vision tasks.
We provide the implementation of PURE, as well as sampling code, in our github repository.