Models for What Changed? Detecting and Evaluating Instruction-Guided Image Edits
with Multimodal Large Language Models [ICCV 2025]
AI & ML interests
None defined yet.
Recent Activity
Models and data for the paper "Recurrence Meets Transformers for Universal Multimodal Retrieval" (arXiv 2509.08897)
-
aimagelab/ReT2-M2KR-CLIP-ViT-B
Visual Document Retrieval • 0.2B • Updated • 129 • 1 -
aimagelab/ReT2-M2KR-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 4 -
aimagelab/ReT2-M2KR-SigLIP2-ViT-L
Visual Document Retrieval • 0.9B • Updated • 5 • 1 -
aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 4
Models for What Changed? Detecting and Evaluating Instruction-Guided Image Edits
with Multimodal Large Language Models [ICCV 2025]
Models and data for the paper "Recurrence Meets Transformers for Universal Multimodal Retrieval" (arXiv 2509.08897)
-
aimagelab/ReT2-M2KR-CLIP-ViT-B
Visual Document Retrieval • 0.2B • Updated • 129 • 1 -
aimagelab/ReT2-M2KR-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 4 -
aimagelab/ReT2-M2KR-SigLIP2-ViT-L
Visual Document Retrieval • 0.9B • Updated • 5 • 1 -
aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 4
models
37
aimagelab/DICE_coherence_Idefics
Updated
aimagelab/DICE_differencedet_Idefics
Updated
aimagelab/ReT2-M2KR-ColBERT-SigLIP2-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
12
aimagelab/ReT2-MBEIR-SigLIP2-ViT-L
Visual Document Retrieval
•
0.9B
•
Updated
•
5
aimagelab/ReT2-MBEIR-CLIP-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
4
aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
4
aimagelab/ReT2-M2KR-SigLIP2-ViT-L
Visual Document Retrieval
•
0.9B
•
Updated
•
5
•
1
aimagelab/ReT2-M2KR-OpenCLIP-ViT-H
Visual Document Retrieval
•
1B
•
Updated
•
4
aimagelab/ReT2-M2KR-CLIP-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
4
aimagelab/ReT2-M2KR-CLIP-ViT-B
Visual Document Retrieval
•
0.2B
•
Updated
•
129
•
1