Video OWL-ViT: Temporally-consistent open-world localization in video Paper • 2308.11093 • Published Aug 22, 2023
Invariant Slot Attention: Object Discovery with Slot-Centric Reference Frames Paper • 2302.04973 • Published Feb 9, 2023 • 1
DORSal: Diffusion for Object-centric Representations of Scenes $\textit{et al.}$ Paper • 2306.08068 • Published Jun 13, 2023 • 6
AudioSlots: A slot-centric generative model for audio separation Paper • 2305.05591 • Published May 9, 2023 • 3
Simple Open-Vocabulary Object Detection with Vision Transformers Paper • 2205.06230 • Published May 12, 2022 • 3