vidore/colqwen-omni-v0.1
Visual Document Retrieval
•
Updated
•
9.07k
•
91
Generate speech from text using a reference audio
An interactive view of human heart
Real-time video captioning powered by FastVLM