Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
imnotkitty 's Collections
Text Generation
Image-To-Image
Image-Text-to-Text

Image-Text-to-Text

updated Feb 26

My Essential Image-Text-to-Text Toolkit

Upvote
-

  • moonshotai/Kimi-K2.5

    Image-Text-to-Text • 1.1T • Updated 18 days ago • 1.71M • • 2.79k

  • deepseek-ai/DeepSeek-OCR-2

    Image-Text-to-Text • 3B • Updated Feb 3 • 1.61M • 954

  • google/gemma-3-27b-it

    Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 898k • • 1.97k

  • tencent/Youtu-VL-4B-Instruct

    Image-Text-to-Text • 5B • Updated Feb 10 • 500 • 155

  • Qwen/Qwen3-VL-8B-Instruct

    Image-Text-to-Text • 9B • Updated Oct 15, 2025 • 6.19M • • 907

  • zai-org/GLM-5

    Text Generation • 754B • Updated Apr 5 • 213k • • 2.09k

  • Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

    Text-to-Speech • 2B • Updated Jan 29 • 1.52M • 1.49k

  • Qwen/Qwen3.5-397B-A17B

    Image-Text-to-Text • 403B • Updated 24 days ago • 1.05M • • 1.48k

  • Qwen/Qwen3.5-35B-A3B

    Image-Text-to-Text • 36B • Updated 24 days ago • 3.11M • • 1.43k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs