alan
jason500
·
AI & ML interests
None yet
Organizations
None yet
text_gen_img
grounding
video_preprocess
mutil big modal image2text
-
OpenGVLab/InternVL-14B-224px
Image Feature Extraction • 14B • Updated • 534 • 35 -
openbmb/MiniCPM-V-2_6
Image-Text-to-Text • 8B • Updated • 85.3k • 1.01k -
RhapsodyAI/MiniCPM-V-Embedding-preview
Feature Extraction • Updated • 19 • 52 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 324k • • 1.54k
MMLM
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
-
OpenGVLab/InternVL-14B-224px
Image Feature Extraction • 14B • Updated • 534 • 35 -
openbmb/MiniCPM-V-2_6
Image-Text-to-Text • 8B • Updated • 85.3k • 1.01k -
RhapsodyAI/MiniCPM-V-Embedding-preview
Feature Extraction • Updated • 19 • 52 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 324k • • 1.54k
caption
MMLM