One Vision-Language-Action Model for GUI Agent
Qinghong (Kevin) Lin
KevinQHLin
AI & ML interests
Vision-Language Model, Video Understanding, Human-AI Interaction
Recent Activity
liked
a Space
about 6 hours ago
camel-ai/Paper2Poster
published
a Space
2 days ago
camel-ai/Paper2Poster
upvoted
a
paper
10 days ago
See the Text: From Tokenization to Visual Reading