Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Salesforce
/
GTA1-7B
like
1
Follow
Salesforce
1.95k
Image-Text-to-Text
Transformers
Safetensors
English
qwen2_5_vl
image-to-text
VLM
Computer-Use-Agent
OS-Agent
GUI
Grounding
conversational
text-generation-inference
arxiv:
2507.05791
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
GTA1-7B
/
README.md
Commit History
Update README.md
53c9211
verified
HelloKKMe
commited on
Oct 3
Update README.md
48cd50e
verified
HelloKKMe
commited on
Oct 3
Update README.md
fbd3f88
verified
HelloKKMe
commited on
Oct 2
Update README.md
59550b2
verified
HelloKKMe
commited on
Oct 1
Update README.md
a0125f9
verified
HelloKKMe
commited on
Oct 1
Upload folder using huggingface_hub
d43fbcb
verified
HelloKKMe
commited on
Oct 1