openfacades-internvl-models
Collection
finetuned InternVL models for building annotations
•
2 items
•
Updated
A vision-language model fine-tuned on building facade analysis tasks, based on InternVL3-2B architecture.
This model is designed for analyzing building facades and architectural features. It combines computer vision and natural language processing capabilities to understand and describe architectural elements in images.
from transformers import AutoModel, AutoTokenizer
# Load model and tokenizer
model = AutoModel.from_pretrained("seshing/openfacades-internvl3-2b", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("seshing/openfacades-internvl3-2b", trust_remote_code=True)
This project is released under the Apache-2.0 License. This project uses the pre-trained InternVL, which is licensed under the Apache-2.0 License.
Built upon the InternVL3-2B model architecture and trained on building facade datasets.
Base model
OpenGVLab/InternVL2_5-2B