OpenFACADES-InternVL3-2B

A vision-language model fine-tuned on building facade analysis tasks, based on InternVL3-2B architecture.

Model Description

This model is designed for analyzing building facades and architectural features. It combines computer vision and natural language processing capabilities to understand and describe architectural elements in images.

Usage

from transformers import AutoModel, AutoTokenizer

# Load model and tokenizer
model = AutoModel.from_pretrained("seshing/openfacades-internvl3-2b", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("seshing/openfacades-internvl3-2b", trust_remote_code=True)

License

This project is released under the Apache-2.0 License. This project uses the pre-trained InternVL, which is licensed under the Apache-2.0 License.

Acknowledgments

Built upon the InternVL3-2B model architecture and trained on building facade datasets.

Downloads last month
25
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for seshing/openfacades-internvl3-2b

Finetuned
(10)
this model

Collection including seshing/openfacades-internvl3-2b