OpenFACADES-InternVL3-2B

A vision-language model fine-tuned on building facade analysis tasks, based on InternVL3-2B architecture.

Model Description

This model is designed for analyzing building facades and architectural features. It combines computer vision and natural language processing capabilities to understand and describe architectural elements in images.

Usage

from transformers import AutoModel, AutoTokenizer

# Load model and tokenizer
model = AutoModel.from_pretrained("seshing/openfacades-internvl3-2b", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("seshing/openfacades-internvl3-2b", trust_remote_code=True)

License

This project is released under the Apache-2.0 License. This project uses the pre-trained InternVL, which is licensed under the Apache-2.0 License.

Acknowledgments

Built upon the InternVL3-2B model architecture and trained on building facade datasets.

Downloads last month: 25

Safetensors

Model size

2B params

Tensor type

BF16

Model tree for seshing/openfacades-internvl3-2b

Base model

OpenGVLab/InternVL2_5-2B

Finetuned

(10)

this model

Collection including seshing/openfacades-internvl3-2b

openfacades-internvl-models

Collection

finetuned InternVL models for building annotations • 2 items • Updated Oct 13