AI4SDG-BERT Model Card

Model Overview

AI4SDG-BERT is a fine-tuned BERT model for detecting disaster-related content in tweets. It supports real-time monitoring of social media data to identify and classify potential urban disasters. This model contributes to Sustainable Development Goal 11 (Sustainable Cities and Communities) by facilitating early warning and emergency response systems.

Intended Use

This model is intended for:

  • Disaster detection from Twitter or other social platforms
  • Urban emergency alert systems
  • Smart city infrastructure monitoring
  • Use by NGOs, civil defense units, or emergency data pipelines

Model Architecture

  • Base model: bert-base-uncased
  • Fine-tuned on: Disaster Tweet Classification Dataset (binary classification: disaster vs. not disaster)
  • Framework: Hugging Face Transformers + PyTorch
  • Post-BERT Classifier: Random Forest trained on BERT [CLS] embeddings

Training Dataset

  • Class distribution: 43% disaster, 57% non-disaster
  • Preprocessing: Lowercase, punctuation/emoji removal, URL stripping

Evaluation

Evaluated on held-out test set from the dataset:

  • Accuracy: 92%
  • F1-score: 91%
  • AUC: 94%

Limitations

  • English-only model
  • May not generalize well to sarcasm, spam, or very informal text
  • Lacks direct geolocation unless paired with NER/metadata
  • Relies on Twitter API access for live inference

Ethical Considerations

  • May reflect bias in who reports disasters and how
  • Not intended as a sole source for life-critical decision making
  • Requires moderation to avoid misinformation risks

Example Usage

from transformers import pipeline
classifier = pipeline("text-classification", model="elam2909/bert-disaster-classifier")
classifier("Huge earthquake just hit southern Turkey. Buildings are collapsing!")
Downloads last month
31
Safetensors
Model size
0.1B params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support