Product Classifier B2

Tento model slouΕΎΓ­ k predikci kategoriΓ­ produktΕ― na zΓ‘kladΔ› jejich nΓ‘zvu nebo popisu...

🏍️ Amazon Product Classifier (Balanced B2)

This is a fine-tuned DistilBERT model for multi-class classification of product titles into Amazon-like product categories.
The model is based on distilbert-base-uncased and was trained on a balanced subset of the Amazon Products dataset.

🧠 Model Architecture

  • Base: distilbert-base-uncased (6-layer, 768 hidden size)
  • Classification Head: 2 dense layers with dropout + ReLU
  • Output: softmax over 19 product categories

πŸ“Š Training Data

The model was trained on a balanced subset (β‰ˆ40k samples) of the Amazon Products Dataset, which contains product titles and their corresponding categories.

Preprocessing included:

  • Removing empty/missing titles
  • Keeping top-level categories only
  • Balancing the dataset to avoid category bias

🍿 Example Categories

  • beauty & health
  • home & kitchen
  • tv, audio & cameras
  • computers & accessories
  • clothing & accessories
  • appliances
  • sports & fitness
  • grocery & gourmet foods
  • ... (total 19)

πŸ§ͺ Example Usage (Python)

from transformers import pipeline

classifier = pipeline("text-classification", model="your-username/product-classifier-model-B2")

result = classifier("Smartwatch with heart rate monitor and GPS tracking")
print(result)
# [{'label': 'stores', 'score': 0.94}]

πŸš€ Intended Use

The model is designed to help developers quickly classify product titles into e-commerce categories, useful for:

  • Auto-tagging items in online stores
  • Cleaning and organizing product catalogs
  • Building recommendation engines (in combination with embeddings)

πŸ“Œ Limitations

  • English-only (trained on distilbert-base-uncased)
  • May not perform well on very short or ambiguous product names
  • Not suitable for legal/medical/financial applications

πŸ“„ License & Source

  • Model: MIT License
  • Training Data: Amazon Products Dataset on Kaggle
    (check license and attribution requirements on Kaggle page)
Downloads last month
1
Safetensors
Model size
67M params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support