flask
flask-cors
torch>=2.0.0
transformers>=4.36.0
accelerate
safetensors
datasets
sqlite3
datetime
bitsandbytes  # Optional, only needed for 4-bit quantized models
💡 Notes:
If you are not using 4-bit quantization, you can remove bitsandbytes.

For performance on CUDA-enabled GPUs, you should still manually install the correct CUDA-compatible version of torch, e.g.:
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
Replace cu118 with your actual CUDA version (cu121, cu117, etc).