Spaces:
Running
Running
File size: 1,245 Bytes
7797b88 11f71f1 7797b88 74ef794 ea1c842 74ef794 ea1c842 74ef794 7797b88 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
---
emoji: ๐
colorTo: purple
colorFrom: purple
---
# ๐ง YouTube Transcript Chatbot (RAG) โ LLaMA, Gemma, DeepSeek, OpenAI
An advanced **Retrieval-Augmented Generation (RAG)** chatbot that answers questions from YouTube video transcripts using top open-source LLMs โ **Llama 3.2**, **Gemma 2**, **DeepSeek**, and **OpenAI GPT-OSS**.
Built with **Streamlit**, **LangChain**, **Hugging Face Hub**, and **FAISS**.
---
## ๐ Features
โ
Fetch YouTube transcripts via **RapidAPI**
โ
Embed and store transcript chunks using **FAISS**
โ
Query understanding through **LangChain retrieval pipeline**
โ
Supports **multiple Hugging Face chat-based models**:
- ๐ฆ **Llama-3.2-1B-Instruct**
- ๐ **Gemma-2-2B-it**
- ๐ง **DeepSeek-V3.2-Exp**
- ๐ฎ **OpenAI GPT-OSS-20B**
โ
Adjustable **temperature** for creative or factual outputs
โ
Seamless **Streamlit UI**
โ
Caching for faster repeat queries
---
## ๐งฉ Architecture
---
## ๐ง Tech Stack
| Component | Technology |
|------------|-------------|
| Frontend | Streamlit |
| Backend | LangChain |
| LLMs | Hugging Face Endpoints |
| Embeddings | intfloat/multilingual-e5-base |
| Vector Store | FAISS |
| API | YouTube Transcript API (RapidAPI) |
--- |