File size: 1,245 Bytes
7797b88
 
 
11f71f1
7797b88
74ef794
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ea1c842
74ef794
 
 
 
ea1c842
 
74ef794
 
 
 
 
 
 
 
 
 
 
7797b88
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
---
emoji: ๐Ÿ‘€
colorTo: purple
colorFrom: purple
---
# ๐Ÿง  YouTube Transcript Chatbot (RAG) โ€” LLaMA, Gemma, DeepSeek, OpenAI

An advanced **Retrieval-Augmented Generation (RAG)** chatbot that answers questions from YouTube video transcripts using top open-source LLMs โ€” **Llama 3.2**, **Gemma 2**, **DeepSeek**, and **OpenAI GPT-OSS**.  

Built with **Streamlit**, **LangChain**, **Hugging Face Hub**, and **FAISS**.

---

## ๐Ÿš€ Features

โœ… Fetch YouTube transcripts via **RapidAPI**  
โœ… Embed and store transcript chunks using **FAISS**  
โœ… Query understanding through **LangChain retrieval pipeline**  
โœ… Supports **multiple Hugging Face chat-based models**:
- ๐Ÿฆ™ **Llama-3.2-1B-Instruct**
- ๐Ÿ’Ž **Gemma-2-2B-it**
- ๐Ÿง  **DeepSeek-V3.2-Exp**
- ๐Ÿ”ฎ **OpenAI GPT-OSS-20B**

โœ… Adjustable **temperature** for creative or factual outputs  
โœ… Seamless **Streamlit UI**  
โœ… Caching for faster repeat queries  

---

## ๐Ÿงฉ Architecture


---

## ๐Ÿง  Tech Stack

| Component | Technology |
|------------|-------------|
| Frontend | Streamlit |
| Backend | LangChain |
| LLMs | Hugging Face Endpoints |
| Embeddings | intfloat/multilingual-e5-base |
| Vector Store | FAISS |
| API | YouTube Transcript API (RapidAPI) |

---