Spaces:

Psytamaa
/

sap-chatbot

Sleeping

App Files Files Community

sap-chatbot / SUPABASE_SETUP.md

github-actions[bot]

Deploy from GitHub Actions 2025-12-11_00:05:39

0f77bc1 8 days ago

preview code

raw

history blame contribute delete

7.1 kB

A newer version of the Streamlit SDK is available: 1.52.2

Upgrade

🗄️ Supabase Vector Database Setup

Your SAP Chatbot now uses Supabase + pgvector for production-grade vector search!

Architecture

GitHub Actions (Ingestion)
    ↓ (SUPABASE_SERVICE_ROLE_KEY)
  ingest.py
    ├─ Load SAP documents
    ├─ Compute embeddings (sentence-transformers)
    └─ Insert into Supabase (pgvector)
           ↓
    HuggingFace Spaces (Streamlit App)
         ├─ User asks question
         ├─ HF Inference API computes embedding
         ├─ Supabase RPC search_documents()
         ├─ Retrieve top-k results
         └─ Generate answer with HF Inference API

Quick Setup

1. Create Supabase Project

Go to https://supabase.com
Sign up (free tier available)
Create new project
Wait for database initialization (~2 min)

2. Enable pgvector

-- In Supabase SQL Editor:
CREATE EXTENSION IF NOT EXISTS vector;

3. Create Documents Table

CREATE TABLE documents (
  id BIGSERIAL PRIMARY KEY,
  source TEXT,
  url TEXT,
  title TEXT,
  content TEXT,
  chunk_id INT,
  embedding VECTOR(384),
  created_at TIMESTAMPTZ DEFAULT NOW()
);

CREATE INDEX ON documents USING ivfflat (embedding vector_cosine_ops) WITH (lists = 100);

4. Create Search Function

CREATE OR REPLACE FUNCTION search_documents(query_embedding VECTOR, k INT DEFAULT 5)
RETURNS TABLE(id BIGINT, source TEXT, url TEXT, title TEXT, content TEXT, chunk_id INT, distance FLOAT8) AS $$
BEGIN
  RETURN QUERY
  SELECT
    documents.id,
    documents.source,
    documents.url,
    documents.title,
    documents.content,
    documents.chunk_id,
    1 - (documents.embedding <=> query_embedding) AS distance
  FROM documents
  ORDER BY documents.embedding <=> query_embedding
  LIMIT k;
END;
$$ LANGUAGE plpgsql;

5. Get Credentials

In Supabase dashboard:

Go to Settings → API
Copy:
- Project URL → SUPABASE_URL
- anon public key → SUPABASE_ANON_KEY (for Streamlit app)
- service_role key → SUPABASE_SERVICE_ROLE_KEY (for GitHub Actions only!)

⚠️ NEVER put service_role key in Space Secrets! Only in GitHub Actions.

6. Run Local Ingestion (Optional)

# Set env vars locally
export SUPABASE_URL="https://your-project.supabase.co"
export SUPABASE_SERVICE_ROLE_KEY="your-service-role-key"
export EMBEDDING_MODEL="sentence-transformers/all-MiniLM-L6-v2"

# Run ingestion
python ingest.py

7. Configure GitHub Actions Secrets

In your GitHub repo:

Settings → Secrets and variables → Actions
Add new secrets:
- SUPABASE_URL = your Supabase URL
- SUPABASE_SERVICE_ROLE_KEY = service role key (for ingestion)

8. Configure HF Space Secrets

In HuggingFace Space Settings → Secrets:

HF_API_TOKEN = your HF token
SUPABASE_URL = your Supabase URL
SUPABASE_ANON_KEY = anon public key (safe to expose)
EMBEDDING_MODEL = (optional) embedding model ID
RESULTS_K = (optional) number of results (default: 5)

File Structure

sap-chatbot/
├── app.py                    # Streamlit UI (uses HF API + Supabase RPC)
├── ingest.py                 # Ingestion script (uses sentence-transformers)
├── Dockerfile                # Docker config for HF Spaces
├── requirements.txt          # Python dependencies (includes supabase, sentence-transformers)
├── .github/
│   └── workflows/
│       └── deploy.yml        # GitHub Actions: ingest + deploy
└── data/
    └── sap_dataset.json      # Source documents

Deployment Flow

First Deployment

GitHub: Push code to main branch
GitHub Actions:
- Runs ingest.py with SUPABASE_SERVICE_ROLE_KEY
- Ingests documents into Supabase
- Workflow completes
HF Spaces:
- Auto-syncs from GitHub (Linked Repository)
- Launches Streamlit app
- App connects to Supabase with SUPABASE_ANON_KEY

Update Knowledge Base

To add more SAP documents:

Update data/sap_dataset.json with new documents
Push to GitHub
GitHub Actions auto-runs ingestion
New documents available in Supabase
HF Spaces app immediately sees new data

API Endpoints

Streamlit App (HF Spaces)

Uses HF Inference API for embeddings
Calls Supabase RPC search_documents(query_embedding, k)
Generates answers with HF Inference API

ingest.py (GitHub Actions)

Uses local sentence-transformers for embeddings
Inserts directly to Supabase with service role key
Runs on schedule or manual trigger

Performance

Operation	Time	Notes
Compute embedding	50-100ms	Local sentence-transformers
Vector search	10-50ms	pgvector with IVFFlat index
HF Inference (answer)	10-30s	Cloud API
Total response	10-30s	Dominated by LLM generation

Cost Analysis

Component	Cost	Notes
Supabase (free tier)	FREE	500MB DB + 2GB file storage
Supabase (paid)	$25+/mo	More storage, more API calls
HF Inference API	FREE	Rate limited, generous
GitHub Actions	FREE	2000 min/month
HF Spaces	FREE	5+ concurrent users
TOTAL	$0-25/mo	Scales with usage

Upgrade to paid Supabase when:

Dataset grows beyond 500MB
Vector searches become slow
Need higher API rate limits

Troubleshooting

"pgvector not found"

Enable pgvector extension in Supabase SQL Editor
Run: CREATE EXTENSION IF NOT EXISTS vector;

"RPC function not found"

Copy search_documents SQL function into Supabase
Run in SQL Editor
Wait for function to compile

"Embedding dimension mismatch"

Model uses 384 dims: sentence-transformers/all-MiniLM-L6-v2
If changing model, recreate VECTOR(new_dim) in table

"Ingestion too slow"

Increase BATCH_SIZE in ingest.py
Run on larger GitHub Actions runner
Consider async ingestion

"Search results irrelevant"

Check embedding model matches
Verify documents chunked correctly
Try different chunk_size/overlap in ingest.py

Advanced: Custom Embeddings

To use different embedding model:

Local (ingest.py)

EMBEDDING_MODEL = "sentence-transformers/all-mpnet-base-v2"  # 768 dims

Recreate table with new dimensions

ALTER TABLE documents ALTER COLUMN embedding TYPE vector(768);

Update app.py

EMBEDDING_MODEL = "sentence-transformers/all-mpnet-base-v2"

Next Steps

✅ Create Supabase project
✅ Enable pgvector and create table
✅ Add GitHub Actions secrets
✅ Push code (triggers ingestion)
✅ Configure HF Space secrets
✅ Test: "How do I monitor SAP jobs?"
✅ Share with team!

Resources

Your production-grade SAP chatbot is ready! 🚀