---
title: FutureCafe Voice Core
emoji: ☎️
colorFrom: indigo
colorTo: blue
sdk: gradio
app_file: app.py
pinned: false
license: mit
---

# ☎️ FutureCafe Voice & Chat Assistant

Welcome to **FutureCafe Voice & Chat Assistant** — an AI-powered demo that lets you interact with a virtual cafe agent using either **voice calls** or **chat messages**.  
The assistant can answer questions about the cafe, help place food orders, confirm reservations, and provide opening hours or location info — all through a simple web interface.

---

## 🎯 Aim of the Project

The goal of this Space is to demonstrate how modern AI components (speech recognition, language models, text-to-speech) can be combined into a **realistic customer service experience** for restaurants and cafes.

FutureCafe Assistant acts like a friendly staff member:
- Answers menu or dietary questions.
- Helps with table reservations.
- Supports order placement and price calculation.
- Provides hours, address, and contact info.
- Handles both **chat** and **voice calls**.

---

## 🚀 How to Use

No installation or setup required — everything runs in the browser.

1. **Voice Call (left panel):**
   - Press **Record**, speak your request, then stop recording.
   - The assistant transcribes your voice, replies with text, and speaks the response back.

2. **Chat / SMS (right panel):**
   - Type a message in the textbox and press Enter.
   - The assistant replies in the chat window.

That’s it! 🎉 You can switch freely between chat and voice.

---

## 🛠️ Tools & Technologies

This demo integrates several lightweight but powerful AI tools:

- **[Gradio](https://gradio.app/)** – User interface for voice and chat.
- **Automatic Speech Recognition (ASR)** – Converts microphone input to text using [faster-whisper](https://github.com/SYSTRAN/faster-whisper).
- **Large Language Model (LLM)** – Core conversational logic (via [OpenAI API](https://openai.com) in this build).
- **Text-to-Speech (TTS)** – Synthesizes assistant replies into natural voice using [Piper](https://github.com/rhasspy/piper) or system TTS.
- **Python & Hugging Face Spaces** – Deployment environment.


---

## 👩‍💻 Credits

Developed as part of a portfolio project to explore **multimodal AI assistants** that combine speech, text, and reasoning for practical real-world scenarios.