WJ88's picture
Title and Description update about no download of model needed
1433cd3 verified

A newer version of the Gradio SDK is available: 5.49.1

Upgrade
metadata
title: NVIDIA Parakeet TDT 0.6B V2 Real Time Mic Transcription ASR STT
emoji: 📊
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.31.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: Real-Time, Speak to Mic, NO MODEL DOWNLOAD NEEDED!!
language: en
inference: true
tags:
  - audio
  - speech-recognition
  - asr
  - real-time
  - cpu
  - nvidia
  - parakeet
  - microphone
  - voice
  - speech
  - browser
  - gradio
  - nemo
  - huggingface

Real-time English speech-to-text in your browser — no GPU required.

This Space runs the 600 M-parameter nvidia/parakeet-tdt-0.6b-v2 model that fits comfortably on the CPU Basic (2 vCPU) tier.

  1. Click “Record”
  2. Allow microphone access and start speaking.
  3. Watch live text appear in the Transcription box.

Stalled UI? Refresh the browser tab — this fully restarts the Space and clears any stuck threads.

Technique Why it matters
OMP_NUM_THREADS=2 & torch.set_num_threads(2) Matches the 2 vCPUs for optimal throughput
FBGEMM backend Fastest kernels on x86
4-second streaming window Low latency & small memory footprint
Gradio stream_every=0.5 Updates the transcript twice per second for real-time feel
Item Licence
Demo code (this repo) Apache-2.0
Model weightsnvidia/parakeet-tdt-0.6b-v2 CC-BY-4.0 (© NVIDIA)

If you redistribute transcripts or fine-tuned weights, please retain the CC-BY-4.0 attribution notice.