Spaces:
Running
on
Zero
Running
on
Zero
A newer version of the Gradio SDK is available:
5.49.1
metadata
title: Medical Transcription Notes
emoji: π
colorFrom: gray
colorTo: yellow
sdk: gradio
sdk_version: 5.35.0
app_file: app.py
pinned: false
license: apache-2.0
short_description: 'OnDevice medical transcription and note creation '
Medical Transcription & Note Generation
This Hugging Face Space provides an end-to-end solution for medical transcription and structured note generation from patient-doctor conversations.
Paper
https://www.medrxiv.org/content/10.1101/2025.07.01.25330679v1
Features
ποΈ Audio Recording & Transcription
- Multi-source Audio Input: Record directly via microphone or upload audio files
- Multiple Whisper Models: Choose from all available Whisper models (tiny to large-v3)
- GPU-Accelerated: Utilizes GPU for fast transcription when available
- Streaming Output: Real-time transcription display with editable text
- Audio Download: Save recorded conversations as WAV files
π Medical Note Generation
- Side-by-side View: Transcription and medical notes displayed together
- Multiple LLM Models: Select from various medical-focused language models
- Dual Note Types:
- SOAP Notes: Structured Subjective, Objective, Assessment, Plan format
- Full Structured Notes: Comprehensive 14-section clinical documentation
- Copy Functionality: Easy copying of transcription and generated notes
Note Types
SOAP Note Format
- S (Subjective): Patient's reported symptoms and history
- O (Objective): Clinical findings, vital signs, lab results
- A (Assessment): Diagnosis and clinical reasoning
- P (Plan): Treatment plan and management
Full Structured Medical Note
- Presenting Illness
- History of Presenting Illness
- Past Medical History
- Surgical History
- Family History
- Social History
- Allergy History
- Medication History
- Dietary History
- Review of Systems
- Physical Exam Findings
- Labs and Imaging
- Assessment
- Plan
Models Available
Whisper Models
- openai/whisper-tiny (Default - fastest)
- openai/whisper-base
- openai/whisper-small
- openai/whisper-medium
- openai/whisper-large-v2
- openai/whisper-large-v3
Medical LLM Models
- OnDeviceMedNotes/Struct_Med_Note_v01 (Default)
- microsoft/DialoGPT-medium
- meta-llama/Llama-2-7b-chat-hf
- mistralai/Mistral-7B-Instruct-v0.1
Usage Instructions
- Record/Upload Audio: Use the microphone to record or upload an audio file
- Select Whisper Model: Choose transcription model based on speed/accuracy needs
- Transcribe: Click "Transcribe Audio" to convert speech to text
- Edit Transcription: Review and edit the transcribed text if needed
- Choose Note Type: Select between SOAP note or full structured note
- Select LLM Model: Choose the medical language model
- Generate Note: Click "Generate Medical Note" to create structured documentation
- Copy/Download: Use copy buttons or download audio file as needed
Technical Details
- GPU Support: Automatic GPU utilization for both transcription and note generation
- Memory Optimization: Efficient model loading and inference
- Error Handling: Robust error handling for various input formats
- Audio Processing: Automatic resampling and format conversion
Privacy & Security
β οΈ Important: This application is for demonstration purposes. Do not upload actual patient data or PHI (Protected Health Information). Always ensure compliance with HIPAA and local privacy regulations when handling medical data.
License
This project is open source. Please ensure appropriate licensing for any models used in production environments.