Whisper Hebrew Nikud v1

Model Description

whisper-hebrew-nikud-v1 directly transcribes Hebrew speech to text with full diacritical marks (niqqud) in a single step, eliminating the traditional two-step pipeline of transcription followed by nikud restoration.

  • Developed by: Maayan Bogin
  • Model type: Automatic Speech Recognition
  • Language(s): Hebrew (注讘专讬转)
  • License: MIT
  • Finetuned from model: ivrit-ai/whisper-large-v3-turbo

Model Sources

How to Get Started with the Model

from transformers import pipeline
import torch

device = 0 if torch.cuda.is_available() else "cpu"
pipe = pipeline(
    task="automatic-speech-recognition",
    model="MayBog/whisper-hebrew-nikud-v1",
    chunk_length_s=30,
    device=device,
)

result = pipe(
    "audio.wav",
    generate_kwargs={"language": "hebrew", "task": "transcribe"}
)

print(result["text"])  # Hebrew text with niqqud

See the GitHub repository for ready-to-use scripts including inference, Gradio interface, and streaming examples.

Acknowledgments

This work builds upon:

Downloads last month
83
Safetensors
Model size
0.8B params
Tensor type
F32
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support

Model tree for MayBog/whisper-hebrew-nikud-v1

Finetuned
(6)
this model