Skip to main content

Whisper

Speech-to-text transcription service.

Features

  • High-Accuracy Transcription: Handles accents, background noise, and technical vocabulary well
  • 99+ Languages: Automatic language detection with multilingual transcription
  • Timestamps: Word- and segment-level timestamps for alignment and captioning
  • Multiple Model Sizes: Tiny to large variants trade off speed vs. accuracy
  • REST API: Various server wrappers (faster-whisper, whisper.cpp) expose an OpenAI-compatible endpoint

References