Skip to main content

Parakeet ASR

GPU-accelerated speech-to-text built on NVIDIA's Parakeet-TDT model.

Features

  • High-Quality Transcription: NVIDIA Parakeet-TDT 0.6B model
  • REST API: Simple HTTP interface for audio transcription
  • GPU Acceleration: Optimized for NVIDIA GPUs, falls back to CPU
  • S3 Integration: Transcribe audio files directly from S3/MinIO

API

Health Check

GET /

Transcribe Audio

POST /transcribe
curl -X POST -F "file=@audio.wav" http://localhost:8000/transcribe

Transcribe from S3

POST /transcribe-s3
{
"bucket": "my-audio-bucket",
"key": "recordings/meeting.mp4",
"endpoint_url": "http://minio:9000"
}

Configuration

VariableDescription
TRANSCRIBE_DEVICEcuda or cpu
S3_ENDPOINTS3/MinIO URL
S3_ACCESS_KEYS3 access key
S3_SECRET_KEYS3 secret key