AI Audio Plugins

Inputs: text (text)
Outputs: audio (audio)

Text-to-speech with ElevenLabs and EdgeTTS. Speech-to-text with Whisper.

ElevenLabs

High-quality, natural-sounding text-to-speech with multiple voices and languages.

{
  "model_id": "eleven_flash_v2_5",
  "voice_id": "21m00Tcm4TlvDq8ikWAM",
  "stability": 0.5,
  "similarity_boost": 0.75
}

Free neural text-to-speech using Microsoft's Edge voices. 0-1 credits. Great for prototyping and budget workflows.

Speech-to-text transcription. Upload audio and get text back. 1-2 credits.

Last updated: 2026-03-27