AI Audio Plugins

Text-to-speech with ElevenLabs and EdgeTTS. Speech-to-text with Whisper.

ElevenLabs

High-quality, natural-sounding text-to-speech with multiple voices and languages.

Models

ModelCreditsBest for
eleven_v35Best quality
eleven_flash_v2_52Low latency (default)
eleven_multilingual_v23Multi-language
eleven_turbo_v2_52Fast

Popular voices

VoiceIDCharacter
Rachel21m00Tcm4TlvDq8ikWAMFemale, warm
BellaEXAVITQu4vr4xnSDxMaLFemale, soft
AntoniErXwobaYiN019PkySvjVMale
JoshTxGEqnHWrfWFTfGW9XjXMale, deep
AdampNInz6obpgDQGcFmaJgBMale, narrator

Config

{
  "model_id": "eleven_flash_v2_5",
  "voice_id": "21m00Tcm4TlvDq8ikWAM",
  "stability": 0.5,
  "similarity_boost": 0.75
}

Ports

  • Inputs: text (text)
  • Outputs: audio (audio)

EdgeTTS (Microsoft)

Free neural text-to-speech using Microsoft's Edge voices. 0-1 credits. Great for prototyping and budget workflows.


Whisper (OpenAI)

Speech-to-text transcription. Upload audio and get text back. 1-2 credits.

Ports

  • Inputs: audio (audio)
  • Outputs: text (text)

Last updated: 2026-03-27

    AI Audio Plugins | Zephly