Coqui TTS is an open-source, high-quality text-to-speech engine built on deep learning.
It provides natural-sounding voices and supports multiple languages, making it a great option for local deployments where flexibility and privacy are priorities.

AVR integrates with CoquiTTS to enable real-time speech synthesis inside your telephony infrastructure.
| Variable | Description | Example Value |
|---|---|---|
PORT |
Port where the CoquiTTS service runs | 6032 |
Example .env file:
PORT=6032
Below is a sample configuration using AVR with CoquiTTS:
avr-tts-coquitts:
image: agentvoiceresponse/avr-tts-coquitts
platform: linux/x86_64
container_name: avr-tts-coquitts
restart: always
environment:
- PORT=6032
- COQUI_AI_TTS_URL=http://avr-coqui-ai-tts:5002/api/tts
ports:
- 6032:6032
networks:
- avr
avr-coqui-ai-tts:
image: ghcr.io/coqui-ai/tts-cpu
platform: linux/x86_64
container_name: avr-coqui-ai-tts
entrypoint: "python3 TTS/server/server.py --model_name tts_models/en/vctk/vits"
restart: always
environment:
- PORT=5002
ports:
- 5002:5002
networks:
- avr
Notes:
curl -X POST http://localhost:6032/text-to-speech-stream \
-H "Content-Type: application/json" \
-d '{"text": "Hello, this is CoquiTTS running with Agent Voice Response!"}' \
--output response.raw
The resulting response.raw file will contain PCM audio at 8kHz.