Description

name: local-llama-tts description: Local text-to-speech using llama-tts (llama.cpp) and OuteTTS-1.0-0.6B model. metadata: { "openclaw": { "emoji": "🔊", "requires": { "bins": ["llama-tts"] }, }, }

Local Llama TTS

Synthesize speech locally using llama-tts and the OuteTTS-1.0-0.6B model.

You can use the wrapper script:

Location: scripts/tts-local.sh (inside skill folder)
Model: /data/public/machine-learning/models/text-to-speach/OuteTTS-1.0-0.6B-Q4_K_M.gguf
Vocoder: /data/public/machine-learning/models/text-to-speach/WavTokenizer-Large-75-Q4_0.gguf
GPU: Enabled via llama-tts.

Model: Download from OuteAI/OuteTTS-1.0-0.6B-GGUF
Vocoder: Download from ggml-org/WavTokenizer (Note: Felix uses a Q4_0 version, Q5_1 is linked here as a high-quality alternative).

Place files in /data/public/machine-learning/models/text-to-speach/ or update scripts/tts-local.sh.

The model card recommends the following settings (hardcoded in the script):