🧪 Skills

voiceclaw

Voice conversation interface for OpenClaw using wake word detection, streaming LLM responses, and text-to-speech. Use when a user wants to talk to their Open...

v1.0.6
❤️ 0
⬇️ 255
👁 1
Share

Description


name: voiceclaw-jp description: Voice conversation interface for OpenClaw using wake word detection, streaming LLM responses, and text-to-speech. Use when a user wants to talk to their OpenClaw agent by voice, set up a voice assistant, or add speech input/output to OpenClaw. Supports configurable wake words, VOICEVOX TTS, and sentence-level streaming for low-latency responses.

voiceclaw

Voice conversation skill for OpenClaw: wake word → STT → LLM (streaming) → TTS → playback.

Requirements

  • OpenClaw running locally (gateway with chatCompletions enabled)
  • Node.js 18+
  • VOICEVOX running on localhost:50021 (download)
  • Chrome/Edge (Web Speech API for STT)
  • HTTPS for remote mic access (localhost works without HTTPS)

Quick Start

# Install
git clone https://github.com/kentoku24/voiceclaw.git
cd voiceclaw
npm install

# Start (no .env needed if OpenClaw is running locally)
npm start
# → [voiceclaw] OpenClaw config loaded from ~/.openclaw/openclaw.json
# → [voiceclaw] listening on http://127.0.0.1:8788

# Open browser
open http://127.0.0.1:8788

Press 開始, say the wake word (default: アリス), then speak your command.

Configuration

All settings are optional. Set in .env or environment variables:

Variable Default Description
WAKE_WORDS アリスちゃん,アリス,... Comma-separated wake words
STT_LANG ja-JP Speech recognition language
OPENCLAW_MODEL openclaw LLM model name
VOICEVOX_URL http://127.0.0.1:50021 VOICEVOX endpoint
VOICEVOX_SPEAKER 1 VOICEVOX speaker ID
HOST 127.0.0.1 Server bind address
PORT 8788 Server port

Gateway token is auto-detected from ~/.openclaw/openclaw.json. Override with OPENCLAW_GATEWAY_TOKEN if needed.

Architecture

Wake word (browser STT) → voiceclaw server → OpenClaw Gateway (streaming)
                                           → sentence-level TTS (VOICEVOX)
                                           → audio playback (Web Audio API)

See docs/architecture.md for the full sequence diagram.

API Endpoints

Method Path Description
GET /health Health check
GET /api/config Client-safe settings (wake words, STT lang)
POST /api/chat-stream Streaming LLM → sentence-level SSE
POST /api/chat Non-streaming LLM (fallback)
POST /api/tts Text → VOICEVOX → WAV audio

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Pricing

Free

Related Configs