Search

649 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Pocket TTS Complete Documentation

Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English only. ~6x real-time on M4 MacBook Air.

❤️ 0 ⬇️ 702

🧪 Skill

Pub Gog

Free

Google Workspace CLI for Gmail, Calendar, Drive, Contacts, Sheets, and Docs. And also 50+ models for image generation, video generation, text-to-speech, spee...

❤️ 0 ⬇️ 43

🧪 Skill

Alicloud Ai Audio Livetranslate

Free

Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation,...

❤️ 0 ⬇️ 46

🧪 Skill

Kokoro TTS

Free

Generate spoken audio from text using the local Kokoro TTS engine. Use when the user asks to "say" something, requests a voice message, or wants text converted to speech.

❤️ 1 ⬇️ 3.9k

🧪 Skill

Pub Autoupd

Free

Automatically update Clawdbot and all installed skills once daily via cron. And also 50+ models for image generation, video generation, text-to-speech, speec...

❤️ 0 ⬇️ 45

🧪 Skill

Voice (Edge TTS)

Free

Convert text to speech using Microsoft Edge TTS with real-time streaming, customizable voice settings, and support for multiple languages including Chinese a...

❤️ 2 ⬇️ 462

🧪 Skill

Pub Obsidian

Free

Work with Obsidian vaults (plain Markdown notes) and automate via obsidian-cli. And also 50+ models for image generation, video generation, text-to-speech, s...

❤️ 0 ⬇️ 48

🧪 Skill

dy-video-to-text

Free

Extract speech-to-text from Douyin (TikTok China) videos, get watermark-free download links, and download videos. Use when user shares a Douyin link, asks to...

❤️ 0 ⬇️ 74

🧪 Skill

Self Improving Agent

Free

Captures learnings, errors, and corrections to enable continuous improvement. And also 50+ models for image generation, video generation, text-to-speech, spe...

❤️ 2.0k ⬇️ 206k

🧪 Skill

Clonev

Free

Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) C

❤️ 0 ⬇️ 1.6k

🧪 Skill

OpenClaw Tailnet TTS Endpoint

Free

Configure an OpenClaw instance to use a local OpenAI-compatible TTS backend (for example openedai-speech) with cloned voices. Use when users ask to wire loca...

❤️ 0 ⬇️ 136

🧪 Skill

Ai Podcast Creation

Free

Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...

❤️ 0 ⬇️ 1.0k

🧪 Skill

Pub Slack

Free

Control Slack from Clawdbot including reacting to messages and pinning items. And also 50+ models for image generation, video generation, text-to-speech, spe...

❤️ 0 ⬇️ 45

🧪 Skill

Addis Assistant

Free

Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the user needs to convert an audio file to text (specifically Amharic), or translate

❤️ 1 ⬇️ 1.8k

🧪 Skill

video-stt

Free

Extract audio from video URLs and transcribe using STT (Speech-to-Text). Supports local Whisper or cloud APIs. Use when: user provides a video URL and wants...

❤️ 0 ⬇️ 85

🧪 Skill

voiceclaw

Free

Voice conversation interface for OpenClaw using wake word detection, streaming LLM responses, and text-to-speech. Use when a user wants to talk to their Open...

❤️ 0 ⬇️ 255

🧪 Skill

Autonoannounce

Free

Build, operate, and troubleshoot Autonoannounce local speaker text-to-speech using the queued pipeline (enqueue to worker to ElevenLabs to playback backend)....

❤️ 0 ⬇️ 38

🧪 Skill

Faster Whisper Local Service

Free

OpenClaw local speech-to-text backend using faster-whisper over HTTP on 127.0.0.1:18790. Use when you want voice transcription without external APIs, without...

❤️ 0 ⬇️ 824

🧪 Skill

Elevenlabs Conversational

Free

Full ElevenLabs platform integration — text-to-speech, voice cloning, and Conversational AI agent creation. Not just TTS — build interactive voice agents wit...

❤️ 0 ⬇️ 135

🧪 Skill

Elevenlabs Tts

Free

ElevenLabs TTS (Text-to-Speech) with emotional audio tags for expressive voice synthesis. WhatsApp-compatible voice messages with Opus conversion. Supports 7...

❤️ 6 ⬇️ 4.9k

🧪 Skill

Sats4AI

Free

Bitcoin-powered AI tools marketplace via MCP. Generate images (Flux, Seedream, Recraft), text (Kimi K2.5, DeepSeek, GPT-OSS), video (Kling V3), music, speech...

❤️ 0 ⬇️ 82

🧪 Skill

Elevenlabs Toolkit

Free

ElevenLabs voice API integration — TTS, sound effects, music generation, speech-to-text, voice isolation, and streaming. Use when building voice-enabled apps...

❤️ 0 ⬇️ 245

🧪 Skill

SiliconFlow TTS Gen

Free

Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.

❤️ 0 ⬇️ 483

🧪 Skill

Pub Web Search

Free

Search the web for information, find current content, and look up news articles. And also 50+ models for image generation, video generation, text-to-speech,...

❤️ 0 ⬇️ 46