Search

649 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Voice Message

Send voice messages across chat channels (Telegram, Discord, Feishu/Lark, Signal, WhatsApp, and others) using edge-tts for text-to-speech and ffmpeg for audi...

❤️ 1 ⬇️ 338

🧪 Skill

Elevenlabs Pro

Free

ElevenLabs advanced TTS for converting text to speech, listing voices, and managing credits

❤️ 0 ⬇️ 234

🔌 MCP

kokoro-tts-mcp

Free

MCP Server that uses the open weight Kokoro TTS models to convert text-to-speech. Can convert text to MP3 on a local driver or auto-upload to an S3 bucket.

❤️ 0 ⬇️ 0

🧪 Skill

MH openai-whisper

Free

Local speech-to-text with the Whisper CLI (no API key).

❤️ 0 ⬇️ 224

🧪 Skill

tts

Free

Convert text or subtitle files into speech audio with options for voice cloning, emotion control, speed, and timeline-accurate dubbing using Kokoro or Noiz b...

❤️ 0 ⬇️ 85

🧪 Skill

Pronunciation Coach

Free

Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonatio...

❤️ 0 ⬇️ 377

🧪 Skill

Openai Whisper 1.0.0

Free

Local speech-to-text with the Whisper CLI (no API key).

❤️ 0 ⬇️ 383

🧪 Skill

Kokoro Agent Voices

Free

Local zero-cost text-to-speech with per-agent voice profiles using Kokoro TTS (82M params). 54 voices available, named agent mappings, WAV output. Use when b...

❤️ 0 ⬇️ 168

🧪 Skill

Local Whisper

Free

Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...

❤️ 0 ⬇️ 253

🧪 Skill

deAPI - AI Media Generation Toolkit

Free

AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...

❤️ 0 ⬇️ 37

🧪 Skill

Voice Recognition

Free

Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.

❤️ 1 ⬇️ 1.2k

🧪 Skill

Addis Assistant

Free

Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the user needs to convert an audio file to text (specifically Amharic), or translate

❤️ 1 ⬇️ 1.8k

🧪 Skill

Elevenlabs

Free

Converts text to natural speech using ElevenLabs for clinical and healthcare use cases. Use when generating patient instructions, discharge summaries, medica...

❤️ 0 ⬇️ 124

🧪 Skill

Mac TTS

Free

Text-to-speech using macOS built-in `say` command. Use for voice notifications, audio alerts, reading text aloud, or announcing messages through Mac speakers. Supports multiple languages including Chi

❤️ 2 ⬇️ 3.9k

🧪 Skill

Miranda SAG (ElevenLabs TTS say-UX)

Free

ElevenLabs text-to-speech with mac-style say UX.

❤️ 0 ⬇️ 326

🧪 Skill

Agent Vibes OpenClaw Skill

Free

Stream free, professional text-to-speech from voiceless servers to Linux, macOS, or Android devices with 50+ voices in 30+ languages. Two architecture options for flexible deployment - server-side TTS

❤️ 1 ⬇️ 1.4k

🧪 Skill

OpenClaw Tailnet TTS Endpoint

Free

Configure an OpenClaw instance to use a local OpenAI-compatible TTS backend (for example openedai-speech) with cloned voices. Use when users ask to wire loca...

❤️ 0 ⬇️ 122

🧪 Skill

Zhipu AI TTS

Free

Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with various voice options. Supports Chin...

❤️ 0 ⬇️ 430

🧪 Skill

🗣️ Edge-TTS Skill using uvx

Free

Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rath

❤️ 2 ⬇️ 858

🧪 Skill

transcription

Free

Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r...

❤️ 0 ⬇️ 46

🧪 Skill

Douyin Upload Skill

Free

Login and publish Douyin (China mainland) videos from local files with OAuth, local speech-to-text, and generated caption drafts. Use when users ask to autho...

❤️ 0 ⬇️ 160

🧪 Skill

Ai Sdk Core

Free

Build backend AI with Vercel AI SDK v6 stable. Covers Output API (replaces generateObject/streamObject), speech synthesis, transcription, embeddings, MCP tools with security guidance. Includes v4→v5

❤️ 2 ⬇️ 1.6k

🧪 Skill

Transcribe Audio with Parakeet MLX

Free

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

❤️ 1 ⬇️ 1.7k

🧪 Skill

Truly Local Piper Multilang TTS (secure)

Free

Local offline text-to-speech via Piper TTS. Self-contained setup, automatic language detection, per-call voice selection. Extensible to any language. Writes...

❤️ 0 ⬇️ 313