Search

628 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Zvukogram TTS

Text-to-Speech via Zvukogram API with SSML support. Use when you need to generate speech from text, create podcasts, voice notifications, or work with audio....

❤️ 2 ⬇️ 442

🧪 Skill

Pub Vidframes

Free

Extract frames or short clips from videos using ffmpeg. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, ch...

❤️ 0 ⬇️ 27

🧪 Skill

Pub Gemini

Free

Gemini CLI for one-shot Q and A, summaries, and generation. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music...

❤️ 0 ⬇️ 26

🧪 Skill

speaker-local

Free

Text-to-speech using Kokoro local TTS. Use when the user wants to convert text to audio, read aloud, or generate speech.

❤️ 0 ⬇️ 154

🧪 Skill

Local Vosk STT

Free

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

❤️ 0 ⬇️ 739

🧪 Skill

Pub Clawdhub

Free

Use the ClawdHub CLI to search, install, update, and publish agent skills. And also 50+ models for image generation, video generation, text-to-speech, speech...

❤️ 0 ⬇️ 41

🧪 Skill

ElevenLabs

Free

ElevenLabs API integration with managed authentication. AI-powered text-to-speech, voice cloning, sound effects, and audio processing. Use this skill when users want to generate speech from text, clon

❤️ 3 ⬇️ 1.4k

🧪 Skill

Alicloud Ai Audio Tts

Free

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech,...

❤️ 0 ⬇️ 885

🧪 Skill

Argmax Transcription and TTS

Free

On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...

❤️ 0 ⬇️ 130

🧪 Skill

Pub Qmd

Free

Local search and indexing CLI (BM25 + vectors + rerank) with MCP mode. And also 50+ models for image generation, video generation, text-to-speech, speech-to-...

❤️ 0 ⬇️ 24

🧪 Skill

Pywayne Tts

Free

Text-to-speech conversion tool. Use when converting text to speech audio files (opus or mp3 format). Supports macOS native 'say' command and Google TTS (gTTS...

❤️ 0 ⬇️ 348

🧪 Skill

mmVoiceMaker

Free

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat...

❤️ 3 ⬇️ 359

🧪 Skill

Faster Whisper

Free

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

❤️ 4 ⬇️ 5.0k

🧪 Skill

Doubao ASR / 豆包语音转写

Free

Transcribe audio files via Doubao Seed-ASR 2.0 (豆包录音文件识别模型2.0, recorded audio → text) API from ByteDance/Volcengine. Best-in-class Chinese speech recognition...

❤️ 1 ⬇️ 590

🧪 Skill

Willow Inference Server

Free

Local ASR and TTS inference server. Use when the user wants to transcribe audio to text (ASR) or convert text to speech (TTS). Requires a running Willow Infe...

❤️ 0 ⬇️ 100

🧪 Skill

Google Gemini Media

Free

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understand

❤️ 5 ⬇️ 3.0k

🧪 Skill

Pocket TTS Complete Documentation

Free

Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English only. ~6x real-time on M4 MacBook Air.

❤️ 0 ⬇️ 714

🧪 Skill

characteristic-voice

Free

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...

❤️ 0 ⬇️ 100

🧪 Skill

Characteristic Voice

Free

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...

❤️ 0 ⬇️ 129

🧪 Skill

Voice.ai Voices

Free

--- name: voice-ai-tts description: > High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API. version: 1.1.5 tags: [tts, voice, speech, voice-ai, audio, streami

❤️ 0 ⬇️ 2.1k

🧪 Skill

Flow Voice

Free

Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...

❤️ 0 ⬇️ 88

🧪 Skill

Groq API Inference

Free

Build and debug Groq API chat and speech workflows with low-latency routing, structured outputs, and production-safe patterns.

❤️ 0 ⬇️ 272

🧪 Skill

Qwen3 Tts Mlx

Free

Local Qwen3-TTS speech synthesis on Apple Silicon via MLX. Use for offline narration, audiobooks, video voiceovers, and multilingual TTS.

❤️ 0 ⬇️ 160

🧪 Skill

Voice

Free

Convert text to speech using Microsoft Edge's TTS engine with customizable voices, direct playback, and automatic temporary file cleanup.

❤️ 0 ⬇️ 1.8k