Search

649 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Eachlabs Voice Audio

Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS, Whisper transcription with diarization, and RVC voice conversion. Use when the

❤️ 0 ⬇️ 789

🧪 Skill

Ai Voice Cloning

Free

AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capa...

❤️ 0 ⬇️ 1.1k

🧪 Skill

Ai Task Hub

Free

AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. U...

❤️ 1 ⬇️ 58

🧪 Skill

AudioPod

Free

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to

❤️ 3 ⬇️ 2.7k

🧪 Skill

macOS Local Voice

Free

Local STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-speech via say + ffmpeg. Fully offline, no API keys required. Includes voice qualit

❤️ 0 ⬇️ 1.0k

🧪 Skill

mmEasyVoice

Free

Simple text-to-speech skill using MiniMax Voice API. Converts text to audio with customizable voice selection. Use for generating speech audio from text.

❤️ 0 ⬇️ 268

🧪 Skill

Telnyx Tts

Free

Generate speech audio from text using Telnyx Text-to-Speech API. Use when you need to convert text to spoken audio, create voice messages, or generate audio content.

❤️ 0 ⬇️ 616

🧪 Skill

Tts Router

Free

Local TTS router for Apple Silicon — pull models, serve OpenAI-compatible API, synthesize speech, clone voices. Use when the user asks to "generate speech",...

❤️ 0 ⬇️ 64

🧪 Skill

Ressemble TTS e STT

Free

--- name: ressemble displayName: Ressemble - Adriano version: 1.0.0 description: Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API. author: Adriano Vargas tags: [tts, stt, audio

❤️ 0 ⬇️ 295

🧪 Skill

Groq Voice Transcribe

Free

Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...

❤️ 0 ⬇️ 81

🧪 Skill

Pub Sonoscli

Free

Control Sonos speakers (discover, status, play, volume, group). And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, m...

❤️ 0 ⬇️ 39

🧪 Skill

Pub Banana

Free

Generate and edit images with Nano Banana Pro (Gemini 3 Pro Image). And also 50+ models for image generation, video generation, text-to-speech, speech-to-tex...

❤️ 0 ⬇️ 70

🧪 Skill

Pub Weather

Free

Get current weather and forecasts (no API key required). And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, c...

❤️ 0 ⬇️ 40

🧪 Skill

Pub Browserauto

Free

Automate web browser interactions using natural language via CLI commands. And also 50+ models for image generation, video generation, text-to-speech, speech...

❤️ 0 ⬇️ 47

🧪 Skill

Pub Gemini

Free

Gemini CLI for one-shot Q and A, summaries, and generation. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music...

❤️ 0 ⬇️ 42

🧪 Skill

Pub Nanopdf

Free

Edit PDFs with natural-language instructions using the nano-pdf CLI. And also 50+ models for image generation, video generation, text-to-speech, speech-to-te...

❤️ 0 ⬇️ 38

🧪 Skill

Local Voice (FluidAudio TTS/STT)

Free

Local text-to-speech (TTS) and speech-to-text (STT) using FluidAudio on Apple Silicon. Sub-second voice synthesis and transcription running entirely on-device via the Apple Neural Engine. Use when set

❤️ 1 ⬇️ 1.2k

🧪 Skill

Pub Clawdhub

Free

Use the ClawdHub CLI to search, install, update, and publish agent skills. And also 50+ models for image generation, video generation, text-to-speech, speech...

❤️ 0 ⬇️ 54

🧪 Skill

Local Vosk STT

Free

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

❤️ 0 ⬇️ 750

🧪 Skill

Zvukogram TTS

Free

Text-to-Speech via Zvukogram API with SSML support. Use when you need to generate speech from text, create podcasts, voice notifications, or work with audio....

❤️ 2 ⬇️ 450

🧪 Skill

Pub Modelusage

Free

Summarize per-model usage for Codex or Claude including cost tracking. And also 50+ models for image generation, video generation, text-to-speech, speech-to-...

❤️ 0 ⬇️ 46

🧪 Skill

Pub Brave

Free

Web search and content extraction via Brave Search API. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, ch...

❤️ 0 ⬇️ 38

🧪 Skill

Skillboss

Free

Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...

❤️ 0 ⬇️ 60

🧪 Skill

Qwen Audio

Free

--- name: qwen-audio description: "High-performance audio library with text-to-speech (TTS) and speech-to-text (STT)." version: "0.0.4" --- # Qwen-Audio ## Overview Qwen-Audio is a high-performance

❤️ 1 ⬇️ 169