Search

649 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

🧪 Skill

Ressemble TTS e STT

Free

Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.

❤️ 0 ⬇️ 279

🧪 Skill

speaker-local

Free

Text-to-speech using Kokoro local TTS. Use when the user wants to convert text to audio, read aloud, or generate speech.

❤️ 0 ⬇️ 154

🧪 Skill

Local Vosk STT

Free

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

❤️ 0 ⬇️ 739

🧪 Skill

Parakeet Stt

Free

Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech

❤️ 1 ⬇️ 2.0k

🧪 Skill

Zvukogram TTS

Free

Text-to-Speech via Zvukogram API with SSML support. Use when you need to generate speech from text, create podcasts, voice notifications, or work with audio....

❤️ 2 ⬇️ 442

🧪 Skill

Sarvam AI

Free

Use Sarvam AI for Indian language Text-to-Speech (TTS), Speech-to-Text (STT), Translation, and Chat.

❤️ 2 ⬇️ 443

🧪 Skill

IMA Studio TTS — seed-tts, DouBao

Free

TTS (text-to-speech) via IMA Open API with seed-tts-2.0. Voice synthesis, speech from text, dubbing, audio content creation. Output: audio URL (mp3/wav). Flo...

❤️ 0 ⬇️ 141

🧪 Skill

Aliyun TTS

Free

--- name: aliyun-tts description: Alibaba Cloud Text-to-Speech synthesis service. metadata: {"clawdbot":{"emoji":"🔊"}} --- # aliyun-tts Alibaba Cloud Text-to-Speech synthesis service. ## Configu

❤️ 1 ⬇️ 2.6k

🧪 Skill

Skillboss

Free

Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...

❤️ 1 ⬇️ 64

🧪 Skill

Alicloud Ai Audio Tts

Free

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech,...

❤️ 0 ⬇️ 885

🧪 Skill

Pywayne Tts

Free

Text-to-speech conversion tool. Use when converting text to speech audio files (opus or mp3 format). Supports macOS native 'say' command and Google TTS (gTTS...

❤️ 0 ⬇️ 348

🧪 Skill

Skillboss

Free

Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...

❤️ 1 ⬇️ 61

🧪 Skill

Argmax Transcription and TTS

Free

On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...

❤️ 0 ⬇️ 141

🧪 Skill

hotbutter voice chat

Free

Enables local voice chat by embedding Hotbutter relay server and PWA, providing speech-to-text and text-to-speech via a secure, self-hosted connection.

❤️ 0 ⬇️ 237

🧪 Skill

mmVoiceMaker

Free

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat...

❤️ 3 ⬇️ 359

🧪 Skill

Faster Whisper

Free

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

❤️ 4 ⬇️ 5.0k

🧪 Skill

Doubao ASR / 豆包语音转写

Free

Transcribe audio files via Doubao Seed-ASR 2.0 (豆包录音文件识别模型2.0, recorded audio → text) API from ByteDance/Volcengine. Best-in-class Chinese speech recognition...

❤️ 1 ⬇️ 590

🧪 Skill

Willow Inference Server

Free

Local ASR and TTS inference server. Use when the user wants to transcribe audio to text (ASR) or convert text to speech (TTS). Requires a running Willow Infe...

❤️ 0 ⬇️ 100

🧪 Skill

Google Gemini Media

Free

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understand

❤️ 5 ⬇️ 3.0k

🧪 Skill

Pocket TTS Complete Documentation

Free

Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English only. ~6x real-time on M4 MacBook Air.

❤️ 0 ⬇️ 714

🧪 Skill

Voice.ai Voices

Free

--- name: voice-ai-tts description: > High-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API. version: 1.1.5 tags: [tts, voice, speech, voice-ai, audio, streami

❤️ 0 ⬇️ 2.1k

🧪 Skill

Flow Voice

Free

Clone any voice from a short audio sample and generate speech with it. Powered by LuxTTS (150x realtime, local, free, no API key). Use when asked to clone a...

❤️ 0 ⬇️ 88

🧪 Skill

Characteristic Voice

Free

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...

❤️ 0 ⬇️ 129

🧪 Skill

Groq API Inference

Free

Build and debug Groq API chat and speech workflows with low-latency routing, structured outputs, and production-safe patterns.

❤️ 0 ⬇️ 272