Search

628 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

LH Edge TTS

Text-to-speech conversion using Python edge-tts for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and sub...

❤️ 0 ⬇️ 164

🧪 Skill

Ningyao Voice Launcher

Free

Install and configure a local browser-based Chinese voice chat launcher with the Ning Yao persona, including one-click Windows launchers, browser speech I/O,...

❤️ 0 ⬇️ 31

🧪 Skill

Faster Whisper Gpu

Free

High-performance local speech-to-text transcription using Faster Whisper with NVIDIA GPU acceleration. Transcribe audio files locally without sending data to...

❤️ 0 ⬇️ 319

🧪 Skill

Gladia YouTube Transcription (Free)

Free

Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...

❤️ 1 ⬇️ 104

🧪 Skill

ListenHub Asr

Free

Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".

❤️ 0 ⬇️ 28

🧪 Skill

Zhipu Asr

Free

Automatic Speech Recognition (ASR) using Zhipu AI (BigModel) GLM-ASR model. Use when you need to transcribe audio files to text. Supports Chinese audio trans...

❤️ 0 ⬇️ 423

🧪 Skill

Elevenlabs

Free

Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when generating audio with ElevenLabs or mana...

❤️ 1 ⬇️ 2.8k

🧪 Skill

SenseAudio-ASR

Free

Build and troubleshoot SenseAudio speech recognition integrations, including HTTP transcription (`/v1/audio/transcriptions`), realtime WebSocket ASR (`/ws/v1...

❤️ 0 ⬇️ 16

🧪 Skill

SenseAudio-TTS

Free

Build and debug SenseAudio text-to-speech integrations on `/v1/t2a_v2` and `/ws/v1/t2a_v2`, including sync HTTP, SSE stream, WebSocket event sequencing, hex...

❤️ 0 ⬇️ 9

🧪 Skill

bailian-tts

Free

Generate speech audio with 阿里云百炼 TTS via the `bailian-cli` npm package. Use when users ask to convert text to voice, choose voices/languages, batch-generate...

❤️ 0 ⬇️ 111

🧪 Skill

Local TTS

Free

Local text-to-speech using Qwen3-TTS with mlx_audio (macOS Apple Silicon) or qwen-tts (Linux/Windows). Privacy-first offline TTS with natural, realistic voic...

❤️ 0 ⬇️ 72

🧪 Skill

Voice Reply

Free

Local text-to-speech using Piper voices via sherpa-onnx. 100% offline, no API keys required. Use when user asks for a voice reply, audio response, spoken answer, or wants to hear something read aloud.

❤️ 5 ⬇️ 3.1k

🧪 Skill

Whisper STT

Free

Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...

❤️ 0 ⬇️ 354

🧪 Skill

Alicloud Ai Entry Modelstudio Test

Free

Run a minimal test matrix for the Model Studio skills that exist in this repo, including image/video/audio, realtime speech, omni, visual reasoning, embeddin...

❤️ 0 ⬇️ 874

🧪 Skill

deAPI - AI Media Generation Toolkit

Free

AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...

❤️ 0 ⬇️ 54

🧪 Skill

Telnyx Stt

Free

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

❤️ 0 ⬇️ 663

🧪 Skill

Webchat Voice Full Stack

Free

One-step full-stack installer for OpenClaw WebChat voice input with local speech-to-text. Orchestrates three focused skills in order: local STT backend (fast...

❤️ 4 ⬇️ 519

🧪 Skill

Simple sound-to-text skill locally

Free

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

❤️ 0 ⬇️ 60

🧪 Skill

PCClaw

Free

PCClaw provides 16 native Windows AI skills for system control, automation, files, notifications, OCR, speech, LLM inference, and task management with minima...

❤️ 0 ⬇️ 227

🧪 Skill

Alicloud Ai Audio Asr Realtime

Free

Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...

❤️ 0 ⬇️ 42

🧪 Skill

VectorClaw MCP

Free

--- name: vectorclaw-mcp description: "MCP tools for Anki Vector: speech, motion, camera, sensors, and automation workflows." openclaw: emoji: "🤖" requires: bins: ["python3"] env: ["VEC

❤️ 1 ⬇️ 159

🧪 Skill

Smart Speak Multilingual TTS

Free

Multilingual Text-to-Speech (TTS) with intelligent Pinyin-to-Hanzi conversion. Use when the user asks to generate audio for text that contains a mix of Vietn...

❤️ 0 ⬇️ 42

🧪 Skill

Feishu Voice Loop

Free

Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.

❤️ 0 ⬇️ 81

🧪 Skill

Agent Vibes OpenClaw Skill

Free

Stream free, professional text-to-speech from voiceless servers to Linux, macOS, or Android devices with 50+ voices in 30+ languages. Two architecture options for flexible deployment - server-side TTS

❤️ 1 ⬇️ 1.4k