Search

628 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

deAPI - AI Media Generation Toolkit

AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...

❤️ 0 ⬇️ 54

🧪 Skill

Telnyx Stt

Free

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

❤️ 0 ⬇️ 663

🧪 Skill

Feishu Voice Loop

Free

Accept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.

❤️ 0 ⬇️ 81

🧪 Skill

Simple sound-to-text skill locally

Free

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

❤️ 0 ⬇️ 60

🧪 Skill

PCClaw

Free

PCClaw provides 16 native Windows AI skills for system control, automation, files, notifications, OCR, speech, LLM inference, and task management with minima...

❤️ 0 ⬇️ 227

🧪 Skill

VectorClaw MCP

Free

--- name: vectorclaw-mcp description: "MCP tools for Anki Vector: speech, motion, camera, sensors, and automation workflows." openclaw: emoji: "🤖" requires: bins: ["python3"] env: ["VEC

❤️ 1 ⬇️ 159

🧪 Skill

Alicloud Ai Audio Asr Realtime

Free

Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...

❤️ 0 ⬇️ 42

🧪 Skill

🗣️ Edge-TTS Skill using uvx

Free

Text-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rath

❤️ 2 ⬇️ 886

🧪 Skill

Feishu Audio Message

Free

Send TTS audio as a proper playable audio message (not file attachment) to Feishu chats. Use when asked to send voice messages, TTS audio, speech announcemen...

❤️ 0 ⬇️ 117

🧪 Skill

Elevenlabs

Free

Converts text to natural speech using ElevenLabs for clinical and healthcare use cases. Use when generating patient instructions, discharge summaries, medica...

❤️ 0 ⬇️ 124

🧪 Skill

characteristic-voice

Free

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...

❤️ 0 ⬇️ 86

🧪 Skill

Volcengine TTS to TOS Agent

Free

Combined agent that synthesizes speech via Volcengine TTS, uploads the audio to TOS, and returns a presigned temporary URL. Use when users need a shareable a...

❤️ 0 ⬇️ 96

🧪 Skill

Local Whisper

Free

Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...

❤️ 10 ⬇️ 6.7k

🧪 Skill

Whisper Transcriber

Free

Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...

❤️ 1 ⬇️ 94

🧪 Skill

Phone Voice Assistant - Amber

Free

The most complete voice and phone calling skill for OpenClaw. Handles inbound and outbound phone calls over Twilio with OpenAI Realtime speech. Inbound outbo...

❤️ 5 ⬇️ 1.2k

🧪 Skill

Voice2text

Free

Offline speech-to-text conversion using Vosk local model; input audio file path, output transcript text.

❤️ 0 ⬇️ 141

🧪 Skill

Feishu Voice Clone TTS Skill

Free

Convert text to speech using Volcengine TTS with preset or cloned voices and send audio messages to Feishu chats or groups.

❤️ 1 ⬇️ 124

🧪 Skill

chat-with-anyone

Free

Chat with any real person or fictional character in their own voice by automatically finding their speech online, extracting a clean reference sample, and ge...

❤️ 1 ⬇️ 80

🧪 Skill

MLX Local Inference Stack

Free

Full local AI inference stack on Apple Silicon Macs via MLX. Includes: LLM chat (Qwen3-14B, Gemma3-12B), speech-to-text ASR (Qwen3-ASR, Whisper), text embedd...

❤️ 1 ⬇️ 293

🧪 Skill

Groq API Inference

Free

Build and debug Groq API chat and speech workflows with low-latency routing, structured outputs, and production-safe patterns.

❤️ 0 ⬇️ 254

🧪 Skill

Edge TTS

Free

Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) U

❤️ 20 ⬇️ 11k

🧪 Skill

Whisper Transcribe

Free

Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to

❤️ 2 ⬇️ 973

🧪 Skill

OpenAI Whisper Local

Free

--- name: openai-whisper description: Local speech-to-text with the Whisper CLI (no API key). homepage: https://openai.com/research/whisper metadata: {"clawdbot":{"emoji":"🎙️","requires":{"bins":

❤️ 0 ⬇️ 68

🧪 Skill

Pocket Tts

Free

Generate high-quality English speech offline on CPU using 8 built-in voices or custom voice cloning with Kyutai's Pocket TTS model.

❤️ 3 ⬇️ 1.8k