Search

628 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Volcengine TTS to TOS Agent

Combined agent that synthesizes speech via Volcengine TTS, uploads the audio to TOS, and returns a presigned temporary URL. Use when users need a shareable a...

❤️ 0 ⬇️ 96

🧪 Skill

Volcengine TTS to TOS Agent

Free

Combined agent that synthesizes speech via Volcengine TTS, uploads the audio to TOS, and returns a presigned temporary URL. Use when users need a shareable a...

❤️ 0 ⬇️ 106

🧪 Skill

Ai Sdk Core

Free

Build backend AI with Vercel AI SDK v6 stable. Covers Output API (replaces generateObject/streamObject), speech synthesis, transcription, embeddings, MCP tools with security guidance. Includes v4→v5

❤️ 2 ⬇️ 1.6k

🧪 Skill

Pocket Tts

Free

Generate high-quality English speech offline on CPU using 8 built-in voices or custom voice cloning with Kyutai's Pocket TTS model.

❤️ 3 ⬇️ 1.8k

🧪 Skill

Feishu Audio Message

Free

Send TTS audio as a proper playable audio message (not file attachment) to Feishu chats. Use when asked to send voice messages, TTS audio, speech announcemen...

❤️ 0 ⬇️ 117

🧪 Skill

Local Whisper

Free

Free local speech-to-text for Telegram and WhatsApp using MLX Whisper on Apple Silicon. Private, no API costs.

❤️ 9 ⬇️ 2.6k

🧪 Skill

Elevenlabs Integration with Openclaw

Free

ClawVox - ElevenLabs voice studio for OpenClaw. Generate speech, transcribe audio, clone voices, create sound effects, and more.

❤️ 3 ⬇️ 2.4k

🧪 Skill

Podcast Generation with Microsoft Foundry

Free

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation f

❤️ 2 ⬇️ 1.9k

🧪 Skill

transcription

Free

Transcribe audio and video files using OpenAI Whisper API. Use when user wants to transcribe audio/video files, extract speech from media, or get text from r...

❤️ 0 ⬇️ 46

🧪 Skill

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Free

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

❤️ 1 ⬇️ 122

🧪 Skill

Edge Tts Unlimited

Free

Free, unlimited text-to-speech using Microsoft Edge neural voices via Python edge-tts. Use when generating long-form audio, podcasts, voice notes, spoken bri...

❤️ 0 ⬇️ 27

🧪 Skill

Qwen Asr Skill

Free

Provides high-accuracy speech-to-text conversion supporting 22 Chinese dialects and 30 languages with automatic language detection, running on CPU.

❤️ 0 ⬇️ 92

🧪 Skill

Telegram Voice Bot

Free

Telegram bot that transcribes voice messages using Whisper and replies in Chinese with Microsoft Edge text-to-speech.

❤️ 0 ⬇️ 17

🧪 Skill

Faster Whisper Transcription

Free

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.

❤️ 0 ⬇️ 534

🧪 Skill

SOTA Zero-shot Voice Cloning TTS

Free

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer...

❤️ 1 ⬇️ 158

🧪 Skill

SpeakNotes: YouTube, Audio & Document Summaries

Free

Use when OpenClaw needs to call SpeakNotes API routes directly using an API key and generate transcripts/summaries from YouTube URLs, media files, or documen...

❤️ 0 ⬇️ 106

🧪 Skill

Feishu Voice Clone TTS Skill

Free

Convert text to speech using Volcengine TTS with preset or cloned voices and send audio messages to Feishu chats or groups.

❤️ 1 ⬇️ 138

🧪 Skill

Otterai Cli

Free

Use when the user mentions Otter, Otter.ai, or wants to find, search, download, export, or manage meeting notes, transcripts, recordings, or audio from calls...

❤️ 0 ⬇️ 93

🧪 Skill

IMA Studio All-in-One — Image, Video, Music, SeeDream, Veo, Suno. Banana

Free

All-in-One AI creation: images (SeeDream 4.5, Midjourney, Nano Banana 2), videos (Wan 2.6, Kling, Veo 3.1, Sora, Pixverse, Hailuo, SeeDance, Vidu), music (Su...

❤️ 1 ⬇️ 347

🧪 Skill

MOA-Debate

Free

--- name: moa-debate description: Run an Oxford Union–style multi-agent debate on any motion using Mixture of Agents architecture --- # Oxford Union Multi-Agent Debate When the user wants to **deb

❤️ 0 ⬇️ 174

🧪 Skill

Clawhub Skill Video Shorts

Free

Generate branded AI avatar lip-sync video shorts for TikTok, Reels, and YouTube Shorts. Create 15-second talking-head videos with custom avatars, auto-genera...

❤️ 0 ⬇️ 90

🧪 Skill

paper claw

Free

Fetch, classify, and summarize papers from multiple sources (arXiv, etc.) with AI-powered multi-language summaries and email delivery.

❤️ 1 ⬇️ 31

🧪 Skill

Listenhub

Free

Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video",...

❤️ 0 ⬇️ 396

🧪 Skill

Listenhub

Free

Explain anything — turn ideas into podcasts, explainer videos, or voice narration. Use when the user wants to "make a podcast", "create an explainer video",...

❤️ 0 ⬇️ 173