Search

649 results for "speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

🧪 Skill

Local Whisper

Free

Free local speech-to-text for Telegram and WhatsApp using MLX Whisper on Apple Silicon. Private, no API costs.

❤️ 9 ⬇️ 2.6k

🧪 Skill

Pub Gog

Free

Google Workspace CLI for Gmail, Calendar, Drive, Contacts, Sheets, and Docs. And also 50+ models for image generation, video generation, text-to-speech, spee...

❤️ 0 ⬇️ 27

🧪 Skill

Podcast Generation with Microsoft Foundry

Free

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation f

❤️ 1 ⬇️ 1.9k

🧪 Skill

Self Improving Agent

Free

Captures learnings, errors, and corrections to enable continuous improvement. And also 50+ models for image generation, video generation, text-to-speech, spe...

❤️ 0 ⬇️ 51

🧪 Skill

characteristic-voice

Free

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...

❤️ 0 ⬇️ 86

🧪 Skill

Local Whisper

Free

Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...

❤️ 10 ⬇️ 6.7k

🧪 Skill

Pub Obsidian

Free

Work with Obsidian vaults (plain Markdown notes) and automate via obsidian-cli. And also 50+ models for image generation, video generation, text-to-speech, s...

❤️ 0 ⬇️ 30

🧪 Skill

Whisper Transcriber

Free

Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...

❤️ 1 ⬇️ 94

🧪 Skill

Alicloud Ai Audio Asr

Free

Transcribe non-realtime speech with Alibaba Cloud Model Studio Qwen ASR models (`qwen3-asr-flash`, `qwen-audio-asr`, `qwen3-asr-flash-filetrans`). Use when c...

❤️ 0 ⬇️ 48

🧪 Skill

Voice2text

Free

Offline speech-to-text conversion using Vosk local model; input audio file path, output transcript text.

❤️ 0 ⬇️ 141

🧪 Skill

OpenAI Whisper Local

Free

--- name: openai-whisper description: Local speech-to-text with the Whisper CLI (no API key). homepage: https://openai.com/research/whisper metadata: {"clawdbot":{"emoji":"🎙️","requires":{"bins":

❤️ 0 ⬇️ 68

🧪 Skill

Openai Whisper 1.0.0

Free

Local speech-to-text with the Whisper CLI (no API key).

❤️ 0 ⬇️ 383

🧪 Skill

Phone Voice Assistant - Amber

Free

The most complete voice and phone calling skill for OpenClaw. Handles inbound and outbound phone calls over Twilio with OpenAI Realtime speech. Inbound outbo...

❤️ 5 ⬇️ 1.2k

🧪 Skill

Zhipu AI TTS

Free

Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with various voice options. Supports Chin...

❤️ 0 ⬇️ 443

🧪 Skill

deAPI - AI Media Generation Toolkit

Free

AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...

❤️ 0 ⬇️ 37

🧪 Skill

deAPI AI Media Suite (Community)

Free

The cheapest AI media API on the market. Generate images (Flux), music (AceStep), speech with voice cloning, transcribe video/audio, OCR, video generation, b...

❤️ 1 ⬇️ 48

🧪 Skill

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Free

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

❤️ 1 ⬇️ 122

🧪 Skill

Douyin Upload Skill

Free

Login and publish Douyin (China mainland) videos from local files with OAuth, local speech-to-text, and generated caption drafts. Use when users ask to autho...

❤️ 0 ⬇️ 160

🧪 Skill

Deepdub TTS

Free

Generate speech audio using Deepdub and attach it as a MEDIA file (Telegram-compatible).

❤️ 9 ⬇️ 1.6k

🧪 Skill

Feishu Audio Message

Free

Send TTS audio as a proper playable audio message (not file attachment) to Feishu chats. Use when asked to send voice messages, TTS audio, speech announcemen...

❤️ 0 ⬇️ 117

🧪 Skill

Qwen Asr Skill

Free

Provides high-accuracy speech-to-text conversion supporting 22 Chinese dialects and 30 languages with automatic language detection, running on CPU.

❤️ 0 ⬇️ 92

🧪 Skill

Faster Whisper Transcription

Free

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.

❤️ 0 ⬇️ 534

🧪 Skill

SpeakNotes: YouTube, Audio & Document Summaries

Free

Use when OpenClaw needs to call SpeakNotes API routes directly using an API key and generate transcripts/summaries from YouTube URLs, media files, or documen...

❤️ 0 ⬇️ 106

🧪 Skill

SOTA Zero-shot Voice Cloning TTS

Free

Voice-first OpenClaw skill powered by MOSS APIs. Use when a user wants spoken replies in a preferred timbre, either from an existing voice_id or from a refer...

❤️ 1 ⬇️ 158