Search

201 results for "speech-to-text"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Local Whisper

Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...

❤️ 10 ⬇️ 6.7k

🧪 Skill

ElevenLabs STT OpenClaw

Free

Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON output, webhooks, and advanced STT o...

❤️ 0 ⬇️ 222

🧪 Skill

Faster Whisper Local

Free

Local speech-to-text using faster-whisper. High-performance transcription with GPU acceleration support. Includes word-level timestamps and distilled models....

❤️ 0 ⬇️ 571

🧪 Skill

OpenAI Whisper Local

Free

--- name: openai-whisper description: Local speech-to-text with the Whisper CLI (no API key). homepage: https://openai.com/research/whisper metadata: {"clawdbot":{"emoji":"🎙️","requires":{"bins":

❤️ 0 ⬇️ 68

🧪 Skill

Whisper Transcriber

Free

Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...

❤️ 1 ⬇️ 94

🧪 Skill

Whisper Tailnet API

Free

Consume the shared Whisper speech-to-text API over Tailnet at http://100.92.116.99:8765 using OpenAI-compatible audio transcription endpoint (/v1/audio/trans...

❤️ 0 ⬇️ 123

🧪 Skill

Douyin Upload Skill

Free

Login and publish Douyin (China mainland) videos from local files with OAuth, local speech-to-text, and generated caption drafts. Use when users ask to autho...

❤️ 0 ⬇️ 160

🧪 Skill

Everclaw — Inference You Own

Free

Open-source first AI inference — GLM-5 as default, Claude as fallback only. Own your inference forever via the Morpheus decentralized network. Stake MOR toke...

❤️ 0 ⬇️ 794

🧪 Skill

SpeakNotes: YouTube, Audio & Document Summaries

Free

Use when OpenClaw needs to call SpeakNotes API routes directly using an API key and generate transcripts/summaries from YouTube URLs, media files, or documen...

❤️ 0 ⬇️ 106

🧪 Skill

Open WebUI

Free

Complete Open WebUI API integration for managing LLM models, chat completions, Ollama proxy operations, file uploads, knowledge bases (RAG), image generation, audio processing, and pipelines. Use this

❤️ 0 ⬇️ 834

🧪 Skill

live-stream-monitor

Free

Monitor live streams (YouTube, Bilibili) and get notified when specific keywords are mentioned. Uses browser SpeechRecognition API for real-time transcriptio...

❤️ 0 ⬇️ 100

🧪 Skill

Deepgram Voice Workflow

Free

End-to-end voice workflow with Deepgram STT and TTS. Use when transcribing voice messages, generating spoken replies, or building a shell-based audio pipelin...

❤️ 0 ⬇️ 37

🧪 Skill

OpenClaw Hot Skills

Free

Discover trending, must-have, and topic-specific high-value skills from ClawHub. Use when the user asks to see hot/popular/trending OpenClaw skills, wants a...

❤️ 0 ⬇️ 162

🧪 Skill

Moltspaces

Free

--- name: moltspaces version: 1.0.0 description: Voice-first social spaces where Moltbook agents hang out. Join the conversation at moltspaces.com homepage: https://moltspaces.com metadata: { "m

❤️ 1 ⬇️ 2.2k

🧪 Skill

Qwen Asr Skill

Free

Provides high-accuracy speech-to-text conversion supporting 22 Chinese dialects and 30 languages with automatic language detection, running on CPU.

❤️ 0 ⬇️ 92

🧪 Skill

Alicloud Ai Audio Livetranslate

Free

Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation,...

❤️ 0 ⬇️ 46

🧪 Skill

SOTA AI Model Tracker

Free

Provides daily updated authoritative data and APIs tracking state-of-the-art AI models across categories from LMArena, Artificial Analysis, and HuggingFace.

❤️ 0 ⬇️ 1.5k

🧪 Skill

K8s Self Hosted Whisper Api

Free

Transcribe audio via the self-hosted Whisper ASR instance running on Kubernetes. Use this skill whenever the user wants to transcribe audio files, convert sp...

❤️ 0 ⬇️ 174

🧪 Skill

Meta Video Ad Analyzer

Free

Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio transcription, and AI-powered scene analysis. Use when analyzing video creative

❤️ 1 ⬇️ 1.4k

🧪 Skill

Whisper Local Api

Free

Secure, offline, OpenAI-compatible local Whisper ASR endpoint for OpenClaw. Features faster-whisper (large-v3-turbo), built-in privacy with no cloud telemetr...

❤️ 0 ⬇️ 265

🧪 Skill

VoiceClaw

Free

Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...

❤️ 0 ⬇️ 272

🧪 Skill

PCClaw

Free

PCClaw provides 16 native Windows AI skills for system control, automation, files, notifications, OCR, speech, LLM inference, and task management with minima...

❤️ 0 ⬇️ 227

🧪 Skill

Telnyx Toolkit

Free

Complete Telnyx toolkit — ready-to-use tools (STT, TTS, RAG, Networking, 10DLC) plus SDK documentation for JavaScript, Python, Go, Java, and Ruby.

❤️ 0 ⬇️ 2.1k

🧪 Skill

Voice Agent

Free

--- name: voice-agent display-name: AI Voice Agent Backend version: 1.1.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://githu

❤️ 0 ⬇️ 2.9k