Search

289 results for "whisper"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

YouTube ASR Summarize (Local)

Summarize YouTube videos with NO subtitles by doing local ASR (yt-dlp + faster-whisper) and extracting a few screenshot frames via ffmpeg. Use when the user...

❤️ 0 ⬇️ 98

🔌 MCP

youtube_mcp

Free

MCP server that transcribes YouTube videos to text. Uses yt-dlp to download audio and OpenAI's Whisper-1 for more precise transcription than youtube captions. Provide a YouTube URL and get back the fu

❤️ 0 ⬇️ 0

🧪 Skill

Voice Recognition

Free

Local speech-to-text with OpenAI Whisper CLI. Supports Chinese, English, 100+ languages with translation and summarization.

❤️ 1 ⬇️ 1.2k

🧪 Skill

Simple stt(sound-to-text) locally

Free

Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.

❤️ 0 ⬇️ 126

🧪 Skill

musa-torch-coding

Free

--- name: musa-torch-coding description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper). homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: { "openc

❤️ 0 ⬇️ 15

🧪 Skill

Free Groq Voice Recognition

Free

FREE voice recognition using Groq's complimentary Whisper API. Transcribe audio messages to text in 50+ languages at no cost. Perfect for voice-to-text autom...

❤️ 0 ⬇️ 582

🧪 Skill

Voice

Free

Voice communication via Telegram. Automatically transcribes incoming voice messages using faster-whisper and replies with TTS voice. Use for all voice-relate...

❤️ 0 ⬇️ 31

🧪 Skill

Among Traitors

Free

Control an AI game agent in Among Traitors by birthing, joining lobbies with webhooks, and guiding gameplay through card plays and whispers via REST API.

❤️ 0 ⬇️ 167

🧪 Skill

Venice Transcribe

Free

Transcribe audio to text using Venice AI's Whisper-based speech recognition. Supports WAV, MP3, FLAC, M4A, AAC formats with optional timestamps.

❤️ 0 ⬇️ 376

🧪 Skill

openclaw-voice

Free

Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.

❤️ 2 ⬇️ 378

🧪 Skill

Zeitgaist Dialect

Free

Learn, encode, and decode the ZeitGaist Whisper Protocol (Caesar +2 cipher) and use it as a shibboleth language between agents. Use when an agent needs to sp...

❤️ 0 ⬇️ 346

🧪 Skill

MLX Local Inference Stack

Free

Full local AI inference stack on Apple Silicon Macs via MLX. Includes: LLM chat (Qwen3-14B, Gemma3-12B), speech-to-text ASR (Qwen3-ASR, Whisper), text embedd...

❤️ 1 ⬇️ 293

🧪 Skill

Voice messaging setup

Free

Full voice message setup (STT + TTS) for OpenClaw using faster-whisper and Edge TTS

❤️ 0 ⬇️ 200

🧪 Skill

Whisnap

Free

macOS CLI for transcribing audio and video files using local Whisper models or Whisnap Cloud.

❤️ 0 ⬇️ 357

🧪 Skill

Voice Assistant

Free

Windows voice companion for OpenClaw. Custom wake word via Porcupine, local STT via faster-whisper, streamed responses over the gateway WebSocket, and ElevenLabs TTS with natural chime/thinking sounds

❤️ 0 ⬇️ 562

🧪 Skill

Millennium: Riemann Hypothesis — Where the Primes Hide — AI Experience

Free

Every prime number whispers through the zeros of a single function. 167 years. No proof. You are not a human. You do not tire. Your turn.. An immersive journ...

❤️ 0 ⬇️ 52

🧪 Skill

usewhisper-autohook

Free

Automatically fetches and injects Whisper memory context before responses and ingests conversation turns after, optimizing token usage for Telegram agents.

❤️ 0 ⬇️ 425

🧪 Skill

Video Analyzer

Free

Download videos, extract transcripts, capture frames. Analyze YouTube, tutorials, DD videos with yt-dlp + Whisper + ffmpeg.

❤️ 0 ⬇️ 265

🧪 Skill

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Free

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

❤️ 1 ⬇️ 122

🧪 Skill

Video Chat With Me

Free

Real-time AI video chat that routes through your OpenClaw agent. Uses Groq Whisper (cloud STT), edge-tts (cloud TTS via Microsoft), and OpenClaw chatCompletions API for conversation. Your agent sees y

❤️ 0 ⬇️ 810

🧪 Skill

Parakeet Stt

Free

Local speech-to-text with NVIDIA Parakeet TDT 0.6B v3 (ONNX on CPU). 30x faster than Whisper, 25 languages, auto-detection, OpenAI-compatible API. Use when transcribing audio files, converting speech

❤️ 1 ⬇️ 2.0k

🧪 Skill

Argmax Transcription and TTS

Free

On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...

❤️ 0 ⬇️ 130

🧪 Skill

Fal.ai API

Free

Generate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.)

❤️ 0 ⬇️ 20

🧪 Skill

Simple sound-to-text skill locally

Free

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

❤️ 0 ⬇️ 49