Search

1144 results for "audio"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Transcribee 🐝

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts

❤️ 5 ⬇️ 2.7k

🧪 Skill

BibiGPT Skill

Free

BibiGPT CLI for summarizing videos, audio, and podcasts directly in the terminal. Use when the user wants to summarize a URL (YouTube, Bilibili, podcast, etc...

❤️ 0 ⬇️ 135

🧪 Skill

sense-music

Free

Analyzes audio to detect BPM, key, structure, genre, mood, transcribe lyrics, and generate visual and textual summaries of music tracks.

❤️ 0 ⬇️ 11

🧪 Skill

Edge TTS English

Free

Generate high-quality English (and multilingual) audio using Microsoft Edge TTS. Use when the user asks to "speak this", "pronounce", "read aloud", "say this...

❤️ 1 ⬇️ 136

🧪 Skill

Multimodal Asset Tagger

Free

Generate AI-optimized Alt Text, file names, captions, and Schema markup for images, videos, and audio assets. Improves AI discoverability on Google Lens, Cha...

❤️ 0 ⬇️ 174

🧪 Skill

Voice Transcribe

Free

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

❤️ 12 ⬇️ 4.3k

🧪 Skill

Qwen3-tts

Free

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium s

❤️ 9 ⬇️ 2.6k

🧪 Skill

FFHub FFmpeg Skill

Free

Process video/audio files using FFHub.io cloud FFmpeg API. Use when the user wants to convert, compress, trim, resize, extract audio, generate thumbnails, or...

❤️ 0 ⬇️ 97

🧪 Skill

Free Resource

Free

Search and retrieve royalty-free media from Pixabay (images/videos), Freesound (audio effects), and Jamendo (music/BGM). Use when the user needs to find stoc...

❤️ 0 ⬇️ 158

🧪 Skill

Meta Video Ad Analyzer

Free

Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio transcription, and AI-powered scene analysis. Use when analyzing video creative

❤️ 1 ⬇️ 1.4k

🔌 MCP

sonos-ts-mcp

Free

Comprehensive Sonos audio system control through pure TypeScript implementation. Features complete device discovery, multi-room playback management, queue control, music library browsing, alarm manage

❤️ 0 ⬇️ 0

🧪 Skill

Podcast Generation from PDF, Text, and Links

Free

Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...

❤️ 2 ⬇️ 638

🧪 Skill

AssemblyAI Transcriber

Free

Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Req

❤️ 0 ⬇️ 1.1k

🧪 Skill

ANY WHISPER API

Free

--- name: any-whisper-api description: Transcribe audio via API Whisper with any compatible local servers. homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: {"clawdbot":{"emoj

❤️ 2 ⬇️ 230

🧪 Skill

clawdio

Free

Analyze Twitter Spaces and voice conversations to extract market intelligence, crypto alpha, sentiment analysis, and speaker-attributed insights. Transforms spoken audio into structured reports, full

❤️ 2 ⬇️ 777

🧪 Skill

Phone Voice Agent

Free

Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to

❤️ 6 ⬇️ 2.3k

🧪 Skill

OpenClaw YouTube Transcript

Free

Transcribe YouTube videos to text by extracting captions and subtitles directly from the video URL using yt-dlp without audio processing.

❤️ 18 ⬇️ 32k

🧪 Skill

multimodal-parser

Free

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

❤️ 0 ⬇️ 98

🧪 Skill

openclaw-voice

Free

Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.

❤️ 2 ⬇️ 378

🧪 Skill

video-stt

Free

Extract audio from video URLs and transcribe using STT (Speech-to-Text). Supports local Whisper or cloud APIs. Use when: user provides a video URL and wants...

❤️ 0 ⬇️ 85

🧪 Skill

Airfoil

Free

Control AirPlay speakers via Airfoil from the command line. Connect, disconnect, set volume, and manage multi-room audio with simple CLI commands.

❤️ 0 ⬇️ 1.8k

🧪 Skill

Otterai Cli

Free

Use when the user mentions Otter, Otter.ai, or wants to find, search, download, export, or manage meeting notes, transcripts, recordings, or audio from calls...

❤️ 0 ⬇️ 93

🧪 Skill

Yt Dlp Downloader

Free

Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a video URL and wants to download it, extract audio (MP3), download subtitles, or

❤️ 8 ⬇️ 5.2k

🧪 Skill

Speechall command-line tool for fast speech-to-text transcription using multiple providers

Free

Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list avai

❤️ 0 ⬇️ 1.1k