Search

1099 results for "audio"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Podcast Generation from PDF, Text, and Links

Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...

❤️ 2 ⬇️ 638

🧪 Skill

sense-music

Free

Analyzes audio to detect BPM, key, structure, genre, mood, transcribe lyrics, and generate visual and textual summaries of music tracks.

❤️ 0 ⬇️ 11

🧪 Skill

Voice Transcribe

Free

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

❤️ 12 ⬇️ 4.3k

🧪 Skill

Multimodal Asset Tagger

Free

Generate AI-optimized Alt Text, file names, captions, and Schema markup for images, videos, and audio assets. Improves AI discoverability on Google Lens, Cha...

❤️ 0 ⬇️ 174

🧪 Skill

Meta Video Ad Analyzer

Free

Extract and analyze content from video ads using Gemini Vision AI. Supports frame extraction, OCR text detection, audio transcription, and AI-powered scene analysis. Use when analyzing video creative

❤️ 1 ⬇️ 1.4k

🧪 Skill

Qwen3-tts

Free

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium s

❤️ 9 ⬇️ 2.6k

🧪 Skill

Dream LipSync

Free

Video lip synchronization using LipSync 2.0 API. Automatically synchronizes audio with lip movements in videos. Powered by Dreamface - AI tools for everyone....

❤️ 0 ⬇️ 105

🧪 Skill

video-stt

Free

Extract audio from video URLs and transcribe using STT (Speech-to-Text). Supports local Whisper or cloud APIs. Use when: user provides a video URL and wants...

❤️ 0 ⬇️ 85

🧪 Skill

Phone Voice Agent

Free

Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to

❤️ 6 ⬇️ 2.3k

🧪 Skill

Webchat Voice Gui

Free

Voice input and microphone button for OpenClaw WebChat Control UI. Adds a mic button to chat, records audio via browser MediaRecorder, transcribes locally vi...

❤️ 0 ⬇️ 128

🧪 Skill

ElevenLabs

Free

ElevenLabs API integration with managed authentication. AI-powered text-to-speech, voice cloning, sound effects, and audio processing. Use this skill when users want to generate speech from text, clon

❤️ 3 ⬇️ 1.4k

🧪 Skill

openclaw-voice

Free

Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.

❤️ 2 ⬇️ 378

🧪 Skill

ANY WHISPER API

Free

--- name: any-whisper-api description: Transcribe audio via API Whisper with any compatible local servers. homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: {"clawdbot":{"emoj

❤️ 2 ⬇️ 230

🧪 Skill

clawdio

Free

Analyze Twitter Spaces and voice conversations to extract market intelligence, crypto alpha, sentiment analysis, and speaker-attributed insights. Transforms spoken audio into structured reports, full

❤️ 2 ⬇️ 777

🧪 Skill

multimodal-parser

Free

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

❤️ 0 ⬇️ 98

🧪 Skill

Airfoil

Free

Control AirPlay speakers via Airfoil from the command line. Connect, disconnect, set volume, and manage multi-room audio with simple CLI commands.

❤️ 0 ⬇️ 1.8k

🧪 Skill

OpenClaw YouTube Transcript

Free

Transcribe YouTube videos to text by extracting captions and subtitles directly from the video URL using yt-dlp without audio processing.

❤️ 18 ⬇️ 32k

🧪 Skill

Kokoro TTS

Free

Generate spoken audio from text using the local Kokoro TTS engine. Use when the user asks to "say" something, requests a voice message, or wants text converted to speech.

❤️ 1 ⬇️ 3.9k

🧪 Skill

Otterai Cli

Free

Use when the user mentions Otter, Otter.ai, or wants to find, search, download, export, or manage meeting notes, transcripts, recordings, or audio from calls...

❤️ 0 ⬇️ 93

🧪 Skill

Voice2text

Free

Offline speech-to-text conversion using Vosk local model; input audio file path, output transcript text.

❤️ 0 ⬇️ 141

🧪 Skill

Yt Dlp Downloader

Free

Download videos from YouTube, Bilibili, Twitter, and thousands of other sites using yt-dlp. Use when the user provides a video URL and wants to download it, extract audio (MP3), download subtitles, or

❤️ 8 ⬇️ 5.2k

🧪 Skill

Speechall command-line tool for fast speech-to-text transcription using multiple providers

Free

Install and use the speechall CLI tool for speech-to-text transcription. Use when the user wants to: (1) transcribe audio or video files to text, (2) install speechall on macOS or Linux, (3) list avai

❤️ 0 ⬇️ 1.1k

🧪 Skill

Edge Tts Unlimited

Free

Free, unlimited text-to-speech using Microsoft Edge neural voices via Python edge-tts. Use when generating long-form audio, podcasts, voice notes, spoken bri...

❤️ 0 ⬇️ 27

🧪 Skill

Whisper Tailnet API

Free

Consume the shared Whisper speech-to-text API over Tailnet at http://100.92.116.99:8765 using OpenAI-compatible audio transcription endpoint (/v1/audio/trans...

❤️ 0 ⬇️ 123