Search

1099 results for "audio"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Phaya Media API

Use the Phaya SaaS backend to generate images, videos, audio, music, and run LLM chat completions via simple REST API calls. Use when the user wants to gener...

❤️ 0 ⬇️ 100

🧪 Skill

Openai

Free

OpenAI API integration — chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI RES...

❤️ 0 ⬇️ 216

🧪 Skill

Zoom Meeting Assistance Rtms Unofficial Community

Free

Zoom RTMS Meeting Assistant — start on-demand to capture meeting audio, video, transcript, screenshare, and chat via Zoom Real-Time Media Streams. Handles meeting.rtms_started and meeting.rtms_stopp

❤️ 1 ⬇️ 2.0k

🧪 Skill

SenseVoice Transcribe

Free

Transcribe audio files (WAV/MP3/M4A/FLAC) to timestamped text using SenseVoice-Small + FSMN-VAD. Supports single-file and batch mode with VAD-anchored per-se...

❤️ 0 ⬇️ 34

🧪 Skill

Firm Spec Compliance Pack

Free

MCP 2025-11-25 specification compliance audit pack. Validates elicitation, tasks, resources/prompts, audio content, JSON Schema 2020-12, SSE transport, and i...

❤️ 0 ⬇️ 137

🧪 Skill

Douyin Video Transcribe

Free

Extract audio from Douyin (抖音/TikTok China) videos and transcribe to text using Whisper. Trigger when user sends a Douyin link (v.douyin.com or www.douyin.co...

❤️ 0 ⬇️ 29

🧪 Skill

Gladia YouTube Transcription (Free)

Free

Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...

❤️ 1 ⬇️ 104

🧪 Skill

Dream Talking Image

Free

Generate talking videos from images using Talking Image API. Create talking videos from audio and images, supporting non-human faces like pets or animated ch...

❤️ 0 ⬇️ 80

🧪 Skill

Supplier Video Ad Builder

Free

Transforms supplier or CJ source videos into 1080×1920 TikTok/Instagram Reels ads with clean zone detection, Pillow text overlays, CTA card, and trending audio.

❤️ 0 ⬇️ 90

🧪 Skill

ComfyUI TTS

Free

Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options.

❤️ 0 ⬇️ 478

🧪 Skill

Smart Speak Multilingual TTS

Free

Multilingual Text-to-Speech (TTS) with intelligent Pinyin-to-Hanzi conversion. Use when the user asks to generate audio for text that contains a mix of Vietn...

❤️ 0 ⬇️ 42

🧪 Skill

Speak Turbo - Talk to your Claude 90ms latency!

Free

Give your agent the ability to speak to you real-time. Talk to your Claude! Ultra-fast TTS, text-to-speech, voice synthesis, audio output with ~90ms latency....

❤️ 0 ⬇️ 422

🧪 Skill

Podcast Generation from PDF, Text, and Links

Free

Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...

❤️ 2 ⬇️ 620

🧪 Skill

Faster Whisper Transcription

Free

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.

❤️ 0 ⬇️ 514

🧪 Skill

TTS AutoPlay with Wake Word

Free

Auto-play TTS voice files with wake word detection. Only plays audio when user message contains wake words like "语音", "念出来", "voice", etc. Perfect for Webcha...

❤️ 0 ⬇️ 174

🧪 Skill

Gladia YouTube Transcription (Free)

Free

Transcribe speech from YouTube videos or audio URLs into text using Gladia API with up to 10 free hours of monthly transcription. Use when: you need to summa...

❤️ 1 ⬇️ 91

🧪 Skill

FFHub FFmpeg Skill

Free

Process video/audio files using FFHub.io cloud FFmpeg API. Use when the user wants to convert, compress, trim, resize, extract audio, generate thumbnails, or...

❤️ 0 ⬇️ 82

🧪 Skill

WiiM

Free

Control WiiM audio devices (play, pause, stop, next, prev, volume, mute, play URLs, presets). Use when the user wants to control music playback, adjust volume, discover WiiM/LinkPlay speakers on the n

❤️ 3 ⬇️ 841

🧪 Skill

Whisper Transcribe

Free

Transcribe audio files to text using OpenAI Whisper. Supports speech-to-text with auto language detection, multiple output formats (txt, srt, vtt, json), batch processing, and model selection (tiny to

❤️ 2 ⬇️ 973

🧪 Skill

Youtube Transcript Api

Free

Extract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch proces...

❤️ 0 ⬇️ 424

🧪 Skill

Whisper STT

Free

Free local speech-to-text transcription using OpenAI Whisper. Transcribe audio files (mp3, wav, m4a, ogg, etc.) to text without API costs. Use when: (1) User...

❤️ 0 ⬇️ 354

🧪 Skill

clawdio

Free

Analyze Twitter Spaces and voice conversations to extract market intelligence, crypto alpha, sentiment analysis, and speaker-attributed insights. Transforms spoken audio into structured reports, full

❤️ 2 ⬇️ 786

🧪 Skill

Local Vosk STT

Free

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

❤️ 0 ⬇️ 750

🧪 Skill

rupali

Free

Playful virtual girlfriend voice companion. Use when the user wants short, flirty, friendly text replies returned as Bulbul v3 audio across chat channels (Discord/Telegram/WhatsApp). Generate a brief

❤️ 0 ⬇️ 640