Search

1099 results for "audio"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

IMA Studio TTS — seed-tts, DouBao

TTS (text-to-speech) via IMA Open API with seed-tts-2.0. Voice synthesis, speech from text, dubbing, audio content creation. Output: audio URL (mp3/wav). Flo...

❤️ 0 ⬇️ 141

🧪 Skill

Super-Transcribe — Unified Speech-to-Text

Free

Unified speech-to-text skill. Use when the user asks to transcribe audio or video, generate subtitles, identify speakers, translate speech, search transcript...

❤️ 0 ⬇️ 212

🧪 Skill

mmVoiceMaker

Free

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat...

❤️ 3 ⬇️ 359

🧪 Skill

Azure Ai Voicelive Py

Free

Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication with Az

❤️ 2 ⬇️ 1.8k

🧪 Skill

Freepik

Free

Generate images, videos, icons, audio, and more using Freepik's AI API. Supports Mystic, Flux, Kling, Hailuo, Seedream, RunWay, Magnific upscaling, stock con...

❤️ 2 ⬇️ 572

🧪 Skill

Fizzread

Free

Instant access to 100K+ nonfiction book summaries with 1-minute audio previews. Free demo key included — no signup needed. Search, browse, and listen via Fiz...

❤️ 1 ⬇️ 180

🧪 Skill

Google Gemini Media

Free

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understand

❤️ 5 ⬇️ 3.0k

🧪 Skill

tts

Free

Convert text or subtitle files into speech audio with options for voice cloning, emotion control, speed adjustment, and timeline-accurate dubbing using Kokor...

❤️ 1 ⬇️ 161

🧪 Skill

tts

Free

Convert text or subtitle files into speech audio with options for voice cloning, emotion control, speed, and timeline-accurate dubbing using Kokoro or Noiz b...

❤️ 1 ⬇️ 2.5k

🧪 Skill

openlesson

Free

Interact with the openLesson tutoring API to generate learning plans, start audio-based sessions, analyze reasoning gaps, and manage tutoring workflows.

❤️ 2 ⬇️ 297

🧪 Skill

Record screen, microphone or camera from macOS terminal

Free

macOS CLI tool to record microphone audio, screen video or screenshot, and camera video or photo from the terminal with device listing and output control.

❤️ 5 ⬇️ 1.1k

🧪 Skill

Seedance 2.0 — AI Video by ByteDance

Free

Generate AI videos using ByteDance's Seedance 1.5 Pro — a native audio-visual joint generation model with cinematic camera control, multi-language lip-sync,...

❤️ 1 ⬇️ 270

🧪 Skill

Speech To Text

Free

Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...

❤️ 0 ⬇️ 1.7k

🧪 Skill

AudioPod

Free

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to

❤️ 3 ⬇️ 2.7k

🧪 Skill

Language Tutor

Free

Create language learning audio with SenseAudio TTS, including pronunciation drills, bilingual lessons, slowed speech practice, and dialogue exercises. Use wh...

❤️ 0 ⬇️ 16

🧪 Skill

video-translation

Free

Translate and dub videos from one language to another, replacing the original audio with TTS while keeping the video intact.

❤️ 0 ⬇️ 92

🧪 Skill

feishu-video

Free

Send voice/audio messages to Feishu (Lark) users. Converts audio files to OPUS format and sends as voice message, not file attachment. create by Alex

❤️ 0 ⬇️ 0

🧪 Skill

Subtitle Generator

Free

Generate synchronized subtitles (SRT/VTT/ASS) from video audio with precise timestamps. Use when users need subtitles, captions, or video transcription with...

❤️ 0 ⬇️ 53

🧪 Skill

Briefing Room

Free

Daily news briefing generator — produces a conversational radio-host-style audio briefing + DOCX document covering weather, X/Twitter trends, web trends, world news, politics, tech, local news, spor

❤️ 0 ⬇️ 1.1k

🧪 Skill

Skill Tiktok Video Pipeline

Free

End-to-end TikTok ad video pipeline. Product script → Veo base video → animated caption overlay → audio mix → final MP4. One command, full automation.

❤️ 0 ⬇️ 208

🧪 Skill

WaveSpeedAI Infinitetalk Talking Avatar Video Generation

Free

Generate talking head videos from a portrait image and audio using WaveSpeed AI's InfiniteTalk model. Produces lip-synced video up to 10 minutes long at 480p...

❤️ 0 ⬇️ 138

🧪 Skill

ClawHub - YouTube Downloader & Clipper

Free

Clip and download specific time ranges or full YouTube videos in various qualities, including audio-only MP3 extraction, using precise timestamps.

❤️ 0 ⬇️ 1.9k

🧪 Skill

Voice Assistant

Free

Real-time voice assistant for OpenClaw. Streams mic audio through configurable STT (Deepgram or ElevenLabs) into your OpenClaw agent, then speaks the response via configurable TTS (Deepgram Aura or El

❤️ 4 ⬇️ 1.3k

🧪 Skill

Talking Head Production

Free

Talking head video production with AI avatars, lipsync, and voiceover. Covers portrait requirements, audio quality, OmniHuman, PixVerse lipsync, Dia TTS. Use...

❤️ 0 ⬇️ 477