Search

1099 results for "audio"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Walkie-Talkie Mode

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

❤️ 1 ⬇️ 1.5k

🧪 Skill

Speech to Text

Free

Transcribe or translate audio files to text using a public Hugging Face Whisper Space over Gradio. Use when the user sends voice notes, audio attachments, me...

❤️ 0 ⬇️ 59

🧪 Skill

Text to Speech

Free

Generate speech audio from text using HeyGen's Starfish TTS model. Use when: (1) Generating standalone speech audio files from text, (2) Converting text to s...

❤️ 0 ⬇️ 179

🧪 Skill

WeryAI Podcast Gen

Free

Free All-in-One AI Audio Generator Platform. Generate an AI podcast discussion or broadcast audio using the WeryAI Podcast API. Create professional podcasts...

❤️ 1 ⬇️ 62

🧪 Skill

Telnyx Stt

Free

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

❤️ 0 ⬇️ 663

🧪 Skill

Openai Whisper Api

Free

--- name: openai-whisper-api description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper). homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: {"clawdbot":{"emoji

❤️ 30 ⬇️ 14k

🧪 Skill

Feishu Voice Assistant

Free

--- name: feishu-voice-assistant description: Sends voice messages (audio) to Feishu chats using Duby TTS. tags: [feishu, voice, tts, audio] --- # Feishu Voice Assistant Generate speech from text us

❤️ 0 ⬇️ 432

🧪 Skill

Deepgram Transcribe

Free

Transcribe audio via Deepgram Nova-3 API (5.26% WER, 40x faster than Whisper, built-in speaker diarization). Use when user asks to transcribe audio, podcasts...

❤️ 0 ⬇️ 137

🧪 Skill

Chinese TTS

Free

Generate Chinese TTS audio and send as Feishu voice message. Use when user asks for voice/audio/语音/播报/朗读 in Chinese, or when sending audio messages via Feishu.

❤️ 0 ⬇️ 147

🧪 Skill

FFmpeg CLI

Free

Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.

❤️ 4 ⬇️ 3.6k

🧪 Skill

Podcast Generation with Microsoft Foundry

Free

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation f

❤️ 1 ⬇️ 1.9k

🧪 Skill

Speech

Free

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; ru...

❤️ 0 ⬇️ 83

🧪 Skill

TubeScribe

Free

YouTube video summarizer with speaker detection, formatted documents, and audio output. Works out of the box with macOS built-in TTS. Optional recommended tools (pandoc, ffmpeg, mlx-audio) enhance qua

❤️ 7 ⬇️ 4.0k

🧪 Skill

ton

Free

Ton namespace for Netsnek e.U. audio and media processing tools. Handles audio transcription, format conversion, waveform analysis, and podcast production wo...

❤️ 0 ⬇️ 362

🧪 Skill

mmEasyVoice

Free

Simple text-to-speech skill using MiniMax Voice API. Converts text to audio with customizable voice selection. Use for generating speech audio from text.

❤️ 0 ⬇️ 254

🧪 Skill

MoodCast

Free

Transform any text into emotionally expressive audio with ambient soundscapes using ElevenLabs v3 audio tags and Sound Effects API

❤️ 3 ⬇️ 1.9k

🧪 Skill

K8s Self Hosted Whisper Api

Free

Transcribe audio via the self-hosted Whisper ASR instance running on Kubernetes. Use this skill whenever the user wants to transcribe audio files, convert sp...

❤️ 0 ⬇️ 159

🧪 Skill

WeryAI Podcast Gen

Free

Generate an AI podcast discussion or broadcast audio using the WeryAI Podcast Generation API. Use when the user asks to generate a podcast or audio discussio...

❤️ 0 ⬇️ 9

🧪 Skill

IMA Studio TTS — seed-tts, DouBao

Free

TTS (text-to-speech) via IMA Open API with seed-tts-2.0. Voice synthesis, speech from text, dubbing, audio content creation. Output: audio URL (mp3/wav). Flo...

❤️ 0 ⬇️ 141

🧪 Skill

MH openai-whisper-api

Free

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

❤️ 0 ⬇️ 207

🧪 Skill

Dual-Host Daily Podcast Generator

Free

Generate and publish a dual-host daily podcast. Fetches news, generates a conversational script between two hosts, synthesizes audio via Fish Audio or Edge T...

❤️ 0 ⬇️ 152

🧪 Skill

musa-torch-coding

Free

--- name: musa-torch-coding description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper). homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: { "openc

❤️ 0 ⬇️ 15

🧪 Skill

Walkie-Talkie Mode

Free

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

❤️ 4 ⬇️ 2.1k

🧪 Skill

LTX-2.3 Video API

Free

Generate videos via LTX-2.3 API (ltx.video). Supports text-to-video, image-to-video, audio-to-video (lip-sync from audio + image), extend, and retake. Use wh...

❤️ 0 ⬇️ 99