Search

1144 results for "audio"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

🧪 Skill

Transcribe audio via Groq API (~10x cheaper than OpenAI API)

Free

Transcribe audio via Groq Automatic Speech Recognition (ASR) Models (Whisper).

❤️ 1 ⬇️ 122

🧪 Skill

🎤 Transcribe audio files using Qwen ASR. 千问STT

Free

Transcribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.

❤️ 1 ⬇️ 139

🧪 Skill

Audio Visualization

Free

Generate audio visualization videos using each::sense AI. Create waveforms, spectrum analyzers, particle effects, 3D visualizations, and beat-synced animatio...

❤️ 2 ⬇️ 447

🧪 Skill

Audio Reply

Free

Generate audio replies using TTS. Trigger with "read it to me [public URL]" to fetch and read content aloud, or "talk to me [topic]" to generate a spoken res...

❤️ 3 ⬇️ 2.1k

🧪 Skill

Alicloud Ai Audio Cosyvoice Voice Clone

Free

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...

❤️ 0 ⬇️ 45

🧪 Skill

Alicloud Ai Audio Livetranslate

Free

Use when live speech translation is needed with Alibaba Cloud Model Studio Qwen LiveTranslate models, including bilingual meetings, realtime interpretation,...

❤️ 0 ⬇️ 46

🧪 Skill

Volcengine TTS Audio Synthesis

Free

Text-to-speech generation on Volcengine (ByteDance) speech services. Use when users need narration, multi-language speech output, voice selection, or TTS tro...

❤️ 0 ⬇️ 108

🧪 Skill

Audio Broadcast

Free

控制小播鼠广播系统进行音频播放和广播通知。使用当用户需要向广播设备播放音频、设置音量、管理定时广播任务、或查看设备状态时。支持播放音

❤️ 0 ⬇️ 371

🧪 Skill

Alicloud Ai Audio Tts Realtime

Free

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instructi...

❤️ 0 ⬇️ 491

🧪 Skill

Alicloud Ai Audio Tts Voice Design

Free

Voice design workflows with Alibaba Cloud Model Studio Qwen TTS VD models. Use when creating custom synthetic voices from text descriptions and using them fo...

❤️ 0 ⬇️ 487

🧪 Skill

Audio Video To Text

Free

音视频转文字技能，使用 Whisper 进行语音识别。支持多种音视频格式，可输出纯文本、SRT/VTT 字幕或 JSON 格式。适用于会议记录、视频字幕生成、采访整

❤️ 0 ⬇️ 119

🧪 Skill

Sense Audio

Free

Integration guide for SenseAudio Open Platform APIs, including TTS (sync/SSE/WebSocket), ASR (HTTP/WebSocket), realtime Agents, video generation/storyboard,...

❤️ 0 ⬇️ 29

🧪 Skill

Alicloud Ai Audio Asr Realtime

Free

Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...

❤️ 0 ⬇️ 42

🧪 Skill

Alicloud Ai Audio Cosyvoice Voice Design

Free

Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...

❤️ 0 ⬇️ 47

🧪 Skill

Sense Audio

Free

Integration guide for SenseAudio Open Platform APIs, including TTS (sync/SSE/WebSocket), ASR (HTTP/WebSocket), realtime Agents, video generation/storyboard,...

❤️ 0 ⬇️ 11

🧪 Skill

Transcribe Audio with Parakeet MLX

Free

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

❤️ 1 ⬇️ 1.7k

🧪 Skill

Volcengine TTS Audio Synthesis

Free

Text-to-speech generation on Volcengine (ByteDance) speech services. Use when users need narration, multi-language speech output, voice selection, or TTS tro...

❤️ 0 ⬇️ 96

🧪 Skill

Audio Transcribe

Free

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

❤️ 0 ⬇️ 750

🧪 Skill

SpeakNotes: YouTube, Audio & Document Summaries

Free

Use when OpenClaw needs to call SpeakNotes API routes directly using an API key and generate transcripts/summaries from YouTube URLs, media files, or documen...

❤️ 0 ⬇️ 117

🧪 Skill

Transcribe Audio with Parakeet MLX

Free

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

❤️ 1 ⬇️ 1.7k

🧪 Skill

Telnyx Tts

Free

Generate speech audio from text using Telnyx Text-to-Speech API. Use when you need to convert text to spoken audio, create voice messages, or generate audio content.

❤️ 0 ⬇️ 616

🧪 Skill

Transcribe

Free

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

❤️ 2 ⬇️ 2.6k

🧪 Skill

Fish Tts

Free

Generate high-quality speech from text using Fish Audio S1 and optionally upload the MP3 audio file to NextCloud via WebDAV.

❤️ 1 ⬇️ 389

🧪 Skill

Talking Circle

Free

Create animated talking-circle videos (Telegram-style round video messages) from avatar frame images and audio. Supports audio-to-video and text-to-video via...

❤️ 1 ⬇️ 129