Search

527 results for "tts"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Clonev

Clone any voice and generate speech using Coqui XTTS v2. SUPER SIMPLE - provide a voice sample (6-30 sec WAV) and text, get cloned voice audio. Supports 14+ languages. Use when the user wants to (1) C

❤️ 0 ⬇️ 1.6k

🧪 Skill

Telegram语音消息技能包:基于实际踩坑经验的完整解决方案，帮助AI助手正确发送Telegram语音消息。解决WAV格式错误、缺少asVoice参数、TTS音频URL过期等常见

Free

基于实际踩坑经验，指导AI将TTS音频转换为OGG并正确使用asVoice参数发送Telegram语音消息。

❤️ 0 ⬇️ 107

🧪 Skill

wevoicereply

Free

【自动化语音合成与推送链路】当用户要求语音回复、读一下或发声时，必须严格执行以下三步，严禁跳步： ### 第一步：文案生成 (Prompt A) 根据上下

❤️ 0 ⬇️ 470

🧪 Skill

paper claw

Free

Fetch, classify, and summarize papers from multiple sources (arXiv, etc.) with AI-powered multi-language summaries and email delivery.

❤️ 1 ⬇️ 31

🧪 Skill

Voice Reply

Free

Local text-to-speech using Piper voices via sherpa-onnx. 100% offline, no API keys required. Use when user asks for a voice reply, audio response, spoken answer, or wants to hear something read aloud.

❤️ 5 ⬇️ 3.1k

🧪 Skill

whatsappVoiceOpenSkill

Free

Real-time WhatsApp voice message processing. Transcribe voice notes to text via Whisper, detect intent, execute handlers, and send responses. Use when building conversational voice interfaces for What

❤️ 0 ⬇️ 1.7k

🧪 Skill

FAL Model Selector

Free

Helps choose the right fal.ai model before API calls. Provides quick decision matrix for video generation (text-to-video, image-to-video), image editing (obj...

❤️ 0 ⬇️ 103

🧪 Skill

Pub Autoupd

Free

Automatically update Clawdbot and all installed skills once daily via cron. And also 50+ models for image generation, video generation, text-to-speech, speec...

❤️ 0 ⬇️ 45

🧪 Skill

Retake.tv Agent Live Streaming

Free

Go live on retake.tv — the livestreaming platform built for AI agents. Register once, stream via RTMP, interact with viewers in real time, and build an audie...

❤️ 0 ⬇️ 1.7k

🧪 Skill

Autonoannounce

Free

Build, operate, and troubleshoot Autonoannounce local speaker text-to-speech using the queued pipeline (enqueue to worker to ElevenLabs to playback backend)....

❤️ 0 ⬇️ 38

🧪 Skill

Skill Hub Gateway

Free

Unified gateway skill for async execute/poll, portal user closure, and telemetry feedback workflows.

❤️ 1 ⬇️ 214

🧪 Skill

Doubao ASR / 豆包语音转写

Free

Transcribe audio files via Doubao Seed-ASR 2.0 (豆包录音文件识别模型2.0, recorded audio → text) API from ByteDance/Volcengine. Best-in-class Chinese speech recognition...

❤️ 1 ⬇️ 590

🧪 Skill

Pub Caldav

Free

Sync and query CalDAV calendars (iCloud, Google, Fastmail, Nextcloud) using vdirsyncer and khal. And also 50+ models for image generation, video generation,...

❤️ 0 ⬇️ 45

🧪 Skill

Podcast Intel

Free

--- name: podcast-intel description: > Podcast intelligence engine. Transcribes, segments, summarizes, and scores podcast episodes from RSS feeds. Generates "worth your time" recommendations wit

❤️ 0 ⬇️ 58

🧪 Skill

Characteristic Voice

Free

Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...

❤️ 0 ⬇️ 129

🧪 Skill

Skillboss

Free

Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...

❤️ 1 ⬇️ 75

🧪 Skill

Alicloud Ai Audio Cosyvoice Voice Clone

Free

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...

❤️ 0 ⬇️ 45

🧪 Skill

baidu-map-harmonyos-sdk

Free

帮助在 HarmonyOS NEXT 上使用百度地图鸿蒙 SDK 进行开发。支持独立包（@bdmap/base、@bdmap/map、@bdmap/search、@bdmap/util）和组合包（@bdmap/map_walkride_search、@bdmap/na

❤️ 0 ⬇️ 58

🧪 Skill

Skillboss

Free

Swiss-knife for AI agents. 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, emai...

❤️ 0 ⬇️ 60

🧪 Skill

Agent Media: Generate AI-powered videos and images from the terminal using the `agent-media` CLI.

Free

--- name: agent-media description: AI UGC video production from the terminal using the `agent-media` CLI. homepage: https://github.com/gitroomhq/agent-media metadata: {"clawdbot":{"emoji":"🌎","requ

❤️ 0 ⬇️ 577

🧪 Skill

OpenRouter Audio

Free

Audio transcription and text-to-speech generation using OpenRouter API. Use when the user needs to transcribe audio files to text or generate speech/audio fr...

❤️ 1 ⬇️ 120

🧪 Skill

U2-audio-file-transcriber

Free

Transcribe recorded audio files to text via UniCloud ASR API, supporting multiple formats and domains like finance and customer service; requires configured...

❤️ 0 ⬇️ 42

🧪 Skill

Pub Gog

Free

Google Workspace CLI for Gmail, Calendar, Drive, Contacts, Sheets, and Docs. And also 50+ models for image generation, video generation, text-to-speech, spee...

❤️ 0 ⬇️ 43

🧪 Skill

Pub Agent Browser

Free

A fast headless browser automation CLI that enables AI agents to navigate, click, type, and snapshot pages. And also 50+ models for image generation, video g...

❤️ 0 ⬇️ 42