Search

279 results for "text-to-speech"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

fal.ai

fal.ai API integration with managed API key authentication. Run AI models for image generation, video generation, audio processing, and more. Use this skill...

❤️ 1 ⬇️ 3.1k

🧪 Skill

deAPI AI Media Suite (Community)

Free

The cheapest AI media API on the market. Generate images (Flux), music (AceStep), speech with voice cloning, transcribe video/audio, OCR, video generation, b...

❤️ 1 ⬇️ 62

🧪 Skill

Ai Marketing Videos

Free

Create AI marketing videos for ads, promos, product launches, and brand content. Models: Veo, Seedance, Wan, FLUX for visuals, Kokoro for voiceover. Types: p...

❤️ 4 ⬇️ 1.2k

🧪 Skill

Elevenlabs Toolkit

Free

ElevenLabs voice API integration — TTS, sound effects, music generation, speech-to-text, voice isolation, and streaming. Use when building voice-enabled apps...

❤️ 0 ⬇️ 245

🧪 Skill

Deepgram Voice Workflow

Free

End-to-end voice workflow with Deepgram STT and TTS. Use when transcribing voice messages, generating spoken replies, or building a shell-based audio pipelin...

❤️ 0 ⬇️ 37

🧪 Skill

Everclaw — Inference You Own

Free

Open-source first AI inference — GLM-5 as default, Claude as fallback only. Own your inference forever via the Morpheus decentralized network. Stake MOR toke...

❤️ 0 ⬇️ 794

🧪 Skill

Open WebUI

Free

Complete Open WebUI API integration for managing LLM models, chat completions, Ollama proxy operations, file uploads, knowledge bases (RAG), image generation, audio processing, and pipelines. Use this

❤️ 0 ⬇️ 834

🧪 Skill

Telegram Voice Bot

Free

Telegram bot that transcribes voice messages using Whisper and replies in Chinese with Microsoft Edge text-to-speech.

❤️ 0 ⬇️ 17

🧪 Skill

Ai Video Generation

Free

Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine...

❤️ 2 ⬇️ 1.2k

🧪 Skill

Doubao ASR / 豆包语音转写

Free

Transcribe audio files via Doubao Seed-ASR 2.0 (豆包录音文件识别模型2.0, recorded audio → text) API from ByteDance/Volcengine. Best-in-class Chinese speech recognition...

❤️ 1 ⬇️ 590

🧪 Skill

PCClaw

Free

PCClaw provides 16 native Windows AI skills for system control, automation, files, notifications, OCR, speech, LLM inference, and task management with minima...

❤️ 0 ⬇️ 227

🧪 Skill

Telnyx Toolkit

Free

Complete Telnyx toolkit — ready-to-use tools (STT, TTS, RAG, Networking, 10DLC) plus SDK documentation for JavaScript, Python, Go, Java, and Ruby.

❤️ 0 ⬇️ 2.1k

🧪 Skill

Voice Agent

Free

--- name: voice-agent display-name: AI Voice Agent Backend version: 1.1.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://githu

❤️ 0 ⬇️ 2.9k

🧪 Skill

VoiceClaw

Free

Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper T...

❤️ 0 ⬇️ 272

🧪 Skill

Voice messaging setup

Free

--- name: voice-stt-tts description: Full voice message setup (STT + TTS) for OpenClaw using faster-whisper and Edge TTS homepage: https://docs.openclaw.ai/nodes/audio metadata: { "openclaw":

❤️ 0 ⬇️ 216

🧪 Skill

Homeassistant Skill

Free

Control Home Assistant devices and automations via REST API. 25 entity domains including lights, climate, locks, presence, weather, calendars, notifications, scripts, and more. Use when the user asks

❤️ 6 ⬇️ 3.2k

🧪 Skill

Avatar

Free

--- name: avatar description: Interactive AI avatar with Simli video rendering and ElevenLabs TTS emoji: "\U0001F9D1\u200D\U0001F4BB" homepage: https://github.com/Johannes-Berggren/openclaw-avatar met

❤️ 0 ⬇️ 1.0k

🧪 Skill

code2animation

Free

Produce complete code-based animated videos by scripting, generating narration, creating visual assets, and rendering final MP4s using the code2animation fra...

❤️ 2 ⬇️ 263

🧪 Skill

Decentralized Agent Cloud

Free

Decentralized compute and data marketplace for AI agents with spot pricing | 去中心化 AI Agent 计算和数据市场，支持 Spot 动态定价

❤️ 0 ⬇️ 37

🧪 Skill

Alicloud Ai Entry Modelstudio

Free

Route Alibaba Cloud Model Studio requests to the right local skill (Qwen Image, Qwen Image Edit, Wan Video, Wan R2V, Qwen TTS, Qwen ASR and advanced TTS vari...

❤️ 0 ⬇️ 880

🧪 Skill

SOTA AI Model Tracker

Free

Provides daily updated authoritative data and APIs tracking state-of-the-art AI models across categories from LMArena, Artificial Analysis, and HuggingFace.

❤️ 0 ⬇️ 1.5k

🧪 Skill

Google Gemini Media

Free

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understand

❤️ 5 ⬇️ 3.0k

🧪 Skill

Novita AI Multimodal

Free

Execute multimodal tasks using Novita AI: text-to-image, image-to-image, text-to-video, image-to-video, TTS, STT. Use for: generating images, generating vide...

❤️ 0 ⬇️ 33

🧪 Skill

U2-audio-file-transcriber

Free

Transcribe recorded audio files to text via UniCloud ASR API, supporting multiple formats and domains like finance and customer service; requires configured...

❤️ 0 ⬇️ 42