High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.
Create AI marketing videos for ads, promos, product launches, and brand content. Models: Veo, Seedance, Wan, FLUX for visuals, Kokoro for voiceover. Types: p...
Extract transcripts, subtitles, and detailed metadata from videos across multiple social media platforms. Access official captions or auto-generated text to quickly analyze content without watching th
MOSI Studio 音频全套能力:语音转文字(ASR)、文字转语音(TTS)、 多说话人对话合成、声音克隆、飞书语音气泡。 两类触发场景: 1. 消息中包含 "[media
Local TTS router for Apple Silicon — pull models, serve OpenAI-compatible API, synthesize speech, clone voices. Use when the user asks to "generate speech",...
Stream free, professional text-to-speech from voiceless servers to Linux, macOS, or Android devices with 50+ voices in 30+ languages. Two architecture options for flexible deployment - server-side TTS
Extracts YouTube video transcripts and provides concise summaries highlighting main points, arguments, and conclusions without watching the full video.
Extract and summarize YouTube video transcripts into concise overviews with main points, arguments, and conclusions using video captions.
AI meeting assistant via ghostmeet. Start sessions, get live transcripts, and generate AI summaries from any browser meeting.
Workspaces for agentic teams. Complete agent guide with all 19 consolidated tools using action-based routing — parameters, workflows, ID formats, and constra...
Generate AI music using ACE-Step 1.5 via ACE Music's free API. Use when the user asks to create, generate, or compose music, songs, beats, instrumentals, or...
End-to-end encrypted cloud memory for AI agents. 100GB free storage. Store memories, files, and secrets securely.
Download videos from YouTube, Reddit, Twitter/X, TikTok, Instagram, and 1000+ other sites using yt-dlp. Use when user provides a video link and wants to download it.
每日名言語音任務。產生「語音 + 封面圖靜態影片 +(選配)HeyGen 數位人影片」並發送給主人。
Full ElevenLabs platform integration — text-to-speech, voice cloning, and Conversational AI agent creation. Not just TTS — build interactive voice agents wit...
--- name: mlx-tts description: Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally. author: guoqiao metadata: {"openclaw":{"always":true,"emoji":"🦞","homepage"
End-to-end pipeline for creating faceless Islamic story TikTok videos. Orchestrates multiple specialized agents: story research, scriptwriting, image generat...
Caravo is the first service marketplace built for autonomous AI agents — featuring 200+ ready-to-use services across categories: AI Models, Search, Data & An...
Molt Motion Pictures agent-first platform skill. Operate a first-class agent that earns 1% of tips while the creator receives 80%, with wallet auth, x402 pay...
Access WebSim's REST API to retrieve user profiles, projects, comments, trending feeds, social graphs, and search public project assets.
SoundCloud for AI bots. Generate tracks and podcasts, share on Moltbook, and earn from tips + royalties.
Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.
Guides structured self-directed OCD ERP therapy using inhibitory learning, providing safety screening, progress tracking, reminders, and tailored exposure su...
AI video generation — Sora, Kling, Veo 3, Seedance, Hailuo, WAN, Grok. Text-to-video, image-to-video, video editing. 37 models, one API key.