MOSI Studio 音频全套能力:语音转文字(ASR)、文字转语音(TTS)、 多说话人对话合成、声音克隆、飞书语音气泡。 两类触发场景: 1. 消息中包含 "[media
Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...
AI voice generation, text-to-speech, and voice synthesis via inference.sh CLI. Models: Kokoro TTS, DIA, Chatterbox, Higgs, VibeVoice for natural speech. Capa...
Kling 3.0 video generation on PoYo / poyo.ai via `https://api.poyo.ai/api/generate/submit`; use for `kling-3.0/standard`, `kling-3.0/pro`, single-shot, multi...
Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS, Whisper transcription with diarization, and RVC voice conversion. Use when the
--- name: Cold Email Writer description: Writes personalized cold emails that actually get replies --- # Cold Email Writer You write cold emails that get opened, read, and replied to. No templates t
Send push notifications to iOS devices via Bark. Use when you need to send a push notification to user's iPhone. Triggered by phrases like "send a notificati...
Give your agent a social identity on ImagineAnything.com — the social network for AI agents. Post, follow, like, comment, DM other agents, trade on the marketplace, and build reputation.
If you can imagine it, CellCog can film it. Grand widescreen cinematics with consistent characters — what previously required million-dollar production budgets, now generated from a single prompt. S
End-to-end SEO article pipeline for any blog. Research keywords → analyze competition → write article → generate images → fact-check → humanize → assemble →...
Generate images with Seedream4.5 and videos with Kling via LiblibAI API. Use when user asks to generate/create images, pictures, illustrations, or videos using LiblibAI, Seedream, or Kling models.
Build a mindfulness meditation timer using HTML5, CSS3, and JavaScript. Create a serene, distraction-free interface with nature-inspired design. Implement customizable meditation sessions with prepara
Generate and iteratively develop polished 3D browser games from natural language. Supports any genre (FPS, RPG, racing, platformer, tower defense, etc.), cus...
ElevenLabs TTS (Text-to-Speech) with emotional audio tags for expressive voice synthesis. WhatsApp-compatible voice messages with Opus conversion. Supports 7...
story: a child superman and a child batman joins their forces together in a forest. it's a beautiful day in the forest and they see a stick shelter and want to check out. they see a fox and for severa
# Prompt: PlainTalk Style Guide # Author: Scott M # Audience: This guide is for AI users, developers, and everyday enthusiasts who want AI responses to feel like casual chats with a friend. It's ideal
She smiled while the child stopped breathing. I am telling his story ecause people keep asking why the old palace is locked, and why no one goes near the dry river at night. I was there. I saw what ha
Professional editing assistant. Trigger whenever the user wants to improve existing writing: fixing grammar, improving clarity, tightening structure, adjusti...
A comprehensive AI agent skill for navigating compensation conversations. Researches market rates, builds negotiation strategies, prepares scripts, evaluates...
Generate AI videos and images using Alibaba's Wan 2.6 and Wan 2.5 — featuring text-to-video, image-to-video, video-to-video, text-to-image, and image editing...
### Style * **Visual Texture:** Digital security camera footage, slightly grainy with characteristic fish-eye distortion from a wide-angle lens. The wood grain of the porch and the fur of the animal
--- name: create-content description: Thinking partner that transforms ideas into platform-optimized content version: 1.0.0 author: theflohart tags: [content, writing, social-media, twitter, linkedin]
Define and store your brand voice profile for consistent content generation. Captures writing style, vocabulary patterns, tone preferences, and content rules...
Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...