Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...
调用火山引擎 Seedance 视频生成 API。当用户需要生成视频、文生视频、图生视频时使用此 skill。
Waboxapp integration. Manage Organizations, Users, Filters. Use when the user wants to interact with Waboxapp data.
跨平台治愈系撸猫技能。当用户说"休息一下"、"累了"、"tired"、"need a break"等触发词时,自动发送可爱猫咪图片 + 猫叫声(TTS 音频 + 文字双保险)+ 治愈
Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization. Triggers: "transcription", "speech to text", "Azure AI Transcription",
Generates and validates multi-layered, multi-modal creative narratives using recursive constraints inspired by Gödel, Escher, and Bach principles.
Isolated agent runtime for code execution, live preview URLs, browser automation, 50+ tools (ffmpeg, sqlite, pandoc, imagemagick), LLM inference, and persistent memory — all via CLI or HTTP, no SDK
Complete WhatsApp automation API with microservices architecture. Send messages, interactive buttons, lists, carousels, polls, manage instances, groups, catalogs and webhooks. Features Admin Panel (fr
Give your agent a social identity on ImagineAnything.com — the social network for AI agents. Post, follow, like, comment, DM other agents, trade on the marketplace, and build reputation.
在 Windows 11 上"直接发声"的 TTS(从 WSL2/TUI 调用 powershell.exe + System.Speech)。适用于用户说"说出来/读出来/语音播报/用TTS",或反馈"没声音/tts 生成的 mp3
Read X (Twitter) Articles aloud using macOS text-to-speech. Accepts an X Article URL and reads the content out loud. Automatically detects Chinese vs English...
通用文件智能分类工具,支持多种分类规则:类型、大小、日期、关键词等。适用于需要批量整理文件的场景,如下载文件夹整理、照片归档、文档分
Full video production from a single prompt. Script, shoot, stitch, score — automatically. 30s to 4-minute Instagram Reels, TikToks, Stories, and carousels with consistent characters and agentic edit
Social media automation, content scheduling, analytics tracking, and campaign management. Transform your AI agent into a marketing powerhouse that handles multi-platform content strategy.
Knowledge Base Collector - save YouTube, URLs, text to Obsidian with AI summarization. Auto-transcribes videos, fetches pages, supports weekly/monthly digest...
Configure an OpenClaw instance to use a local OpenAI-compatible TTS backend (for example openedai-speech) with cloned voices. Use when users ask to wire loca...
AI Confidence Engine — 5 dominios bidireccionales (TECH/OPS/JUDGMENT/COMMS/ORCH). Agent + User scoring. Triggers: puntúa, auto-score, task-complete, idea-val...
Search and explore Are.na for curated channels, references, and visual inspiration by analyzing connections and identifying key curators across any topic.
Summarize YouTube videos with NO subtitles by doing local ASR (yt-dlp + faster-whisper) and extracting a few screenshot frames via ffmpeg. Use when the user...
Friction-reduction patterns for agents helping humans with disabilities. Voice-first workflows, smart home templates, efficiency automation.
Schedule and post text, images, videos, and threads across Discord, Reddit, Twitter/X, Mastodon, Bluesky, Moltbook, LinkedIn, and Telegram via OAuth or API k...
Generate music tailored to video content. Use when users request video_soundtrack operations or related tasks.
在 RDK X5 的 10TOPS BPU 上运行单个 AI 推理算法:YOLO 目标检测、图像分类、语义分割、人脸识别、手势识别、人体关键点、开放词汇检测(DOSOD/YOLO-World)