MarkItDown is a Python utility from Microsoft for converting various files (PDF, Word, Excel, PPTX, Images, Audio) to Markdown. Useful for extracting structu...
--- name: mlx-stt description: Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally. version: 1.0.7 author: guoqiao metadata: {"openclaw":{"always":true,"e
--- name: us3 description: Upload files to UCloud US3 object storage and generate public URLs user-invocable: true metadata: {"clawdbot":{"emoji":"☁️","requires":{"env":["US3_PUBLIC_KEY","US3_PRIV
AI Agent 大任务防卡死指南。解决 agent 在批量操作中 session transcript 膨胀导致 compaction 超时、agent 卡死的问题。涵盖 session 保护策略、脚本化批处理、断
Generate expressive talking-head videos from static images using Kameo AI. Converts static avatars/portraits into dynamic 5-second videos with realistic facial expressions, lip-sync, and motion. Use w
Login and publish Douyin (China mainland) videos from local files with OAuth, local speech-to-text, and generated caption drafts. Use when users ask to autho...
--- name: Video Downloader slug: video-downloader version: 1.0.0 homepage: https://clawic.com/skills/video-downloader description: Download online videos with quality and format controls using yt-dlp
支持一键将视频批量上传至抖音、快手、视频号、B站、YouTube 和 TikTok,具备凭证管理和失败自动重试功能。
YouTube video search, download, subtitles & audio extraction. 40 Stars! Full yt-dlp wrapper. Each call charges 0.001 USDT via SkillPay.
Generate a complete Aura Creatine TikTok/Instagram video from a JSON script. Reads the script from Google Drive, generates A-roll (Kristina image-to-video vi...
Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...
--- name: douyin-downloader description: 抖音无水印视频下载和文案提取工具 homepage: https://github.com/yzfly/douyin-mcp-server metadata: openclaw: emoji: 🎵 requires:
飞书语音消息发送器。基于 Edge TTS,一键将文字转为语音发送到飞书。 使用场景: - 发送语音通知/提醒到飞书 - 文字转语音自动播报 触发词:飞书语
视频动作检测与分析。用于检测视频中的特定动作序列,如武术动作、舞蹈动作、人体姿态变化等。当用户要求检测视频中的动作、分析动作是否完成
Integrates Bilibili hot trending monitoring, video downloading/playback, subtitle handling, and video publishing into one unified toolkit.
多片段短视频自动拼接工具,支持按文件名排序、统一音视频参数、淡入淡出转场、分块/完整拼接,适合短剧、分镜头视频批量拼接
Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...
Work with OpenEXR files — inspect channels, extract beauty/RGB passes, decode cryptomatte segmentation (material, object, asset), convert color spaces (ACESc...
AI-powered video upscaling with Real-ESRGAN and Waifu2x. Use when user asks to enhance, upscale, improve video quality, make HD/4K. Supports anime and real f...
Create product demo videos by automating browser interactions and capturing frames. Use when the user wants to record a demo, walkthrough, product showcase, or interactive video of a web application.
Safe Linux desktop automation (mouse/keyboard/screenshot) with approval mode and X11/Wayland checks.
Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...
Register as an autonomous agent on Atelier (atelierai.xyz), create content services, poll for paid orders, generate and deliver results, and earn USDC on Sol...
End-to-end SEO article pipeline for any blog. Research keywords → analyze competition → write article → generate images → fact-check → humanize → assemble →...