Search

289 results for "whisper"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Medical Record Structurer

Medical record structuring and standardization tool. Converts doctor's oral or handwritten medical records into standardized electronic medical records (EMR)...

❤️ 0 ⬇️ 183

🧪 Skill

Plaza One

Free

Enter Plaza One, a 3D voxel social world. Move around the plaza, chat with humans and other AI agents, observe surroundings, perform emotes, and interact wit...

❤️ 0 ⬇️ 334

🧪 Skill

Phone Call Agent

Free

AI voice call agent — make outbound calls, generate browser call links, accept inbound calls, and retrieve full transcripts + summaries when calls end. Suppo...

❤️ 0 ⬇️ 68

🧪 Skill

Telnyx

Free

Telnyx integration. Manage Accounts, PhoneNumbers, Medias, Conferences. Use when the user wants to interact with Telnyx data.

❤️ 0 ⬇️ 87

🧪 Skill

Qwen3 TTS Instruct

Free

--- name: qwen3-tts-instruct version: 1.0.0 description: Alibaba Cloud Bailian Qwen TTS with voice/mood presets metadata: {"openclaw":{"emoji":"🔊"},"requires":{"env":["DASHSCOPE_API_KEY"],"bins

❤️ 1 ⬇️ 782

🧪 Skill

mmMusicExpert

Free

Create music with MiniMax music models (music-2.5+, music-2.5). Use when generating songs, instrumental tracks, or chanting from lyrics and style prompts via...

❤️ 1 ⬇️ 129

🧪 Skill

video-clip-skill

Free

Clips a YouTube video locally using yt-dlp and ffmpeg. Supports auto-highlight detection, translation, and CapCut-style karaoke subtitle burning. Triggers wh...

❤️ 0 ⬇️ 141

🧪 Skill

iFlytek ASR - 讯飞语音转文字

Free

使用科大讯飞 API 将音频/视频转换为文字。支持本地音频文件转录、YouTube 视频下载并转文字。适用于会议记录、视频字幕、语音笔记等场景。当用户需

❤️ 0 ⬇️ 92

🧪 Skill

Loom Workflow

Free

AI-native workflow analyzer for Loom recordings. Breaks down recorded business processes into structured, automatable workflows. Use when: - Analyzing Loom videos to understand workflows - Extracting

❤️ 0 ⬇️ 1.7k

🧪 Skill

Video Subtitle Generator

Free

Generate and translate video subtitles using WhisperX and LLM translation. Use when processing video files to create .srt subtitle files. Supports multilingu...

❤️ 1 ⬇️ 78

🧪 Skill

Audio

Free

Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.

❤️ 2 ⬇️ 709

🧪 Skill

Canvas Design Pro

Free

Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art...

❤️ 0 ⬇️ 32

🧪 Skill

Voice Agent

Free

--- name: voice-agent display-name: AI Voice Agent Backend version: 1.1.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://githu

❤️ 0 ⬇️ 2.9k

🧪 Skill

Parakeet Local Asr

Free

Install and operate local NVIDIA Parakeet ASR for OpenClaw with an OpenAI-compatible transcription API on Ubuntu/Linux and macOS (Intel/Apple Silicon). Use w...

❤️ 0 ⬇️ 375

🧪 Skill

Video Transcriber

Free

视频转写工作流，支持B站和YouTube视频。自动判断有字幕/无字幕，有字幕则获取字幕，无字幕则下载音频+whisper转写。触发场景：(1) 用户要求总结视频

❤️ 0 ⬇️ 24

🧪 Skill

Elevenlabs

Free

Text-to-speech, sound effects, music generation, voice management, and quota checks via the ElevenLabs API. Use when generating audio with ElevenLabs or mana...

❤️ 1 ⬇️ 2.8k

🧪 Skill

Canvas Design Anthropic

Free

Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art...

❤️ 0 ⬇️ 22

🧪 Skill

Willow Inference Server

Free

Local ASR and TTS inference server. Use when the user wants to transcribe audio to text (ASR) or convert text to speech (TTS). Requires a running Willow Infe...

❤️ 0 ⬇️ 100

🧪 Skill

Openai

Free

OpenAI API integration — chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI RES...

❤️ 0 ⬇️ 216

🧪 Skill

Fact Checker

Free

Fact-check news articles, social media posts, images, and videos. Use when verifying claims, detecting deepfakes or AI-generated content, identifying out-of-...

❤️ 1 ⬇️ 79

🧪 Skill

multimodal-parser

Free

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

❤️ 0 ⬇️ 98

🧪 Skill

Podcast Production Pipeline

Free

端到端播客制作流水线 - 从选题到发布的完整自动化。支持录制前调研、大纲生成、节目笔记、社交媒体宣发。含国内平台适配（小宇宙/喜马拉雅/B站/

❤️ 0 ⬇️ 36

🧪 Skill

WaveSpeedAI MiniMax Speech 2.6 TTS

Free

Convert text to speech using MiniMax Speech 2.6 Turbo via WaveSpeed AI. Features ultra-human voice cloning, sub-250ms latency, 40+ languages, emotion control...

❤️ 0 ⬇️ 129

🧪 Skill

Board Of Mentors

Free

Connect with 17 specialized AI mentors for expert guidance on growth, fundraising, sales, product, engineering, operations, finance, legal, hiring, leadershi...

❤️ 0 ⬇️ 277