Search

289 results for "transcribe"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

sense-music

Analyzes audio to detect BPM, key, structure, genre, mood, transcribe lyrics, and generate visual and textual summaries of music tracks.

❤️ 0 ⬇️ 11

🧪 Skill

SpeakNotes: YouTube, Audio & Document Summaries

Free

Use when OpenClaw needs to call SpeakNotes API routes directly using an API key and generate transcripts/summaries from YouTube URLs, media files, or documen...

❤️ 0 ⬇️ 106

🧪 Skill

Faster Whisper

Free

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

❤️ 4 ⬇️ 5.0k

🧪 Skill

GifHorse

Free

Search video dialogue and create reaction GIFs with timed subtitles. Perfect for creating meme-worthy clips from movies and TV shows.

❤️ 1 ⬇️ 2.0k

🧪 Skill

Argmax Transcription and TTS

Free

On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...

❤️ 0 ⬇️ 141

🧪 Skill

2nd Brain

Free

Personal knowledge base for capturing and retrieving information about people, places, restaurants, games, tech, events, media, ideas, and organizations. Use...

❤️ 0 ⬇️ 767

🧪 Skill

Voice messaging setup

Free

--- name: voice-stt-tts description: Full voice message setup (STT + TTS) for OpenClaw using faster-whisper and Edge TTS homepage: https://docs.openclaw.ai/nodes/audio metadata: { "openclaw":

❤️ 0 ⬇️ 216

🧪 Skill

AudioPod

Free

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to

❤️ 3 ⬇️ 2.7k

🧪 Skill

autoglmasr

Free

AutoGLM ASR MCP 服务：长音频并发转录、上下文传递、时间戳分段。基于智谱 GLM-ASR-2512。触发词：语音识别、ASR、转录、转录音频、长音频

❤️ 0 ⬇️ 131

🧪 Skill

Subtitle Generator

Free

Generate synchronized subtitles (SRT/VTT/ASS) from video audio with precise timestamps. Use when users need subtitles, captions, or video transcription with...

❤️ 0 ⬇️ 53

🧪 Skill

Venice AI

Free

Complete Venice AI platform — text generation, web search, embeddings, TTS, speech-to-text, image generation, video creation, upscaling, and AI editing. Private, uncensored AI inference for everythi

❤️ 2 ⬇️ 2.3k

🧪 Skill

MLX Local Inference Stack

Free

Full local AI inference stack on Apple Silicon Macs via MLX. Includes: LLM chat (Qwen3-14B, Gemma3-12B), speech-to-text ASR (Qwen3-ASR, Whisper), text embedd...

❤️ 1 ⬇️ 307

🧪 Skill

Youmind Youtube Transcript

Free

Extract YouTube video transcripts and subtitles via YouMind API — no yt-dlp, no proxy, no local dependencies. Batch extract up to 5 videos at once with paral...

❤️ 2 ⬇️ 165

🧪 Skill

Voice Agent

Free

--- name: voice-agent display-name: AI Voice Agent Backend version: 1.1.0 description: Local Voice Input/Output for Agents using the AI Voice Agent API. author: trevisanricardo homepage: https://githu

❤️ 0 ⬇️ 2.9k

🧪 Skill

dy-video-to-text

Free

Extract speech-to-text from Douyin (TikTok China) videos, get watermark-free download links, and download videos. Use when user shares a Douyin link, asks to...

❤️ 0 ⬇️ 74

🧪 Skill

ClawdBites

Free

Extract recipes from Instagram reels. Use when a user sends an Instagram reel link and wants to get the recipe from the caption. Parses ingredients, instructions, and macros into a clean format.

❤️ 0 ⬇️ 1.7k

🧪 Skill

Openai

Free

OpenAI API integration — chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI RES...

❤️ 0 ⬇️ 216

🧪 Skill

video-download

Free

Download videos from 1800+ websites and generate subtitles using Faster Whisper AI. Use when user wants to download videos from YouTube, Bilibili, Twitter, T...

❤️ 2 ⬇️ 1.2k

🧪 Skill

Video Captions

Free

Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.

❤️ 2 ⬇️ 481

🧪 Skill

Bilibili Up To Kb

Free

Convert Bilibili (B站) videos into a searchable text knowledge base. Supports single videos and batch processing of entire UP主 channels. Uses local whisper.cp...

❤️ 0 ⬇️ 177

🧪 Skill

Faster Whisper Local Service

Free

OpenClaw local speech-to-text backend using faster-whisper over HTTP on 127.0.0.1:18790. Use when you want voice transcription without external APIs, without...

❤️ 0 ⬇️ 824

🧪 Skill

Google Gemini Media

Free

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understand

❤️ 5 ⬇️ 3.0k

🧪 Skill

Auto Subtitle

Free

视频自动字幕生成器，批量为视频生成字幕文件（SRT/VTT），结合视频帧提取和语音转文字，预览模式和撤销功能！

❤️ 0 ⬇️ 0

🧪 Skill

Agent Memory Architecture

Free

Complete zero-dependency memory system for AI agents — file-based architecture, daily notes, long-term curation, context management, heartbeat integration, a...

❤️ 2 ⬇️ 175