MCP server for video analysis — extracts transcripts, key frames, OCR text, and annotated timelines from video URLs. Supports Loom and direct video files (.mp4, .webm). Zero auth required.
Interact with YouTube to search videos, get details, and manage transcripts.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Stock sentiment analysis and financial data for US equity research. Analyze multi-engine AI sentiment (Grok, DeepSeek, GPT-5), earnings call transcripts, soc...
AI podcast editing as a service. Upload raw audio or submit a URL, get back edited episodes with filler words removed, noise reduction, transcripts, show notes, and social clips. Includes webhooks for
Organize voice message transcripts into a structured, searchable knowledge base with tags, links, and progressive note-taking.
Query MeetGeek meeting intelligence from CLI - list meetings, get AI summaries, transcripts, action items, and search across all your calls with natural language.
Extract and summarize YouTube video transcripts into concise overviews with main points, arguments, and conclusions using video captions.
OpenClaw agent skill for converting documents to Markdown. Documentation and utilities for Microsoft's MarkItDown library. Supports PDF, Word, PowerPoint, Excel, images (OCR), audio (transcription), H
--- name: musa-torch-coding description: Transcribe audio via OpenAI Audio Transcriptions API (Whisper). homepage: https://platform.openai.com/docs/guides/speech-to-text metadata: { "openc
Ton namespace for Netsnek e.U. audio and media processing tools. Handles audio transcription, format conversion, waveform analysis, and podcast production wo...
Headless Google Meet bot that joins meetings and captures live captions as transcripts.
This official skill from the Voicenotes team gives OpenClaw access to new APIs and the ability to search semantically, retrieve full transcripts, filter by t...
--- name: aliyun-asr description: "Pure Aliyun ASR skill for voice message transcription, supports multiple channels including Feishu" metadata: { "openclaw": { "emoji": "🎙️",
Send bots to Zoom, Google Meet, and Microsoft Teams meetings. Get live transcripts, recordings, and reports. Works with Vexa Cloud or your own self-hosted in...
An MCP server to make phone calls using VoIP/SIP and OpenAI's Realtime API and observe the transcript.
Transcribe meetings with speaker identification and generate summaries with action items. Use when users need meeting transcription, meeting notes, or want t...
Access Granola AI meeting notes via MCP (mcporter). Query meetings, list by date range, get full details, and pull verbatim transcripts. Use when the user as...
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.
Provide advanced YouTube data extraction and analysis capabilities including multi-language transcript extraction, comprehensive search, and trend detection. Enable efficient and quota-friendly access
Ensures AI agents maintain consistent identity by auditing soul rules, detecting behavioral drift in transcripts, and injecting context-aware reminders.
Generate synchronized subtitles (SRT/VTT/ASS) from video audio with precise timestamps. Use when users need subtitles, captions, or video transcription with...
Local text-to-speech (TTS) and speech-to-text (STT) using FluidAudio on Apple Silicon. Sub-second voice synthesis and transcription running entirely on-device via the Apple Neural Engine. Use when set
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.