Search

297 results for "transcribe"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

dy-video-to-text

Extract speech-to-text from Douyin (TikTok China) videos, get watermark-free download links, and download videos. Use when user shares a Douyin link, asks to...

❤️ 0 ⬇️ 74

🧪 Skill

Bilibili Up To Kb

Free

Convert Bilibili (B站) videos into a searchable text knowledge base. Supports single videos and batch processing of entire UP主 channels. Uses local whisper.cp...

❤️ 0 ⬇️ 177

🧪 Skill

Adb Claw

Free

Your eyes, hands, and ears on Android. See the screen (screenshot + indexed UI tree), interact (tap, swipe, scroll, type, clear-field), navigate via deep lin...

❤️ 1 ⬇️ 143

🧪 Skill

Ai Task Hub

Free

AI task hub for image analysis, background removal, speech-to-text, text-to-speech, markdown conversion, and async execute/poll/presentation orchestration. U...

❤️ 1 ⬇️ 58

🧪 Skill

Auto Subtitle

Free

视频自动字幕生成器，批量为视频生成字幕文件（SRT/VTT），结合视频帧提取和语音转文字，预览模式和撤销功能！

❤️ 0 ⬇️ 0

🧪 Skill

Faster Whisper Local Service

Free

OpenClaw local speech-to-text backend using faster-whisper over HTTP on 127.0.0.1:18790. Use when you want voice transcription without external APIs, without...

❤️ 0 ⬇️ 824

🧪 Skill

Feishu Voice (ElevenLabs)

Free

Send and receive voice messages on Feishu (Lark) using ElevenLabs TTS and STT. Activate when user asks to send a voice message on Feishu, or when receiving a...

❤️ 0 ⬇️ 179

🧪 Skill

Doc Process

Free

Document intelligence: categorize, autofill forms, analyze contracts, scan receipts/invoices, analyze bank statements, parse resumes/CVs, scan IDs/passports...

❤️ 1 ⬇️ 270

🔌 MCP

Transcriptor

Free

Extract transcripts, subtitles, and detailed metadata from videos across multiple social media platforms. Access official captions or auto-generated text to quickly analyze content without watching th

❤️ 0 ⬇️ 154

🧪 Skill

Google Gemini Media

Free

Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understand

❤️ 5 ⬇️ 3.0k

🧪 Skill

Agent Memory Architecture

Free

Complete zero-dependency memory system for AI agents — file-based architecture, daily notes, long-term curation, context management, heartbeat integration, a...

❤️ 2 ⬇️ 175

🧪 Skill

Whisper Tailnet API

Free

Consume the shared Whisper speech-to-text API over Tailnet at http://100.92.116.99:8765 using OpenAI-compatible audio transcription endpoint (/v1/audio/trans...

❤️ 0 ⬇️ 123

🧪 Skill

WA Relay

Free

WhatsApp message relay and firewall for OpenClaw agents. Intercepts messages from third parties (non-owner contacts), notifies the owner, and sends replies o...

❤️ 0 ⬇️ 72

🧪 Skill

Loom Workflow

Free

AI-native workflow analyzer for Loom recordings. Breaks down recorded business processes into structured, automatable workflows. Use when: - Analyzing Loom videos to understand workflows - Extracting

❤️ 0 ⬇️ 1.7k

🧪 Skill

Elevenlabs Toolkit

Free

ElevenLabs voice API integration — TTS, sound effects, music generation, speech-to-text, voice isolation, and streaming. Use when building voice-enabled apps...

❤️ 0 ⬇️ 245

🧪 Skill

Voice Memo

Free

Send native iMessage voice bubbles with ElevenLabs TTS via BlueBubbles. Use when: user asks to send a voice message, wants something spoken aloud, storytelli...

❤️ 0 ⬇️ 246

🧪 Skill

Clawhub Skill Content Ingestion

Free

Turn any URL into structured content — YouTube videos (via Gemini Video API), web articles, PDFs, and audio files. Extract transcripts, summaries, and metada...

❤️ 0 ⬇️ 81

🧪 Skill

Jetson CUDA Voice Pipeline

Free

Fully offline, CUDA-accelerated local voice assistant pipeline for NVIDIA Jetson. Wake word (openWakeWord) → real-time VAD → whisper.cpp GPU STT → LLM → Pipe...

❤️ 0 ⬇️ 286

🧪 Skill

Elevenlabs Tts

Free

ElevenLabs TTS (Text-to-Speech) with emotional audio tags for expressive voice synthesis. WhatsApp-compatible voice messages with Opus conversion. Supports 7...

❤️ 6 ⬇️ 4.9k

🧪 Skill

Venice API Kit

Free

Complete Venice AI API toolkit - image generation, video, audio, embeddings, transcription, characters, models, and admin functions. Privacy-focused inferenc...

❤️ 0 ⬇️ 456

🧪 Skill

PaperPod

Free

Isolated agent runtime for code execution, live preview URLs, browser automation, 50+ tools (ffmpeg, sqlite, pandoc, imagemagick), LLM inference, and persistent memory — all via CLI or HTTP, no SDK

❤️ 2 ⬇️ 1.1k

🧪 Skill

Video Subtitle Generator

Free

Generate and translate video subtitles using WhisperX and LLM translation. Use when processing video files to create .srt subtitle files. Supports multilingu...

❤️ 1 ⬇️ 78

🧪 Skill

DeepThink

Free

Manage your personal knowledge, store insights, track tasks, and stay accountable by syncing and updating your DeepThink user data and todos.

❤️ 3 ⬇️ 2.1k

🧪 Skill

Bidirectional Voice Chat System

Free

双向语音对话系统 - 语音识别转文字 + Edge TTS语音合成 + Cloudflare Tunnel公网访问

❤️ 0 ⬇️ 166