Search

Transcribe, index, and semantically search all voice recordings, extracting action items and meeting insights for comprehensive conversation intelligence.

❤️ 0 ⬇️ 509

🧪 Skill

Percept Listen

Free

Captures ambient audio from wearable devices, transcribes locally, and streams searchable, speaker-tagged conversation data to your OpenClaw agent.

❤️ 0 ⬇️ 238

🧪 Skill

Faster Whisper Transcription

Free

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.

❤️ 0 ⬇️ 514

🧪 Skill

X Reader

Free

Fetch, transcribe, and analyze content from URLs, files, or transcripts across multiple platforms, providing personalized, multi-dimensional insights.

❤️ 0 ⬇️ 535

🧪 Skill

asr-file-transfer

Free

Transcribe recorded audio files to text via UniSound UniCloud ASR API, supporting multiple formats and optimized for finance and customer service domains.

❤️ 0 ⬇️ 99

🧪 Skill

deAPI AI Media Suite (Community)

Free

The cheapest AI media API on the market. Generate images (Flux), music (AceStep), speech with voice cloning, transcribe video/audio, OCR, video generation, b...

❤️ 1 ⬇️ 48

🔌 MCP

voicesphere-mcp

Free

Launch voice collection campaigns for feature phones, list active tasks, and monitor campaign stats. Validate and transcribe audio samples automatically to ensure high-quality datasets. Credit mobile

❤️ 0 ⬇️ 10

🧪 Skill

Elevenlabs Integration with Openclaw

Free

ClawVox - ElevenLabs voice studio for OpenClaw. Generate speech, transcribe audio, clone voices, create sound effects, and more.

❤️ 3 ⬇️ 2.4k

🧪 Skill

deAPI - AI Media Generation Toolkit

Free

AI media generation via deAPI. Transcribe YouTube/audio/video, generate images from text, text-to-speech, OCR, remove backgrounds, upscale images, create vid...

❤️ 0 ⬇️ 37

🧪 Skill

Gemini STT

Free

Transcribe audio files using Google's Gemini API or Vertex AI

❤️ 2 ⬇️ 2.7k

🧪 Skill

ElevenLabs Speech-to-Text

Free

Transcribe audio files using ElevenLabs Speech-to-Text (Scribe v2).

❤️ 5 ⬇️ 3.3k

🧪 Skill

YouTube Transcript Pipeline Lite

Free

Run a lightweight YouTube transcript workflow: transcribe, attribution cleanup, translation, and packaging with minimal tooling. Use for repeatable transcrip...

❤️ 0 ⬇️ 380

🧪 Skill

Speech to Text

Free

Transcribe or translate audio files to text using a public Hugging Face Whisper Space over Gradio. Use when the user sends voice notes, audio attachments, me...

❤️ 0 ⬇️ 71

🧪 Skill

ElevenLabs STT OpenClaw

Free

Transcribe audio files with ElevenLabs Speech-to-Text (Scribe v2) from the local CLI. Supports diarization, events, JSON output, webhooks, and advanced STT o...

❤️ 0 ⬇️ 222

🧪 Skill

Video Subtitles

Free

Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captio

❤️ 11 ⬇️ 6.1k

🧪 Skill

AssemblyAI advanced speech transcription

Free

Transcribe audio/video with AssemblyAI (local upload or URL), plus subtitles + paragraph/sentence exports.

❤️ 3 ⬇️ 2.5k

🧪 Skill

Youtube Editor

Free

Automate YouTube video editing: download videos, transcribe with Whisper, analyze content using GPT-4, and create Korean SEO-optimized metadata plus consiste...

❤️ 0 ⬇️ 2.1k

🧪 Skill

Agentic Calling

Free

Enable AI agents to autonomously make, receive, transcribe, route, and record phone calls using Twilio with customizable voice messages and IVR support.

❤️ 3 ⬇️ 2.1k

🧪 Skill

Facticity.AI Complete Integration

Free

Complete Facticity.AI integration - fact-check claims, extract claims from content, transcribe links, check link reliability, check credits, and monitor task...

❤️ 0 ⬇️ 265

🧪 Skill

Elevenlabs Integration with Openclaw

Free

ClawVox - ElevenLabs voice studio for OpenClaw. Generate speech, transcribe audio, clone voices, create sound effects, and more.

❤️ 3 ⬇️ 2.4k

🧪 Skill

Walkie-Talkie Mode

Free

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

❤️ 4 ⬇️ 2.1k