🧪 Skills
K8s Self Hosted Whisper Api
Transcribe audio via the self-hosted Whisper ASR instance running on Kubernetes. Use this skill whenever the user wants to transcribe audio files, convert sp...
v0.1.0
Description
name: self-hosted-whisper-api description: Transcribe audio via the self-hosted Whisper ASR instance running on Kubernetes. Use this skill whenever the user wants to transcribe audio files, convert speech to text, generate subtitles, or translate audio. Triggers on audio transcription, speech-to-text, whisper, voice-to-text, subtitle generation, or audio translation requests. version: 1.0.0 user-invocable: true metadata: {"openclaw": {"emoji": "🎙️", "requires": {"bins": ["curl"]}}}
Self-Hosted Whisper API (curl)
Transcribe an audio file via the Whisper ASR webservice at http://whisper-asr.whisper-asr.svc.cluster.local:9000.
Uses the onerahmet/openai-whisper-asr-webservice API (/asr endpoint).
Quick start
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
Defaults:
- Endpoint:
http://whisper-asr.whisper-asr.svc.cluster.local:9000/asr - Task:
transcribe - Output:
txt
Useful flags
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --language en --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language de
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --output srt --out /tmp/subtitles.srt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --output vtt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --translate
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --vad-filter --json
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --word-timestamps --json
Notes
- Supported
--outputformats:txt,json,vtt,srt,tsv --translateproduces an English transcript regardless of source language--vad-filterenables voice activity detection to skip silent sections--word-timestampsadds word-level timing (use with--json)- The model is configured on the server side (ASR_MODEL env var), not per request
- Swagger docs available at
http://whisper-asr.whisper-asr.svc.cluster.local:9000/docs - No authentication required
Reviews (0)
Sign in to write a review.
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!