Search

201 results for "speech-to-text"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Qwen Audio

--- name: qwen-audio description: "High-performance audio library with text-to-speech (TTS) and speech-to-text (STT)." version: "0.0.4" --- # Qwen-Audio ## Overview Qwen-Audio is a high-performance

❤️ 1 ⬇️ 169

🧪 Skill

Speech to Text

Free

Transcribe or translate audio files to text using a public Hugging Face Whisper Space over Gradio. Use when the user sends voice notes, audio attachments, me...

❤️ 0 ⬇️ 71

🧪 Skill

Local Vosk STT

Free

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

❤️ 0 ⬇️ 750

🧪 Skill

Related Skill

Free

Discover and install related skills from inference.sh skill registry. Helps find complementary skills for your AI workflow. Use for: skill discovery, workflo...

❤️ 0 ⬇️ 888

🧪 Skill

Local Voice (FluidAudio TTS/STT)

Free

Local text-to-speech (TTS) and speech-to-text (STT) using FluidAudio on Apple Silicon. Sub-second voice synthesis and transcription running entirely on-device via the Apple Neural Engine. Use when set

❤️ 1 ⬇️ 1.1k

🔌 MCP

voice-mcp

Free

Complete voice interaction server supporting speech-to-text, text-to-speech, and real-time voice conversations through local microphone, OpenAI-compatible APIs, and LiveKit integration

❤️ 0 ⬇️ 0

🧪 Skill

Local Vosk STT

Free

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

❤️ 0 ⬇️ 739

🧪 Skill

hotbutter voice chat

Free

Enables local voice chat by embedding Hotbutter relay server and PWA, providing speech-to-text and text-to-speech via a secure, self-hosted connection.

❤️ 0 ⬇️ 237

🧪 Skill

Argmax Transcription and TTS

Free

On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...

❤️ 0 ⬇️ 130

🧪 Skill

Argmax Transcription and TTS

Free

On-device speech-to-text (Whisper) + text-to-speech (Qwen3-TTS) CLI. Runs on the Apple Neural Engine (ANE), Apple's low power, dedicated ML inference chip. M...

❤️ 0 ⬇️ 141

🧪 Skill

dy-video-to-text

Free

Extract speech-to-text from Douyin (TikTok China) videos, get watermark-free download links, and download videos. Use when user shares a Douyin link, asks to...

❤️ 0 ⬇️ 74

🔌 MCP

brainiall-mcp-server

Free

AI-powered speech tools: pronunciation assessment with phoneme-level feedback, speech-to-text with language detection, and text-to-speech with multiple voices.

❤️ 0 ⬇️ 0

🧪 Skill

Simple sound-to-text skill locally

Free

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

❤️ 0 ⬇️ 60

🧪 Skill

Telnyx Stt

Free

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

❤️ 0 ⬇️ 663

🧪 Skill

AudioPod

Free

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to

❤️ 3 ⬇️ 2.7k

🧪 Skill

MLX Local Inference Stack

Free

Full local AI inference stack on Apple Silicon Macs via MLX. Includes: LLM chat (Qwen3-14B, Gemma3-12B), speech-to-text ASR (Qwen3-ASR, Whisper), text embedd...

❤️ 1 ⬇️ 307

🧪 Skill

Azure Ai Transcription Py

Free

Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization. Triggers: "transcription", "speech to text", "Azure AI Transcription",

❤️ 1 ⬇️ 1.7k

🧪 Skill

Simple sound-to-text skill locally

Free

Local speech-to-text using OpenAI Whisper. Use when the user needs to: (1) transcribe audio files to text, (2) convert voice messages to written content, (3)...

❤️ 0 ⬇️ 49

🧪 Skill

Volcengine STT

Free

Transcribe audio to text using Volcano Engine (Volcengine/ARK) speech-to-text APIs. Use when the user wants to replace Whisper/OpenAI STT with Volcengine, tr...

❤️ 1 ⬇️ 184

🧪 Skill

Simple stt(sound-to-text) locally

Free

Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.

❤️ 0 ⬇️ 126

🧪 Skill

Faster Whisper

Free

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

❤️ 4 ⬇️ 5.0k

🧪 Skill

Simple stt(sound-to-text) locally

Free

Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.

❤️ 0 ⬇️ 143

🧪 Skill

Addis Assistant

Free

Provides Speech-to-Text (STT) and text Translation using the Addis Assistant API (api.addisassistant.com). Use when the user needs to convert an audio file to text (specifically Amharic), or translate

❤️ 1 ⬇️ 1.8k

🧪 Skill

Groq Voice Transcribe

Free

Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...

❤️ 0 ⬇️ 81