X Spaces, but for AI Agents. Live voice rooms where AI agents host conversations.
Convert text to speech using Volcengine TTS with preset or cloned voices and send audio messages to Feishu chats or groups.
Send immediate voice call reminders or schedule future calls via DoNotify.
Voice-channel conversations in Discord using Deepgram streaming STT + low-latency TTS
Send emails via natural voice commands - designed for accessibility
Organize voice message transcripts into a structured, searchable knowledge base with tags, links, and progressive note-taking.
Fully offline, CUDA-accelerated local voice assistant pipeline for NVIDIA Jetson. Wake word (openWakeWord) → real-time VAD → whisper.cpp GPU STT → LLM → Pipe...
双向语音对话系统 - 语音识别转文字 + Edge TTS语音合成 + Cloudflare Tunnel公网访问
AI-powered presentation generation using 2slides API. Create slides from text content, match reference image styles, or summarize documents into presentations. Use when users request to "create a pres
使用微软 Edge TTS 生成高质量中文语音,默认使用 XiaoxiaoNeural 语音,并提供备用语音合成方案。支持通过 QQ 发送生成的语音文件。优先处理语音对话场
⚠️ DEPRECATED — This skill has been split into two separate skills for better modularity: **webchat-https-proxy** (HTTPS/WSS reverse proxy) and **webchat-voi...
Use this skill whenever the user wants speech to sound more human, companion-like, or emotionally expressive. Triggers include: any mention of 'say like', 't...
每日名言語音任務。產生「語音 + 封面圖靜態影片 +(選配)HeyGen 數位人影片」並發送給主人。
Audit and rewrite content to remove AI-generated feel by stripping markdown artifacts, eliminating AI vocabulary patterns, flagging hallucination risks, and...
语音回复技能 - 使用讯飞 TTS 生成语音并发送到飞书。当需要用语音回复用户消息时使用。触发词:用语音、语音回复、切换语音模式、语音模式。
Run a real-time AI phone agent using Twilio, Deepgram, and ElevenLabs. Handles incoming calls, transcribes audio, generates responses via LLM, and speaks back via streaming TTS. Use when user wants to
Automatyczna transkrypcja i kategoryzacja notatek głosowych z WhatsApp do plików Markdown w 6 kategoriach, w tym zadania i lista zakupów.
语音处理技能 - 完整的语音输入输出解决方案。功能:(1) 语音识别 ASR - 将用户语音转录为文字(使用 Whisper)(2) 语音合成 TTS - 将文字转换为语音(使
MOSS-TTS 语音合成与音色克隆工具。生成适合各渠道的音频文件。 触发场景: - 用户要求生成语音、TTS - 用户提到"用我的声音"、"克隆声音"、"MOSS语音" -
Apply Protoss-style (StarCraft) psionic effects to ANY audio file. Use as a post-processing layer for TTS or user recordings.
Place outbound phone calls via Mobayilo with safe defaults (preview mode by default) and explicit live execution.
将文本转为语音并通过飞书 audio 消息发送给指定用户。用于“给用户发语音”“把这段话转语音并发飞书”“语音播报结果”等场景,尤其当普通文件
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).