Download videos, extract transcripts, capture frames. Analyze YouTube, tutorials, DD videos with yt-dlp + Whisper + ffmpeg.
Run a unified SEO and GEO audit for a website, page, or domain. Use when the user asks for a full SEO audit, GEO audit, AI visibility review, EEAT review, en...
Local TTS router for Apple Silicon — pull models, serve OpenAI-compatible API, synthesize speech, clone voices. Use when the user asks to "generate speech",...
Generate speech from text using Kyutai Pocket TTS - lightweight, CPU-friendly, streaming TTS with voice cloning. English only. ~6x real-time on M4 MacBook Air.
Freshsales integration. Manage crm and sales data, records, and workflows. Use when the user wants to interact with Freshsales data.
Organizes family knowledge into Inbox (raw capture by scene), Cognition (distilled insights by 5 dimensions), and Guidebook (validated reusable methods) in O...
Lattice integration. Manage Persons, Organizations, Roles, Activities, Notes, Files. Use when the user wants to interact with Lattice data.
Lightning Network payments via Archon DIDs - create wallets, send/receive sats, verify payments, Lightning Address zaps
Save and retrieve "read it later" content - links, articles, notes, and ideas for future reference. Use when the user wants to (1) save a link, URL, article,...
Analyze audio quality, detect noise types, and provide improvement recommendations. Use when users need to check audio quality, validate recordings, or ident...
Transcribe audio files to text using local speech recognition. Triggers on: "转录", "transcribe", "语音转文字", "ASR", "识别音频", "把这段音频转成文字".
Integrate Google NotebookLM capabilities into your workflow via the unofficial notebooklm-py library. Use when you need to: create/manage notebooks, import s...
Create explainer videos with narration and AI-generated visuals. Triggers on: "解说视频", "explainer video", "explain this as a video", "tutorial video", "introd...
Free All-in-One AI Image Generator Platform. Access FLUX, Midjourney alternatives, Wan AI, and Qwen Image in one place. Generate photorealistic 8K images nat...
Free All-in-One AI Video Generator Platform. Access Kling AI, Google Veo, Sora 2, and Runway in one place. Generate cinematic Text-to-Video, Image-to-Video,...
Build a personal quotes system for saving, discovering, and automatically surfacing meaningful words.
Analyze Twitter Spaces and voice conversations to extract market intelligence, crypto alpha, sentiment analysis, and speaker-attributed insights. Transforms spoken audio into structured reports, full
Time-blind friendly planning, executive function support, and daily structure for ADHD brains. Specializes in realistic time estimation, dopamine-aware task...
Social media automation, content scheduling, analytics tracking, and campaign management. Transform your AI agent into a marketing powerhouse that handles multi-platform content strategy.
Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat...
Manage your personal knowledge, store insights, track tasks, and stay accountable by syncing and updating your DeepThink user data and todos.
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files.
Download music from YouTube/YouTube Music and stream to Chromecast via Home Assistant. Complete CLI toolset with web server integration, configuration wizard, and playback controls.
Give your agent a searchable knowledge brain - semantic search, topic synthesis, and action tracking across your saved YouTube videos, articles, Reddit threads, X posts, and PDFs