Simple local Speech-To-Text using Whisper. One-command install with auto model download. Supports 99+ languages.
Skill-aware agent routing with explicit competence/cost modeling. +22.5% accuracy, 700x cheaper than RL routers. Based on arXiv:2602.19672.
Build and deploy production ML pipelines with data processing, model training, evaluation, and deployment using TensorFlow, PyTorch, or Scikit-learn.
Design, configure, debug, and optimize OpenClaw AI agent deployments. Master guide for gateway configuration, openclaw.json settings, model routing and fallb...
Design, optimize, and communicate SaaS pricing — tier structure, value metrics, pricing pages, and price increase strategy. Use when building a pricing model...
Enable Azure OpenAI integration with OpenClaw via a lightweight local proxy. Use when configuring Azure OpenAI as a model provider, when encountering 404 errors with Azure OpenAI in OpenClaw, or when
Delegate longer user-facing text generation to Upstage Solar Pro3 while keeping the primary model for planning and tool calls. Use when drafting explanations...
Enable seamless integration of language models with external data sources and tools through a standardized protocol. Facilitate dynamic access to files, APIs, and custom operations to enhance AI capab
Automatic Speech Recognition (ASR) using Zhipu AI (BigModel) GLM-ASR model. Use when you need to transcribe audio files to text. Supports Chinese audio trans...
Use Renderful from OpenClaw for image/video/audio/3D creation with model discovery, quote-before-generate workflow, deterministic polling, and insufficient-funds/x402 fallback.
Generate production-grade 3D models from one or multiple images with Hitem3D. Use when users ask to turn photos, concept art, product shots, or portraits int...
# Git Commit Guidelines for AI Language Models ## Core Principles 1. **Follow Conventional Commits** (https://www.conventionalcommits.org/) 2. **Be concise and precise** - No flowery language, super
Model post-labor economies with automation shocks, distribution redesign, and policy portfolios across income, ownership, time, and services.
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
Complete startup workflow from 0 to 1, integrating founder, business-model-canvas, market-research-reports, competitive-analysis, business-plan, pitch-deck-v...
Generate images with Seedream4.5 and videos with Kling via LiblibAI API. Use when user asks to generate/create images, pictures, illustrations, or videos using LiblibAI, Seedream, or Kling models.
Build and maintain Convex backends with schema-safe modeling, query and mutation patterns, auth guards, and production rollout checks.
Calculates classification and regression metrics like accuracy, F1-score, RMSE, and provides confusion matrix to evaluate ML model performance.
Generate images with Alibaba Cloud Model Studio Z-Image Turbo (z-image-turbo) via DashScope multimodal-generation API. Use when creating text-to-image output...
--- name: tg-voice-whisper description: Auto-transcribe Telegram voice messages (.ogg Opus) to text using local OpenAI Whisper (tiny model). Reply with transcription + auto-delete file for privacy. No
Generate AI images from prompts using CivitAI's JavaScript SDK with customizable models, samplers, seeds, steps, LoRA, and output options.
--- name: gws-modelarmor version: 1.0.0 description: "Google Model Armor: Filter user-generated content for safety." metadata: openclaw: category: "productivity" requires: bins: ["gws"
Generate professional AI product photography and commercial images. Models: FLUX, Imagen 3, Grok, Seedream for product shots, lifestyle images, mockups. Capa...
Offline speech-to-text (ASR) using whisper.cpp (whisper-cli) + ffmpeg. Supports batch transcription, timestamps, SRT/TXT/JSON outputs, and model download. Cr...