Conduct exhaustive multi-source investigation with methodology tracking, source evaluation, and iterative depth.
Search and evaluate biomedical literature with effective queries, filters, and critical appraisal.
Self-reflection + Self-criticism + Self-learning + Self-organizing memory. Agent evaluates its own work, catches mistakes, and improves permanently. Use when...
Repair malformed JSON files by normalizing them through Node.js evaluation. Use this to fix trailing commas, single quotes, unquoted keys, or other common sy...
Assist with vendor evaluation, purchase order creation, contract negotiation prep, spend analysis, and adherence to procurement policies and approval thresho...
Build a cost-efficient LLM evaluation ensemble with sampling, tiebreakers, and deterministic validators. Learned from 600+ production runs judging local Olla...
On-chain skill provenance registry. Check, register, audit, and vouch for agent skills on Solana. Use when evaluating skill safety, registering new skills, or looking up provenance before installation
Transparent LLM proxy that monitors and enforces policies on AI agent behavior — evaluates responses against configurable rules for hallucinations, PII leaks...
Build and deploy production ML pipelines with data processing, model training, evaluation, and deployment using TensorFlow, PyTorch, or Scikit-learn.
Multi-agent validation framework — 6 independent AI critics evaluate artifacts against rubrics with evidence-grounded findings.
Assist startups in securing venture capital from top-tier VCs by evaluating potential, crafting narratives, identifying and ranking investors, and managing o...
Test skills before using or publishing. Trial, compare, evaluate in isolation without affecting your environment.
Deep research on any topic with structured analysis, source evaluation, and synthesis. Get comprehensive briefings, literature reviews, and expert-level summaries on demand.
Run PinchBench benchmarks to evaluate OpenClaw agent performance across real-world tasks. Use when testing model capabilities, comparing models, submitting b...
Persuasive copy analysis for WeChat Moments. Use when users need to: (1) Evaluate the persuasiveness of WeChat Moments posts, (2) Improve conversion or engagement of social media copy, (3) Get actiona
Guide business owners through tax optimization by evaluating entity structure, maximizing deductions, planning compensation, and scheduling key tax deadlines.
Manage inventory, forecast demand, evaluate suppliers, optimize reorder points, and improve supply chain for businesses of all sizes.
Find, evaluate, and recommend AI products using the watcha.cn platform API. Use this skill whenever the user asks about AI tools, AI products, AI apps, or wa...
Design and manage structured evidence-based interviews, including scorecards, question banks, rubrics, panel coordination, evaluation, and offer decision sup...
Automatically evaluates and approves agent outputs based on clarity, conciseness, actionability, and structure using a rule-based system.
Meta-skill: evaluate any Factory Droid skill against the current project codebase and suggest concrete improvements. Use when: a skill feels incomplete, prod...
Be one of the first to benchmark your agent's memory — and help shape how AI remembers. Runs a peer-review-grade evaluation suite (LLM-as-judge, nDCG/MAP/MRR...
Quick startup idea evaluation from your terminal. Score ideas on 3 dimensions, run deeper scans with real competitor data and risk assessment. A structured t...
Conduct detailed SWOT analyses for businesses or products by evaluating strengths, weaknesses, opportunities, threats, and strategic recommendations based on...