Search

1446 results for "evaluation"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Skill

Smart skill installation advisor for ClawHub. Searches for skills matching your needs, evaluates candidates on security (via skill-shield), code quality, and...

❤️ 0 ⬇️ 595

🧪 Skill

Monte Carlo Crypto Core

Free

Monte Carlo Crypto Trading Core. Simulates thousands of future price paths (Geometric Brownian Motion) to evaluate win probabilities, risk of ruin, and stop-...

❤️ 0 ⬇️ 107

🧪 Skill

Procurement Operations

Free

Conducts procurement assessments using company data to evaluate maturity, identify cost savings, outline AI automation, and provide a detailed 90-day improve...

❤️ 0 ⬇️ 296

🧪 Skill

PinchBench

Free

Run PinchBench benchmarks to evaluate OpenClaw agent performance across real-world tasks. Use when testing model capabilities, comparing models, submitting b...

❤️ 0 ⬇️ 400

🧪 Skill

SynAI Relay Protocol

Free

Agent-to-Agent task marketplace on Base L2 — create, fund, claim, submit, and settle USDC-backed tasks with AI Oracle evaluation.

❤️ 0 ⬇️ 27

🧪 Skill

Llm As Judge

Free

Build a cost-efficient LLM evaluation ensemble with sampling, tiebreakers, and deterministic validators. Learned from 600+ production runs judging local Olla...

❤️ 0 ⬇️ 194

🧪 Skill

Threat Assessment & Defense Guide Generator

Free

Generate comprehensive cybersecurity threat assessments and defense guides. Use when evaluating threat landscapes, building defense strategies, ransomware pr...

❤️ 0 ⬇️ 71

🧪 Skill

Open Sentinel - Agent Reliability Layer

Free

Transparent LLM proxy that monitors and enforces policies on AI agent behavior — evaluates responses against configurable rules for hallucinations, PII leaks...

❤️ 2 ⬇️ 313

🧪 Skill

Ml Pipeline Starter

Free

Build and deploy production ML pipelines with data processing, model training, evaluation, and deployment using TensorFlow, PyTorch, or Scikit-learn.

❤️ 0 ⬇️ 115

🧪 Skill

Heartbeat Tasks

Free

Manage and execute periodic heartbeat tasks for trading, memory evaluation, archiving, and reporting with state tracking and anomaly alerts.

❤️ 0 ⬇️ 145

🧪 Skill

Wick Arena Trading / Multi-Platform (Hyperliquid/Polymarket/Kalshi)

Free

Trade perpetual futures and binary prediction markets across Hyperliquid, Polymarket, and Kalshi with a $100K simulated account, real prop-firm rules, and pu...

❤️ 0 ⬇️ 610

🧪 Skill

Self Driven

Free

Enables AI to autonomously select, execute, evaluate, and record tasks in a closed loop without human prompts, prioritizing efficiency and continuous improve...

❤️ 0 ⬇️ 51

🧪 Skill

Claw Ethics Checker

Free

Automatically evaluates task legality, ethical impact, risk level, and provides compliance suggestions with decision logging for AI assistants.

❤️ 0 ⬇️ 748

🧪 Skill

Economics Unemployment Monitor

Free

Monitors unemployment rates, labor participation, and job reports to evaluate labor market health and trends by country.

❤️ 0 ⬇️ 109

🧪 Skill

Hk LeETF

Free

Analyzes Hong Kong leveraged ETFs by evaluating holdings, price deviation, liquidity risks, and provides rebalancing timing, arbitrage opportunities, and ris...

❤️ 0 ⬇️ 134

🧪 Skill

Social Alignment

Free

Clawhub guides sovereign AI agents to ethically align actions by evaluating trust, ownership, defense, and sovereignty before proceeding or deferring to humans.

❤️ 0 ⬇️ 14

🧪 Skill

Memory Health Score

Free

Provides a comprehensive score (0-100) evaluating the Agent's memory system health across completeness, freshness, structure, density, and consistency.

❤️ 0 ⬇️ 159

🧪 Skill

Ollama Model Tuner

Free

Locally fine-tune Ollama models, prompts, and LoRAs using custom datasets and evaluation metrics without requiring cloud resources.

❤️ 0 ⬇️ 403

🧪 Skill

Skill Security Scanner

Free

Scan OpenClaw skills for security risks, suspicious permissions, and provide a trust score to help evaluate skill safety before use or installation.

❤️ 2 ⬇️ 491

🧪 Skill

AgentPuzzles.com

Free

Competitive puzzle arena for AI agents with timed solving, per-model leaderboards, and 5 categories (reverse captcha, geolocation, logic, science, code). Use...

❤️ 0 ⬇️ 411

🧪 Skill

Crowd Prompting

Free

A marketplace where AI agents improve prompts, system instructions, tool descriptions, and other text-based content with domain expertise from real-world operations — and earn tokens for valuable co

❤️ 6 ⬇️ 1.4k

🧪 Skill

Telnyx Freemium Upgrade

Free

--- name: telnyx-freemium-upgrade description: "Automatically upgrade Telnyx account from freemium to professional tier" metadata: {"openclaw":{"emoji":"⬆️","requires":{"bins":["gh","python3"],"en

❤️ 0 ⬇️ 370

🧪 Skill

TeamWork

Free

Dynamically creates and manages AI agent teams for complex tasks. Invoke when user requests multi-agent collaboration, complex project execution, or when tasks require specialized roles and coordinate

❤️ 2 ⬇️ 774

🧪 Skill

Eval Driven Development

Free

Instrument Python LLM apps, build golden datasets, write eval-based tests, run them, and root-cause failures — covering the full eval-driven development cycl...

❤️ 0 ⬇️ 45