Search

2165 results for "failures"

All 🧪 Skills 🔌 MCP Servers 📏 Rules 💬 Prompts

Incident Replay

Post-mortem analysis for AI agent failures. Capture state, reconstruct timelines, identify root causes. When your agent breaks, know what happened, why, and...

❤️ 0 ⬇️ 116

🧪 Skill

K8s Debug

Free

Diagnose and fix Kubernetes pods, CrashLoopBackOff, Pending, DNS, networking, storage, and rollout failures with kubectl.

❤️ 0 ⬇️ 12

🧪 Skill

Chaos Engineer

Free

Use when designing chaos experiments, implementing failure injection frameworks, or conducting game day exercises. Invoke for chaos experiments, resilience t...

❤️ 0 ⬇️ 96

🧪 Skill

Eval Driven Development

Free

Instrument Python LLM apps, build golden datasets, write eval-based tests, run them, and root-cause failures — covering the full eval-driven development cycl...

❤️ 0 ⬇️ 45

🧪 Skill

Cron Health Check

Free

--- name: cron-health-check displayName: Cron Health Check | OpenClaw Skill description: Monitors OpenClaw cron job health, identifies failures, timeouts, and delivery issues. version: 1.0.0 --- # Cr

❤️ 0 ⬇️ 242

🧪 Skill

Engineer

Free

Apply engineering judgment across systems, constraints, trade-offs, failure modes, and verification before acting.

❤️ 0 ⬇️ 14

🧪 Skill

Cron Doctor by Clawra

Free

Diagnose and triage cron job failures. Checks job states, identifies error patterns, prioritizes by criticality, generates health reports. Triggers on: cron...

❤️ 0 ⬇️ 80

🧪 Skill

Agent Regression Guard

Free

Prevent quality regressions after agent changes. Run targeted before/after checks for prompt, model, config, and tool updates; return pass rate, failure clus...

❤️ 0 ⬇️ 54

🧪 Skill

Agent Evals Lab

Free

Evaluate agent quality and reliability with practical scorecards: accuracy, relevance, actionability, risk flags, tool-call failures, regression checks, and...

❤️ 1 ⬇️ 58

🧪 Skill

Skill 107

Free

Design and apply replication, partitioning, consensus, failure recovery, and message ordering patterns for reliable, scalable distributed systems.

❤️ 0 ⬇️ 77

🧪 Skill

GitHub Actions Conclusion Volatility Audit

Free

Audit GitHub Actions workflow conclusion volatility to surface unstable pipelines before they become chronic failures.

❤️ 0 ⬇️ 97

🧪 Skill

Runtime Debugging Skill

Free

Diagnose and fix bugs using runtime execution traces. Use when debugging errors, analyzing failures, or finding root causes in Python, Node.js, or Java appli...

❤️ 0 ⬇️ 72

🧪 Skill

Test Sentinel

Free

--- name: test-sentinel description: Writes and runs tests (unit, integration, E2E), performs linting, and auto-fixes failures user-invocable: true --- # Test Sentinel You are a QA engineer responsi

❤️ 0 ⬇️ 439

🧪 Skill

Runtime Debug Skill

Free

Diagnose and fix bugs using runtime execution traces. Use when debugging errors, analyzing failures, or finding root causes in Python, Node.js, or Java appli...

❤️ 0 ⬇️ 71

🧪 Skill

GitHub Actions Merge Queue Health Audit

Free

Audit GitHub merge queue workflow health with failure-rate, queue-latency, and stale-success risk scoring.

❤️ 0 ⬇️ 80

🧪 Skill

Gateway Watchdog Lite

Free

Installs a macOS or Linux service that probes the OpenClaw gateway every 2 minutes and auto-recovers it on failure, sending Telegram alerts.

❤️ 0 ⬇️ 45

🧪 Skill

task-queue-by-model-source

Free

Multi-queue task orchestration system. Tasks are routed to queues by model source, with support for task dependencies, context passing, and failure handling....

❤️ 0 ⬇️ 131

🧪 Skill

GitHub Actions Branch Drift Audit

Free

Detect branch-level GitHub Actions reliability drift by comparing failure and runtime deltas against a mainline baseline.

❤️ 0 ⬇️ 81

🧪 Skill

databricks-helper

Free

Query and control Databricks jobs via text by checking status, listing recent runs, finding failures, and triggering pipelines using the REST API.

❤️ 0 ⬇️ 126

🧪 Skill

Cron Worker Guardrails

Free

Use when: hardening OpenClaw cron/background workers (POSIX shells: bash/sh) against brittle quoting, cwd/env drift, and false pipeline failures (SIGPIPE, pi...

❤️ 0 ⬇️ 617

🧪 Skill

GitHub Actions PR Gate Health Audit

Free

Audit pull-request and merge-queue GitHub Actions reliability by scoring failure rate, queue latency, and stale-success risk for merge gates.

❤️ 0 ⬇️ 83

🧪 Skill

Memory Self-Heal

Free

General-purpose self-healing loop that learns from past failures, retries safely, and records reusable fixes.

❤️ 1 ⬇️ 284

🧪 Skill

Clawhub Skill Smart Cron

Free

Schedule OpenClaw tasks using natural language with full cron lifecycle, timezone support, failure alerts, and execution logs without needing cron syntax.

❤️ 0 ⬇️ 224

🧪 Skill

OpenClaw Watch Dog

Free

Self-healing monitoring system for OpenClaw gateway. Auto-detects failures, fixes crashes, and sends Telegram alerts.

❤️ 1 ⬇️ 1.8k