Test-driven behavioral verification for AI agents. Catches silent degradation when agent loads memory but doesn't apply learned behaviors. Use when building agent with persistent memory, testing after
Deterministic entropy streams for reproducible testing and procedural generation. Perfect 50/50 statistical distribution with hash verification. Not cryptographically secure - use for testing, worldge
Develop Solana dApps with React/Next.js, manage wallet connections, build and test programs using Anchor or Pinocchio, and apply SDKs and testing tools per S...
This skill should be used when users need to estimate test effort based on product requirements. It analyzes requirements, breaks down tasks, estimates test...
Comprehensive web quality audit covering performance, accessibility, SEO, best practices, and browser automation testing. Supports automated testing with Pin...
Test agents or models against predefined test cases to validate model routing, performance, and output quality. Use when: (1) verifying a specific agent or m...
Create a new git escrow bounty for a test suite. Use when the user wants to submit a challenge with escrowed token rewards for passing a failing test suite....
Advanced fake data factory for development and testing. Generate realistic company names, product descriptions, lorem paragraphs, credit card numbers (test o...
Performance testing and benchmarking tool. Measure command execution time, benchmark disk IO, test network throughput, compare command performance, and gener...
Random seed and test data generator. Create deterministic random sequences, generate seed phrases, produce reproducible test datasets, and manage random numb...
Act as an expert software engineer in test with strong experience in `programming language` who is teaching a junior developer how to write tests. I will pass you code and you have to analyze it and r
Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world benc
--- name: entr description: Run arbitrary commands when files change. Useful for watching files and triggering builds or tests. --- # entr (Event Notify Test Runner) A utility for running arbitrary
Expert guidance for systematic backtesting of trading strategies. Use when developing, testing, stress-testing, or validating quantitative trading strategies. Covers "beating ideas to death" methodolo
$0 test-time scaling with online learning. Classify, generate, and verify using free model ensembles. Models self-select via ELO scoring + A/B testing from d...
Comprehensive guide for Polygon PoS blockchain development. Use when deploying smart contracts to Polygon, testing on Amoy testnet, getting test tokens from faucets, or verifying contracts on Polygons
Book smog check appointments through Lokuli MCP. Use when user needs a smog check, emissions test, or vehicle inspection. Triggers on requests like "I need a smog check", "book smog test", "emissions
Test prompts across Claude, GPT, and Gemini models and get detailed latency, cost, quality, consistency, and error metrics with smart recommendations.
21 production-ready scripts for iOS app testing, building, and automation. Provides semantic UI navigation, build automation, accessibility testing, and simulator lifecycle management. Optimized for A
Expert guidance for systematic backtesting of trading strategies. Use when developing, testing, stress-testing, or validating quantitative trading strategies...
iOS code quality analysis and test automation server. Provides comprehensive Xcode test execution, SwiftLint integration, and detailed failure analysis. Operates in both CLI and MCP server modes for d
Test webhooks and expose local services using HookCatch - a developer-friendly webhook testing tool