🧪 Skills
Arxiv Agentic Verifier
Actively verifies Python/JS code correctness by generating targeted test cases that expose logic flaws based on problem constraints.
v1.0.0
Description
ArXiv Agentic Verifier
Source Paper: Scaling Agentic Verifier for Competitive Coding (ID: 4a4c4dae6a5145ebc4d62eb2d64b0f0f) Type: Code Verification / Test Generation
Description
This skill implements an "Agentic Verifier" that actively reasons about code correctness by generating targeted, "discriminative" test cases. Instead of random sampling, it analyzes the problem constraints and code logic to find edge cases or logic flaws.
Features
- Analyze Code: Understands Python/JS code logic.
- Generate Tests: Creates specific inputs to break the code.
- Execute & Verify: Runs the code against generated tests (sandbox recommended for production).
Usage
const AgenticVerifier = require('./index');
const verifier = new AgenticVerifier(process.env.OPENAI_API_KEY);
const problem = "Given two integers A and B, output their sum.";
const code = "print(int(input().split()[0]) + int(input().split()[1]))";
verifier.verify(problem, code, 'python')
.then(result => console.log(result))
.catch(err => console.error(err));
Configuration
- OPENAI_API_KEY: Required for LLM reasoning.
Security Warning
This skill executes code provided to it. Use in a restricted environment or sandbox.
Reviews (0)
Sign in to write a review.
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!