🧪 Skills

FlowForge

Autonomously breaks coding tasks into spec, plan, code, and QA phases, executing all heavy work via Claude Code with multi-account rate limit rotation.

v1.0.0
❤️ 0
⬇️ 108
👁 1
Share

Description


name: flowforge description: Autonomous AI coding pipeline that breaks any task into a structured Spec → Plan → Code → QA loop and executes it via Claude Code. Use when: (1) starting any new feature, refactor, or bug fix, (2) given a GitHub issue to implement, (3) asked to "run FlowForge", "forge this", "plan and build", or "auto-implement". Routes ALL heavy work through Claude Code. Supports multi-account rotation to handle rate limits automatically.

FlowForge

Autonomous spec → plan → code → QA pipeline powered by Claude Code. All heavy computation runs through Claude Code (Max subscription). OpenClaw only orchestrates.

Architecture

Flo (minimal tokens) → shell pipeline → Claude Code (all heavy work)
                                              ↓
                                   Account rotation on rate limit

Workflow Types

Classify the task before planning — each type has a different phase structure:

Type When Phase Order
feature New capability Backend → Worker → Frontend → Integration
refactor Restructure existing code Add New → Migrate → Remove Old → Cleanup
investigation Bug hunt Reproduce → Investigate → Fix → Harden
migration Move data/infra Prepare → Test → Execute → Cleanup
simple Single-file change Just subtasks, no phases

Steps

1. Setup workspace

bash ~/clawd/skills/flowforge/scripts/init_forge.sh "<task_description>" "<repo_path>"

Creates ~/.forge/<timestamp>/ with task.md.

2. Run the pipeline

bash ~/clawd/skills/flowforge/scripts/run_forge.sh ~/.forge/<timestamp>/

This chains 4 Claude Code calls:

  1. Spec — generates spec.md (high thinking)
  2. Plan — generates implementation_plan.json (high thinking)
  3. Code — executes each subtask with verification (medium thinking)
  4. QA — reviews output, scores against spec (high thinking)

Each step saves output to the workspace directory. Claude Code does ALL the work.

3. Monitor

Poll workspace for completion:

tail -f ~/.forge/<timestamp>/progress.log
cat ~/.forge/<timestamp>/qa_report.md

Account Rotation

Three Claude Max accounts rotate automatically on rate limit:

account-1@gmail.com  →  account-2@gmail.com  →  account-3@gmail.com  →  retry

Configure your accounts in ~/.flowforge/accounts.txt (one email per line). Save credentials per account in ~/.claude/accounts/<email>.json. Switch accounts with: bash <skill-dir>/scripts/rotate_account.sh

GitHub Issues

To pull a task from a GitHub issue:

gh issue view <number> --repo <owner>/<repo> --json title,body | \
  jq -r '"# " + .title + "\n\n" + .body' > ~/.forge/<timestamp>/task.md

Then run the pipeline normally.

Output

On completion, workspace contains:

  • spec.md — full specification
  • implementation_plan.json — phases + subtasks with status
  • qa_report.md — QA review and score
  • progress.log — timestamped execution log

Optional: Rubric Scoring (200 criteria)

Add --rubric flag for high-stakes runs. Scores against a universal 200-criterion quality rubric after the spec-based QA pass:

bash ~/clawd/skills/flowforge/scripts/run_forge.sh ~/.forge/<timestamp>/ --rubric

Rubric covers: Architecture (40), Code Quality (40), Testing (40), Error Handling (30), Security (20), Documentation (15), Observability (15).

Verdict thresholds: ≥180 = Ship it | 150–179 = Needs work | <150 = Major rework

Skip --rubric for quick tasks. Use it before shipping to production.

Prompts

See references/spec-prompt.md, references/planner-prompt.md, references/qa-prompt.md, references/rubric-prompt.md for the full Claude Code prompts used at each stage.

Reviews (0)

Sign in to write a review.

No reviews yet. Be the first to review!

Comments (0)

Sign in to join the discussion.

No comments yet. Be the first to share your thoughts!

Compatible Platforms

Pricing

Free

Related Configs