Avenir Web
Execute and improve Avenir-Web autonomous web tasks end-to-end with mode selection, instruction validation, single or batch runs, and next-step recommendations.
Description
name: avenir-web description: Use this skill to run and improve Avenir-Web autonomous web tasks end-to-end: choose mode (headless/headed/demo), write clear task instructions, execute single or batch runs, and propose high-impact next iterations.
Avenir-Web
What this skill does
This skill operates Avenir-Web for reliable web-task execution and iteration.
Responsibilities:
- run single tasks and batch tasks
- choose mode (
headless/headed/demo) - improve instruction quality before execution
- analyze run outputs and recommend the next best change
- execute one atomic action without strategy/checklist overhead
- read the current page by screenshoting it and asking the main model a question
Use this skill for requests like:
- run a task on a website
- run a task list and summarize outcomes
- improve success rate with better instructions/config
Canonical entrypoints
Single task:
python example.py --task "<instruction>" --website "<url>" --mode headless
Atomic action:
python scripts/atomic_action.py --action CLICK --website "<url>" --coords "500,500"
Read page:
python scripts/read_page.py --website "<url>" --question "<question>"
Batch:
cd src
python run_agent.py -c config/batch_experiment.toml
Prefer these scripts over ad-hoc commands.
Quick usage example
Single task:
python example.py \
--task "On openrouter.ai, list image-input-capable distillable models sorted by price ascending." \
--website "https://openrouter.ai/" \
--mode demo
Batch:
cd src
python run_agent.py -c config/batch_experiment.toml
Atomic action:
python scripts/atomic_action.py \
--action TYPE \
--website "https://example.com/" \
--coords "500,420" \
--value "hello"
Read page:
python scripts/read_page.py \
--website "https://openrouter.ai/" \
--question "What models or prices are visible on this page?"
Run modes
| mode | behavior | best for |
|---|---|---|
headless |
no visible browser window | fast, reproducible runs and large batch jobs |
headed |
visible browser window | manual observation without demo overlay |
demo |
visible window + overlay/dashboard controls | live debugging and demonstrations |
Notes:
- if mode is missing, use
headless demoimproves observability, not model intelligence
Mode selection:
- benchmark/batch ->
headless - visual debugging ->
headed - demo/control flow visibility ->
demo
Instruction design
confirmed_task should include:
- objective
- constraints
- completion condition
Template:
On <website>, <objective>. Apply constraints: <constraints>. Finish when <observable completion state>.
Keep it single-goal, specific, and verifiable.
Single-task workflow
Input:
taskwebsite- optional
mode,task-id,output-dir
Steps:
- check environment and API key
- validate instruction quality
- run
example.py - inspect outputs
- report status + cause + next action
Recommended report fields:
task_id- status:
success/partial/failed - evidence summary
- one-line cause
- one recommended next step
Atomic action workflow
Use scripts/atomic_action.py when you need exactly one browser operation and do not want strategist/checklist generation.
Typical uses:
- one click
- one type
- one goto
- one scroll
Properties:
- disables strategy generation
- disables checklist generation
- executes exactly one action
- returns structured JSON with result, URL, screenshot path, and output directory
Read-page workflow
Use scripts/read_page.py when you want to inspect the current page by screenshot and ask the main model a direct question.
Properties:
- opens the page
- captures a screenshot
- sends the screenshot plus page metadata to the main model
- returns structured JSON with the answer and screenshot path
Batch workflow
Task file schema
[
{
"task_id": "example_task_001",
"confirmed_task": "Find image-input-capable distillable models sorted by price ascending.",
"website": "https://openrouter.ai/"
}
]
Required per task:
task_idconfirmed_taskwebsite
Config checklist (src/config/batch_experiment.toml)
[basic].save_file_dir[experiment].task_file_path[experiment].max_op[playwright].mode[model].name- API key source
Batch execution
- validate JSON schema and config paths
- choose mode and
max_op - run batch command
- summarize per-task outcomes
- provide one global improvement recommendation
Recommended batch report fields:
- total/completed/failed counts
- per-task status list
- recurring issue patterns
- one highest-impact next change
API requirements
Required credential:
OPENROUTER_API_KEY(preferred)
Resolution order:
- environment variable
OPENROUTER_API_KEY [api_keys].openrouter_api_keyin TOML (fallback)
Rules:
- never hardcode real keys in source files
- never print full keys in logs/outputs/reports
- fail fast if key is missing with an actionable message
Script usage rules
- script-first: use repository entrypoints before custom commands
- non-interactive CLI only
- explicit flags and paths
- deterministic behavior preferred
- clear, actionable error messages
If adding helper scripts:
- place under
scripts/ - use CLI flags (no prompts)
- return stable, parseable summaries
- document usage in this file
Environment checklist
Before running:
- Python environment available
- dependencies installed (
pip install -e src) - Playwright Chromium installed (
python -m playwright install chromium) - API key configured
- config/task paths valid
Output contract
Each run summary should include:
- execution metadata: run type, mode, task IDs
- outcome: status and evidence summary
- diagnosis: root-cause hypothesis
- next action: one highest-impact recommendation
Boundaries
- do not claim completion without evidence
- do not skip issue summary
- avoid large refactors before instruction/config fixes
- avoid interactive prompts in core workflow
One-line identity
Avenir-Web execution and reliability skill: mode selection + instruction design + run analysis + iteration planning.
Reviews (0)
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!