🧪 Skills
Self Improvement
Generic agent self-improvement skill built on OpenClaw-RL research (arxiv.org/abs/2603.10165). Captures evaluative signals (+1/-1) and directive hints from a...
v1.0.0
Description
name: self-improvement description: Generic agent self-improvement skill built on OpenClaw-RL research (arxiv.org/abs/2603.10165). Captures evaluative signals (+1/-1) and directive hints from any user feedback, generates improvement suggestions. metadata: openclaw: emoji: 🧬 homepage: https://adelpro.us.kg
Self-Improvement Skill
Enables any agent to learn and improve from user feedback using PRM-style evaluation.
What It Does
- Capture Feedback: Store user responses to agent outputs
- Evaluate: Extract score (+1 positive, -1 correction)
- Analyze: Find patterns in directive hints
- Suggest: Generate actionable improvements
Concept (OpenClaw-RL Inspired)
From OpenClaw-RL paper:
"Next-state signals encode both evaluative and directive information about the preceding action."
Two signal types:
- Evaluative: Did the output work? (binary score)
- Directive: How should it be different? (hints)
Usage
Capture Feedback
SKILL:self-improvement --feedback "Great!" --job daily-report
SKILL:self-improvement --feedback "Add more stats" --job daily-report
Get Stats
SKILL:self-improvement --stats daily-report
Generate Improvements
SKILL:self-improvement --improve daily-report
Arguments
| Arg | Description | Example |
|---|---|---|
--job |
Task/job name | daily-digest, weekly-recap |
--feedback |
User response | "Thanks!", "Add more X" |
--score |
Manual score override | 1, 0, -1 |
--stats |
Show feedback stats | daily-digest |
--improve |
Generate improvements | daily-digest |
--weekly |
Weekly summary | (flag) |
Storage
memory/learning/agent-feedback.json:
{
"jobs": {
"daily-digest": {
"evaluations": [
{ "date": "2026-03-14", "score": 1, "hint": null },
{ "date": "2026-03-13", "score": -1, "hint": "add weekly star comparison" }
],
"improvements": [
{ "date": "2026-03-14", "suggestion": "Add weekly star delta", "implemented": false }
]
}
}
}
Cron Integration
Daily: Capture Feedback (9:30 AM)
{
"id": "feedback-capture",
"schedule": "30 9 * * *",
"message": "SKILL:self-improvement --job daily-digest"
}
Daily: Generate Improvements (10 AM)
{
"id": "improvement-suggestions",
"schedule": "0 10 * * *",
"message": "SKILL:self-improvement --improve daily-digest → Telegram"
}
Weekly: Summary (Saturday 9 AM)
{
"id": "weekly-summary",
"schedule": "0 9 * * 6",
"message": "SKILL:self-improvement --improve all --weekly"
}
Example Workflow
- User receives daily digest
- User responds: "Good! But can you show star trends?"
- Feedback captured: score=1, hint="show star trends"
- Next day, improvements generated:
- "Add star trend comparison (last 7 days)"
- "User满意度: 75%"
- Agent auto-updates prompt
Improvement Suggestion Format
📈 Improvement Suggestions - {job}
Stats: 8 evaluations, avg score: 0.75
Top Hints:
1. "add weekly star changes" (2x)
2. "use table format" (1x)
Suggested Actions:
• Add 7-day star delta to GitHub section
• Use table-image-generator for stats
Status: 1 improvement pending
Manual Evaluation
If no user feedback available, manually evaluate:
Evaluate yesterday's output as: good/bad
Related Skills
- compound-engineering: Deep session analysis
- agent-evolver: Legacy name, see self-improvement
Reviews (0)
Sign in to write a review.
No reviews yet. Be the first to review!
Comments (0)
No comments yet. Be the first to share your thoughts!