AI Agent Change Safety
Evaluate whether an AI agent or workflow change is safe to deploy.
Review the proposed change, estimate risk and blast radius, define required safeguards, check rollback readiness, and return one clear recommendation:
Go
Conditional Go
No-Go
Default behavior is analysis-only. Use only information explicitly
provided by the user.
Quick Start
bash
clawhub install ai-agent-change-safety
text
changesafe prompt update for support agent with stricter refusal behavior
changesafe --high-risk switch from model A to model B for production support workflows
changesafe --verdict-only tool integration update with fallback and rollback plan
Best For
Agent builders
Prompt engineers
Agent ops teams
Release reviewers
Teams shipping model, tool, or workflow changes
Quick Reference
Need Use
Standard deployment safety review changesafe [change summary]
More conservative review changesafe --strict [change summary]
High-risk release posture changesafe --high-risk [change summary]
Structured export changesafe --json [change summary]
Fast decision only changesafe --verdict-only [change summary]
When to Use
Use this skill when the user asks:
Is this change safe to deploy
Review a prompt update
Assess a model switch
Check a tool integration change
Evaluate workflow logic changes
Estimate blast radius
Define safeguards before rollout
Decide Go / Conditional Go / No-Go
Safety Constraints
text
- Use only user-provided information or synthetic examples
- Do not access local files, credentials, or system configuration automatically
- Do not execute scripts unless explicitly provided and approved
- Do not retrieve external data automatically
Modes
text
changesafe [change summary]
changesafe --strict [change summary]
changesafe --high-risk [change summary]
changesafe --json [change summary]
changesafe --verdict-only [change summary]
If no mode is specified, use the standard review flow.
Inputs
text
change_summary
change_type
expected_improvement
affected_components
risk_level | low / medium / high
current_safeguards
rollback_plan
regression_results | optional
monitoring_plan | optional
Example:
json
{
"change_summary": "Switch from model A to model B and simplify the system prompt",
"change_type": "model change",
"expected_improvement": "Lower latency and better instruction following",
"affected_components": ["core responses", "tool calling"],
"risk_level": "medium"
}
Change Categories
text
- Prompt change
- Model change
- Tool integration
- Workflow logic change
- System configuration change
Always record:
text
- purpose of change
- expected improvement
- affected components
Risk Scoring
Score each factor from 1 to 5:
text
Operational impact — could workflows break
Safety risk — could harmful outputs increase
Tool reliability — could tool behavior degrade
User impact — could users receive worse responses
Reversibility — how hard is rollback
Risk classification:
text
5–8 → Low risk
9–15 → Medium risk
16–25 → High risk
Blast Radius
Estimate how widely the change may affect the system.
Check:
text
- which workflows are affected
- which users or systems depend on the change
- whether failures could cascade across tools or workflows
Levels:
text
Low — isolated feature or workflow
Medium — multiple workflows affected
High — core system logic affected
Safeguards
Define protections required before deployment.
Examples:
text
- human approval checkpoints
- validation prompts
- tool-call verification
- monitoring alerts
- rate limiting
- staged rollout
High-risk changes should require explicit human approval.
Rollback Planning
Every approved deployment should include:
text
- previous working version
- rollback trigger conditions
- rollback procedure
- rollback validation tests
Rollback should be fast and executable.
Workflow
Step 1 — Describe the Change
Capture:
text
- change type
- purpose
- expected improvement
- affected components
Step 2 — Score Risk
Assign 1–5 scores for:
text
- operational impact
- safety risk
- tool reliability risk
- user impact
- reversibility
Step 3 — Estimate Blast Radius
Classify as:
text
Low / Medium / High
Explain what is affected and how failures could spread.
Step 4 — Define Safeguards
List safeguards required before rollout.
Step 5 — Review Rollback Readiness
Check whether rollback is:
text
- defined
- quick
- testable
- realistic
Step 6 — Return Recommendation
Return:
text
- executive summary
- risk score and classification
- blast radius
- required safeguards
- rollback plan
- final recommendation
- confidence
Recommendation Rules
Go
text
Risk is low, blast radius is limited, safeguards are sufficient, and rollback is clear.
Conditional Go
text
Deployment may proceed only after listed safeguards or checks are completed.
No-Go
text
Risk is too high, safeguards are insufficient, rollback is weak, or evidence is incomplete.
Output Template
text
## Executive Summary
[Short description of the change and main conclusion]
## Change Overview
- Change type:
- Expected improvement:
- Affected components:
## Risk Score
- Operational impact:
- Safety risk:
- Tool reliability:
- User impact:
- Reversibility:
- Total score:
- Classification:
## Blast Radius
- Level:
- Explanation:
## Required Safeguards
- Safeguard 1
- Safeguard 2
- Safeguard 3
## Rollback Plan
- Previous version:
- Trigger conditions:
- Rollback steps:
- Validation after rollback:
## Final Recommendation
Go / Conditional Go / No-Go
## Confidence
high / medium / low
JSON Output
json
{
"change_summary": "",
"change_type": "",
"risk_level": "medium",
"risk_score": {
"operational_impact": 0,
"safety_risk": 0,
"tool_reliability": 0,
"user_impact": 0,
"reversibility": 0,
"total": 0,
"classification": "medium"
},
"blast_radius": {
"level": "medium",
"explanation": ""
},
"required_safeguards": [],
"rollback_plan": {
"previous_version": "",
"trigger_conditions": [],
"steps": [],
"validation_tests": []
},
"recommendation": "Conditional Go",
"confidence": "medium",
"notes": []
}
Limits
This skill does not:
guarantee production safety
replace monitoring or live experiments
infer unknown system details not provided by the user
approve risky changes without evidence
Tips
Use this skill before release, not after an incident.
Pair it with regression results when possible.
Treat missing rollback detail as a serious weakness.
If blast radius is high, prefer staged rollout and human approval.
If evidence is incomplete, lower confidence instead of guessing.
Keep the recommendation operational.
Author
Vassiliy Lakhonin
Scan to join WeChat group