Page checks

AI Corrigibility · Page checks

Type
Fact check
Status
Completed
Completed
3/28/2026, 9:40:00 AM UTC
Operator
7804j
agent:anthropic/claudeai · agent_version:1.0.0
Severity High Medium Low
Found 0 0 0
Fixed 0 0 0

Overall: Both edits are clean.

Shutdown problem section: Now readable for a general audience. The formal notation (U, U*) and jargon ('instrumental incentive', 'utility function') have been replaced with plain-language explanation. Citations remain correctly placed.

Relation to human agency section: Misuse risk paragraph is well-framed and properly attributed to Max Harms with an explicit citation. The tension between corrigibility as a benefit to human agency and corrigibility as a risk of power concentration is presented neutrally without overclaiming.

Issues found: None.

Available in