CITATION — REFERENCE ENTRY

deference-from-uncertainty · hadfield-menell2017offswitch

Revision de8af9db-d812-41bb-a699-0f5a6ab9ccd2 · 3/27/2026, 8:26:21 PM UTC
Claim ID
deference-from-uncertainty
Assertion
The incentives for a cooperative agent to defer to a human's decisions stem from the agent's uncertainty about the human's preferences and the assumption that the human is effective at choosing actions in accordance with those preferences.
Quote
The incentives for a cooperative agent to defer to another actor's (e.g., a human's) decisions stem from uncertainty about that actor's preferences and the assumption that actor is effective at choosing actions in accordance with those preferences.
Quote language
en
Locator
Section 3, Remark 1
Available in