References: beneficial AI and machine ethics

Primary source

Dan Hendrycks. Introduction to AI Safety, Ethics, and Society. Taylor & Francis, 2024. Center for AI Safety, free to read at aisafetybook.com. L7 draws from Chapter 6 (Beneficial AI and Machine Ethics), primarily sections 6.8 (Social Welfare Functions) and 6.9 (Moral Uncertainty) for the load-bearing arguments, with the broader Ch 6 catalog (sections 6.2 Law, 6.3 Fairness, 6.4 Economic Engine, 6.5 Wellbeing, 6.6 Preferences, 6.7 Happiness) informing the lesson’s discussion of value-type distinctions.

Chapter section	Topic	URL
Ch 6.3	Fairness	aisafetybook.com/textbook/fairness
Ch 6.5	Wellbeing	aisafetybook.com/textbook/wellbeing
Ch 6.8	Social Welfare Functions	aisafetybook.com/textbook/social-welfare-functions
Ch 6.9	Moral Uncertainty	aisafetybook.com/textbook/moral-uncertainty

Verbatim quotes used in the lesson

A1 discipline preserved: verbatim from cited sections, no paraphrasing inside quote marks.

§6.9 Moral Uncertainty, core framing: “AI systems should represent moral uncertainty to avoid acting on overconfidence, which could lead to outcomes that humans consider morally reprehensible.”
§6.8 Social Welfare Functions, core framing: “Social welfare functions aggregate individual wellbeing into overall societal wellbeing.”
§6.8 Social Welfare Functions, on cost-benefit analysis: “relies on financial proxies for wellbeing and neglects distributional impacts.”

The three strategies for moral uncertainty (My Favorite Theory, maximizing expected choiceworthiness, moral parliament) and the named SWF families (utilitarian, prioritarian) are paraphrased from the chapter’s structure; the not-jointly-satisfiable fairness result is from the formal-fairness literature that Ch 6.3 draws on rather than from the chapter directly.

Posture and license

Same posture as L1 through L6: the CAIS textbook is © 2026 Center for AI Safety, published by Taylor & Francis, free to read online with no explicit Creative Commons or reuse license. This lesson is a structural mirror with verbatim quotes anchored to specific chapter sections within fair-use limits, link-out only, no embed, no derivative runs.

What L8 builds on from here

L8 enters Hendrycks Chapter 7 (Collective Action Problems) and takes the multi-stakeholder framing L7 introduced and works it at the formal level. Game theory provides the analytical tools (Nash equilibria, prisoner’s dilemma, public goods); cooperation, conflict, and evolutionary pressures are the substantive topics. The moral-parliament approach from L7 becomes the coordination instrument in the multi-actor setting L8 names. The natural-selection-on-AIs sub-mechanism from L2 (race bucket) returns at L8 in formal dress.

References: beneficial AI and machine ethics

Primary source

Verbatim quotes used in the lesson

Posture and license

Suggested companion reading

What L8 builds on from here