Summary: AI governance

Summary

L9 closes Phase 3 and closes the track. L8 named institutional cooperation mechanisms as the structurally-correct response to multi-agent coordination failures and noted that the institutional structure must itself be designed, governed, and protected. L9 takes that question on. Hendrycks Chapter 8 organizes governance around a four-layer taxonomy that the lesson works in turn.

Corporate governance (Ch 8.4): what an individual AI organization commits to and internally enforces. Current state: Responsible Scaling Policies committing to capability-evaluation thresholds and deployment pauses at red lines; internal safety teams and board oversight; model cards and capability disclosures. Structural limit: unilateral commitments are undercut by competitor behavior under L8’s race-to-the-bottom dynamic. Necessary but not sufficient.

National governance (Ch 8.5): regulation by sovereign jurisdictions. Mechanisms span familiar regulatory instruments adapted from medical devices, financial services, and other high-stakes domains: mandatory pre-deployment evaluation (EU AI Act high-risk categorization), incident reporting, licensing for systems above capability thresholds, liability rules. Structural limit: jurisdictional. National regulation does not constrain developers in other jurisdictions, which produces incentive for nations to maintain lighter regulation to attract AI development. Addresses within-jurisdiction, leaves between-jurisdiction to international governance.

International governance (Ch 8.6): “international cooperation is important in order to manage risks from AI” (Hendrycks §8.6). The chapter draws explicit parallels to nuclear-weapons governance, noting both technologies are “offense-dominant” with identifiable supply-chain chokepoints (uranium for nuclear, computing power for AI). Mechanisms span the standard repertoire: unilateral national commitments that become reciprocal, bilateral and multilateral negotiations, summits, treaties, and dedicated international organizations modeled on the IAEA. The aviation-certification analogy illustrates market-based compliance pressure: “domestic regulators must have certain verification procedures” to maintain international airspace access. The chapter is honest about the verification asymmetry inherited from the nuclear precedent: violations are easier to detect than development is to confirm.

Compute governance (Ch 8.7): the most recent addition to the field’s governance vocabulary and the one Hendrycks foregrounds. “Compute is indispensable for developing and deploying AIs. Restricting access to compute allows control over what AIs are created and used.” And: “Compute is physical, excludable, and quantifiable which allows it to be tracked, restricted, and measured” (both §8.7). The three properties make compute a tractable regulatory lever in ways algorithm or data regulation are not. Mechanisms include compute reporting (training runs above a threshold disclosed), compute caps (limit total training compute for specific deployments), chip export controls (restrict cross-border AI-chip shipment), cloud-provider KYC (verify customer identity before granting large compute access). Limits: depends on supply-chain concentration, FLOP-as-capability-proxy reliability, and international coordination. The chapter treats compute governance as the most-tractable current lever, not the final answer.

The L9 capability is operational: situate a real governance proposal inside the four-layer taxonomy. The worked example in the lesson body decomposes the EU AI Act’s general-purpose-AI-with-systemic-risk provisions as primarily national governance with corporate-layer hooks and compute-governance scaffolding (the systemic-risk threshold operates through a 10^25 FLOP training-compute proxy). A second worked example decomposes a hypothetical compute-export multilateral as primarily international governance with compute as the underlying lever and national jurisdictions providing enforcement. The capability is the placement, not the endorsement; the lesson takes no position on whether specific proposals are the right policy.

Track closure. Nine lessons across three phases. Phase 1 gave field-framing and the four-bucket typology (vocabulary to classify any AI-harm headline). Phase 2 worked the deployment-time safety case (monitoring/robustness, alignment, safety engineering, complex systems). Phase 3 added the policy and coordination layer (moral uncertainty, collective action, governance). The track produces working vocabulary for a discipline that is not solved, attributed throughout, in a register that lets readers contest any claim using the same vocabulary. It does not produce a position on whether AI development should slow down or speed up; that is not the track’s job.

The closing thought from the chapter, which the track inherits: a safety case is a Swiss-cheese stack of partial defenses, the holes are largest where the field has the fewest tools, and the honest disposition is to keep filling holes rather than to declare any layer sufficient. The track ends where the field is: in motion.