AI Safety at UCLA Intro Fellowship: Governance Track

Understand the technical backbone of modern AI models
Build a sense of why these technical details are relevant to AI policy, in the context of the AI triad (compute, data, and algorithms)
Begin to think about what risks might be posed by AI

Week 2: Catastrophic Risk from AI

Core Content (~60 min):

Existential Risk from Power-Seeking AI (60 min)

Optional Additional Content:

First Principles AI Safety

Surveys of AI Risks

AI Risks that Could Lead to Catastrophe | CAIS (25 min)
Preventing an AI-Related Catastrophe - 80,000 Hours (60 min)

Concrete Scenarios

What Failure Looks Like by Paul Christiano (20 min)
Auto-GPT and AI Race Acceleration by The AI Beat (10 min)
The True Story of How GPT-2 Became Maximally Lewd (14 min)

Learning Goals:

Understand the core arguments for existential risk from AI
Begin to form an idea of the different paths to reducing AI risk
Visualize how the techniques used to train AI directly contribute to potential bad outcomes

Week 3: AI Safety — Goals and Challenges

Core Content (~60 min):

Paradigms of AI Alignment: Components and Enablers (34 min)
Avoiding Extreme Global Vulnerability as a Core AI Governance Problem (10 min)
AI Safety Seems Hard to Measure (16 min)

Optional Additional Content:

AI Safety Neglectedness

Nobody’s on the Ball on AI Alignment (15 min)
The Need for Work on Technical AI Alignment by Daniel Eth (25 min)

What Could Go Wrong

AGI Ruin: A List of Lethalities (20 min)
Rogue AIs by the Center for AI Safety (35 min)
Racing through a minefield: the AI deployment problem (18 min)

Paths to Success

Paradigms of AI Alignment: Components and Enablers (34 min)
Managing Extreme AI Risks Amid Rapid Progress (20 min)
What is AI Alignment? – BlueDot Impact (10 min)

Learning Goals:

Understand the term “AI alignment” — what it means, and paths to achieving it
Understand the difficulties that arise when trying to align powerful AI models
Build a framework for the various factors that exascerbate AI risk, and how each of these could potentially be mitigated

Part 2: Introduction to AI Governance

Week 4: AI Policy Levers

Core Content (~60 min):

The AI Triad and What It Means for National Security Strategy (45 min)
Artificial Intelligence Index Report 2025 section 6.1 - pgs 326-335 (15 min)

Optional Additional Content:

Learning Goals:

Understand the direct implications of the AI triad for policy
Learn about existing standards for AI
Gain historical context on the successes and failures of past technology governance

Week 5: Technical Governance and Concrete Scenarios

Core Content (~60 min):

AI 2027 (40 min)
Open Problems in Technical AI Governance (20 min)

Optional Additional Content:

Compute Governance

Computing Power and the Governance of AI (20 min)
Choking Off China’s Access to the Future of AI (15 min)
Computing Power and the Governance of AI (45 min)
Primer on AI Chips and AI Governance (20 min)

AI Control

Model Evaluation for Extreme Risks by Toby Shevlane (35 min)
Societal Adaptation to Advanced AI (40 min)

Learning Goals:

Ground AI policy goals in concrete AI scenarios and timelines.
Understand the landscape of technical AI governance problems and research agendas.

Week 6: International Governance and Ethical Standards

Core Content (~60 min):

Optional Additional Content:

Compute Governance

Ethical Standards

The Bletchley Declaration (10 min)
OECD AI Principles (10 min)

Institutions and Policies

China’s AI Regulations and How They Get Made (20 min)
Driving U.S. Innovation in Artificial Intelligence: A Roadmap for AI Policy (30 min)
High-Level Summary of the AI Act (10 min)
Vision Statement of the US AI Safety Institute (15 min)
International Institutions for Advanced AI (20 min)

Learning Goals:

Understand the conditions that warrant internationalized AI governance, and how it can be achieved.
Survey existing international recommendations for AI ethics.

Week 7: Looking Ahead

Core Content (~60 min):

Career Profile: AI Governance and Policy by 80000 Hours (15 min)
Advice for Undergraduates (15 min)
AI Safety Policy Can’t Go On Like This (30 min)

Optional Additional Content:

Skills to Build

Advice and Ideas

Advice for Seeking Full-Time Roles (8 min)
12 Tentative Ideas for U.S. AI Policy by Muehlhauser (5 min)
So You Want to Be a Policy Entrepreneur? by Michael Mintrom (40 min)
What’s next in AI Governance (58 min)
AI Governance Project Ideas – BlueDot Impact (10 min)
Collection of AI Governance Research Ideas - Markus Anderlung (20 min)

Career Resources

Learning Goals:

Understand what careers exist in the AI governance space, and the skills required for each
Browse proposals for AI governance and take note of the ones you may be interested in pursuing
Understand the resources available to you if you are interested in pursuing AI governance

AI Safety at UCLA Intro Fellowship: Governance Track

Table of Contents

Part 1: Introduction AI Safety

Part 2: Introduction to AI Governance

Part 1: Introduction to AI Safety

Week 0: Overview, Ethos, and Social

Core Content (~10 min):

Optional Additional Content:

Learning Goals:

Week 1: Artificial Intelligence — How it Works and What it Can Achieve

Core Content (~60 min):

Optional Additional Content:

Learning Goals:

Week 2: Catastrophic Risk from AI

Core Content (~60 min):

Optional Additional Content:

Learning Goals:

Week 3: AI Safety — Goals and Challenges

Core Content (~60 min):

Optional Additional Content:

Learning Goals:

Part 2: Introduction to AI Governance

Week 4: AI Policy Levers

Core Content (~60 min):

Optional Additional Content:

Learning Goals:

Week 5: Technical Governance and Concrete Scenarios

Core Content (~60 min):

Optional Additional Content:

Learning Goals:

Week 6: International Governance and Ethical Standards

Core Content (~60 min):

Optional Additional Content:

Learning Goals:

Week 7: Looking Ahead

Core Content (~60 min):

Optional Additional Content:

Learning Goals: