Videos from Center for AI Safety (CAIS)

1
channel
Lecture 7 | AI Safety, Ethics, & Society: Collective Action Problems
Lecture 7 | AI Safety, Ethics, & Society: Collective Action Problems
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/collective-action-problems. Topics: Collect
Lecture 6 | AI Safety, Ethics, & Society: Beneficial AI and Machine Ethics
Lecture 6 | AI Safety, Ethics, & Society: Beneficial AI and Machine Ethics
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/beneficial-ai. Topics: Beneficial AI and Ma
Lecture 5 | AI Safety, Ethics, & Society: Complex Systems
Lecture 5 | AI Safety, Ethics, & Society: Complex Systems
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/complex-systems. Topics: Complex Systems Da
Lecture 4 | AI Safety, Ethics, & Society: Safety Engineering
Lecture 4 | AI Safety, Ethics, & Society: Safety Engineering
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/safety-engineering. Topics: Safety Engineer
Lecture 3 | AI Safety, Ethics, & Society: Single Agent Safety
Lecture 3 | AI Safety, Ethics, & Society: Single Agent Safety
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/single-agent-safety. Topics: Single Agent S
Lecture 2 | AI Safety, Ethics, & Society: AI Fundamentals
Lecture 2 | AI Safety, Ethics, & Society: AI Fundamentals
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/ai-fundamentals Topics: AI Fundamentals Dan
Lecture 1 | AI Safety, Ethics, & Society: Introduction and Overview of AI Risks
Lecture 1 | AI Safety, Ethics, & Society: Introduction and Overview of AI Risks
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook/overview-of-catastrophic-ai-risks. Topics:
Representation Engineering
Representation Engineering
This is a presentation on Representation Engineering by Andy Zou from the Center for AI Safety. 2:49 references work from Barack and Krakauer (2021) Two views on the cognitive brain (https://www.n...
An Overview of Catastrophic Risks
An Overview of Catastrophic Risks
This is an audio version of an Overview of Catastrophic AI Risks by Dan Hendrycks, Mantas Mazeika, and Thomas Woodside. A link to the PDF version of the paper can be found below. https://arxiv.org...
WAIC 2023: AI Risks and Safety Forum
WAIC 2023: AI Risks and Safety Forum
This was a virtual symposium at the 2023 World AI Conference in Shanghai. Speakers: Dan Hendrycks 3:23 Stuart Russell 32:08 Dawn Song 59:26 Nicholas Carlini 1:23:38 David Krueger 1:52:04 Peter Rai...
Peter Railton - A World of Natural and Artificial Agents in a Shared Environment
Peter Railton - A World of Natural and Artificial Agents in a Shared Environment
This discussion is part of a series of guest speaker events as part of the CAIS Philosophy Fellowship. For more information, visit https://philosophy.safe.ai
Lecture 8 | AI Safety, Ethics, & Society: Governance
Lecture 8 | AI Safety, Ethics, & Society: Governance
You can find more information including the corresponding section of the AI Safety, Ethics, & Society textbook at https://www.aisafetybook.com/textbook.... Topics: Governance Dan Hendrycks, Direc
Shelly Kagan - The Moral Claims of AI
Shelly Kagan - The Moral Claims of AI
This discussion is part of a series of guest speaker events as part of the CAIS Philosophy Fellowship. For more information, visit https://philosophy.safe.ai
Peter Salib - AI Rights as a Safety Technology
Peter Salib - AI Rights as a Safety Technology
This discussion is part of a series of guest speaker events as part of the CAIS Philosophy Fellowship. For more information, visit https://philosophy.safe.ai
Ethan Perez - Discovering Language Model Behaviors with Model Written Evaluations
Ethan Perez - Discovering Language Model Behaviors with Model Written Evaluations
This discussion is part of a series of guest speaker events as part of the CAIS Philosophy Fellowship. For more information, visit https://philosophy.safe.ai