Databite No. 161: Red Teaming Generative AI Harm

What exactly is generative AI (genAI) red-teaming? What strategies and standards should guide its implementation? And how can it protect the public interest? In this conversation, Lama Ahmad, Camille François, Tarleton Gillespie, Briana Vecchione, and Borhane Blili-Hamelin examined red-teaming’s place in the evolving landscape of genAI evaluation and governance.

Our discussion drew on a new report by Data &amp; Society (D&amp;S) and AI Risk and Vulnerability Alliance (ARVA), a nonprofit that aims to empower communities to recognize, diagnose, and manage harmful flaws in AI. The report, Red-Teaming in the Public Interest, investigates how red-teaming methods are being adapted to confront uncertainty about flaws in systems and to encourage public engagement with the evaluation and oversight of genAI systems. Red-teaming offers a flexible approach to uncovering a wide range of problems with genAI models. It also offers new opportunities for incorporating diverse communities into AI governance practices.

Ultimately, we hope this report and discussion present a vision of red-teaming as an area of public interest sociotechnical experimentation.

00 Opening
12 Welcome and Framing
48 Panel Introductions
34 Discussion Overview
23 Lama Ahmad on The Value of Human Red-Teaming
37 Tarleton Gillespie on Labor and Content Moderation Antecedents
03 Briana Vecchione on Participation &amp; Accountability
25 Camille François on Global Policy and Open-source Infrastructure
09 Questions and Answers
39 Final Takeaways