AISU@NeurIPS

2018–2022 · 185+ chercheurs

L'AI Safety Unconference à NeurIPS a réuni des chercheurs pour des présentations éclair, des discussions facilitées et des collaborations. Sur trois éditions, nous avons accueilli des participants des principales organisations de sûreté de l'IA du monde entier.

2022

New Orleans, LA November 28, 2022 ~85 participants

Participants de Mila, Stanford, Anthropic, OpenAI, UC Berkeley, U Toronto, ETH Zurich, Max Planck Institute, Cambridge, Vector Institute, NYU, DeepMind, Oxford, MIT.

Présentations éclair

Haydn Belfield What standard-setting in EU + US might mean for AI safety
Esben Kran Hackathons in AI safety research
Franziska Boenisch Privacy attacks against federated learning
Aaron Tucker Bandits with Costly Reward Observations
Lewis Hammond Cooperative AI
Adam Dziedzic Stealing and defending self-supervised models
David Lindner Active Learning for Reward Modelling
Lauro Langosco di Langosco An empirical demonstration of deceptive alignment
Zhijing Jin Causally aligning language models

Discussions facilitées

Haydn Belfield AI governance
Adam Dziedzic Is this model mine? On stealing and defending machine learning models
Lewis Hammond Cooperative AI
Lauro Langosco di Langosco Deceptive alignment

Témoignages

"This was a fascinating event that was helpful for keeping up with the cutting edge of the field, and for launching collaborations." — Haydn Belfield
"The AI safety unconference was very useful to meet and talk with the AI safety researchers at NeurIPS." — Esben Kran
"It was very reassuring to hear that diverse perspectives on AI risk are being studied seriously, including criticism of the AI safety community." — Arvind Raghavan

Organisé par : Orpheus Lummis, Mauricio H. Luduena
Partenaires : Center for AI Safety
Soutenu par : Nisan Stiennon

Voir le site archivé ↗

2019

Vancouver, BC December 9, 2019 ~50 participants

Participants de OpenAI, DeepMind, Cambridge, MIRI, Mila, FLI.

Organisé par : David Krueger (Mila), Orpheus Lummis (EA Québec), Gretchen Krueger (OpenAI), Richard Mallah (FLI), Joe Collman
Soutenu par : Effective Altruism Foundation, Survival and Flourishing Fund, Future of Life Institute

Voir le site archivé ↗

2018

Montréal, QC December 8, 2018 ~50 participants

Participants de UC Berkeley, Vector Institute, Mila, OpenAI, DeepMind, Oxford, CHAI, McGill, NYU, Partnership on AI.

Présentations éclair

Adam Gleave
Jan Leike
David Krueger
Dan Hendrycks
Aaron Tucker
Victoria Krakovna

Témoignages

"A great way to meet the best people in the area and propel daring ideas forward." — Stuart Armstrong
"The event was a great place to meet others with shared research interests. I particularly enjoyed the small discussion groups that exposed me to new perspectives." — Adam Gleave