AISU@NeurIPS

2018–2022 · 185+ chercheurs

L'AI Safety Unconference à NeurIPS a réuni des chercheurs pour des présentations éclair, des discussions facilitées et des collaborations. Sur trois éditions, nous avons accueilli des participants des principales organisations de sûreté de l'IA du monde entier.

2022

New Orleans, LA November 28, 2022 ~85 participants

Participants de Mila, Stanford, Anthropic, OpenAI, UC Berkeley, U Toronto, ETH Zurich, Max Planck Institute, Cambridge, Vector Institute, NYU, DeepMind, Oxford, MIT.

Présentations éclair

Haydn Belfield What standard-setting in EU + US might mean for AI safety

Esben Kran Hackathons in AI safety research

Franziska Boenisch Privacy attacks against federated learning

Aaron Tucker Bandits with Costly Reward Observations

Lewis Hammond Cooperative AI

Adam Dziedzic Stealing and defending self-supervised models

David Lindner Active Learning for Reward Modelling

Lauro Langosco di Langosco An empirical demonstration of deceptive alignment

Zhijing Jin Causally aligning language models

Discussions facilitées

Haydn Belfield AI governance

Adam Dziedzic Is this model mine? On stealing and defending machine learning models

Lewis Hammond Cooperative AI

Lauro Langosco di Langosco Deceptive alignment

Témoignages

"This was a fascinating event that was helpful for keeping up with the cutting edge of the field, and for launching collaborations." — Haydn Belfield

"The AI safety unconference was very useful to meet and talk with the AI safety researchers at NeurIPS." — Esben Kran

"It was very reassuring to hear that diverse perspectives on AI risk are being studied seriously, including criticism of the AI safety community." — Arvind Raghavan

2019

Vancouver, BC December 9, 2019 ~50 participants

Participants de OpenAI, DeepMind, Cambridge, MIRI, Mila, FLI.

2018

Montréal, QC December 8, 2018 ~50 participants

Participants de UC Berkeley, Vector Institute, Mila, OpenAI, DeepMind, Oxford, CHAI, McGill, NYU, Partnership on AI.

Présentations éclair

Adam Gleave

Jan Leike

David Krueger

Dan Hendrycks

Aaron Tucker

Victoria Krakovna

Témoignages

"A great way to meet the best people in the area and propel daring ideas forward." — Stuart Armstrong

"The event was a great place to meet others with shared research interests. I particularly enjoyed the small discussion groups that exposed me to new perspectives." — Adam Gleave