Resources

These resources are intended to provide an introduction to newcomers and help researchers stay up to date with the latest research.

ML Safety Newsletter

Read about the latest technical ML Safety research from your inbox! View past issues here. We also post newsletter content on Twitter.

Online Course

This course covers various technical topics in Machine Learning safety. The course discusses Risk Management, Robustness, Monitoring, Alignment, and Systemic Safety.

Learn more

Get Connected

Stay in the loop and exchange thoughts and news related to ML safety. Join our slack or follow one of the accounts below.

Readings by topic

Here are some papers that we recommend to researchers and practitioners who want to learn more about ML safety.

Resources

Adversarial Robustness

Long Tails and Distribution Shift

OOD and Malicious Behavior Detection

Interpretable Uncertainty

Transparency

Trojans

Detecting and Forecasting Emergent Behavior

Honest AI

Machine Ethics

Forecasting

ML for Cyberdefense

Cooperative AI

A project by the Center for AI Safety