Beyond Algorithms: understanding the Challenges of AI Safety

thumbnail

Author

Mitrani, Nathaniel

Document type

Coursework

Date

2024

rights

Open AccessOpen Access

Publisher

Universitat Politècnica de Catalunya

Academic year

2023/2024



Abstract

We go over the different ways an AI system might not behave as we intend it to, highlighting the importance and increasing need for research in this direction. We introduce AI safety, and the challenges in Reinforcement Learning and Deep Learning, and extend to the study of aligning super-intelligent systems or Superalignment
user

Participating teacher

  • Mitrani, Nathaniel

Files