论文标题

负面人权作为长期AI安全和监管的基础

Negative Human Rights as a Basis for Long-term AI Safety and Regulation

论文作者

Bajgar, Ondrej, Horenovsky, Jan

论文摘要

如果在新情况下自主的AI系统要可靠地安全,那么他们将需要纳入指导它们以识别和避免有害行为的一般原则。这些原则可能需要得到一个约束力的监管制度,这将需要广泛接受的基本原则。它们还应该足够具体用于技术实施。本文从法律中汲取灵感,解释了负面的人权如何履行此类原则的作用,并为国际监管制度以及为未来的AI系统建立技术安全限制的基础。

If autonomous AI systems are to be reliably safe in novel situations, they will need to incorporate general principles guiding them to recognize and avoid harmful behaviours. Such principles may need to be supported by a binding system of regulation, which would need the underlying principles to be widely accepted. They should also be specific enough for technical implementation. Drawing inspiration from law, this article explains how negative human rights could fulfil the role of such principles and serve as a foundation both for an international regulatory system and for building technical safety constraints for future AI systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源