智能论文笔记

Negative Human Rights as a Basis for Long-term AI Safety and Regulation

Ondrej Bajgar , Jan Horenovsky

分类：人工智能

2022-08-31

如果未来的AI系统在新的情况下是可靠的安全性，那么他们将需要纳入指导它们的一般原则，以便强烈地认识到哪些结果和行为将是有害的。这样的原则可能需要得到约束力的监管制度的支持，该法规需要广泛接受的基本原则。它们还应该足够具体用于技术实施。本文从法律中汲取灵感，解释了负面的人权如何履行此类原则的作用，并为国际监管制度以及为未来的AI系统建立技术安全限制的基础。

translated by 谷歌翻译

Law Informs Code: A Legal Informatics Approach to Aligning Artificial Intelligence with Humans

John J. Nay

分类：人工智能 | 机器学习

2022-09-14

We are currently unable to specify human goals and societal values in a way that reliably directs AI behavior. Law-making and legal interpretation form a computational engine that converts opaque human values into legible directives. "Law Informs Code" is the research agenda capturing complex computational legal processes, and embedding them in AI. Similar to how parties to a legal contract cannot foresee every potential contingency of their future relationship, and legislators cannot predict all the circumstances under which their proposed bills will be applied, we cannot ex ante specify rules that provably direct good AI behavior. Legal theory and practice have developed arrays of tools to address these specification problems. For instance, legal standards allow humans to develop shared understandings and adapt them to novel situations. In contrast to more prosaic uses of the law (e.g., as a deterrent of bad behavior through the threat of sanction), leveraged as an expression of how humans communicate their goals, and what society values, Law Informs Code. We describe how data generated by legal processes (methods of law-making, statutory interpretation, contract drafting, applications of legal standards, legal reasoning, etc.) can facilitate the robust specification of inherently vague human goals. This increases human-AI alignment and the local usefulness of AI. Toward society-AI alignment, we present a framework for understanding law as the applied philosophy of multi-agent alignment. Although law is partly a reflection of historically contingent political power - and thus not a perfect aggregation of citizen preferences - if properly parsed, its distillation offers the most legitimate computational comprehension of societal values available. If law eventually informs powerful AI, engaging in the deliberative political process to improve law takes on even more meaning.

translated by 谷歌翻译

The European AI Liability Directives -- Critique of a Half-Hearted Approach and Lessons for the Future

Philipp Hacker

分类：人工智能 | 机器学习

2022-11-25

The optimal liability framework for AI systems remains an unsolved problem across the globe. In a much-anticipated move, the European Commission advanced two proposals outlining the European approach to AI liability in September 2022: a novel AI Liability Directive and a revision of the Product Liability Directive. They constitute the final, and much-anticipated, cornerstone of AI regulation in the EU. Crucially, the liability proposals and the EU AI Act are inherently intertwined: the latter does not contain any individual rights of affected persons, and the former lack specific, substantive rules on AI development and deployment. Taken together, these acts may well trigger a Brussels effect in AI regulation, with significant consequences for the US and other countries. This paper makes three novel contributions. First, it examines in detail the Commission proposals and shows that, while making steps in the right direction, they ultimately represent a half-hearted approach: if enacted as foreseen, AI liability in the EU will primarily rest on disclosure of evidence mechanisms and a set of narrowly defined presumptions concerning fault, defectiveness and causality. Hence, second, the article suggests amendments, which are collected in an Annex at the end of the paper. Third, based on an analysis of the key risks AI poses, the final part of the paper maps out a road for the future of AI liability and regulation, in the EU and beyond. This includes: a comprehensive framework for AI liability; provisions to support innovation; an extension to non-discrimination/algorithmic fairness, as well as explainable AI; and sustainability. I propose to jump-start sustainable AI regulation via sustainability impact assessments in the AI Act and sustainable design defects in the liability regime. In this way, the law may help spur not only fair AI and XAI, but potentially also sustainable AI (SAI).

translated by 谷歌翻译

X-Risk Analysis for AI Research

Dan Hendrycks , Mantas Mazeika

分类：人工智能

2022-06-13

人工智能（AI）有可能极大地改善社会，但是与任何强大的技术一样，它的风险和责任也增加。当前的AI研究缺乏有关如何管理AI系统（包括投机性长期风险）的长尾风险的系统讨论。请记住，AI可能是提高人类的长期潜力不可或缺的一部分，人们担心建立更聪明，更强大的AI系统最终可能会导致比我们更强大的系统。有人说这就像玩火，并推测这可能会造成生存风险（X风险）。为了增加这些讨论，我们回顾了来自危害分析和系统安全的时间测试概念的集合，这些概念旨在将大型流程引导到更安全的方向上。然后，我们讨论AI研究人员如何对AI系统的安全产生长期影响。最后，我们讨论如何稳健地塑造将影响安全和一般能力之间平衡的过程。

translated by 谷歌翻译

Conformity Assessments and Post-market Monitoring: A Guide to the Role of Auditing in the Proposed European AI Regulation

Jakob Mokander , Maria Axente , Federico Casolari , Luciano Floridi

分类：人工智能

2021-11-09

拟议的欧洲人工智能法案（AIA）是第一次尝试详细阐述由任何主要全球经济开展的AI一般法律框架。因此，AIA可能成为如何调节AI系统（应当）的更大话语中的参考点。在本文中，我们描述并讨论了AIA中提出的两项初级执法机制：高风险AI系统的提供者预计会进行的符合性评估，以及提供者必须建立履行表现的市场后监测计划在整个寿命中的高风险AI系统。我们认为，AIA可以被解释为建立欧洲审计的欧洲生态系统的建议，尽管换句话说。我们的分析提供了两个主要贡献。首先，通过描述从现有文献借入的AI审计中借用的AIA中包含的执法机制，我们帮助AI系统的提供者了解它们如何证明在实践中遵守AIA所示的要求。其次，通过从审计视角审查AIA，我们寻求提供以前研究如何进一步改进AIA中概述的监管方法的可转让教训。我们通过突出AIA的七个方面来结束修正案（或简单澄清）会有所帮助。最重要的是，需要将模糊概念转化为可验证标准，并加强基于内部支票的符合性评估的体制保障措施。

translated by 谷歌翻译

Contextualizing Artificially Intelligent Morality: A Meta-Ethnography of Top-Down, Bottom-Up, and Hybrid Models for Theoretical and Applied Ethics in Artificial Intelligence

Jennafer S. Roberts , Laura N. Montoya

分类：人工智能 | 神经与进化计算

2022-04-15

在这一荟萃术中，我们探索了道德人工智能（AI）设计实施的三个不同角度，包括哲学伦理观点，技术观点和通过政治镜头进行框架。我们的定性研究包括一篇文献综述，该综述通过讨论前面发表的对比度上下，自下而上和混合方法的价值和缺点，突出了这些角度的交叉引用。对该框架的小说贡献是政治角度，该角度构成了人工智能中的道德规范，要么由公司和政府决定，并通过政策或法律（来自顶部）强加于人，或者是人民要求的道德（从底部出现），以及自上而下，自下而上和混合技术，即AI在道德构造和考虑到其用户中的发展方式以及对世界的预期和意外后果和长期影响。作为自下而上的应用技术方法和AI伦理原则作为一种实际的自上而下方法，重点是强化学习。这项调查包括现实世界中的案例研究，以基于历史事实，当前的世界环境以及随之而来的现实，就AI的伦理和理论未来的思想实验进行了有关AI伦理和理论未来思想实验的哲学辩论。

translated by 谷歌翻译

A Principles-based Ethical Assurance Argument for AI and Autonomous Systems

Zoe Porter , Ibrahim Habli , John McDermid

分类：人工智能

2022-03-29

保证案件提出了一个明确且可辩护的论点，并得到证据支持，即系统将按照特定情况下的意图运行。通常，保证案例提出了一个论点，即系统在其预期的上下文中将是安全的。值得信赖的AI研究社区中的一项新兴建议是扩展和应用这种方法，以保证使用AI系统或自治系统（AI/AS）在特定情况下将是可接受的道德。在本文中，我们进一步提出了这一建议。我们通过为AI/AS提供基于原则的道德保证（PBEA）论点模式来做到这一点。 PBEA参数模式为推理给定AI/AS的整体道德可接受性提供了一个框架，它可能是特定道德保证案例的早期原型模板。构成PBEA论证模式基础的四个核心道德原则是：正义；福利；非遗憾；并尊重个人自主权。在整个过程中，我们将参数模式的阶段连接到AI/作为应用程序的示例。这有助于显示其最初的合理性。

translated by 谷歌翻译

Systems Challenges for Trustworthy Embodied Systems

Harald Ruess

分类：人工智能

2022-01-10

即将开发我们呼叫所体现的系统的新一代越来越自主和自学习系统。在将这些系统部署到真实上下文中，我们面临各种工程挑战，因为它以有益的方式协调所体现的系统的行为至关重要，确保他们与我们以人为本的社会价值观的兼容性，并且设计可验证安全可靠的人类-Machine互动。我们正在争辩说，引发系统工程将来自嵌入到体现系统的温室，并确保动态联合的可信度，这种情况意识到的情境意识，意图，探索，探险，不断发展，主要是不可预测的，越来越自主的体现系统在不确定，复杂和不可预测的现实世界环境中。我们还识别了许多迫切性的系统挑战，包括可信赖的体现系统，包括强大而人为的AI，认知架构，不确定性量化，值得信赖的自融化以及持续的分析和保证。

translated by 谷歌翻译

Big data's disparate impact

分类：

Advocates of algorithmic techniques like data mining argue that these techniques eliminate human biases from the decision-making process. But an algorithm is only as good as the data it works with. Data is frequently imperfect in ways that allow these algorithms to inherit the prejudices of prior decision makers. In other cases, data may simply reflect the widespread biases that persist in society at large. In still others, data mining can discover surprisingly useful regularities that are really just preexisting patterns of exclusion and inequality. Unthinking reliance on data mining can deny historically disadvantaged and vulnerable groups full participation in society. Worse still, because the resulting discrimination is almost always an unintentional emergent property of the algorithm's use rather than a conscious choice by its programmers, it can be unusually hard to identify the source of the problem or to explain it to a court. This Essay examines these concerns through the lens of American antidiscrimination law-more particularly, through Title

translated by 谷歌翻译

Manifestations of Xenophobia in AI Systems

Nenad Tomasev , Jonathan Leader Maynard , Iason Gabriel

分类：人工智能

2022-12-15

Xenophobia is one of the key drivers of marginalisation, discrimination, and conflict, yet many prominent machine learning (ML) fairness frameworks fail to comprehensively measure or mitigate the resulting xenophobic harms. Here we aim to bridge this conceptual gap and help facilitate safe and ethical design of artificial intelligence (AI) solutions. We ground our analysis of the impact of xenophobia by first identifying distinct types of xenophobic harms, and then applying this framework across a number of prominent AI application domains, reviewing the potential interplay between AI and xenophobia on social media and recommendation systems, healthcare, immigration, employment, as well as biases in large pre-trained models. These help inform our recommendations towards an inclusive, xenophilic design of future AI systems.

translated by 谷歌翻译

Demystifying the Draft EU Artificial Intelligence Act

Michael Veale , Frederik Zuiderveen Borgesius

分类：人工智能

2021-07-08

2021年4月，欧洲委员会提出了对人工智能的规定，称为AI法案。我们概述了该法案，分析了其影响，借鉴了当代AI对过去四十年来欧盟产品安全制度的实践的研究。AI行为的各个方面，例如不同风险水平的不同规则，有意义。但我们还发现，AI法案草案的一些规定令人惊讶的法律影响，而其他人可能在达到其指定的目标方面可能在很大程度上无效。几个总体方面，包括执法制度和最大统一的风险抢先抢占合法的国家AI政策，提出重大关注。这些问题应作为立法过程中的优先考虑。

translated by 谷歌翻译

The alignment problem from a deep learning perspective

Richard Ngo

分类：人工智能 | 机器学习

2022-08-30

在接下来的几十年中，人工通用情报（AGI）可能会超过人类在各种重要任务下的能力。该报告为为什么如果没有实质性采取行动来阻止它，AGI可能会利用他们的智能来追求目标，而这些目标是从人类的角度出发，可能会带来潜在的灾难性后果。该报告旨在涵盖激励对对齐问题的关注的关键论点，以尽可能简洁，具体和技术上的方式进行对齐问题。我认为，现实的培训过程可能会导致AGIS中未对准的目标，尤其是因为通过强化学习训练的神经网络将学会计划实现一系列目标；通过欺骗性追求未对准的目标获得更多奖励；并以破坏服从的方式概括。就像Cotra（2022）的较早报告中一样，我在参考说明性AGI培训过程中解释了我的主张，然后概述了解决问题的不同方面的可能的研究方向。

translated by 谷歌翻译

AI Ethics Principles in Practice: Perspectives of Designers and Developers

Conrad Sanderson , David Douglas , Qinghua Lu , Emma Schleiger , Jon Whittle , Justine Lacey , Glenn Newnham , Stefan Hajkowicz , Cathy Robinson , David Hansen

分类：人工智能

2021-12-14

随着各种公开的AI伦理原则的共识，差距仍然可以随时采用设计和开发负责任的AI系统。我们研究了来自澳大利亚国家科学研究机构（CSIRO）的研究人员和工程师的实践和经验，他们参与设计和开发AI系统的一系列目的。半结构化访谈用于检查参与者的做法如何与澳大利亚政府提出的一套高级AI伦理原则涉及并对齐。原则包括：隐私保护和安全，可靠性和安全性，透明度和解释性，公平性，竞争性，责任，人以人为本的价值观和人类，社会与环境福祉。研究了研究人员和工程师的见解以及在原则的实际应用中为它们提供的挑战。最后，提供了一系列组织响应，以支持实施高级AI道德原则。

translated by 谷歌翻译

The Fallacy of AI Functionality

Inioluwa Deborah Raji , I. Elizabeth Kumar , Aaron Horowitz , Andrew D. Selbst

分类：机器学习

2022-06-20

部署的AI系统通常不起作用。它们可以随意地构造，不加选择地部署并欺骗性地促进。然而，尽管有这一现实，但学者，新闻界和决策者对功能的关注很少。这导致技术和政策解决方案的重点是“道德”或价值一致的部署，通常会跳过先前的问题，即给定系统功能或完全提供任何好处。描述各种功能失败的危害，我们分析一组案例研究，以创建已知的AI功能问题的分类法。然后，我们指出的是政策和组织响应，这些策略和组织响应经常被忽略，并在功能成为重点后变得更容易获得。我们认为功能是一项有意义的AI政策挑战，是保护受影响社区免受算法伤害的必要第一步。

translated by 谷歌翻译

Tackling problems, harvesting benefits -- A systematic review of the regulatory debate around AI

Anja Folberth , Jutta Jahnel , Jascha Bareis , Carsten Orwat , Christian Wadephul

分类：人工智能

2022-09-07

如何将新兴和全面的技术（例如AI）整合到我们社会的结构和运营中是当代政治，科学和公众辩论的问题。它从不同学科中产生了大量的国际学术文献。本文分析了有关人工智能调节（AI）的学术辩论。该系统审查包括在2016年1月1日至2020年12月31日之间发表的73份同行评审期刊文章样本。分析集中于社会风险和危害，监管责任问题以及可能基于风险的政策框架在内和基于原则的方法。主要利益是拟议的监管方法和工具。提出了各种形式的干预措施，例如禁令，批准，标准设定和披露。对所包括论文的评估表明该领域的复杂性，这表明其早产和剩余的缺乏清晰度。通过对学术辩论进行结构性分析，我们在经验和概念上均可更好地理解AI和监管的联系以及基本规范性决策。科学建议与拟议的欧洲AI调节的比较说明了调节的特定方法，其优势和缺点。

translated by 谷歌翻译

Randomized Classifiers vs Human Decision-Makers: Trustworthy AI May Have to Act Randomly and Society Seems to Accept This

Gábor Erdélyi , Olivia J. Erdélyi , Vladimir Estivill-Castro

分类：人工智能 | 机器学习

2021-11-15

\ EMPH {人工智能}（AI）系统越来越多地参与影响我们生活的决策，确保自动决策是公平的，道德已经成为最优先事项。直观地，我们觉得类似人的决定，人工代理人的判断应该必然地以一些道德原则为基础。然而，如果有关决定所基础的所有有关因素的全部信息，可以真正伦理（人类或人为）和公平（根据任何道德理论）和公平（根据公平的任何概念）的规定在决策时。这提出了两个问题：（1）在设置中，我们依赖使用通过监督学习获得的分类器的AI系统，存在一些感应/泛化，即使在学习期间也可能不存在一些相关属性。（2）根据游戏揭示任何 - 无论是道德的纯策略都不可避免地易于剥削，建模这些决定。此外，在许多游戏中，只能通过使用混合策略来获得纳什均衡，即实现数学上最佳结果，决定必须随机化。在本文中，我们认为，在监督学习设置中，存在至少以及确定性分类器的随机分类器，因此在许多情况下可能是最佳选择。我们支持我们的理论效果，具有一个实证研究，表明对随机人工决策者的积极社会态度，并讨论了与使用与当前的AI政策和标准化举措相关的随机分类器相关的一些政策和实施问题。

translated by 谷歌翻译

Fairness and Bias in Robot Learning

Laura Londoño , Juana Valeria Hurtado , Nora Hertz , Philipp Kellmeyer , Silja Voeneky , Abhinav Valada

分类：机器人 | 人工智能 | 计算机视觉 | 机器学习

2022-07-07

机器学习显着增强了机器人的能力，使他们能够在人类环境中执行广泛的任务并适应我们不确定的现实世界。机器学习各个领域的最新作品强调了公平性的重要性，以确保这些算法不会再现人类的偏见并导致歧视性结果。随着机器人学习系统在我们的日常生活中越来越多地执行越来越多的任务，了解这种偏见的影响至关重要，以防止对某些人群的意外行为。在这项工作中，我们从跨学科的角度进行了关于机器人学习公平性的首次调查，该研究跨越了技术，道德和法律挑战。我们提出了偏见来源的分类法和由此产生的歧视类型。使用来自不同机器人学习域的示例，我们研究了不公平结果和减轻策略的场景。我们通过涵盖不同的公平定义，道德和法律考虑以及公平机器人学习的方法来介绍该领域的早期进步。通过这项工作，我们旨在为公平机器人学习中的开创性发展铺平道路。

translated by 谷歌翻译

Current and Near-Term AI as a Potential Existential Risk Factor

Benjamin S. Bucknall , Shiri Dori-Hacohen

分类：人工智能

2022-09-21

有大量且不断增长的证据和文学探索人工智能（AI）技术对整个社会，政治和人类的影响。单独的平行工作已经探索了人类的存在风险，包括但不限于非对齐的人工通用智能（AGI）的风险。在本文中，我们认为当前和近期人工智能技术有可能通过充当中间风险因素来促进存在风险的观念，并且这种潜力不仅限于不规则的AGI场景。我们提出这样的假设，即AI的某些已经记录的影响可以充当存在的风险因素，从而放大了先前确定的存在风险来源的可能性。此外，即使在没有人工通用智能的情况下，未来十年的未来发展也有可能极大地加剧这些危险因素。我们的主要贡献是对潜在的AI风险因素以及它们之间的因果关系的（非排斥）的解释，重点是AI如何影响电力动态和信息安全。该博览会表明，从AI系统到没有假设未来AI能力的存在风险存在因果途径。

translated by 谷歌翻译

Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governance

Nicholas Kluge Corrêa , Camila Galvão , James William Santos , Carolina Del Pino , Edson Pontes Pinto , Camila Barbosa , Diogo Massmann , Rodrigo Mambrini , Luiza Galvão , Edmund Terem

分类：人工智能

2022-06-23

在过去的十年中，许多组织制作了旨在从规范意义上进行标准化的文件，并为我们最近和快速的AI开发促进指导。但是，除了一些荟萃分析和该领域的批判性评论外，尚未分析这些文档中提出的思想的全部内容和分歧。在这项工作中，我们试图扩展过去研究人员所做的工作，并创建一种工具，以更好地数据可视化这些文档的内容和性质。我们还提供了通过将工具应用于200个文档的样本量获得的结果的批判性分析。

translated by 谷歌翻译

Rebuilding Trust: Queer in AI Approach to Artificial Intelligence Risk Management

Ashwin , William Agnew , Umut Pajaro , Hetvi Jethwani , Arjun Subramonian

分类：人工智能

2021-09-21

值得信赖的人工智能（AI）已成为一个重要的话题，因为在AI系统及其创造者中的信任已经丢失。研究人员，公司和政府具有远离技术开发，部署和监督的边缘化群体的长期和痛苦的历史。结果，这些技术对小群体的有用甚至有害。我们争辩说，渴望信任的任何AI开发，部署和监测框架必须纳入女权主义，非剥削参与性设计原则和强大，外部和持续监测和测试。我们还向考虑到透明度，公平性和问责制的可靠性方面的重要性，特别是考虑对任何值得信赖的AI系统的核心价值观的正义和转移权力。创建值得信赖的AI通过资金，支持和赋予Grassroots组织，如AI Queer等基层组织开始，因此AI领域具有多样性和纳入可信和有效地发展的可信赖AI。我们利用AI的专家知识Queer通过其多年的工作和宣传来讨论以及如何以及如何在数据集和AI系统中使用如何以及如何在数据集和AI系统中使用以及沿着这些线路的危害。基于此，我们分享了对AI的性别方法，进一步提出了Queer认识论并分析它可以带来AI的好处。我们还讨论了如何在愿景中讨论如何使用此Queer认识论，提出与AI和性别多样性和隐私和酷儿数据保护相关的框架。

translated by 谷歌翻译