智能论文笔记

Modeling Human-AI Team Decision Making

Wei Ye , Francesco Bullo , Noah Friedkin , Ambuj K Singh

分类：人工智能

2022-01-08

AI和人类为团体审议带来了互补技能。在审议包括风险要素和评估人和AI代理商的能力的探索开采过程中，建模本集团决策尤其挑战。为了调查这个问题，我们向一系列智力展示了一系列的知识问题，通过不完美的AI代理商提供了一系列的人群。集团的目标是评估本集团成员及其可用AI代理商的相对专业知识，评估与不同行动相关的风险，并通过达成共识来最大限度地提高整体奖励。我们在这种不确定的情况下提出和经验验证了人类队决策的模型，并显示了前景理论，影响动态和贝叶斯学习的社会认知构建在预测人AI群体行为中的价值。

translated by 谷歌翻译

Randomized Classifiers vs Human Decision-Makers: Trustworthy AI May Have to Act Randomly and Society Seems to Accept This

Gábor Erdélyi , Olivia J. Erdélyi , Vladimir Estivill-Castro

分类：人工智能 | 机器学习

2021-11-15

\ EMPH {人工智能}（AI）系统越来越多地参与影响我们生活的决策，确保自动决策是公平的，道德已经成为最优先事项。直观地，我们觉得类似人的决定，人工代理人的判断应该必然地以一些道德原则为基础。然而，如果有关决定所基础的所有有关因素的全部信息，可以真正伦理（人类或人为）和公平（根据任何道德理论）和公平（根据公平的任何概念）的规定在决策时。这提出了两个问题：（1）在设置中，我们依赖使用通过监督学习获得的分类器的AI系统，存在一些感应/泛化，即使在学习期间也可能不存在一些相关属性。（2）根据游戏揭示任何 - 无论是道德的纯策略都不可避免地易于剥削，建模这些决定。此外，在许多游戏中，只能通过使用混合策略来获得纳什均衡，即实现数学上最佳结果，决定必须随机化。在本文中，我们认为，在监督学习设置中，存在至少以及确定性分类器的随机分类器，因此在许多情况下可能是最佳选择。我们支持我们的理论效果，具有一个实证研究，表明对随机人工决策者的积极社会态度，并讨论了与使用与当前的AI政策和标准化举措相关的随机分类器相关的一些政策和实施问题。

translated by 谷歌翻译

Explanations Can Reduce Overreliance on AI Systems During Decision-Making

Helena Vasconcelos , Matthew Jörke , Madeleine Grunde-McLaughlin , Tobias Gerstenberg , Michael Bernstein , Ranjay Krishna

分类：人工智能

2022-12-13

Prior work has identified a resilient phenomenon that threatens the performance of human-AI decision-making teams: overreliance, when people agree with an AI, even when it is incorrect. Surprisingly, overreliance does not reduce when the AI produces explanations for its predictions, compared to only providing predictions. Some have argued that overreliance results from cognitive biases or uncalibrated trust, attributing overreliance to an inevitability of human cognition. By contrast, our paper argues that people strategically choose whether or not to engage with an AI explanation, demonstrating empirically that there are scenarios where AI explanations reduce overreliance. To achieve this, we formalize this strategic choice in a cost-benefit framework, where the costs and benefits of engaging with the task are weighed against the costs and benefits of relying on the AI. We manipulate the costs and benefits in a maze task, where participants collaborate with a simulated AI to find the exit of a maze. Through 5 studies (N = 731), we find that costs such as task difficulty (Study 1), explanation difficulty (Study 2, 3), and benefits such as monetary compensation (Study 4) affect overreliance. Finally, Study 5 adapts the Cognitive Effort Discounting paradigm to quantify the utility of different explanations, providing further support for our framework. Our results suggest that some of the null effects found in literature could be due in part to the explanation not sufficiently reducing the costs of verifying the AI's prediction.

translated by 谷歌翻译

Role of Human-AI Interaction in Selective Prediction

Elizabeth Bondi , Raphael Koster , Hannah Sheahan , Martin Chadwick , Yoram Bachrach , Taylan Cemgil , Ulrich Paquet , Krishnamurthy Dvijotham

分类：人工智能

2021-12-13

最近的工作表明，当AI的预测不可靠时，可以学会推迟人类的选择性预测系统的潜在好处，特别是提高医疗保健等高赌注应用中AI系统的可靠性。然而，大多数事先工作假定当他们解决预测任务时，人类行为仍然保持不变，作为人类艾队团队的一部分而不是自己。我们表明，通过执行实验来规定在选择性预测的背景下量化人AI相互作用的实验并非如此。特别是，我们研究将不同类型信息传送给人类的影响，了解AI系统的决定推迟。使用现实世界的保护数据和选择性预测系统，可以在单独工作的人体或AI系统上提高预期准确性，我们表明，这种消息传递对人类判断的准确性产生了重大影响。我们的结果研究了消息传递策略的两个组成部分：1）人类是否被告知AI系统的预测和2）是否被告知选择性预测系统的决定推迟。通过操纵这些消息传递组件，我们表明，通过通知人类推迟的决定，可以显着提高人类的性能，但不透露对AI的预测。因此，我们表明，考虑在设计选择性预测系统时如何传送到人类的决定是至关重要的，并且必须使用循环框架仔细评估人类-AI团队的复合精度。

translated by 谷歌翻译

Frontiers in Collective Intelligence: A Workshop Report

Tyler Millhouse , Melanie Moses , Melanie Mitchell

分类：人工智能

2021-12-13

2021年8月，圣达菲研究所举办了一个关于集体智力的研讨会，是智力项目基础的一部分。该项目旨在通过促进智能性质的跨学科研究来推进人工智能领域。该研讨会汇集了计算机科学家，生物学家，哲学家，社会科学家和其他人，以分享他们对多种代理人之间的互动产生的洞察力的见解 - 是否这些代理商是机器，动物或人类。在本报告中，我们总结了每个会谈和随后的讨论。我们还借出了许多关键主题，并确定未来研究的重要前沿。

translated by 谷歌翻译

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

分类：

2017-06-22

There has been a recent resurgence in the area of explainable artificial intelligence as researchers and practitioners seek to make their algorithms more understandable. Much of this research is focused on explicitly explaining decisions or actions to a human observer, and it should not be controversial to say that looking at how humans explain to each other can serve as a useful starting point for explanation in artificial intelligence. However, it is fair to say that most work in explainable artificial intelligence uses only the researchers' intuition of what constitutes a 'good' explanation. There exists vast and valuable bodies of research in philosophy, psychology, and cognitive science of how people define, generate, select, evaluate, and present explanations, which argues that people employ certain cognitive biases and social expectations towards the explanation process. This paper argues that the field of explainable artificial intelligence should build on this existing research, and reviews relevant papers from philosophy, cognitive psychology/science, and social psychology, which study these topics. It draws out some important findings, and discusses ways that these can be infused with work on explainable artificial intelligence.

translated by 谷歌翻译

A Survey of Ad Hoc Teamwork Research

Reuth Mirsky , Ignacio Carlucho , Arrasy Rahman , Elliot Fosong , William Macke , Mohan Sridharan , Peter Stone , Stefano V. Albrecht

分类：人工智能

2022-02-16

临时团队合作是设计可以与新队友合作而无需事先协调的研究问题的研究问题。这项调查做出了两个贡献：首先，它提供了对临时团队工作问题不同方面的结构化描述。其次，它讨论了迄今为止该领域取得的进展，并确定了临时团队工作中需要解决的直接和长期开放问题。

translated by 谷歌翻译

Balancing Performance and Human Autonomy with Implicit Guidance Agent

Ryo Nakahashi , Seiji Yamada

分类：人工智能

2021-09-01

人类代理团队，这是一个问题，其中人类和自治机构合作实现一项任务，是人类AI协作的典型。为有效的合作，人类希望有一个有效的计划，而是在现实的情况下，他们可能难以计算由于认知限制的最佳计划。在这种情况下，具有许多计算资源的代理的指导可能是有用的。但是，如果代理人明确指导人类行为，人类可能会觉得他们已经失去了自主，并由代理商控制。因此，我们调查了通过代理人行为提供的隐性指导。通过这种指导，代理商以一种方式使人类能够易于找到合作任务的有效计划，然后可以改善计划。由于人类自愿改善了他们的计划，他或她保持自治。我们通过将贝叶斯思想理论集成到现有的协作规划算法中并通过行为实验进行了模拟了一个具有隐含指导，并通过隐性指导的行为实验证明了使人们能够在改善计划和保留自治之间保持平衡。

translated by 谷歌翻译

Free Will Belief as a consequence of Model-based Reinforcement Learning

Erik M. Rehn

分类：机器学习 | 人工智能

2021-11-14

关于人类是否有自由的辩论是几个世纪以来的争夺。虽然有良好的论据，基于我们目前对大自然法律的理解，虽然人类不可能自由的意志，但大多数人都相信他们。这种差异乞求解释。如果我们接受我们没有自由的意志，我们面临着两个问题：（1）虽然自由是一个非常常用的概念，每个人都直觉理解，我们实际提到的是，当我们说行动或选择时，我们实际上是什么？免费“或不是？而且，（2）为什么自由的信念会如此共同？这种信念来自哪里，它的目的是什么？在本文中，我们从加强学习（RL）的角度来看这些问题。 RL是最初为培训人工智能代理制定的框架。然而，它也可以用作人为决策和学习的计算模型，并通过这样做，我们建议通过观察人们的常识理解自由来回回答第一问题与信息熵密切相关RL代理的归一化行动值，而第二个可以通过代理人来制定本身的必要性，就像他们在处理时间信用分配问题时所做的那样做出决定。简而言之，我们建议通过将RL框架应用为人类学习的模型，这变得明显，为了让我们有效地学习并聪明，我们需要将自己视为自由意志。

translated by 谷歌翻译

Resonating Minds -- Emergent Collaboration Through Hierarchical Active Inference

Jan Pöppel , Sebastian Kahl , Stefan Kopp

分类：人工智能

2021-12-02

在复杂的协作任务上共同努力需要代理商协调他们的行为。在实际交互之前明确或完全执行此操作并不总是可能也不充分。代理人还需要不断了解他人的当前行动，并迅速适应自己的行为。在这里，我们调查我们称之为信仰共鸣的精神状态（意图，目标）的效率，自动协调过程如何导致协作的解决问题。我们为协作剂（HAICA）提出了分层有源推断的模型。它将高效的贝叶斯理论与基于预测处理和主动推断的感知动作系统相结合。通过让一个药物的推断精神状态影响另一个代理人的预测信念来实现信仰共振，从而实现了他自己的目标和意图。这样，推断的精神状态影响了代理人自己的任务行为，没有明确的协作推理。我们在超核域中实施和评估此模型，其中两个代理具有不同程度的信仰共振组合，以满足膳食订单。我们的结果表明，基于HAICA的代理商实现了与最近现有技术方法相当的团队表现，同时产生了更低的计算成本。我们还表明，信仰共振在环境中特别有益，代理商是对环境的不对称知识。结果表明，信仰共振和有效推断允许快速高效的代理协调，因此可以用作合作认知剂的结构块。

translated by 谷歌翻译

Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making

Kori Inkpen , Shreya Chappidi , Keri Mallari , Besmira Nushi , Divya Ramesh , Pietro Michelucci , Vani Mandava , Libuše Hannah Vepřek , Gabrielle Quinn

分类：人工智能

2022-08-16

人为决策的合作努力实现超出人类或人工智能表现的团队绩效。但是，许多因素都会影响人类团队的成功，包括用户的领域专业知识，AI系统的心理模型，对建议的信任等等。这项工作检查了用户与三种模拟算法模型的互动，所有这些模型都具有相似的精度，但对其真正的正面和真实负率进行了不同的调整。我们的研究检查了在非平凡的血管标签任务中的用户性能，参与者表明给定的血管是流动还是停滞。我们的结果表明，虽然AI-Assistant的建议可以帮助用户决策，但用户相对于AI的基线性能和AI错误类型的补充调整等因素会显着影响整体团队的整体绩效。新手用户有所改善，但不能达到AI的准确性。高度熟练的用户通常能够识别何时应遵循AI建议，并通常保持或提高其性能。与AI相似的准确性水平的表演者在AI建议方面是最大的变化。此外，我们发现用户对AI的性能亲戚的看法也对给出AI建议时的准确性是否有所提高产生重大影响。这项工作提供了有关与人类协作有关的因素的复杂性的见解，并提供了有关如何开发以人为中心的AI算法来补充用户在决策任务中的建议。

translated by 谷歌翻译

A Survey of Multi-Agent Human-Robot Interaction Systems

Abhinav Dahiya , Alexander M. Aroyo , Kerstin Dautenhahn , Stephen L. Smith

分类：机器人

2022-12-10

This article presents a survey of literature in the area of Human-Robot Interaction (HRI), specifically on systems containing more than two agents (i.e., having multiple humans and/or multiple robots). We identify three core aspects of ``Multi-agent" HRI systems that are useful for understanding how these systems differ from dyadic systems and from one another. These are the Team structure, Interaction style among agents, and the system's Computational characteristics. Under these core aspects, we present five attributes of HRI systems, namely Team size, Team composition, Interaction model, Communication modalities, and Robot control. These attributes are used to characterize and distinguish one system from another. We populate resulting categories with examples from recent literature along with a brief discussion of their applications and analyze how these attributes differ from the case of dyadic human-robot systems. We summarize key observations from the current literature, and identify challenges and promising areas for future research in this domain. In order to realize the vision of robots being part of the society and interacting seamlessly with humans, there is a need to expand research on multi-human -- multi-robot systems. Not only do these systems require coordination among several agents, they also involve multi-agent and indirect interactions which are absent from dyadic HRI systems. Adding multiple agents in HRI systems requires advanced interaction schemes, behavior understanding and control methods to allow natural interactions among humans and robots. In addition, research on human behavioral understanding in mixed human-robot teams also requires more attention. This will help formulate and implement effective robot control policies in HRI systems with large numbers of heterogeneous robots and humans; a team composition reflecting many real-world scenarios.

translated by 谷歌翻译

Humans decompose tasks by trading off utility and computational cost

Carlos G. Correa , Mark K. Ho , Frederick Callaway , Nathaniel D. Daw , Thomas L. Griffiths

分类：人工智能

2022-11-07

Human behavior emerges from planning over elaborate decompositions of tasks into goals, subgoals, and low-level actions. How are these decompositions created and used? Here, we propose and evaluate a normative framework for task decomposition based on the simple idea that people decompose tasks to reduce the overall cost of planning while maintaining task performance. Analyzing 11,117 distinct graph-structured planning tasks, we find that our framework justifies several existing heuristics for task decomposition and makes predictions that can be distinguished from two alternative normative accounts. We report a behavioral study of task decomposition ($N=806$) that uses 30 randomly sampled graphs, a larger and more diverse set than that of any previous behavioral study on this topic. We find that human responses are more consistent with our framework for task decomposition than alternative normative accounts and are most consistent with a heuristic -- betweenness centrality -- that is justified by our approach. Taken together, our results provide new theoretical insight into the computational principles underlying the intelligent structuring of goal-directed behavior.

translated by 谷歌翻译

Commander's Intent: A Dataset and Modeling Approach for Human-AI Task Specification in Strategic Play

Pradyumna Tambwekar , Nathan Vaska , Lakshita Dodeja , Matthew Gombolay

分类：人工智能 | 自然语言处理 | 机器学习

2022-08-17

有效的人类组合需要能够传达团队目标的能力和您需要代理商进行操作的约束。提供指定团队共享意图或操作标准的能力，可以使AI代理执行其主要功能，同时仍然能够满足当前团队的特定愿望。尽管已经开展了重要的工作来指导代理通过语言或演示执行任务，但先前的工作缺乏专注于可以在团队指定的参数中运行的建筑物。更糟糕的是，缺乏有关使人类通过非结构化的自然主义语言提供其规范的研究。在本文中，我们建议将目标和约束用作调节和评估自治药物的脚手架。我们通过介绍一个新颖的数据集和相关的数据收集协议来为这一领域做出贡献，该协议将语言描述映射到与人参与者为棋盘游戏风险开发的特定策略相对应的目标和约束。利用最先进的语言模型和增强程序，我们开发了一个机器学习框架，该框架可用于从非结构化策略描述中识别目标和约束。为了验证我们的方法，我们进行了一项人为主体研究，以建立我们的数据集的人类基础。我们的结果表明，与执行同一机器翻译任务的人类评估者相比，我们的机器学习体系结构能够更好地将非结构化语言描述解释为策略规范（F（1,272.53）= 17.025，p <0.001）。

translated by 谷歌翻译

Monte Carlo Tree Search: A Review of Recent Modifications and Applications

Maciej Świechowski , Konrad Godlewski , Bartosz Sawicki , Jacek Mańdziuk

分类：人工智能 | 机器学习

2021-03-08

蒙特卡洛树搜索（MCT）是设计游戏机器人或解决顺序决策问题的强大方法。该方法依赖于平衡探索和开发的智能树搜索。MCT以模拟的形式进行随机抽样，并存储动作的统计数据，以在每个随后的迭代中做出更有教育的选择。然而，该方法已成为组合游戏的最新技术，但是，在更复杂的游戏（例如那些具有较高的分支因素或实时系列的游戏）以及各种实用领域（例如，运输，日程安排或安全性）有效的MCT应用程序通常需要其与问题有关的修改或与其他技术集成。这种特定领域的修改和混合方法是本调查的主要重点。最后一项主要的MCT调查已于2012年发布。自发布以来出现的贡献特别感兴趣。

translated by 谷歌翻译

Two steps to risk sensitivity

Chris Gagne , Peter Dayan

分类：人工智能

2021-11-12

分类加强学习（RL） - 其中代理人了解其行动的所有可能的长期后果，而不仅仅是预期的价值 - 最近的兴趣。分配视图的最重要可接受性之一是在结果不完全确定的情况下促进现代，测量的，风险的风险。相比之下，在风险下决策的心理和神经科学调查利用了各种更令人尊敬的理论模型，例如缺乏公理理想的性质，例如连贯性。在这里，我们考虑了用于建模人类和动物规划的风险措施，称为有条件的价值 - 风险（CVAR），这量化了最坏情况结果（例如，车辆事故或捕食）。我们首先在连续的情况下采用传统的分布方法，在序列环境中，在众所周知的两步任务中重新分析人类决策者的选择，揭示了在粘性和坚持下潜伏的大量风险厌恶。然后，我们考虑风险敏感性的进一步关键特性，即时间一致性，显示出这种形式的CVAR的替代品，享受这种理想的特征。我们使用模拟来检查各种形式的设置，其中各种形式因对人类和动物规划和行为而产生影响的方式。

translated by 谷歌翻译

Coordination with Humans via Strategy Matching

Michelle Zhao , Reid Simmons , Henny Admoni

分类：机器人 | 机器学习

2022-10-27

Human and robot partners increasingly need to work together to perform tasks as a team. Robots designed for such collaboration must reason about how their task-completion strategies interplay with the behavior and skills of their human team members as they coordinate on achieving joint goals. Our goal in this work is to develop a computational framework for robot adaptation to human partners in human-robot team collaborations. We first present an algorithm for autonomously recognizing available task-completion strategies by observing human-human teams performing a collaborative task. By transforming team actions into low dimensional representations using hidden Markov models, we can identify strategies without prior knowledge. Robot policies are learned on each of the identified strategies to construct a Mixture-of-Experts model that adapts to the task strategies of unseen human partners. We evaluate our model on a collaborative cooking task using an Overcooked simulator. Results of an online user study with 125 participants demonstrate that our framework improves the task performance and collaborative fluency of human-agent teams, as compared to state of the art reinforcement learning methods.

translated by 谷歌翻译

Explainable Goal-Driven Agents and Robots -- A Comprehensive Review

Fatai Sado , Chu Kiong Loo , Wei Shiung Liew , Matthias Kerzel , Stefan Wermter

分类：机器人 | 人工智能

2020-04-21

最近的自主代理和机器人的应用，如自动驾驶汽车，情景的培训师，勘探机器人和服务机器人带来了关注与当前生成人工智能（AI）系统相关的至关重要的信任相关挑战。尽管取得了巨大的成功，基于连接主义深度学习神经网络方法的神经网络方法缺乏解释他们对他人的决策和行动的能力。没有符号解释能力，它们是黑色盒子，这使得他们的决定或行动不透明，这使得难以信任它们在安全关键的应用中。最近对AI系统解释性的立场目睹了可解释的人工智能（XAI）的几种方法;然而，大多数研究都专注于应用于计算科学中的数据驱动的XAI系统。解决越来越普遍的目标驱动器和机器人的研究仍然缺失。本文评论了可解释的目标驱动智能代理和机器人的方法，重点是解释和沟通代理人感知功能的技术（示例，感官和愿景）和认知推理（例如，信仰，欲望，意图，计划和目标）循环中的人类。审查强调了强调透明度，可辨与和持续学习以获得解释性的关键策略。最后，本文提出了解释性的要求，并提出了用于实现有效目标驱动可解释的代理和机器人的路线图。

translated by 谷歌翻译

The Utility of Explainable AI in Ad Hoc Human-Machine Teaming

Rohan Paleja , Muyleng Ghuy , Nadun Ranawaka Arachchige , Reed Jensen , Matthew Gombolay

分类：人工智能

2022-09-08

机器学习的最新进展导致人们对可解释的AI（XAI）的兴趣越来越大，使人类能够深入了解机器学习模型的决策。尽管最近有这种兴趣，但XAI技术的实用性尚未在人机组合中得到特征。重要的是，XAI提供了增强团队情境意识（SA）和共享心理模型发展的希望，这是有效的人机团队的关键特征。快速开发这种心理模型在临时人机团队中尤其重要，因为代理商对他人的决策策略没有先验知识。在本文中，我们提出了两个新颖的人类受试者实验，以量化在人机组合场景中部署XAI技术的好处。首先，我们证明XAI技术可以支持SA（$ P <0.05）$。其次，我们研究了通过协作AI政策抽象诱导的不同SA级别如何影响临时人机组合绩效。重要的是，我们发现XAI的好处不是普遍的，因为对人机团队的组成有很大的依赖。新手受益于XAI提供增加的SA（$ P <0.05 $），但容易受到认知开销的影响（$ P <0.05 $）。另一方面，专家性能随着基于XAI的支持（$ p <0.05 $）而降低，这表明关注XAI的成本超过了从提供的其他信息中获得的收益以增强SA所获得的收益。我们的结果表明，研究人员必须通过仔细考虑人机团队组成以及XAI方法如何增强SA来故意在正确的情况下设计和部署正确的XAI技术。

translated by 谷歌翻译

From partners to populations: A hierarchical Bayesian account of coordination and convention

Robert D. Hawkins , Michael Franke , Michael C. Frank , Adele E. Goldberg , Kenny Smith , Thomas L. Griffiths , Noah D. Goodman

分类：自然语言处理 | 人工智能

2021-04-12

语言是协调问题的强大解决方案：他们提供了稳定的，有关我们所说的单词如何对应于我们头脑中的信仰和意图的共同期望。然而，在变量和非静止社会环境中的语言使用需要语言表征来灵活：旧词在飞行中获取新的临时或合作伙伴特定含义。在本文中，我们介绍了柴（通过推理的连续分层适应），一个分层贝叶斯的协调理论和会议组织，旨在在这两个基本观察之间调和长期张力。我们认为，沟通的中央计算问题不仅仅是传输，如在经典配方中，而是在多个时间尺度上持续学习和适应。合作伙伴特定的共同点迅速出现在数型互动中的社会推论中，而社群范围内的社会公约是稳定的前锋，这些前锋已经抽象出与多个合作伙伴的互动。我们展示了新的实证数据，展示了我们的模型为多个现象提供了对先前账户挑战的计算基础：（1）与同一合作伙伴的重复互动的更有效的参考表达的融合（2）将合作伙伴特定的共同基础转移到陌生人，并（3）交际范围的影响最终会形成。

translated by 谷歌翻译