智能论文笔记

Hierarchical principles of embodied reinforcement learning: A review

Manfred Eppe , Christian Gumbsch , Matthias Kerzel , Phuong D. H. Nguyen , Martin V. Butz , Stefan Wermter

分类：人工智能

2020-12-18

认知心理学和相关学科已经确定了几种关键机制，使智能生物学药物能够学会解决复杂的问题。存在紧迫的证据表明，这些物种中能够解决问题技能的认知机制以等级心理表征为基础。在为人工代理和机器人提供基于学习的问题解决能力的最有希望的计算方法之一是分层增强学习。但是，到目前为止，现有的计算方法尚未能够为人工代理提供与智能动物相媲美的解决问题的能力，包括人类和非人类灵长类动物，乌鸦或章鱼。在这里，我们首先调查了认知心理学和相关学科的文献，发现许多重要的心理机制涉及组成抽象，好奇心和前瞻性模型。然后，我们将这些见解与当代分层的增强学习方法联系起来，并确定实现这些机制的关键机器智能方法。作为我们的主要结果，我们表明所有重要的认知机制均已在孤立的计算体系结构中独立实施，并且缺乏适当整合它们的方法。我们希望我们的结果指导更复杂的认知启发性层次结构方法的发展，以便未来的人工代理在智能动物水平上实现解决问题的性能。

translated by 谷歌翻译

Intelligent problem-solving as integrated hierarchical reinforcement learning

Manfred Eppe , Christian Gumbsch , Matthias Kerzel , Phuong D. H. Nguyen , Martin V. Butz , Stefan Wermter

分类：人工智能 | 机器学习 | 机器人

2022-08-18

根据认知心理学和相关学科，生物学剂中复杂的解决问题行为的发展取决于等级认知机制。分层增强学习是一种有前途的计算方法，最终可能在人工代理和机器人中产生可比的解决问题的行为。但是，迄今为止，许多人类和非人类动物的解决问题能力显然优于人造系统的能力。在这里，我们提出了整合生物学启发的层次机制的步骤，以实现人造代理中的高级解决问题的技能。因此，我们首先回顾了认知心理学中的文献，以强调构图抽象和预测性处理的重要性。然后，我们将获得的见解与当代分层的强化学习方法联系起来。有趣的是，我们的结果表明，所有确定的认知机制均已在孤立的计算体系结构中单独实施，这提出了一个问题，为什么没有单个统一体系结构可以集成它们。作为我们的最终贡献，我们通过对开发这种统一体系结构的计算挑战的综合观点来解决这个问题。我们希望我们的结果可以指导更复杂的认知启发的分层机器学习体系结构的发展。

translated by 谷歌翻译

On the link between conscious function and general intelligence in humans and machines

Arthur Juliani , Kai Arulkumaran , Shuntaro Sasai , Ryota Kanai

分类：人工智能 | 神经与进化计算

2022-03-24

在流行媒体中，人造代理商的意识出现与同时实现人类或超人水平智力的那些相同的代理之间通常存在联系。在这项工作中，我们探讨了意识和智力之间这种看似直观的联系的有效性和潜在应用。我们通过研究与三种当代意识功能理论相关的认知能力：全球工作空间理论（GWT），信息生成理论（IGT）和注意力模式理论（AST）。我们发现，这三种理论都将有意识的功能专门与人类领域将军智力的某些方面联系起来。有了这个见解，我们转向人工智能领域（AI），发现尽管远未证明一般智能，但许多最先进的深度学习方法已经开始纳入三个功能的关键方面理论。确定了这一趋势后，我们以人类心理时间旅行的激励例子来提出方式，其中三种理论中每种理论的见解都可以合并为一个单一的统一和可实施的模型。鉴于三种功能理论中的每一种都可以通过认知能力来实现这一可能，因此，具有精神时间旅行的人造代理不仅具有比当前方法更大的一般智力，而且还与我们当前对意识功能作用的理解更加一致在人类中，这使其成为AI研究的有希望的近期目标。

translated by 谷歌翻译

Representation Internal-Manipulation (RIM): A Neuro-Inspired Computational Theory of Consciousness

Gianluca Baldassarre , Giovanni Granato

分类：人工智能 | 机器学习 | 神经与进化计算 | (统计)机器学习

2019-12-31

Many theories, based on neuroscientific and psychological empirical evidence and on computational concepts, have been elaborated to explain the emergence of consciousness in the central nervous system. These theories propose key fundamental mechanisms to explain consciousness, but they only partially connect such mechanisms to the possible functional and adaptive role of consciousness. Recently, some cognitive and neuroscientific models try to solve this gap by linking consciousness to various aspects of goal-directed behaviour, the pivotal cognitive process that allows mammals to flexibly act in challenging environments. Here we propose the Representation Internal-Manipulation (RIM) theory of consciousness, a theory that links the main elements of consciousness theories to components and functions of goal-directed behaviour, ascribing a central role for consciousness to the goal-directed manipulation of internal representations. This manipulation relies on four specific computational operations to perform the flexible internal adaptation of all key elements of goal-directed computation, from the representations of objects to those of goals, actions, and plans. Finally, we propose the concept of `manipulation agency' relating the sense of agency to the internal manipulation of representations. This allows us to propose that the subjective experience of consciousness is associated to the human capacity to generate and control a simulated internal reality that is vividly perceived and felt through the same perceptual and emotional mechanisms used to tackle the external world.

translated by 谷歌翻译

Towards autonomous artificial agents with an active self: modeling sense of control in situated action

Sebastian Kahl , Sebastian Wiese , Nele Russwinkel , Stefan Kopp

分类：人工智能

2021-12-10

在本文中，我们在人工代理中介绍了活跃的自我的计算建模叙述。特别是，我们专注于代理人如何配备控制意识以及它在自主位于行动中的方式以及反过来，影响行动控制。我们认为这需要铺设一个体现的认知模型，将自下而上的过程（传感器学习和对控制的细粒度适应）与自上而下的过程（战略选择和决策的认知过程）。我们基于预测处理和自由能量最小化的原理提出了这种概念计算架构。使用此常规模型，我们描述了控制层次结构的级别的控制感以及如何支持在不可预测的环境中的动作控制。我们在模型的实施以及模拟任务场景中的第一评估，其中自主代理必须应对不可预测的情况并经历相应的控制感。我们探讨了不同的型号参数设置，导致不同方式结合低电平和高级动作控制。结果表明，在低/高级动作控制需求的情况下适当加权信息的重要性，并且他们证明了控制的感觉如何促进这一点。

translated by 谷歌翻译

Building Machines That Learn and Think Like People

Brenden M. Lake , Tomer D. Ullman , Joshua B. Tenenbaum , Samuel J. Gershman

分类：

2016-04-01

Recent progress in artificial intelligence (AI) has renewed interest in building systems that learn and think like people. Many advances have come from using deep neural networks trained end-to-end in tasks such as object recognition, video games, and board games, achieving performance that equals or even beats humans in some respects. Despite their biological inspiration and performance achievements, these systems differ from human intelligence in crucial ways. We review progress in cognitive science suggesting that truly human-like learning and thinking machines will have to reach beyond current engineering trends in both what they learn, and how they learn it. Specifically, we argue that these machines should (a) build causal models of the world that support explanation and understanding, rather than merely solving pattern recognition problems; (b) ground learning in intuitive theories of physics and psychology, to support and enrich the knowledge that is learned; and (c) harness compositionality and learning-to-learn to rapidly acquire and generalize knowledge to new tasks and situations. We suggest concrete challenges and promising routes towards these goals that can combine the strengths of recent neural network advances with more structured cognitive models.

translated by 谷歌翻译

A Whole Brain Probabilistic Generative Model: Toward Realizing Cognitive Architectures for Developmental Robots

Tadahiro Taniguchi , Hiroshi Yamakawa , Takayuki Nagai , Kenji Doya , Masamichi Sakagami , Masahiro Suzuki , Tomoaki Nakamura , Akira Taniguchi

分类：人工智能

2021-03-15

建立一种人类综合人工认知系统，即人工综合情报（AGI），是人工智能（AI）领域的圣杯。此外，实现人工系统实现认知发展的计算模型将是脑和认知科学的优秀参考。本文介绍了一种通过集成元素认知模块来开发认知架构的方法，以实现整个模块的训练。这种方法是基于两个想法：（1）脑激发AI，学习人类脑建筑以构建人类级智能，（2）概率的生成模型（PGM）基础的认知系统，为发展机器人开发认知系统通过整合PGM。发展框架称为全大脑PGM（WB-PGM），其根本地不同于现有的认知架构，因为它可以通过基于感官电机信息的系统不断学习。在这项研究中，我们描述了WB-PGM的基本原理，基于PGM的元素认知模块的当前状态，与人类大脑的关系，对认知模块的整合的方法，以及未来的挑战。我们的研究结果可以作为大脑研究的参考。随着PGMS描述变量之间的明确信息关系，本说明书提供了从计算科学到脑科学的可解释指导。通过提供此类信息，神经科学的研究人员可以向AI和机器人提供的研究人员提供反馈，以及目前模型缺乏对大脑的影响。此外，它可以促进神经认知科学的研究人员以及AI和机器人的合作。

translated by 谷歌翻译

Five Properties of Specific Curiosity You Didn't Know Curious Machines Should Have

Nadia M. Ady , Roshan Shariff , Johannes Günther , Patrick M. Pilarski

分类：人工智能 | 机器学习

2022-12-01

Curiosity for machine agents has been a focus of lively research activity. The study of human and animal curiosity, particularly specific curiosity, has unearthed several properties that would offer important benefits for machine learners, but that have not yet been well-explored in machine intelligence. In this work, we conduct a comprehensive, multidisciplinary survey of the field of animal and machine curiosity. As a principal contribution of this work, we use this survey as a foundation to introduce and define what we consider to be five of the most important properties of specific curiosity: 1) directedness towards inostensible referents, 2) cessation when satisfied, 3) voluntary exposure, 4) transience, and 5) coherent long-term learning. As a second main contribution of this work, we show how these properties may be implemented together in a proof-of-concept reinforcement learning agent: we demonstrate how the properties manifest in the behaviour of this agent in a simple non-episodic grid-world environment that includes curiosity-inducing locations and induced targets of curiosity. As we would hope, our example of a computational specific curiosity agent exhibits short-term directed behaviour while updating long-term preferences to adaptively seek out curiosity-inducing situations. This work, therefore, presents a landmark synthesis and translation of specific curiosity to the domain of machine learning and reinforcement learning and provides a novel view into how specific curiosity operates and in the future might be integrated into the behaviour of goal-seeking, decision-making computational agents in complex environments.

translated by 谷歌翻译

Is it possible not to cheat on the Turing Test: Exploring the potential and challenges for true natural language 'understanding' by computers

Lize Alberts

分类：自然语言处理 | 人工智能

2022-06-29

最近围绕语言处理模型的复杂性的最新炒作使人们对机器获得了类似人类自然语言的指挥的乐观情绪。人工智能中自然语言理解的领域声称在这一领域取得了长足的进步，但是，在这方面和其他学科中使用“理解”的概念性清晰，使我们很难辨别我们实际上有多近的距离。目前的方法和剩余挑战的全面，跨学科的概述尚待进行。除了语言知识之外，这还需要考虑我们特定于物种的能力，以对，记忆，标签和传达我们（足够相似的）体现和位置经验。此外，测量实际约束需要严格分析当前模型的技术能力，以及对理论可能性和局限性的更深入的哲学反思。在本文中，我将所有这些观点（哲学，认知语言和技术）团结在一起，以揭开达到真实（人类般的）语言理解所涉及的挑战。通过解开当前方法固有的理论假设，我希望说明我们距离实现这一目标的实际程度，如果确实是目标。

translated by 谷歌翻译

Transfer learning for reinforcement learning domains: A survey.

分类：

The reinforcement learning paradigm is a popular way to address problems that have only limited environmental feedback, rather than correctly labeled examples, as is common in other machine learning contexts. While significant progress has been made to improve learning in a single task, the idea of transfer learning has only recently been applied to reinforcement learning tasks. The core idea of transfer is that experience gained in learning to perform one task can help improve learning performance in a related, but different, task. In this article we present a framework that classifies transfer learning methods in terms of their capabilities and goals, and then use it to survey the existing literature, as well as to suggest future directions for transfer learning work.

translated by 谷歌翻译

Active Inference in Robotics and Artificial Agents: Survey and Challenges

Pablo Lanillos , Cristian Meo , Corrado Pezzato , Ajith Anil Meera , Mohamed Baioumy , Wataru Ohata , Alexander Tschantz , Beren Millidge , Martijn Wisse , Christopher L. Buckley

分类：机器人 | 人工智能 | 机器学习

2021-12-03

有效推论是一种数学框架，它起源于计算神经科学，作为大脑如何实现动作，感知和学习的理论。最近，已被证明是在不确定性下存在国家估算和控制问题的有希望的方法，以及一般的机器人和人工代理人的目标驱动行为的基础。在这里，我们审查了最先进的理论和对国家估计，控制，规划和学习的积极推断的实现;描述当前的成就，特别关注机器人。我们展示了相关实验，以适应，泛化和稳健性而言说明其潜力。此外，我们将这种方法与其他框架联系起来，并讨论其预期的利益和挑战：使用变分贝叶斯推理具有功能生物合理性的统一框架。

translated by 谷歌翻译

Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey

Cédric Colas , Tristan Karch , Olivier Sigaud , Pierre-Yves Oudeyer

分类：机器学习 | 人工智能

2020-12-17

建立可以探索开放式环境的自主机器，发现可能的互动，自主构建技能的曲目是人工智能的一般目标。发展方法争辩说，这只能通过可以生成，选择和学习解决自己问题的自主和本质上动机的学习代理人来实现。近年来，我们已经看到了发育方法的融合，特别是发展机器人，具有深度加强学习（RL）方法，形成了发展机器学习的新领域。在这个新域中，我们在这里审查了一组方法，其中深入RL算法训练，以解决自主获取的开放式曲目的发展机器人问题。本质上动机的目标条件RL算法训练代理商学习代表，产生和追求自己的目标。自我生成目标需要学习紧凑的目标编码以及它们的相关目标 - 成就函数，这导致与传统的RL算法相比，这导致了新的挑战，该算法设计用于使用外部奖励信号解决预定义的目标集。本文提出了在深度RL和发育方法的交叉口中进行了这些方法的类型，调查了最近的方法并讨论了未来的途径。

translated by 谷歌翻译

Explainable Goal-Driven Agents and Robots -- A Comprehensive Review

Fatai Sado , Chu Kiong Loo , Wei Shiung Liew , Matthias Kerzel , Stefan Wermter

分类：机器人 | 人工智能

2020-04-21

最近的自主代理和机器人的应用，如自动驾驶汽车，情景的培训师，勘探机器人和服务机器人带来了关注与当前生成人工智能（AI）系统相关的至关重要的信任相关挑战。尽管取得了巨大的成功，基于连接主义深度学习神经网络方法的神经网络方法缺乏解释他们对他人的决策和行动的能力。没有符号解释能力，它们是黑色盒子，这使得他们的决定或行动不透明，这使得难以信任它们在安全关键的应用中。最近对AI系统解释性的立场目睹了可解释的人工智能（XAI）的几种方法;然而，大多数研究都专注于应用于计算科学中的数据驱动的XAI系统。解决越来越普遍的目标驱动器和机器人的研究仍然缺失。本文评论了可解释的目标驱动智能代理和机器人的方法，重点是解释和沟通代理人感知功能的技术（示例，感官和愿景）和认知推理（例如，信仰，欲望，意图，计划和目标）循环中的人类。审查强调了强调透明度，可辨与和持续学习以获得解释性的关键策略。最后，本文提出了解释性的要求，并提出了用于实现有效目标驱动可解释的代理和机器人的路线图。

translated by 谷歌翻译

Inductive Biases for Deep Learning of Higher-Level Cognition

Anirudh Goyal , Yoshua Bengio

分类：机器学习 | 人工智能 | (统计)机器学习

2020-11-30

一个令人着迷的假设是，人类和动物的智力可以通过一些原则（而不是启发式方法的百科全书清单）来解释。如果这个假设是正确的，我们可以更容易地理解自己的智能并建造智能机器。就像物理学一样，原理本身不足以预测大脑等复杂系统的行为，并且可能需要大量计算来模拟人类式的智力。这一假设将表明，研究人类和动物所剥削的归纳偏见可以帮助阐明这些原则，并为AI研究和神经科学理论提供灵感。深度学习已经利用了几种关键的归纳偏见，这项工作考虑了更大的清单，重点是关注高级和顺序有意识的处理的工作。阐明这些特定原则的目的是，它们有可能帮助我们建立从人类的能力中受益于灵活分布和系统概括的能力的AI系统，目前，这是一个领域艺术机器学习和人类智力。

translated by 谷歌翻译

A Survey of Knowledge-based Sequential Decision Making under Uncertainty

Shiqi Zhang , Mohan Sridharan

分类：人工智能

2020-08-19

用声明知识（RDK）和顺序决策（SDM）推理是人工智能的两个关键研究领域。RDK方法的原因是具有声明领域知识，包括常识性知识，它是先验或随着时间的收购，而SDM方法（概率计划和强化学习）试图计算行动政策，以最大程度地提高时间范围内预期的累积效用；两类方法的原因是存在不确定性。尽管这两个领域拥有丰富的文献，但研究人员尚未完全探索他们的互补优势。在本文中，我们调查了利用RDK方法的算法，同时在不确定性下做出顺序决策。我们讨论重大发展，开放问题和未来工作的方向。

translated by 谷歌翻译

A Survey on Interpretable Reinforcement Learning

Claire Glanois , Paul Weng , Matthieu Zimmer , Dong Li , Tianpei Yang , Jianye Hao , Wulong Liu

分类：机器学习 | 人工智能

2021-12-24

虽然深增强学习已成为连续决策问题的有希望的机器学习方法，但对于自动驾驶或医疗应用等高利害域来说仍然不够成熟。在这种情况下，学习的政策需要例如可解释，因此可以在任何部署之前检查它（例如，出于安全性和验证原因）。本调查概述了各种方法，以实现加固学习（RL）的更高可解释性。为此，我们将解释性（作为模型的财产区分开来和解释性（作为HOC操作后的讲话，通过代理的干预），并在RL的背景下讨论它们，并强调前概念。特别是，我们认为可译文的RL可能会拥抱不同的刻面：可解释的投入，可解释（转型/奖励）模型和可解释的决策。根据该计划，我们总结和分析了与可解释的RL相关的最近工作，重点是过去10年来发表的论文。我们还简要讨论了一些相关的研究领域并指向一些潜在的有前途的研究方向。

translated by 谷歌翻译

Imitation learning: A survey of learning methods

分类：

Imitation learning techniques aim to mimic human behavior in a given task. An agent (a learning machine) is trained to perform a task from demonstrations by learning a mapping between observations and actions. The idea of teaching by imitation has been around for many years, however, the field is gaining attention recently due to advances in computing and sensing as well as rising demand for intelligent applications. The paradigm of learning by imitation is gaining popularity because it facilitates teaching complex tasks with minimal expert knowledge of the tasks. Generic imitation learning methods could potentially reduce the problem of teaching a task to that of providing demonstrations; without the need for explicit programming or designing reward functions specific to the task. Modern sensors are able to collect and transmit high volumes of data rapidly, and processors with high computational power allow fast processing that maps the sensory data to actions in a timely manner. This opens the door for many potential AI applications that require real-time perception and reaction such as humanoid robots, self-driving vehicles, human computer interaction and computer games to name a few. However, specialized algorithms are needed to effectively and robustly learn models as learning by imitation poses its own set of challenges. In this paper, we survey imitation learning methods and present design options in different steps of the learning process. We introduce a background and motivation for the field as well as highlight challenges specific to the imitation problem. Methods for designing and evaluating imitation learning tasks are categorized and reviewed. Special attention is given to learning methods in robotics and games as these domains are the most popular in the literature and provide a wide array of problems and methodologies. We extensively discuss combining imitation learning approaches using different sources and methods, as well as incorporating other motion learning methods to enhance imitation. We also discuss the potential impact on industry, present major applications and highlight current and future research directions.

translated by 谷歌翻译

A Computational Architecture for Machine Consciousness and Artificial Superintelligence: Updating Working Memory Iteratively

Jared Edward Reser

分类：自然语言处理 | 计算机视觉

2022-03-29

这篇理论文章研究了如何在计算机中构建类似人类的工作记忆和思维过程。应该有两个工作记忆存储，一个类似于关联皮层中的持续点火，另一个类似于大脑皮层中的突触增强。这些商店必须通过环境刺激或内部处理产生的新表示不断更新。它们应该连续更新，并以一种迭代的方式进行更新，这意味着在下一个状态下，应始终保留一组共同工作中的某些项目。因此，工作记忆中的一组概念将随着时间的推移逐渐发展。这使每个状态都是对先前状态的修订版，并导致连续的状态与它们所包含的一系列表示形式重叠和融合。随着添加新表示形式并减去旧表示形式，在这些更改过程中，有些保持活跃几秒钟。这种持续活动，类似于人工复发性神经网络中使用的活动，用于在整个全球工作区中传播激活能量，以搜索下一个关联更新。结果是能够朝着解决方案或目标前进的联想连接的中间状态链。迭代更新在这里概念化为信息处理策略，一种思想流的计算和神经生理决定因素以及用于设计和编程人工智能的算法。

translated by 谷歌翻译

Continual Lifelong Learning with Neural Networks: A Review

German I. Parisi , Ronald Kemker , Jose L. Part , Christopher Kanan , Stefan Wermter

分类：

2018-02-21

Humans and animals have the ability to continually acquire, fine-tune, and transfer knowledge and skills throughout their lifespan. This ability, referred to as lifelong learning, is mediated by a rich set of neurocognitive mechanisms that together contribute to the development and specialization of our sensorimotor skills as well as to long-term memory consolidation and retrieval. Consequently, lifelong learning capabilities are crucial for computational systems and autonomous agents interacting in the real world and processing continuous streams of information. However, lifelong learning remains a long-standing challenge for machine learning and neural network models since the continual acquisition of incrementally available information from non-stationary data distributions generally leads to catastrophic forgetting or interference. This limitation represents a major drawback for state-of-the-art deep neural network models that typically learn representations from stationary batches of training data, thus without accounting for situations in which information becomes incrementally available over time. In this review, we critically summarize the main challenges linked to lifelong learning for artificial learning systems and compare existing neural network approaches that alleviate, to different extents, catastrophic forgetting. Although significant advances have been made in domain-specific learning with neural networks, extensive research efforts are required for the development of robust lifelong learning on autonomous agents and robots. We discuss well-established and emerging research motivated by lifelong learning factors in biological systems such as structural plasticity, memory replay, curriculum and transfer learning, intrinsic motivation, and multisensory integration.

translated by 谷歌翻译

Explanation in Artificial Intelligence: Insights from the Social Sciences

Tim Miller

分类：

2017-06-22

There has been a recent resurgence in the area of explainable artificial intelligence as researchers and practitioners seek to make their algorithms more understandable. Much of this research is focused on explicitly explaining decisions or actions to a human observer, and it should not be controversial to say that looking at how humans explain to each other can serve as a useful starting point for explanation in artificial intelligence. However, it is fair to say that most work in explainable artificial intelligence uses only the researchers' intuition of what constitutes a 'good' explanation. There exists vast and valuable bodies of research in philosophy, psychology, and cognitive science of how people define, generate, select, evaluate, and present explanations, which argues that people employ certain cognitive biases and social expectations towards the explanation process. This paper argues that the field of explainable artificial intelligence should build on this existing research, and reviews relevant papers from philosophy, cognitive psychology/science, and social psychology, which study these topics. It draws out some important findings, and discusses ways that these can be infused with work on explainable artificial intelligence.

translated by 谷歌翻译