智能论文笔记

Analyzing and Enhancing Closed-loop Stability in Reactive Simulation

Wei-Jer Chang , Yeping Hu , Chenran Li , Wei Zhan , Masayoshi Tomizuka

分类：机器人 | 人工智能 | 机器学习

2022-08-09

模拟在有效评估自动驾驶汽车方面发挥了重要作用。现有方法主要依赖于基于启发式的模拟，在该模拟中，交通参与者遵循某些无法产生复杂人类行为的人类编码的规则。因此，提出了反应性仿真概念，以通过利用现实世界数据来弥合模拟和现实世界交通情况之间的人类行为差距。但是，这些反应性模型可以在模拟几个步骤后轻松地产生不合理的行为，我们将模型视为失去其稳定性。据我们所知，没有任何工作明确讨论并分析了反应性仿真框架的稳定性。在本文中，我们旨在对反应性模拟进行彻底的稳定性分析，并提出一种增强稳定性的解决方案。具体而言，我们首先提出了一个新的反应模拟框架，在其中我们发现模拟状态序列的平滑度和一致性是稳定性的关键因素。然后，我们将运动学媒介物模型纳入框架中，以提高反应性模拟的闭环稳定性。此外，在本文中提出了一些新颖的指标，以更好地分析模拟性能。

translated by 谷歌翻译

Differentiable Integrated Motion Prediction and Planning with Learnable Cost Function for Autonomous Driving

Zhiyu Huang , Haochen Liu , Jingda Wu , Chen Lv

分类：机器人

2022-07-21

相应地预测周围交通参与者的未来状态，并计划安全，平稳且符合社会的轨迹对于自动驾驶汽车至关重要。当前的自主驾驶系统有两个主要问题：预测模块通常与计划模块解耦，并且计划的成本功能很难指定和调整。为了解决这些问题，我们提出了一个端到端的可区分框架，该框架集成了预测和计划模块，并能够从数据中学习成本函数。具体而言，我们采用可区分的非线性优化器作为运动计划者，该运动计划将神经网络给出的周围剂的预测轨迹作为输入，并优化了自动驾驶汽车的轨迹，从而使框架中的所有操作都可以在框架中具有可观的成本，包括成本功能权重。提出的框架经过大规模的现实驾驶数据集进行了训练，以模仿整个驾驶场景中的人类驾驶轨迹，并在开环和闭环界面中进行了验证。开环测试结果表明，所提出的方法的表现优于各种指标的基线方法，并提供以计划为中心的预测结果，从而使计划模块能够输出接近人类的轨迹。在闭环测试中，提出的方法表明能够处理复杂的城市驾驶场景和鲁棒性，以抵抗模仿学习方法所遭受的分配转移。重要的是，我们发现计划和预测模块的联合培训比在开环和闭环测试中使用单独的训练有素的预测模块进行计划要比计划更好。此外，消融研究表明，框架中的可学习组件对于确保计划稳定性和性能至关重要。

translated by 谷歌翻译

RITA: Boost Autonomous Driving Simulators with Realistic Interactive Traffic Flow

Zhengbang Zhu , Shenyu Zhang , Yuzheng Zhuang , Yuecheng Liu , Minghuan Liu , Liyuan Mao , Ziqing Gong , Weinan Zhang , Shixiong Kai , Qiang Gu

分类：人工智能 | 机器人

2022-11-07

High-quality traffic flow generation is the core module in building simulators for autonomous driving. However, the majority of available simulators are incapable of replicating traffic patterns that accurately reflect the various features of real-world data while also simulating human-like reactive responses to the tested autopilot driving strategies. Taking one step forward to addressing such a problem, we propose Realistic Interactive TrAffic flow (RITA) as an integrated component of existing driving simulators to provide high-quality traffic flow for the evaluation and optimization of the tested driving strategies. RITA is developed with fidelity, diversity, and controllability in consideration, and consists of two core modules called RITABackend and RITAKit. RITABackend is built to support vehicle-wise control and provide traffic generation models from real-world datasets, while RITAKit is developed with easy-to-use interfaces for controllable traffic generation via RITABackend. We demonstrate RITA's capacity to create diversified and high-fidelity traffic simulations in several highly interactive highway scenarios. The experimental findings demonstrate that our produced RITA traffic flows meet all three design goals, hence enhancing the completeness of driving strategy evaluation. Moreover, we showcase the possibility for further improvement of baseline strategies through online fine-tuning with RITA traffic flows.

translated by 谷歌翻译

GET-DIPP: Graph-Embedded Transformer for Differentiable Integrated Prediction and Planning

Jiawei Sun , Chengran Yuan , Shuo Sun , Zhiyang Liu , Terence Goh , Anthony Wong , Keng Peng Tee , Marcelo H. Ang Jr

分类：机器人

2022-11-11

Accurately predicting interactive road agents' future trajectories and planning a socially compliant and human-like trajectory accordingly are important for autonomous vehicles. In this paper, we propose a planning-centric prediction neural network, which takes surrounding agents' historical states and map context information as input, and outputs the joint multi-modal prediction trajectories for surrounding agents, as well as a sequence of control commands for the ego vehicle by imitation learning. An agent-agent interaction module along the time axis is proposed in our network architecture to better comprehend the relationship among all the other intelligent agents on the road. To incorporate the map's topological information, a Dynamic Graph Convolutional Neural Network (DGCNN) is employed to process the road network topology. Besides, the whole architecture can serve as a backbone for the Differentiable Integrated motion Prediction with Planning (DIPP) method by providing accurate prediction results and initial planning commands. Experiments are conducted on real-world datasets to demonstrate the improvements made by our proposed method in both planning and prediction accuracy compared to the previous state-of-the-art methods.

translated by 谷歌翻译

Interaction-Aware Trajectory Prediction and Planning for Autonomous Vehicles in Forced Merge Scenarios

Kaiwen Liu , Nan Li , H. Eric Tseng , Ilya Kolmanovsky , Anouck Girard

分类：机器人

2021-12-14

一般而言，融合是人类驱动因素和自治车辆的具有挑战性的任务，特别是在密集的交通中，因为合并的车辆通常需要与其他车辆互动以识别或创造间隙并安全合并。在本文中，我们考虑了强制合并方案的自主车辆控制问题。我们提出了一种新的游戏 - 理论控制器，称为领导者跟随者游戏控制器（LFGC），其中自主EGO车辆和其他具有先验不确定驾驶意图的车辆之间的相互作用被建模为部分可观察到的领导者 - 跟随游戏。 LFGC估计基于观察到的轨迹的其他车辆在线在线，然后预测其未来的轨迹，并计划使用模型预测控制（MPC）来同时实现概率保证安全性和合并目标的自我车辆自己的轨迹。为了验证LFGC的性能，我们在模拟和NGSIM数据中测试它，其中LFGC在合并中展示了97.5％的高成功率。

translated by 谷歌翻译

BITS: Bi-level Imitation for Traffic Simulation

Danfei Xu , Yuxiao Chen , Boris Ivanovic , Marco Pavone

分类：机器人 | 机器学习

2022-08-26

仿真是对机器人系统（例如自动驾驶汽车）进行扩展验证和验证的关键。尽管高保真物理和传感器模拟取得了进步，但在模拟道路使用者的现实行为方面仍然存在一个危险的差距。这是因为，与模拟物理和图形不同，设计人类行为的第一个原理模型通常是不可行的。在这项工作中，我们采用了一种数据驱动的方法，并提出了一种可以学会从现实世界驱动日志中产生流量行为的方法。该方法通过将交通仿真问题分解为高级意图推理和低级驾驶行为模仿，通过利用驾驶行为的双层层次结构来实现高样本效率和行为多样性。该方法还结合了一个计划模块，以获得稳定的长马行为。我们从经验上验证了我们的方法，即交通模拟（位）的双层模仿，并具有来自两个大规模驾驶数据集的场景，并表明位表明，在现实主义，多样性和长途稳定性方面可以达到平衡的交通模拟性能。我们还探索了评估行为现实主义的方法，并引入了一套评估指标以进行交通模拟。最后，作为我们的核心贡献的一部分，我们开发和开源一个软件工具，该工具将跨不同驱动数据集的数据格式统一，并将现有数据集将场景转换为交互式仿真环境。有关其他信息和视频，请参见https://sites.google.com/view/nvr-bits2022/home

translated by 谷歌翻译

HTML版本

Conditional Predictive Behavior Planning with Inverse Reinforcement Learning for Human-like Autonomous Driving

Zhiyu Huang , Haochen Liu , Jingda Wu , Chen Lv

分类：机器人

2022-12-17

Making safe and human-like decisions is an essential capability of autonomous driving systems and learning-based behavior planning is a promising pathway toward this objective. Distinguished from existing learning-based methods that directly output decisions, this work introduces a predictive behavior planning framework that learns to predict and evaluate from human driving data. Concretely, a behavior generation module first produces a diverse set of candidate behaviors in the form of trajectory proposals. Then the proposed conditional motion prediction network is employed to forecast other agents' future trajectories conditioned on each trajectory proposal. Given the candidate plans and associated prediction results, we learn a scoring module to evaluate the plans using maximum entropy inverse reinforcement learning (IRL). We conduct comprehensive experiments to validate the proposed framework on a large-scale real-world urban driving dataset. The results reveal that the conditional prediction model is able to forecast multiple possible future trajectories given a candidate behavior and the prediction results are reactive to different plans. Moreover, the IRL-based scoring module can properly evaluate the trajectory proposals and select close-to-human ones. The proposed framework outperforms other baseline methods in terms of similarity to human driving trajectories. Moreover, we find that the conditional prediction model can improve both prediction and planning performance compared to the non-conditional model, and learning the scoring module is critical to correctly evaluating the candidate plans to align with human drivers.

translated by 谷歌翻译

Beyond RMSE: Do machine-learned models of road user interaction produce human-like behavior?

Aravinda Ramakrishnan Srinivasan , Yi-Shin Lin , Morris Antonello , Anthony Knittel , Mohamed Hasan , Majd Hawasly , John Redford , Subramanian Ramamoorthy , Matteo Leonetti , Jac Billington

分类：机器学习

2022-06-22

自动驾驶汽车使用各种传感器和机器学习型号来预测周围道路使用者的行为。文献中的大多数机器学习模型都集中在定量误差指标上，例如均方根误差（RMSE），以学习和报告其模型的功能。对定量误差指标的关注倾向于忽略模型的更重要的行为方面，从而提出了这些模型是否真正预测类似人类行为的问题。因此，我们建议分析机器学习模型的输出，就像我们将在常规行为研究中分析人类数据一样。我们介绍定量指标，以证明在自然主义高速公路驾驶数据集中存在三种不同的行为现象：1）运动学依赖性谁通过合并点首次通过合并点2）巷道上的车道更改，可容纳坡道车辆3 ）车辆通过高速公路上的车辆变化，以避免铅车冲突。然后，我们使用相同的指标分析了三个机器学习模型的行为。即使模型的RMSE值有所不同，所有模型都捕获了运动学依赖性的合并行为，但在不同程度上挣扎着捕获更细微的典型礼貌车道变更和高速公路车道的变化行为。此外，车道变化期间的碰撞厌恶分析表明，模型努力捕获人类驾驶的物理方面：在车辆之间留下足够的差距。因此，我们的分析强调了简单的定量指标不足，并且在分析人类驾驶预测的机器学习模型时需要更广泛的行为观点。

translated by 谷歌翻译

B-GAP: Behavior-Rich Simulation and Navigation for Autonomous Driving

Angelos Mavrogiannis , Rohan Chandra , Dinesh Manocha

分类：机器人

2020-11-07

我们解决了由具有不同驱动程序行为的道路代理人填充的密集模拟交通环境中的自我车辆导航问题。由于其异构行为引起的代理人的不可预测性，这种环境中的导航是挑战。我们提出了一种新的仿真技术，包括丰富现有的交通模拟器，其具有与不同程度的侵略性程度相对应的行为丰富的轨迹。我们在驾驶员行为建模算法的帮助下生成这些轨迹。然后，我们使用丰富的模拟器培训深度加强学习（DRL）策略，包括一组高级车辆控制命令，并在测试时间使用此策略来执行密集流量的本地导航。我们的政策隐含地模拟了交通代理商之间的交互，并计算了自助式驾驶员机动，例如超速，超速，编织和突然道路变化的激进驾驶员演习的安全轨迹。我们增强的行为丰富的模拟器可用于生成由对应于不同驱动程序行为和流量密度的轨迹组成的数据集，我们的行为的导航方案可以与最先进的导航算法相结合。

translated by 谷歌翻译

Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior

Davis Rempe , Jonah Philion , Leonidas J. Guibas , Sanja Fidler , Or Litany

分类：计算机视觉 | 机器学习 | 机器人

2021-12-09

自治车辆的评估和改善规划需要可扩展的长尾交通方案。有用的是，这些情景必须是现实的和挑战性的，但不能安全地开车。在这项工作中，我们介绍努力，一种自动生成具有挑战性的场景的方法，导致给定的计划者产生不良行为，如冲突。为了维护情景合理性，关键的想法是利用基于图形的条件VAE的形式利用学习的交通运动模型。方案生成在该流量模型的潜在空间中制定了优化，通过扰乱初始的真实世界的场景来产生与给定计划者碰撞的轨迹。随后的优化用于找到“解决方案”的场景，确保改进给定的计划者是有用的。进一步的分析基于碰撞类型的群集生成的场景。我们攻击两名策划者并展示争取在这两种情况下成功地产生了现实，具有挑战性的情景。我们另外“关闭循环”并使用这些方案优化基于规则的策划器的超参数。

translated by 谷歌翻译

Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts

Stefano Pini , Christian S. Perone , Aayush Ahuja , Ana Sofia Rufino Ferreira , Moritz Niendorf , Sergey Zagoruyko

分类：机器人 | 机器学习

2022-11-03

The goal of autonomous vehicles is to navigate public roads safely and comfortably. To enforce safety, traditional planning approaches rely on handcrafted rules to generate trajectories. Machine learning-based systems, on the other hand, scale with data and are able to learn more complex behaviors. However, they often ignore that agents and self-driving vehicle trajectory distributions can be leveraged to improve safety. In this paper, we propose modeling a distribution over multiple future trajectories for both the self-driving vehicle and other road agents, using a unified neural network architecture for prediction and planning. During inference, we select the planning trajectory that minimizes a cost taking into account safety and the predicted probabilities. Our approach does not depend on any rule-based planners for trajectory generation or optimization, improves with more training data and is simple to implement. We extensively evaluate our method through a realistic simulator and show that the predicted trajectory distribution corresponds to different driving profiles. We also successfully deploy it on a self-driving vehicle on urban public roads, confirming that it drives safely without compromising comfort. The code for training and testing our model on a public prediction dataset and the video of the road test are available at https://woven.mobi/safepathnet

translated by 谷歌翻译

MixNet: Structured Deep Neural Motion Prediction for Autonomous Racing

Phillip Karle , Ferenc Török , Maximilian Geisslinger , Markus Lienkamp

分类：机器人

2022-08-03

可靠地预测围绕自动赛车的参赛者车辆的动议对于有效和表现计划至关重要。尽管高度表现力，但深度神经网络是黑盒模型，使其在安全至关重要的应用（例如自动驾驶）中具有挑战性。在本文中，我们介绍了一种结构化的方式，以预测具有深神网络的对立赛车的运动。最终可能的输出轨迹集受到限制。因此，可以给出有关预测的质量保证。我们通过将模型与基于LSTM的编码器架构一起评估模型来报告该模型的性能，这些架构是从高保真硬件中获取的数据中获得的。拟议的方法的表现优于预测准确性的基线，但仍能履行质量保证。因此，该模型的强大现实应用已被证明。介绍的模型被部署在慕尼黑技术大学的Indy Automous Challenge 2021中。本研究中使用的代码可作为开放源软件提供，网址为www.github.com/tumftm/mixnet。

translated by 谷歌翻译

Driving in Dense Traffic with Model-Free Reinforcement Learning

Dhruv Mauria Saxena , Sangjae Bae , Alireza Nakhaei , Kikuo Fujimura , Maxim Likhachev

分类：机器人 | 人工智能 | 机器学习

2019-09-15

Traditional planning and control methods could fail to find a feasible trajectory for an autonomous vehicle to execute amongst dense traffic on roads. This is because the obstacle-free volume in spacetime is very small in these scenarios for the vehicle to drive through. However, that does not mean the task is infeasible since human drivers are known to be able to drive amongst dense traffic by leveraging the cooperativeness of other drivers to open a gap. The traditional methods fail to take into account the fact that the actions taken by an agent affect the behaviour of other vehicles on the road. In this work, we rely on the ability of deep reinforcement learning to implicitly model such interactions and learn a continuous control policy over the action space of an autonomous vehicle. The application we consider requires our agent to negotiate and open a gap in the road in order to successfully merge or change lanes. Our policy learns to repeatedly probe into the target road lane while trying to find a safe spot to move in to. We compare against two model-predictive control-based algorithms and show that our policy outperforms them in simulation.

translated by 谷歌翻译

AdvDO: Realistic Adversarial Attacks for Trajectory Prediction

Yulong Cao , Chaowei Xiao , Anima Anandkumar , Danfei Xu , Marco Pavone

分类：机器学习 | 人工智能

2022-09-19

轨迹预测对于自动驾驶汽车（AV）是必不可少的，以计划正确且安全的驾驶行为。尽管许多先前的作品旨在达到更高的预测准确性，但很少有人研究其方法的对抗性鲁棒性。为了弥合这一差距，我们建议研究数据驱动的轨迹预测系统的对抗性鲁棒性。我们设计了一个基于优化的对抗攻击框架，该框架利用精心设计的可区分动态模型来生成逼真的对抗轨迹。从经验上讲，我们基于最先进的预测模型的对抗性鲁棒性，并表明我们的攻击使通用指标和计划感知指标的预测错误增加了50％以上和37％。我们还表明，我们的攻击可以导致AV在模拟中驶离道路或碰撞到其他车辆中。最后，我们演示了如何使用对抗训练计划来减轻对抗性攻击。

translated by 谷歌翻译

An Intelligent Self-driving Truck System For Highway Transportation

Dawei Wang , Lingping Gao , Ziquan Lan , Wei Li , Jiaping Ren , Jiahui Zhang , Peng Zhang , Pei Zhou , Shengao Wang , Jia Pan

分类：机器人 | 人工智能

2021-12-31

最近，自主驾驶社会上有许多进展，吸引了学术界和工业的很多关注。然而，现有的作品主要专注于汽车，自动驾驶卡车算法和模型仍然需要额外的开发。在本文中，我们介绍了智能自动驾驶卡车系统。我们所呈现的系统由三个主要组成部分组成，1）一个现实的交通仿真模块，用于在测试场景中产生现实的交通流量，2）设计和评估了在现实世界部署中模仿实际卡车响应的高保真卡车模型，3 ）具有基于学习的决策算法和多模轨迹策划仪的智能计划模块，考虑到卡车的约束，道路斜率变化和周围的交通流量。我们为每个组分单独提供定量评估，以证明每个部件的保真度和性能。我们还将我们的建议系统部署在真正的卡车上，并进行真实的世界实验，表明我们的系统能力缓解了SIM-TO-REAL差距。我们的代码可以在https://github.com/inceptioresearch/iits提供

translated by 谷歌翻译

Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving

Chenran Li , Tu Trinh , Letian Wang , Changliu Liu , Masayoshi Tomizuka , Wei Zhan

分类：机器人

2022-07-08

在与其他代理商的社交互动下进行计划是自动驾驶的重要问题。随着自动驾驶汽车在相互作用中的作用会影响，并且也受到其他试剂的影响，因此自动驾驶汽车需要有效地推断其他试剂的反应。大多数现有方法将问题提出为广泛的NASH平衡问题，该问题通过基于优化的方法解决。但是，他们要求过多的计算资源，并且由于非凸度而容易落入本地最低限度。蒙特卡洛树搜索（MCTS）成功解决了游戏理论问题中的此类问题。但是，随着交互游戏树的成倍增长，一般的MCT仍然需要大量迭代才能达到Optima。在本文中，我们通过将预测算法作为启发式算法纳入了基于一般MCT的高效游戏理论轨迹计划算法。最重要的是，符合社会的奖励和贝叶斯推理算法旨在产生多样化的驾驶行为并确定其他驾驶员的驾驶偏好。结果证明了在高度交互式场景中包含自然主义驾驶行为的数据集的提议框架的有效性。

translated by 谷歌翻译

Vehicle Type Specific Waypoint Generation

Yunpeng Liu , Jonathan Wilder Lavington , Adam Scibior , Frank Wood

分类：人工智能

2022-08-09

我们开发了一种通用机制，用于从概率的驾驶行为基础模型中生成车辆型特定路线序列。许多基础行为模型都经过了不包括车辆信息的数据培训，这些数据限制了其在下游应用程序（例如计划）中的实用性。我们的新方法有条件地将这种行为预测模型专门为媒介物类型，通过利用用于生产特定车辆控制器的增强学习算法的副产品。我们展示了如何使用通用的概率行为模型组成车辆特定的价值函数估计，以生成车辆型特定的路线序列，而这些序列序列更可能在物理上是可行的，而不是其车辆敏捷的序列。

translated by 谷歌翻译

Safety-driven Interactive Planning for Neural Network-based Lane Changing

Xiangguo Liu , Ruochen Jiao , Bowen Zheng , Dave Liang , Qi Zhu

分类：机器人

2022-01-22

基于神经网络的驾驶规划师在改善自动驾驶的任务绩效方面表现出了巨大的承诺。但是，确保具有基于神经网络的组件的系统的安全性，尤其是在密集且高度交互式的交通环境中，这是至关重要的，但又具有挑战性。在这项工作中，我们为基于神经网络的车道更改提出了一个安全驱动的互动计划框架。为了防止过度保守计划，我们确定周围车辆的驾驶行为并评估其侵略性，然后以互动方式相应地适应了计划的轨迹。如果在预测的最坏情况下，即使存在安全的逃避轨迹，则自我车辆可以继续改变车道；否则，它可以停留在当前的横向位置附近或返回原始车道。我们通过广泛而全面的实验环境以及在自动驾驶汽车公司收集的现实情况下进行了广泛的模拟，定量证明了计划者设计的有效性及其优于基线方法的优势。

translated by 谷歌翻译

A human factors approach to validating driver models for interaction-aware automated vehicles

Olger Siebinga , Arkady Zgonnikov , David Abbink

分类：机器人

2021-09-27

自动驾驶汽车的一个主要挑战是安全，平稳地与其他交通参与者进行互动。处理此类交通交互的一种有希望的方法是为自动驾驶汽车配备与感知的控制器（IACS）。这些控制器预测，周围人类驾驶员将如何根据驾驶员模型对自动驾驶汽车的行为做出响应。但是，很少验证IACS中使用的驱动程序模型的预测有效性，这可能会限制IACS在简单的模拟环境之外的交互功能。在本文中，我们认为，除了评估IAC的互动能力外，还应在自然的人类驾驶行为上验证其潜在的驱动器模型。我们为此验证提出了一个工作流程，其中包括基于方案的数据提取和基于人为因素文献的两阶段（战术/操作）评估程序。我们在一项案例研究中证明了该工作流程，该案例研究对现有IAC复制的基于反向的基于学习的驱动程序模型。该模型仅在40％的预测中显示出正确的战术行为。该模型的操作行为与观察到的人类行为不一致。案例研究表明，有原则的评估工作流程是有用和需要的。我们认为，我们的工作流将支持为将来的自动化车辆开发适当的驾驶员模型。

translated by 谷歌翻译

Learning Based High-Level Decision Making for Abortable Overtaking in Autonomous Vehicles

Ehsan Malayjerdi , Gokhan Alcan , Eshagh Kargar , Hatem Darweesh , Raivo Sell , Ville Kyrki Senior Member

分类：机器人

2022-07-28

自动驾驶汽车是一项不断发展的技术，旨在通过自动操作从车道变更到超车来提高安全性，可访问性，效率和便利性。超车是自动驾驶汽车最具挑战性的操作之一，当前的自动超车技术仅限于简单情况。本文研究了如何通过允许动作流产来提高自主超车的安全性。我们提出了一个基于深层Q网络的决策过程，以确定是否以及何时需要中止超车的操作。拟议的算法在与交通情况不同的模拟中进行了经验评估，这表明所提出的方法可以改善超车手动过程中的安全性。此外，使用自动班车Iseauto在现实世界实验中证明了该方法。

translated by 谷歌翻译