智能论文笔记

MPC with Learned Residual Dynamics with Application on Omnidirectional MAVs

Maximilian Brunner , Weixuan Zhang , Ahmad Roumie , Marco Tognon , Roland Siegwart

分类：机器人

2022-07-04

空中操纵的生长场通常依赖于完全致动的或全向微型航空车（OMAV），它们可以在与环境接触时施加任意力和扭矩。控制方法通常基于无模型方法，将高级扳手控制器与执行器分配分开。如有必要，在线骚扰观察员拒绝干扰。但是，虽然是一般，但这种方法通常会产生次优控制命令，并且不能纳入平台设计给出的约束。我们提出了两种基于模型的方法来控制OMAV，以实现轨迹跟踪的任务，同时拒绝干扰。第一个通过从实验数据中学到的模型来优化扳手命令并补偿模型错误。第二个功能优化了低级执行器命令，允许利用分配无空格并考虑执行器硬件给出的约束。在现实世界实验中显示和评估两种方法的疗效和实时可行性。

translated by 谷歌翻译

Learning Variable Impedance Control for Aerial Sliding on Uneven Heterogeneous Surfaces by Proprioceptive and Tactile Sensing

Weixuan Zhang , Lionel Ott , Marco Tognon , Roland Siegwart

分类：机器人 | 机器学习

2022-06-28

能够与环境进行物理相互作用的新型航空车的最新发展导致了新的应用，例如基于接触的检查。这些任务要求机器人系统将力与部分知名的环境交换，这可能包含不确定性，包括未知的空间变化摩擦特性和表面几何形状的不连续变化。找到对这些环境不确定性的强大控制策略仍然是一个公开挑战。本文提出了一种基于学习的自适应控制策略，用于航空滑动任务。特别是，基于当前控制信号，本体感受测量和触觉感应的策略，实时调整了标准阻抗控制器的收益。在学生教师学习设置中，该策略通过简化执行器动力进行了模拟培训。使用倾斜臂全向飞行器验证了所提出方法的现实性能。所提出的控制器结构结合了数据驱动和基于模型的控制方法，使我们的方法能够直接转移并不从模拟转移到真实平台。与微调状态的相互作用控制方法相比，我们达到了减少的跟踪误差和改善的干扰排斥反应。

translated by 谷歌翻译

Control of over-redundant cooperative manipulation via sampled communication

Enrica Rossi , Marco Tognon , Ruggero Carli , Antonio Franchi , Luca Schenato

分类：机器人

2021-12-02

在这项工作中，我们考虑了需要通过电缆或机器人臂操纵/运输物体的移动机器人的问题。我们考虑一种操纵机器人的数量冗余的场景，即，可以通过机器人的不同配置获得所需的对象配置。这项工作的目的是表明，可以使用通信来实现机器人中的协同局部反馈控制器，以改善扰动抑制并降低对象中的结构应力。特别地，我们考虑采样测量并通过无线传输测量的现实场景，并且采样周期与系统动态时间常数相当。我们首先提出了一种运动模型，该模型与高增益控制下的整体系统动态一致，然后我们为不同规范下的配置误差提供了足够的指数稳定性和单调减少。最后，我们在完整的动态系统上测试所提出的控制器，显示出局部通信的益处。

translated by 谷歌翻译

Coordinated Multi-Robot Trajectory Tracking over Sampled Communication

Enrica Rossi , Marco Tognon , Luca Ballotta , Ruggero Carli , Juan Cortés , Antonio Franchi , Luca Schenato

分类：机器人

2021-11-30

在本文中，我们为采样通信场景中的一类多机器人系统提出了一种反向运动控制器。目标是使一组机器人执行轨迹跟踪{以协调的方式}当通信的采样时间是不可忽略的，破坏标准控制设计的理论收敛保证。鉴于配置空间中可行的期望轨迹，所提出的控制器从采样时间瞬间从系统接收测量，并计算由低级控制器跟踪的机器人的速度引用。我们提出了一个共同设计的反馈加上馈电控制器，具有可提供的稳定性和误差会聚保证，并进一步表明所获得的控制器是可分散的实现的可供选择。我们使用现实模拟器（飞行起重机）的电缆悬挂负荷的协同空中操纵方案中的数值模拟来测试所提出的控制策略。最后，我们将建议的分散控制器与集中式方法进行比较，可通过智能启发式调整反馈增益，并表明它实现了可比性。

translated by 谷歌翻译

Modeling and Control of an Omnidirectional Micro Aerial Vehicle Equipped with a Soft Robotic Arm

Róbert Szász , Mike Allenspach , Minghao Han , Marco Tognon , Robert. K. Katzschmann

分类：机器人

2021-11-04

飞行操纵器是带有附着的刚性机器人的空中无人机，属于机器人的最新和最积极开发的研究领域。这些臂的刚性性质往往缺乏遵守，灵活性和运动平滑。这项工作建议使用柔软的机器人臂连接到全向微空中飞行器（OMAV），以利用臂的柔顺和灵活的行为，同时留下可操纵和动态的，感谢全向无人机作为浮座。随机臂的统一在这种组合平台的建模和控制中造成挑战;这些挑战是通过这项工作解决的。我们基于三个建模原理提出了飞行机械手的统一模型：分段恒定曲率（PCC）和增强刚体模型（ABBM）假设用于建模软连续式机器人和传统刚体机器人借用的浮动基础方法文学。为了演示该参数化的有效性和有用性，实现了一种基于分层模型的反馈控制器。在各种动态任务的模拟中验证并评估控制器，其中检查并验证了该平台的无缺陷运动，干扰恢复和轨迹跟踪能力。软飞行机械手平台可以在空中建筑，货物交付，人力援助，维护和仓库自动化中打开新的应用领域。

translated by 谷歌翻译

Improving Performance in Neural Networks by Dendrites-Activated Connections

Carlo Metta , Marco Fantozzi , Andrea Papini , Gianluca Amato , Matteo Bergamaschi , Silvia Giulia Galfrè , Alessandro Marchetti , Michelangelo Vegliò , Maurizio Parton , Francesco Morandin

分类：神经与进化计算 | 机器学习

2023-01-03

Computational units in artificial neural networks follow a simplified model of biological neurons. In the biological model, the output signal of a neuron runs down the axon, splits following the many branches at its end, and passes identically to all the downward neurons of the network. Each of the downward neurons will use their copy of this signal as one of many inputs dendrites, integrate them all and fire an output, if above some threshold. In the artificial neural network, this translates to the fact that the nonlinear filtering of the signal is performed in the upward neuron, meaning that in practice the same activation is shared between all the downward neurons that use that signal as their input. Dendrites thus play a passive role. We propose a slightly more complex model for the biological neuron, where dendrites play an active role: the activation in the output of the upward neuron becomes optional, and instead the signals going through each dendrite undergo independent nonlinear filterings, before the linear combination. We implement this new model into a ReLU computational unit and discuss its biological plausibility. We compare this new computational unit with the standard one and describe it from a geometrical point of view. We provide a Keras implementation of this unit into fully connected and convolutional layers and estimate their FLOPs and weights change. We then use these layers in ResNet architectures on CIFAR-10, CIFAR-100, Imagenette, and Imagewoof, obtaining performance improvements over standard ResNets up to 1.73%. Finally, we prove a universal representation theorem for continuous functions on compact sets and show that this new unit has more representational power than its standard counterpart.

translated by 谷歌翻译

Fairness Guaranteed and Auction-based x-haul and Cloud Resource Allocation in Multi-tenant O-RANs

Sourav Mondal , Marco Ruffini

分类：人工智能

2023-01-02

The open-radio access network (O-RAN) embraces cloudification and network function virtualization for base-band function processing by dis-aggregated radio units (RUs), distributed units (DUs), and centralized units (CUs). These enable the cloud-RAN vision in full, where multiple mobile network operators (MNOs) can install their proprietary or open RUs, but lease on-demand computational resources for DU-CU functions from commonly available open-clouds via open x-haul interfaces. In this paper, we propose and compare the performances of min-max fairness and Vickrey-Clarke-Groves (VCG) auction-based x-haul and DU-CU resource allocation mechanisms to create a multi-tenant O-RAN ecosystem that is sustainable for small, medium, and large MNOs. The min-max fair approach minimizes the maximum OPEX of RUs through cost-sharing proportional to their demands, whereas the VCG auction-based approach minimizes the total OPEX for all resources utilized while extracting truthful demands from RUs. We consider time-wavelength division multiplexed (TWDM) passive optical network (PON)-based x-haul interfaces where PON virtualization technique is used to flexibly provide optical connections among RUs and edge-clouds at macro-cell RU locations as well as open-clouds at the central office locations. Moreover, we design efficient heuristics that yield significantly better economic efficiency and network resource utilization than conventional greedy resource allocation algorithms and reinforcement learning-based algorithms.

translated by 谷歌翻译

A System-Level View on Out-of-Distribution Data in Robotics

Rohan Sinha , Apoorva Sharma , Somrita Banerjee , Thomas Lew , Rachel Luo , Spencer M. Richards , Yixiao Sun , Edward Schmerling , Marco Pavone

分类：机器人 | 机器学习

2022-12-28

When testing conditions differ from those represented in training data, so-called out-of-distribution (OOD) inputs can mar the reliability of black-box learned components in the modern robot autonomy stack. Therefore, coping with OOD data is an important challenge on the path towards trustworthy learning-enabled open-world autonomy. In this paper, we aim to demystify the topic of OOD data and its associated challenges in the context of data-driven robotic systems, drawing connections to emerging paradigms in the ML community that study the effect of OOD data on learned models in isolation. We argue that as roboticists, we should reason about the overall system-level competence of a robot as it performs tasks in OOD conditions. We highlight key research questions around this system-level view of OOD problems to guide future research toward safe and reliable learning-enabled autonomy.

translated by 谷歌翻译

Fundamental Limits of Two-layer Autoencoders, and Achieving Them with Gradient Methods

Alexander Shevchenko , Kevin Kögler , Hamed Hassani , Marco Mondelli

分类：机器学习 | (统计)机器学习

2022-12-27

Autoencoders are a popular model in many branches of machine learning and lossy data compression. However, their fundamental limits, the performance of gradient methods and the features learnt during optimization remain poorly understood, even in the two-layer setting. In fact, earlier work has considered either linear autoencoders or specific training regimes (leading to vanishing or diverging compression rates). Our paper addresses this gap by focusing on non-linear two-layer autoencoders trained in the challenging proportional regime in which the input dimension scales linearly with the size of the representation. Our results characterize the minimizers of the population risk, and show that such minimizers are achieved by gradient methods; their structure is also unveiled, thus leading to a concise description of the features obtained via training. For the special case of a sign activation function, our analysis establishes the fundamental limits for the lossy compression of Gaussian sources via (shallow) autoencoders. Finally, while the results are proved for Gaussian data, numerical simulations on standard datasets display the universality of the theoretical predictions.

translated by 谷歌翻译

Investigation of reinforcement learning for shape optimization of profile extrusion dies

Clemens Fricke , Daniel Wolff , Marco Kemmerling , Stefanie Elgeti

分类：机器学习

2022-12-23

Profile extrusion is a continuous production process for manufacturing plastic profiles from molten polymer. Especially interesting is the design of the die, through which the melt is pressed to attain the desired shape. However, due to an inhomogeneous velocity distribution at the die exit or residual stresses inside the extrudate, the final shape of the manufactured part often deviates from the desired one. To avoid these deviations, the shape of the die can be computationally optimized, which has already been investigated in the literature using classical optimization approaches. A new approach in the field of shape optimization is the utilization of Reinforcement Learning (RL) as a learning-based optimization algorithm. RL is based on trial-and-error interactions of an agent with an environment. For each action, the agent is rewarded and informed about the subsequent state of the environment. While not necessarily superior to classical, e.g., gradient-based or evolutionary, optimization algorithms for one single problem, RL techniques are expected to perform especially well when similar optimization tasks are repeated since the agent learns a more general strategy for generating optimal shapes instead of concentrating on just one single problem. In this work, we investigate this approach by applying it to two 2D test cases. The flow-channel geometry can be modified by the RL agent using so-called Free-Form Deformation, a method where the computational mesh is embedded into a transformation spline, which is then manipulated based on the control-point positions. In particular, we investigate the impact of utilizing different agents on the training progress and the potential of wall time saving by utilizing multiple environments during training.

translated by 谷歌翻译