智能论文笔记

Reinforcement Learning for Multi-Truck Vehicle Routing Problems

Randall Correll , Sean J. Weinberg , Fabio Sanches , Takanori Ide , Takafumi Suzuki

分类：机器学习 | 人工智能

2022-11-30

Vehicle routing problems and other combinatorial optimization problems have been approximately solved by reinforcement learning agents with policies based on encoder-decoder models with attention mechanisms. These techniques are of substantial interest but still cannot solve the complex routing problems that arise in a realistic setting which can have many trucks and complex requirements. With the aim of making reinforcement learning a viable technique for supply chain optimization, we develop new extensions to encoder-decoder models for vehicle routing that allow for complex supply chains using classical computing today and quantum computing in the future. We make two major generalizations. First, our model allows for routing problems with multiple trucks. Second, we move away from the simple requirement of having a truck deliver items from nodes to one special depot node, and instead allow for a complex tensor demand structure. We show how our model, even if trained only for a small number of trucks, can be embedded into a large supply chain to yield viable solutions.

translated by 谷歌翻译

Quantum Neural Networks for a Supply Chain Logistics Application

Randall Correll , Sean J. Weinberg , Fabio Sanches , Takanori Ide , Takafumi Suzuki

分类：机器学习

2022-11-30

Problem instances of a size suitable for practical applications are not likely to be addressed during the noisy intermediate-scale quantum (NISQ) period with (almost) pure quantum algorithms. Hybrid classical-quantum algorithms have potential, however, to achieve good performance on much larger problem instances. We investigate one such hybrid algorithm on a problem of substantial importance: vehicle routing for supply chain logistics with multiple trucks and complex demand structure. We use reinforcement learning with neural networks with embedded quantum circuits. In such neural networks, projecting high-dimensional feature vectors down to smaller vectors is necessary to accommodate restrictions on the number of qubits of NISQ hardware. However, we use a multi-head attention mechanism where, even in classical machine learning, such projections are natural and desirable. We consider data from the truck routing logistics of a company in the automotive sector, and apply our methodology by decomposing into small teams of trucks, and we find results comparable to human truck assignment.

translated by 谷歌翻译

On minimal variations for unsupervised representation learning

Vivien Cabannes , Alberto Bietti , Randall Balestriero

分类：机器学习 | 人工智能 | (统计)机器学习

2022-11-07

Unsupervised representation learning aims at describing raw data efficiently to solve various downstream tasks. It has been approached with many techniques, such as manifold learning, diffusion maps, or more recently self-supervised learning. Those techniques are arguably all based on the underlying assumption that target functions, associated with future downstream tasks, have low variations in densely populated regions of the input space. Unveiling minimal variations as a guiding principle behind unsupervised representation learning paves the way to better practical guidelines for self-supervised learning algorithms.

translated by 谷歌翻译

POLICE: Provably Optimal Linear Constraint Enforcement for Deep Neural Networks

Randall Balestriero , Yann LeCun

分类：机器学习 | 计算机视觉 | (统计)机器学习

2022-11-02

Deep Neural Networks (DNNs) outshine alternative function approximators in many settings thanks to their modularity in composing any desired differentiable operator. The formed parametrized functional is then tuned to solve a task at hand from simple gradient descent. This modularity comes at the cost of making strict enforcement of constraints on DNNs, e.g. from a priori knowledge of the task, or from desired physical properties, an open challenge. In this paper we propose the first provable affine constraint enforcement method for DNNs that requires minimal changes into a given DNN's forward-pass, that is computationally friendly, and that leaves the optimization of the DNN's parameter to be unconstrained i.e. standard gradient-based method can be employed. Our method does not require any sampling and provably ensures that the DNN fulfills the affine constraint on a given input space's region at any point during training, and testing. We coin this method POLICE, standing for Provably Optimal LInear Constraint Enforcement.

translated by 谷歌翻译

Batch Normalization Explained

Randall Balestriero , Richard G. Baraniuk

分类：机器学习 | 人工智能 | 计算机视觉 | (统计)机器学习

2022-09-29

在现代深网（DNS）中，至关重要的，无处不在且知之甚少的成分是批处理（BN），它以特征图为中心并归一化。迄今为止，只有有限的进步才能理解为什么BN会提高DN学习和推理表现。工作专注于表明BN平滑DN的损失格局。在本文中，我们从函数近似的角度从理论上研究BN。我们利用这样一个事实，即当今最先进的DNS是连续的分段仿射（CPA），可以通过定义在输入空间的分区上定义的仿射映射来预测培训数据（所谓的“线性”区域”）。 {\ em我们证明了BN是一种无监督的学习技术，它独立于DN的权重或基于梯度的学习 - 适应DN的样条分区的几何形状以匹配数据。} BN提供了“智能初始化”，可提高“智能初始化” DN学习的性能，因为它甚至适应了以随机权重初始化的DN，以使其样条分区与数据保持一致。我们还表明，微型批次之间的BN统计数据的变化引入了辍学的随机扰动，以对分区边界，因此分类问题的决策边界。每次微型摄入扰动可通过增加训练样本和决策边界之间的边距来减少过度拟合并改善概括。

translated by 谷歌翻译

Incrementality Bidding and Attribution

Randall Lewis , Jeffrey Wong

分类：机器学习

2022-08-25

向潜在客户展示广告而不是通常称为“增量”的因果效应是广告有效性的基本问题。在数字广告中，三个主要难题对于严格量化广告增量的核心：广告购买/竞标/定价，归因和实验。在机器学习和因果计量经济学基础的基础上，我们提出了一种方法，将这三个概念统一为竞标和归因的计算可行模型，该模型涵盖了广告效应的随机化，培训，交叉验证，评分，评分和转换归因。这种方法的实施很可能可以确保广告回报率的重大改善。

translated by 谷歌翻译

Kill Chaos with Kindness: Agreeableness Improves Team Performance Under Uncertainty

Soo Ling Lim , Randall S. Peterson , Peter J. Bentley , Xiaoran Hu , JoEllyn Prouty McLaren

分类：人工智能 | 神经与进化计算

2022-08-09

团队是人类成就的核心。在过去的半个世纪中，心理学家已经确定了五个跨文化有效的人格变量：神经质，外向性，开放性，尽职尽责和同意。前四个与团队绩效显示一致的关系。然而，令人愉快的（和谐，无私，谦虚和合作）表现出与团队绩效的无关紧要和高度可变的关系。我们通过计算建模解决这种不一致。基于代理的模型（ABM）用于预测人格特质对团队合作的影响，然后使用遗传算法来探索ABM的限制，以发现哪种特征与最佳和最差的表现相关，以解决与与最差的团队相关的问题，以解决与问题有关的问题。不同级别的不确定性（噪声）。探索所揭示的新依赖性通过分析迄今为止最大的团队绩效数据集的先前未观察到的数据来证实，其中包括593个团队中的3,698个个人，从事5,000多个没有不确定性的小组任务，在10年内收集了不确定性。我们的发现是，团队绩效和同意之间的依赖性受到任务不确定性的调节。以这种方式将进化计算与ABM相结合，为团队合作的科学研究，做出新的预测以及提高我们对人类行为的理解提供了一种新方法。我们的结果证实了计算机建模对发展理论的潜在实用性，并阐明了随着工作环境的越来越流畅和不确定的启示。

translated by 谷歌翻译

What Do We Maximize in Self-Supervised Learning?

Ravid Shwartz-Ziv , Randall Balestriero , Yann LeCun

分类：机器学习 | 人工智能

2022-07-20

在本文中，我们研究了自我监督的学习方法，尤其是VICREG，以提供对其构建的信息理论理解。作为第一步，我们演示了如何获得确定性网络的信息理论数量，为依赖随机模型的先前工作提供了可能的替代方法。这使我们能够证明如何从第一原则及其对数据分布的假设中发现的（重新）。此外，我们从经验上证明了我们的假设的有效性，证实了我们对Vicreg的新理解。最后，我们认为，我们获得的派生和见解可以推广到许多其他SSL方法，为SSL和转移学习的理论和实际理解开辟了新的途径。

translated by 谷歌翻译

Exploring Runtime Decision Support for Trauma Resuscitation

Keyi Li , Sen Yang , Travis M. Sullivan , Randall S. Burd , Ivan Marsic

分类：人工智能 | 机器学习

2022-07-06

基于AI的推荐系统已成功应用于许多域（例如，电子商务，提要排名）。医学专家认为，将这种方法纳入临床决策支持系统可能有助于减少医疗团队的错误并改善治疗过程中的患者结果（例如，创伤复苏，手术过程）。但是，已经进行了有限的研究来开发自动数据驱动的治疗决策支持。我们探索了构建治疗建议系统以提供下一分钟活动预测的可行性。该系统使用患者环境（例如人口统计和生命体征）和过程上下文（例如活动）来连续预测将在下一分钟进行的活动。我们在预先录制的创伤复苏数据集上评估了我们的系统，并对不同模型变体进行了消融研究。对于61种活动类型，最佳模型的平均F1得分为0.67。我们包括医疗团队的反馈并讨论未来的工作。

translated by 谷歌翻译

Guillotine Regularization: Improving Deep Networks Generalization by Removing their Head

Florian Bordes , Randall Balestriero , Quentin Garrido , Adrien Bardes , Pascal Vincent

分类：机器学习

2022-06-27

近年来出现的一种意外技术包括使用自我监督学习（SSL）方法培训深网（DN），并在下游任务上使用此网络，但其最后几层已完全删除。这种通常的脱脂技巧实际上对于SSL方法显示竞争性表演至关重要。例如，在成像网分类上，可以以这种方式获得超过30个百分比。这有点令人烦恼，因为人们希望在训练期间SSL标准明确执行不变性的网络层（最后一层）应该是用于下游最佳概括性能的一种。但这似乎并非如此，这项研究阐明了原因。我们将这种技巧称为断头台正则化（GR），实际上是一种普遍适用的正则化形式，也已用于改善转移学习方案中的泛化性能。在这项工作中，通过理论和实验，我们将GR形式化并确定其在SSL方法中成功背后的根本原因。我们的研究表明，这种技巧对于SSL的性能至关重要，原因有两个：（i）确定训练过程中使用的正面对的数据启发不当，和/或（ii）次优选择了该训练的超参数。 SSL损失。

translated by 谷歌翻译