智能论文笔记

Probability flow solution of the Fokker-Planck equation

Nicholas M. Boffi , Eric Vanden-Eijnden

分类：机器学习

2022-06-09

在高维度中整合时间依赖性的fokker-planck方程的选择方法是通过集成相关的随机微分方程来生成溶液中的样品。在这里，我们介绍了基于整合描述概率流的普通微分方程的替代方案。与随机动力学不同，该方程式在以后的任何时候都会从初始密度将样品从溶液中的样品推到样品。该方法具有直接访问数量的优势，这些数量挑战仅估算仅给定解决方案的样品，例如概率电流，密度本身及其熵。概率流程方程取决于溶液对数的梯度（其“得分”），因此A-Priori未知也是如此。为了解决这种依赖性，我们用一个深神网络对分数进行建模，该网络通过根据瞬时概率电流传播一组粒子来实现，该网络可以在直接学习中学习。我们的方法是基于基于得分的生成建模的最新进展，其重要区别是训练程序是独立的，并且不需要来自目标密度的样本才能事先可用。为了证明该方法的有效性，我们考虑了相互作用粒子系统物理学的几个示例。我们发现该方法可以很好地缩放到高维系统，并准确匹配可用的分析解决方案和通过蒙特卡洛计算的力矩。

translated by 谷歌翻译

Adversarially Robust Stability Certificates can be Sample-Efficient

Thomas T. C. K. Zhang , Stephen Tu , Nicholas M. Boffi , Jean-Jacques E. Slotine , Nikolai Matni

分类：机器学习

2021-12-20

在安全关键系统的背景下将模拟缩小到现实差距的动机，我们考虑学习用于未知非线性动力系统的前列鲁棒稳定性证书。符合鲁棒控制的方法，我们考虑添加系统动态的添加剂和Lipschitz有界对手。我们表明，在基础系统上的增量稳定性的合适假设下，学习对抗稳定证明的统计成本相当于持续因素，以学习名义稳定证明。我们的结果铰接在新的导火颤机复杂性的新型界限，这可能是独立的兴趣。据我们所知，这是在对动态系统生成的数据进行对抗性学习时，对样本复杂性限制的第一次表征。我们还提供一种用于近似对抗训练算法的实用算法，并在阻尼摆锤示例上验证我们的发现。

translated by 谷歌翻译

Nonparametric adaptive control and prediction: theory and randomized algorithms

Nicholas M. Boffi , Stephen Tu , Jean-Jacques E. Slotine

分类：机器学习

2021-06-07

非线性自适应控制理论中的一个关键假设是系统的不确定性可以在一组已知基本函数的线性跨度中表示。虽然该假设导致有效的算法，但它将应用限制为非常特定的系统类别。我们介绍一种新的非参数自适应算法，其在参数上学习无限尺寸密度，以取消再现内核希尔伯特空间中的未知干扰。令人惊讶的是，所产生的控制输入承认，尽管其底层无限尺寸结构，但是尽管它的潜在无限尺寸结构实现了其实施的分析表达。虽然这种自适应输入具有丰富和富有敏感性的 - 例如，传统的线性参数化 - 其计算复杂性随时间线性增长，使其比其参数对应力相对较高。利用随机傅里叶特征的理论，我们提供了一种有效的随机实现，该实现恢复了经典参数方法的复杂性，同时可透明地保留非参数输入的表征性。特别地，我们的显式范围仅取决于系统的基础参数，允许我们所提出的算法有效地缩放到高维系统。作为该方法的说明，我们展示了随机近似算法学习由牛顿重力交互的十点批量组成的60维系统的预测模型的能力。

translated by 谷歌翻译

Denoising instrumented mouthguard measurements of head impact kinematics with a convolutional neural network

Xianghao Zhan , Yuzhe Liu , Nicholas J. Cecchi , Ashlyn A. Callan , Enora Le Flao , Olivier Gevaert , Michael M. Zeineh , Gerald A. Grant , David B. Camarillo

分类：机器学习

2022-12-19

Wearable sensors for measuring head kinematics can be noisy due to imperfect interfaces with the body. Mouthguards are used to measure head kinematics during impacts in traumatic brain injury (TBI) studies, but deviations from reference kinematics can still occur due to potential looseness. In this study, deep learning is used to compensate for the imperfect interface and improve measurement accuracy. A set of one-dimensional convolutional neural network (1D-CNN) models was developed to denoise mouthguard kinematics measurements along three spatial axes of linear acceleration and angular velocity. The denoised kinematics had significantly reduced errors compared to reference kinematics, and reduced errors in brain injury criteria and tissue strain and strain rate calculated via finite element modeling. The 1D-CNN models were also tested on an on-field dataset of college football impacts and a post-mortem human subject dataset, with similar denoising effects observed. The models can be used to improve detection of head impacts and TBI risk evaluation, and potentially extended to other sensors measuring kinematics.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Backward Reachability Analysis of Neural Feedback Loops: Techniques for Linear and Nonlinear Systems

Nicholas Rober , Sydney M. Katz , Chelsea Sidrane , Esen Yel , Michael Everett , Mykel J. Kochenderfer , Jonathan P. How

分类：机器学习 | 机器人

2022-09-28

安全至关重要的应用中神经网络（NNS）的患病率的增加，要求采用证明安全行为的方法。本文提出了一种向后的可及性方法，以安全验证神经反馈循环（NFLS），即具有NN控制策略的闭环系统。尽管最近的作品集中在远程达到NFL的安全认证策略上，但落后性能比远期策略具有优势，尤其是在避免障碍的情况下。先前的工作已经开发了用于无NNS系统的向后可及性分析的技术，但是由于其激活功能的非线性，反馈回路中的NNS存在唯一的问题，并且由于NN模型通常不可逆转。为了克服这些挑战，我们使用现有的NN分析工具有效地找到了对反射（BP）集的过度评估，即NN控制策略将将系统驱动到给定目标集的状态集。我们介绍了用于计算以馈电NN表示的控制策略的线性和非线性系统的BP过度评估的框架，并提出了计算有效的策略。我们使用各种模型的数值结果来展示所提出的算法，包括6D系统的安全认证。

translated by 谷歌翻译

Accelerated and interpretable oblique random survival forests

Byron C. Jaeger , Sawyer Welden , Kristin Lenoir , Jaime L. Speiser , Matthew Segar , Ambarish Pandey , Nicholas M. Pajewski

分类： (统计)机器学习

2022-08-01

倾斜的随机生存森林（RSF）是一种用于右翼结果的合奏监督学习方法。斜RSF中的树是使用预测变量的线性组合生长的，以创建分支，而在标准RSF中，使用单个预测变量。倾斜的RSF集合通常比标准RSF合奏具有更高的预测准确性。但是，评估预测变量的所有可能的线性组合会诱导大量的计算开销，从而将应用限制为大规模数据集。此外，几乎没有开发用于解释斜RSF合奏的方法，与基于轴的对应物相比，它们仍然难以解释。我们介绍了一种提高斜力RSF计算效率的方法，以及一种用斜RSF估计单个预测变量重要性的方法。我们减少计算开销的策略是利用牛顿 - 拉夫森评分（Newton-Raphson）评分，这是一种经典的优化技术，我们适用于决策树的每个非叶子节点内的COX部分似然函数。我们通过在线性组合中否定了用于给定预测指标的每个系数，然后计算出降低的降低准确性，从而估计单个预测因子对斜RSF的重要性。通常，在基准测试实验中，我们发现，与现有的斜RSF相比，与现有软件相比，我们对斜RSF的实现速度约为450倍，而较高的Brier得分则要高450倍。我们在模拟研究中发现，“否定重要性”比置换重要性，莎普利添加性解释和先前引入的技术更可靠地区分相关和无关的预测因子，以基于方差分析来衡量斜RSF的可变重要性。当前研究中引入的方法可在AORSF R软件包中获得。

translated by 谷歌翻译

Towards Highly Expressive Machine Learning Models of Non-Melanoma Skin Cancer

Simon M. Thomas , James G. Lefevre , Glenn Baxter , Nicholas A. Hamilton

分类：机器学习 | 人工智能 | 自然语言处理 | 计算机视觉

2022-07-09

病理学家拥有丰富的词汇，他们可以描述细胞形态的所有细微差别。在他们的世界中，图像和单词都有自然的配对。最近的进步表明，现在可以对机器学习模型进行培训，以学习高质量的图像功能并将其表示为离散信息。这使得自然语言（也是离散的语言）可以与成像旁边共同建模，从而描述了成像内容。在这里，我们介绍了将离散建模技术应用于非黑色素瘤皮肤癌的问题结构域，特别是eme骨内癌（IEC）的组织学图像。通过实施IEC图像的高分辨率（256x256）图像的VQ-GAN模型，我们训练了序列到序列变压器，以使用病理学家术语来生成自然语言描述。结合使用连续生成方法获得的交互式概念矢量的概念，我们展示了一个额外的解释性角度。结果是为高度表达的机器学习系统而努力的一种有希望的方法，不仅可以用作预测/分类工具，而且还意味着要进一步了解我们对疾病的科学理解。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译