智能论文笔记

Comparing Bayesian Models for Organ Contouring in Headand Neck Radiotherapy

Prerak Mody , Nicolas Chaves-de-Plaza , Klaus Hildebrandt , Rene van Egmond , Huib de Ridder , Marius Staring

分类：计算机视觉 | 机器学习

2021-11-01

无放射治疗器官轮廓的深度学习模型是临床用途，但目前，预测轮廓的自动化质量评估（QA）有很多工具。使用贝叶斯模型及其相关的不确定性，可以自动化检测不准确预测的过程。我们使用定量测量 - 预期的校准误差（ECE）和基于定性的测量区域的精确度（R-AVU）图来调查两个贝叶斯模型进行自动轮廓众所周知，模型应该具有低欧洲欧洲经委会被认为是值得信赖的。然而，在QA语境中，模型也应该在不准确的区域中具有高不确定性，并且在准确的区域中的不确定性低。此类行为可以直接对专家用户的视觉关注潜在地不准确的地区，导致QA过程中的加速。使用R-AVU图表，我们定性地比较了不同模型的行为准确和不准确的地区。使用三种型号在Miccai2015头和颈部分割挑战和DeepMindtcia CT数据集上进行实验：丢弃骰子，辍学-CE（交叉熵）和Flipout-Ce。定量结果表明，丢弃骰子具有最高的ECE，而辍学-CE和FLIPOUT-CE具有最低的ECE。为了更好地了解辍学-CE和Flipout-CE之间的差异，我们使用R-AVU图表，显示Flipout-CE在不准确的地区具有比Dropout-Ce更好的不确定性覆盖率。定量和定性度量的这种组合探讨了一种新方法，有助于选择哪种模型可以在临床环境中作为QA工具部署。

translated by 谷歌翻译

On a Uniform Causality Model for Industrial Automation

Maria Krantz , Alexander Windmann , Rene Heesch , Lukas Moddemann , Oliver Niggemann

分类：人工智能

2022-09-20

网络物理系统（CPS）的复杂性日益增加，使工业自动化具有挑战性。需要处理大量传感器记录的数据，以充分执行诸如故障的诊断之类的任务。解决这种复杂性的一种有希望的方法是因果关系的概念。但是，大多数有关因果关系的研究都集中在推断未知系统部分之间的因果关系。工程以根本不同的方式使用因果关系：复杂的系统是通过将组件与已知可控行为相结合的。由于CP是通过第二种方法构建的，因此大多数基于数据的因果模型不适合工业自动化。为了弥合这一差距，提出了针对工业自动化各种应用程序领域的统一因果模型，这将允许更好地沟通和跨学科的更好的数据使用。最终的模型在数学上描述了CPS的行为，并且由于对应用领域的独特要求评估了该模型，因此证明统一的因果关系模型可以作为在工业自动化中应用新方法的基础，该方法侧重于机器学习。

translated by 谷歌翻译

Interpretable by Design: Learning Predictors by Composing Interpretable Queries

Aditya Chattopadhyay , Stewart Slocum , Benjamin D. Haeffele , Rene Vidal , Donald Geman

分类：计算机视觉 | 机器学习

2022-07-03

对于使用高性能机器学习算法通常不透明的决策，人们越来越担心。用特定于领域的术语对推理过程的解释对于在医疗保健等风险敏感领域中采用至关重要。我们认为，机器学习算法应该可以通过设计来解释，并且表达这些解释的语言应与域和任务有关。因此，我们将模型的预测基于数据的用户定义和特定于任务的二进制函数，每个都对最终用户有明确的解释。然后，我们最大程度地减少了在任何给定输入上准确预测所需的预期查询数。由于解决方案通常是棘手的，因此在事先工作之后，我们根据信息增益顺序选择查询。但是，与以前的工作相反，我们不必假设查询在有条件地独立。取而代之的是，我们利用随机生成模型（VAE）和MCMC算法（未经调整的Langevin）来选择基于先前的查询 - 答案的输入的最有用的查询。这使得在线确定要解决预测歧义所需的任何深度的查询链。最后，关于视觉和NLP任务的实验证明了我们的方法的功效及其优越性比事后解释的优势。

translated by 谷歌翻译

Transferable End-to-end Room Layout Estimation via Implicit Encoding

Hao Zhao , Rene Ranftl , Yurong Chen , Hongbin Zha

分类：计算机视觉

2021-12-21

我们研究了从单个全景图像估算房间布局的问题。大多数前工程都有两个阶段：特征提取和参数模型配件。在这里，我们提出了一种端到端的方法，其直接从输入全景图像预测参数布局。它利用隐式编码过程将参数布局嵌入到潜像。然后学习从图像到此潜在空间的映射使端到端的房间布局估计成为可能。然而，尽管许多有趣的性质，但端到端的方法具有几个臭名昭着的缺点。广泛提出的批评是他们与数据集偏见令人困扰，并没有转移到陌生的域名。我们的研究回应了这种共同的信念。为此，我们建议使用语义边界预测映射作为中间域。它在四个基准（StructureD3D，Panocontext，S3DIS和Matterport3D）上带来了显着的性能提升，特别是在零拍摄传输设置中。代码，数据和模型将被释放。

translated by 谷歌翻译

Damage Estimation and Localization from Sparse Aerial Imagery

Rene Garcia Franceschini , Jeffrey Liu , Saurabh Amin

分类：计算机视觉 | 机器学习

2021-11-05

空中图像为应对飓风等自然灾害提供了重要的情境意识。它们非常适合提供损坏估算和本地化的信息（Del）;即，表征灾难后损坏的类型和空间程度。尽管最近进行了传感和无人空中系统技术的进步，但大部分灾后的空中图像仍然由手持式DSLR摄像机，从小，载人的固定翼飞机。但是，这些手持式摄像机缺乏IMU信息，并且通过运营商机会拍摄的图像。因此，来自此图像的DEL仍然是一个高度手动和耗时的过程。我们提出了一种方法来检测航空图像中的损坏，并在世界坐标中本地化，专注于检测和定位洪水。该方法是基于使用运动的结构通过投影转换将图像坐标与世界坐标联系起来，使用类激活映射来检测图像中损坏的程度，并将投射转换应用于本地化世界坐标损坏。我们评估了我们在2016年路易斯安那州洪水的事件后数据上的绩效，并发现我们的方法达到了88％的精确度。鉴于使用有限数据的这种高精度，我们认为这种方法目前是可行的，用于从手持空中图像进行灾难反应的快速和有效的德。

translated by 谷歌翻译

AI Governance for Businesses

Johannes Schneider , Rene Abraham , Christian Meske , Jan vom Brocke

分类：人工智能

2020-11-20

人工智能（AI）治理调节行使权威和控制AI的管理。它旨在通过有效利用数据并最大程度地减少与AI相关的成本和风险来利用AI。尽管AI治理和AI伦理等主题在理论，哲学，社会和监管层面上进行了详尽的讨论，但针对公司和公司的AI治理工作有限。这项工作将AI产品视为系统，在该系统中，通过机器学习（ML）模型（培训）数据传递关键功能。我们通过在AI和相关领域（例如ML）合成文献来得出一个概念框架。我们的框架将AI治理分解为数据的治理，（ML）模型和（AI）系统沿着四个维度。它与现有的IT和数据治理框架和实践有关。它可以由从业者和学者都采用。对于从业者来说，主要是研究论文的综合，但从业者的出版物和监管机构的出版物也为实施AI治理提供了宝贵的起点，而对于学者来说，该论文强调了许多AI治理领域，值得更多关注。

translated by 谷歌翻译

Temporal Convolutional Networks for Action Segmentation and Detection

Colin Lea , Michael D. Flynn , Rene Vidal , Austin Reiter , Gregory D. Hager

分类：

2016-11-16

The ability to identify and temporally segment finegrained human actions throughout a video is crucial for robotics, surveillance, education, and beyond. Typical approaches decouple this problem by first extracting local spatiotemporal features from video frames and then feeding them into a temporal classifier that captures high-level temporal patterns. We introduce a new class of temporal models, which we call Temporal Convolutional Networks (TCNs), that use a hierarchy of temporal convolutions to perform fine-grained action segmentation or detection. Our Encoder-Decoder TCN uses pooling and upsampling to efficiently capture long-range temporal patterns whereas our Dilated TCN uses dilated convolutions. We show that TCNs are capable of capturing action compositions, segment durations, and long-range dependencies, and are over a magnitude faster to train than competing LSTM-based Recurrent Neural Networks. We apply these models to three challenging fine-grained datasets and show large improvements over the state of the art.

translated by 谷歌翻译

On the causality-preservation capabilities of generative modelling

Yves-Cédric Bauwelinckx , Jan Dhaene , Tim Verdonck , Milan van den Heuvel

分类：机器学习

2023-01-03

Modeling lies at the core of both the financial and the insurance industry for a wide variety of tasks. The rise and development of machine learning and deep learning models have created many opportunities to improve our modeling toolbox. Breakthroughs in these fields often come with the requirement of large amounts of data. Such large datasets are often not publicly available in finance and insurance, mainly due to privacy and ethics concerns. This lack of data is currently one of the main hurdles in developing better models. One possible option to alleviating this issue is generative modeling. Generative models are capable of simulating fake but realistic-looking data, also referred to as synthetic data, that can be shared more freely. Generative Adversarial Networks (GANs) is such a model that increases our capacity to fit very high-dimensional distributions of data. While research on GANs is an active topic in fields like computer vision, they have found limited adoption within the human sciences, like economics and insurance. Reason for this is that in these fields, most questions are inherently about identification of causal effects, while to this day neural networks, which are at the center of the GAN framework, focus mostly on high-dimensional correlations. In this paper we study the causal preservation capabilities of GANs and whether the produced synthetic data can reliably be used to answer causal questions. This is done by performing causal analyses on the synthetic data, produced by a GAN, with increasingly more lenient assumptions. We consider the cross-sectional case, the time series case and the case with a complete structural model. It is shown that in the simple cross-sectional scenario where correlation equals causation the GAN preserves causality, but that challenges arise for more advanced analyses.

translated by 谷歌翻译

Meta-learning generalizable dynamics from trajectories

Qiaofeng Li , Tianyi Wang , Vwani Roychowdhury , M. Khalid Jawed

分类：机器学习

2023-01-03

We present the interpretable meta neural ordinary differential equation (iMODE) method to rapidly learn generalizable (i.e., not parameter-specific) dynamics from trajectories of multiple dynamical systems that vary in their physical parameters. The iMODE method learns meta-knowledge, the functional variations of the force field of dynamical system instances without knowing the physical parameters, by adopting a bi-level optimization framework: an outer level capturing the common force field form among studied dynamical system instances and an inner level adapting to individual system instances. A priori physical knowledge can be conveniently embedded in the neural network architecture as inductive bias, such as conservative force field and Euclidean symmetry. With the learned meta-knowledge, iMODE can model an unseen system within seconds, and inversely reveal knowledge on the physical parameters of a system, or as a Neural Gauge to "measure" the physical parameters of an unseen system with observed trajectories. We test the validity of the iMODE method on bistable, double pendulum, Van der Pol, Slinky, and reaction-diffusion systems.

translated by 谷歌翻译

Hierarchical Explanations for Video Action Recognition

Sadaf Gulshad , Teng Long , Nanne van Noord

分类：计算机视觉 | 人工智能 | 机器学习

2023-01-01

We propose Hierarchical ProtoPNet: an interpretable network that explains its reasoning process by considering the hierarchical relationship between classes. Different from previous methods that explain their reasoning process by dissecting the input image and finding the prototypical parts responsible for the classification, we propose to explain the reasoning process for video action classification by dissecting the input video frames on multiple levels of the class hierarchy. The explanations leverage the hierarchy to deal with uncertainty, akin to human reasoning: When we observe water and human activity, but no definitive action it can be recognized as the water sports parent class. Only after observing a person swimming can we definitively refine it to the swimming action. Experiments on ActivityNet and UCF-101 show performance improvements while providing multi-level explanations.

translated by 谷歌翻译