智能论文笔记

Conjunction Data Messages behave as a Poisson Process

Francisco Caldas , Claudia Soares , Cláudia Nunes , Marta Guimarães , Mariana Filipe , Rodrigo Ventura

分类：机器学习

2021-05-14

空间碎片是太空勘探中的一个主要问题。国际机构不断监控大量的轨道对象数据库，并以结合数据消息的形式发出警告。卫星运营商的一个重要问题是估计新信息将到达，以便他们可以及时反应，但避免卫星演习。我们提出了一个统计学习模型的消息到达过程，允许我们回答两个重要问题：（1）下一个指定的时间间隔有任何新的消息吗？（2）下一条消息到达的不确定性何时到达？我们的贝叶斯泊松过程模型的问题（2）的平均预测误差小于在50K关闭遇到事件的测试集中超过4小时的基线。

translated by 谷歌翻译

Bayesian Neural Hawkes Process for Event Uncertainty Prediction

Manisha Dubey , Ragja Palakkadavath , P. K. Srijith

分类：机器学习

2021-12-29

许多应用包括具有事件发生时间的事件数据序列。预测发生时间的模型在社交网络，金融交易，医疗保健和人类流动等各种应用程序中起着重要作用。最近的作品引入了基于神经网络的基于点的点过程，用于建模事件时间，并显示在预测事件时提供最先进的性能。然而，在量化预测性不确定性并且倾向于在外推期间产生过度自信预测的神经网络。适当的不确定性量化对于许多实际应用至关重要。因此，我们提出了一种新型点过程模型，贝叶斯神经鹰过程，利用贝叶斯模型的不确定性建模能力和神经网络的泛化能力。该模型能够通过事件发生时间预测认识性不确定性，并且在模拟和现实世界数据集上对其有效性进行了证明。

translated by 谷歌翻译

Spatiotemporal Clustering with Neyman-Scott Processes via Connections to Bayesian Nonparametric Mixture Models

Yixin Wang , Anthony Degleris , Alex H. Williams , Scott W. Linderman

分类： (统计)机器学习 | 机器学习

2022-01-13

Neyman-Scott processes (NSPs) are point process models that generate clusters of points in time or space. They are natural models for a wide range of phenomena, ranging from neural spike trains to document streams. The clustering property is achieved via a doubly stochastic formulation: first, a set of latent events is drawn from a Poisson process; then, each latent event generates a set of observed data points according to another Poisson process. This construction is similar to Bayesian nonparametric mixture models like the Dirichlet process mixture model (DPMM) in that the number of latent events (i.e. clusters) is a random variable, but the point process formulation makes the NSP especially well suited to modeling spatiotemporal data. While many specialized algorithms have been developed for DPMMs, comparatively fewer works have focused on inference in NSPs. Here, we present novel connections between NSPs and DPMMs, with the key link being a third class of Bayesian mixture models called mixture of finite mixture models (MFMMs). Leveraging this connection, we adapt the standard collapsed Gibbs sampling algorithm for DPMMs to enable scalable Bayesian inference on NSP models. We demonstrate the potential of Neyman-Scott processes on a variety of applications including sequence detection in neural spike trains and event detection in document streams.

translated by 谷歌翻译

Functional Model of Residential Consumption Elasticity under Dynamic Tariffs

Kamalanathan Ganesan , João Tomé Saraiva , Ricardo J. Bessa

分类：机器学习

2021-11-22

零售商的主要障碍之一是了解他们可以从合同需求响应（DR）客户期望的消费弹性。零售商提供的DR产品的目前的趋势不是消费者特定的，这对消费者在这些计划中的积极参与的额外障碍带来了额外的障碍。消费者需求行为的弹性因个人而异。该实用程序将从知识中获益，更准确地了解其价格的变化将如何修改其客户的消费模式。这项工作提出了博士签约消费者消费弹性的功能模型。该模型的目的是确定负载调整，消费者可以为不同的价格水平提供给零售商或公用事业。拟议的模型使用贝叶斯概率方法来识别实际的负载调整，单个合同的客户可以提供它可以体验的不同价格水平。发达的框架为零售商或公用事业提供了一个工具，以获得关于个人消费者如何应对不同价格水平的关键信息。这种方法能够量化消费者对DR信号作出反应的可能性，并识别各个合同的博士客户提供的实际负载调整提供他们可以体验的不同价格水平。该信息可用于最大限度地提高零售商或实用程序可以向系统运营商提供的服务的控制和可靠性。

translated by 谷歌翻译

Probability Paths and the Structure of Predictions over Time

Zhiyuan Jerry Lin , Hao Sheng , Sharad Goel

分类：机器学习 | (统计)机器学习

2021-06-11

在环境中，从天气预报到财务预测的政治预测，未来二元成果的概率估计通常随着时间的推移而发展。例如，随着新信息可用的时间，特定日期的估计可能性在特定日变化。鉴于这种概率路径的集合，我们介绍了一个贝叶斯框架 - 我们称之为高斯潜在信息鞅，或粘合 - 用于模拟动态预测的结构随着时间的推移。例如，假设一个星期下雨的可能性是50％，并考虑两个假设情景。首先，人们希望预测同样可能成为明天的25％或75％;第二，人们预计预测将在未来几天保持不变。一个时间敏感的决策者可以在后一种情况下立即选择一个行动方案，但可能会推迟他们在前者的决定，知道新信息迫在眉睫。我们通过假设根据信息流的潜在进程的预测更新来模拟这些轨迹，从历史数据推断出来。与时间序列分析的一般方法相比，这种方法保留了诸如Martingale结构的概率路径的重要属性，以及适当的挥发性，并且更好地量化了概率路径周围的未来不确定性。我们表明光泽优于三种流行的基线方法，产生了由三种不同度量测量的更高估计的后验概率路径分布。通过阐明时间随着时间的推移来解除预测的动态结构，希望能帮助个人做出更明智的选择。

translated by 谷歌翻译

Can a latent Hawkes process be used for epidemiological modelling?

Stamatina Lamprinakou , Axel Gandy , Emma McCoy

分类：机器学习

2022-08-15

了解Covid-19的传播是众多研究的主题，突出了可靠的流行模型的重要性。在这里，我们使用带有时间协变量的潜在霍克斯工艺引入了一种新型的流行模型，用于建模感染。与其他模型不同，我们通过基础霍克斯过程驱动的概率分布进行对报告的案例进行建模。通过霍克斯过程对感染进行建模，使我们能够估计受感染的人感染的人。我们提出了一个内核密度颗粒滤波器（KDPF），以推断潜在病例和繁殖数，并在不久的将来预测新病例。计算工作与感染的数量成正比，使使用粒子滤波器类型算法（例如KDPF）成为可能。我们证明了拟议的算法对合成数据集的性能，而Covid-19报告了英国各个地方当局的病例，并将我们的模型基于替代方法。

translated by 谷歌翻译

Deep Neyman-Scott Processes

Chengkuan Hong , Christian R. Shelton

分类： (统计)机器学习 | 机器学习

2021-11-06

Neyman-Scott过程是COX过程的特殊情况。潜在和可观察的随机过程均为泊松过程。我们考虑了本文的深度Neyman-Scott过程，其中网络的建筑组件是所有泊松过程。我们通过Markov Chain Monte Carlo开发了一种高效的后部抽样，并使用它来实现基于可能性的推断。我们的方法为复杂的分层点流程推断出来的空间。我们在实验中展示了更多隐藏的泊松过程为似然拟合和事件类型预测带来了更好的性能。我们还将我们的方法与最先进的模式进行了用于时间现实世界数据集的方法，并使用较少的参数展示数据拟合和预测的竞争能力。

translated by 谷歌翻译

Machine Learning in Orbit Estimation: a Survey

Francisco Caldas , Cláudia Soares

分类：机器学习

2022-07-19

自50年代后期以来，当发射第一个人造卫星时，居民太空物品（RSO）的数量已稳步增加。据估计，目前约有100万个大于1厘米的物体正在绕地球绕，只有30,000个，大于10厘米，目前正在跟踪。为了避免碰撞的链反应，称为凯斯勒综合征，必须准确跟踪和预测空间碎片和卫星的轨道是必不可少的。当前基于物理的方法在7天的预测中存在误差，在考虑大部分小于1米的空间碎片时，这是不够的。通常，这种故障是由于轨迹开始时空间对象状态周围的不确定性，在环境条件（例如大气阻力）中的预测错误以及RSO的质量或几何形状等特定的未知特征。利用数据驱动的技术，即机器学习，可以提高轨道预测准确性：通过得出未测量的对象的特征，改善非保守力的效果，并通过深度学习模型具有高度复杂的非复杂性非 - 的卓越抽象能力来建模线性系统。在这项调查中，我们概述了该领域正在完成的当前工作。

translated by 谷歌翻译

Neural Superstatistics: A Bayesian Method for Estimating Dynamic Models of Cognition

Lukas Schumacher , Paul-Christian Bürkner , Andreas Voss , Ullrich Köthe , Stefan T. Radev

分类： (统计)机器学习

2022-11-23

Mathematical models of cognition are often memoryless and ignore potential fluctuations of their parameters. However, human cognition is inherently dynamic, regardless of the reference time scale. Thus, we propose to augment mechanistic cognitive models with a temporal dimension and estimate the resulting dynamics from a superstatistics perspective. In its simplest form, such a model entails a hierarchy between a low-level observation model and a high-level transition model. The observation model describes the local behavior of a system, and the transition model specifies how the parameters of the observation model evolve over time. To overcome the estimation challenges resulting from the complexity of superstatistical models, we develop and validate a simulation-based deep learning method for Bayesian inference, which can recover both time-varying and time-invariant parameters. We first benchmark our method against two existing frameworks capable of estimating time-varying parameters. We then apply our method to fit a dynamic version of the diffusion decision model to long time series of human response times data. Our results show that the deep learning approach is very efficient in capturing the temporal dynamics of the model. Furthermore, we show that the erroneous assumption of static or homogeneous parameters will hide important temporal information.

translated by 谷歌翻译

Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods

Eyke Hüllermeier , Willem Waegeman

分类：

2019-10-21

The notion of uncertainty is of major importance in machine learning and constitutes a key element of machine learning methodology. In line with the statistical tradition, uncertainty has long been perceived as almost synonymous with standard probability and probabilistic predictions. Yet, due to the steadily increasing relevance of machine learning for practical applications and related issues such as safety requirements, new problems and challenges have recently been identified by machine learning scholars, and these problems may call for new methodological developments. In particular, this includes the importance of distinguishing between (at least) two different types of uncertainty, often referred to as aleatoric and epistemic. In this paper, we provide an introduction to the topic of uncertainty in machine learning as well as an overview of attempts so far at handling uncertainty in general and formalizing this distinction in particular.

translated by 谷歌翻译

A Survey on Uncertainty Reasoning and Quantification for Decision Making: Belief Theory Meets Deep Learning

Zhen Guo , Zelin Wan , Qisheng Zhang , Xujiang Zhao , Feng Chen , Jin-Hee Cho , Qi Zhang , Lance M. Kaplan , Dong H. Jeong , Audun Jøsang

分类：人工智能 | 机器学习

2022-06-12

对不确定性的深入了解是在不确定性下做出有效决策的第一步。深度/机器学习（ML/DL）已被大大利用，以解决处理高维数据所涉及的复杂问题。但是，在ML/DL中，推理和量化不同类型的不确定性的探索少于其他人工智能（AI）领域。特别是，自1960年代以来，在KRR上已经研究了信仰/证据理论，以推理并衡量不确定性以提高决策效率。我们发现，只有少数研究利用了ML/DL中的信念/证据理论中的成熟不确定性研究来解决不同类型的不确定性下的复杂问题。在本调查论文中，我们讨论了一些流行的信念理论及其核心思想，这些理论涉及不确定性原因和类型，并量化它们，并讨论其在ML/DL中的适用性。此外，我们讨论了三种主要方法，这些方法在深度神经网络（DNN）中利用信仰理论，包括证据DNN，模糊DNN和粗糙的DNN，就其不确定性原因，类型和量化方法以及其在多元化问题中的适用性而言。域。根据我们的深入调查，我们讨论了见解，经验教训，对当前最新桥接信念理论和ML/DL的局限性，最后是未来的研究方向。

translated by 谷歌翻译

Light curve completion and forecasting using fast and scalable Gaussian processes (MuyGPs)

Imène R. Goumiri , Alec M. Dunton , Amanda L. Muyskens , Benjamin W. Priest , Robert E. Armstrong

分类： (统计)机器学习

2022-08-31

明显大小的时间变化（称为光曲线）是望远镜在长时间内捕获的感兴趣的观察统计。光曲线提供了空间域意识（SDA）目标（例如对象识别或姿势估计）作为潜在变量推理问题等目标的探索。与较高的精确仪器相比，来自货架上商业架子（COTS）摄像机的地面观测仍然很便宜，但是，有限的传感器可用性与嘈杂的观察结果相结合，可能会产生可能难以建模的gappy时间序列数据。这些外部因素混淆了对光曲线的自动开发，这使光曲线预测和外推成为应用的关键问题。传统上，使用基于扩散或基于示例的方法解决了图像或时间序列的完成问题。最近，由于学习复杂的非线性嵌入方面的经验成功，深度神经网络（DNNS）已成为首选工具。但是，DNN通常需要大量的培训数据，而这些数据不一定在查看单个卫星的光曲线的独特功能时可用。在本文中，我们提出了一种新的方法，可以使用高斯工艺（GPS）预测光曲线的缺失和未来数据点。 GPS是非线性概率模型，可推断后验分布在功能上并自然量化不确定性。但是，GP推理和培训的立方缩放是其在应用中采用的主要障碍。特别是，单个光曲线可以具有数十万个观测值，这远远超出了单个机器上常规GP的实际实现极限。因此，我们采用MUYGP，这是一种可扩展的框架，用于使用最近的邻居稀疏和局部交叉验证的GP模型的超参数估计。 muygps ...

translated by 谷歌翻译

HTML版本

A Review of Incident Prediction, Resource Allocation, and Dispatch Models for Emergency Management

Ayan Mukhopadhyay , Geoffrey Pettet , Sayyed Vazirizade , Di Lu , Said El Said , Alex Jaimes , Hiba Baroud , Yevgeniy Vorobeychik , Mykel Kochenderfer , Abhishek Dubey

分类：人工智能

2020-06-07

在过去的五十年中，研究人员已经开发了设计和改进了应急响应管理（ERM）系统的统计，数据驱动，分析和算法方法。该问题已被认为是本质上的困难，并且构成了不确定性下的时空决策，这在文献中已经解决了不同的假设和方法。该调查提供了对这些方法的详细审查，重点关注有关四个子流程的关键挑战和问题：（a）事件预测，（b）入射检测，（c）资源分配，和（c）计算机辅助调度紧急响应。我们突出了该领域前后工作的优势和缺点，并探讨了不同建模范式之间的相似之处和差异。我们通过说明这种复杂领域未来研究的开放挑战和机会的结论。

translated by 谷歌翻译

Bayesian Semiparametric Model for Sequential Treatment Decisions with Informative Timing

Arman Oganisian , Kelly D. Getz , Todd A. Alonzo , Richard Aplenc , Jason A. Roy

分类：机器学习 | (统计)机器学习

2022-11-29

We develop a Bayesian semi-parametric model for the estimating the impact of dynamic treatment rules on survival among patients diagnosed with pediatric acute myeloid leukemia (AML). The data consist of a subset of patients enrolled in the phase III AAML1031 clinical trial in which patients move through a sequence of four treatment courses. At each course, they undergo treatment that may or may not include anthracyclines (ACT). While ACT is known to be effective at treating AML, it is also cardiotoxic and can lead to early death for some patients. Our task is to estimate the potential survival probability under hypothetical dynamic ACT treatment strategies, but there are several impediments. First, since ACT was not randomized in the trial, its effect on survival is confounded over time. Second, subjects initiate the next course depending on when they recover from the previous course, making timing potentially informative of subsequent treatment and survival. Third, patients may die or drop out before ever completing the full treatment sequence. We develop a generative Bayesian semi-parametric model based on Gamma Process priors to address these complexities. At each treatment course, the model captures subjects' transition to subsequent treatment or death in continuous time under a given rule. A g-computation procedure is used to compute a posterior over potential survival probability that is adjusted for time-varying confounding. Using this approach, we conduct posterior inference for the efficacy of hypothetical treatment rules that dynamically modify ACT based on evolving cardiac function.

translated by 谷歌翻译

OutbreakFlow: Model-based Bayesian inference of disease outbreak dynamics with invertible neural networks and its application to the COVID-19 pandemics in Germany

Stefan T. Radev , Frederik Graw , Simiao Chen , Nico T. Mutters , Vanessa M. Eichel , Till Bärnighausen , Ullrich Köthe

分类：机器学习

2020-10-01

流行病学中的数学模型是一种不可或缺的工具，可以确定传染病的动态和重要特征。除了他们的科学价值之外，这些模型通常用于在正在进行的爆发期间提供政治决策和干预措施。然而，通过将复杂模型连接到真实数据来可靠地推断正在进行的爆发的动态仍然很难，并且需要费力的手动参数拟合或昂贵的优化方法，这些方法必须从划痕中重复给定模型的每个应用。在这项工作中，我们用专门的神经网络的流行病学建模的新组合来解决这个问题。我们的方法需要两个计算阶段：在初始训练阶段中，描述该流行病的数学模型被用作神经网络的教练，该主管是关于全球可能疾病动态的全球知识。在随后的推理阶段，训练有素的神经网络处理实际爆发的观察到的数据，并且揭示了模型的参数，以便实际地再现观察到的动态并可可靠地预测未来的进展。通过其灵活的框架，我们的仿真方法适用于各种流行病学模型。此外，由于我们的方法是完全贝叶斯的，它旨在纳入所有可用的关于合理参数值的先前知识，并返回这些参数上的完整关节后部分布。我们的方法在德国的早期Covid-19爆发阶段的应用表明，我们能够获得可靠的概率估计对重要疾病特征，例如生成时间，未检测到的感染部分，症状发作前的传播可能性，以及报告延迟非常适中的现实观测。

translated by 谷歌翻译

Dr. Neurosymbolic, or: How I Learned to Stop Worrying and Accept Statistics

Masataro Asai

分类：人工智能 | 机器学习

2022-09-08

象征性的AI社区越来越多地试图在神经符号结构中接受机器学习，但由于文化障碍，仍在挣扎。为了打破障碍，这份相当有思想的个人备忘录试图解释和纠正统计，机器学习和深入学习的惯例，从局外人的角度进行深入学习。它提供了一个分步协议，用于设计一个机器学习系统，该系统满足符号AI社区认真对待所必需的最低理论保证，即，它讨论“在哪些条件下，我们可以停止担心和接受统计机器学习。 “一些亮点：大多数教科书都是为计划专门研究STAT/ML/DL的人编写的，应该接受术语。该备忘录适用于经验丰富的象征研究人员，他们听到了很多嗡嗡声，但仍然不确定和持怀疑态度。有关STAT/ML/DL的信息目前太分散或嘈杂而无法投资。此备忘录优先考虑紧凑性，并特别注意与象征性范式相互共鸣的概念。我希望这份备忘录能节省时间。它优先考虑一般数学建模，并且不讨论任何特定的函数近似器，例如神经网络（NNS），SVMS，决策树等。它可以对校正开放。将此备忘录视为与博客文章相似的内容，采用有关Arxiv的论文的形式。

translated by 谷歌翻译

Joint Non-parametric Point Process model for Treatments and Outcomes: Counterfactual Time-series Prediction Under Policy Interventions

Çağlar Hızlı , ST John , Anne Juuti , Tuure Saarinen , Kirsi Pietiläinen , Pekka Marttinen

分类：机器学习

2022-09-09

决策者需要在采用新的治疗政策之前预测结果的发展，该政策定义了何时以及如何连续地影响结果的治疗序列。通常，预测介入的未来结果轨迹的算法将未来治疗的固定顺序作为输入。这要么忽略了未来治疗对结果之前的结果的依赖性，要么隐含地假设已知治疗政策，因此排除了该政策未知或需要反事实分析的情况。为了应对这些局限性，我们开发了一种用于治疗和结果的联合模型，该模型允许估计处理策略和顺序治疗（OUT COMECTION数据）的影响。它可以回答有关治疗政策干预措施的介入和反事实查询，因为我们使用有关血糖进展的现实数据显示，并在此基础上进行了模拟研究。

translated by 谷歌翻译

Non-Gaussian Process Regression

Yaman Kındap , Simon Godsill

分类： (统计)机器学习 | 机器学习

2022-09-07

标准GPS为行为良好的流程提供了灵活的建模工具。然而，预计与高斯的偏差有望在现实世界数据集中出现，结构异常值和冲击通常会观察到。在这些情况下，GP可能无法充分建模不确定性，并且可能会过度推动。在这里，我们将GP框架扩展到一类新的时间变化的GP，从而可以直接建模重尾非高斯行为，同时通过非均匀GPS表示的无限混合物保留了可拖动的条件GP结构。有条件的GP结构是通过在潜在转化的输入空间上调节观测值来获得的，并使用L \'{e} Vy过程对潜在转化的随机演变进行建模，该过程允许贝叶斯在后端预测密度和潜在转化中的贝叶斯推断功能。我们为该模型提供了马尔可夫链蒙特卡洛推理程序，并证明了与标准GP相比的潜在好处。

translated by 谷歌翻译

Evaluating vaccine allocation strategies using simulation-assisted causal modelling

Armin Kekić , Jonas Dehning , Luigi Gresele , Julius von Kügelgen , Viola Priesemann , Bernhard Schölkopf

分类：人工智能

2022-12-14

Early on during a pandemic, vaccine availability is limited, requiring prioritisation of different population groups. Evaluating vaccine allocation is therefore a crucial element of pandemics response. In the present work, we develop a model to retrospectively evaluate age-dependent counterfactual vaccine allocation strategies against the COVID-19 pandemic. To estimate the effect of allocation on the expected severe-case incidence, we employ a simulation-assisted causal modelling approach which combines a compartmental infection-dynamics simulation, a coarse-grained, data-driven causal model and literature estimates for immunity waning. We compare Israel's implemented vaccine allocation strategy in 2021 to counterfactual strategies such as no prioritisation, prioritisation of younger age groups or a strict risk-ranked approach; we find that Israel's implemented strategy was indeed highly effective. We also study the marginal impact of increasing vaccine uptake for a given age group and find that increasing vaccinations in the elderly is most effective at preventing severe cases, whereas additional vaccinations for middle-aged groups reduce infections most effectively. Due to its modular structure, our model can easily be adapted to study future pandemics. We demonstrate this flexibility by investigating vaccine allocation strategies for a pandemic with characteristics of the Spanish Flu. Our approach thus helps evaluate vaccination strategies under the complex interplay of core epidemic factors, including age-dependent risk profiles, immunity waning, vaccine availability and spreading rates.

translated by 谷歌翻译

A Comprehensive Review of Digital Twin -- Part 2: Roles of Uncertainty Quantification and Optimization, a Battery Digital Twin, and Perspectives

Adam Thelen , Xiaoge Zhang , Olga Fink , Yan Lu , Sayan Ghosh , Byeng D. Youn , Michael D. Todd , Sankaran Mahadevan , Chao Hu , Zhen Hu

分类：机器学习

2022-08-27

作为行业4.0时代的一项新兴技术，数字双胞胎因其承诺进一步优化流程设计，质量控制，健康监测，决策和政策制定等，通过全面对物理世界进行建模，以进一步优化流程设计，质量控制，健康监测，决策和政策，因此获得了前所未有的关注。互连的数字模型。在一系列两部分的论文中，我们研究了不同建模技术，孪生启用技术以及数字双胞胎常用的不确定性量化和优化方法的基本作用。第二篇论文介绍了数字双胞胎的关键启示技术的文献综述，重点是不确定性量化，优化方法，开源数据集和工具，主要发现，挑战和未来方向。讨论的重点是当前的不确定性量化和优化方法，以及如何在数字双胞胎的不同维度中应用它们。此外，本文介绍了一个案例研究，其中构建和测试了电池数字双胞胎，以说明在这两部分评论中回顾的一些建模和孪生方法。 GITHUB上可以找到用于生成案例研究中所有结果和数字的代码和预处理数据。

translated by 谷歌翻译