智能论文笔记

Greykite: Deploying Flexible Forecasting at Scale at LinkedIn

Reza Hosseini , Albert Chen , Kaixu Yang , Sayan Patra , Yi Su , Saad Eddin Al Orjany , Sishi Tang , Parvez Ahammad

分类：机器学习

2022-07-15

预测可帮助企业分配资源并实现目标。在LinkedIn，产品所有者使用预测来设定业务目标，跟踪前景和监视健康。工程师使用预测有效地提供硬件。开发一种预测解决方案来满足这些需求，需要对各种时间序列进行准确，可解释的预测，并以次数至季度的频率。我们提出了Greykite，这是一个用于预测的开源Python库，已在LinkedIn上部署了二十多种用例。它的旗舰算法Silverkite提供了可解释的，快速且高度灵活的单变量预测，可捕获诸如时期增长和季节性，自相关，假期和回归剂等效果。该库通过促进数据探索，模型配置，执行和解释来实现自我服务的准确性和信任。我们的基准结果显示了来自各个域的数据集的现成速度和准确性。在过去的两年中，金融，工程和产品团队的资源计划和分配，目标设置和进度跟踪，异常检测和根本原因分析的资源团队一直信任灰金矿的预测。我们希望灰金矿对具有类似应用的预测从业者有用，这些应用需要准确，可解释的预测，这些预测捕获了与人类活动相关的时间序列共有的复杂动力学。

translated by 谷歌翻译

NeuralProphet: Explainable Forecasting at Scale

Oskar Triebe , Hansika Hewamalage , Polina Pilyugina , Nikolay Laptev , Christoph Bergmeir , Ram Rajagopal

分类：机器学习 | (统计)机器学习

2021-11-29

我们向Facebook先知推出了一位继任者，为可解释，可扩展和用户友好的预测框架制定了一个行业标准。随着时间序列数据的扩散，可说明的预测仍然是企业和运营决策的具有挑战性的任务。需要混合解决方案来弥合可解释的古典方法与可扩展深层学习模型之间的差距。我们将先知视为这样一个解决方案的前兆。然而，先知缺乏本地背景，这对于预测近期未来至关重要，并且由于其斯坦坦后代而挑战。 NeultProphet是一种基于Pytorch的混合预测框架，并用标准的深度学习方法培训，开发人员可以轻松扩展框架。本地上下文使用自动回归和协变量模块引入，可以配置为经典线性回归或作为神经网络。否则，NeultProphet保留了先知的设计理念，提供了相同的基本模型组件。我们的结果表明，NeultProcrophet在一组生成的时间序列上产生了相当或优质的质量的可解释的预测组件。 NeultProphet在各种各样的现实数据集合中占先知。对于中期预测，NeultProclecrophet将预测精度提高55％至92％。

translated by 谷歌翻译

Demand Forecasting for Platelet Usage: from Univariate Time Series to Multivariate Models

Maryam Motamedi , Jessica Dawson , Na Li , Douglas G. Down , Nancy M. Heddle

分类：机器学习 | (统计)机器学习

2021-01-06

Platelet products are both expensive and have very short shelf lives. As usage rates for platelets are highly variable, the effective management of platelet demand and supply is very important yet challenging. The primary goal of this paper is to present an efficient forecasting model for platelet demand at Canadian Blood Services (CBS). To accomplish this goal, four different demand forecasting methods, ARIMA (Auto Regressive Moving Average), Prophet, lasso regression (least absolute shrinkage and selection operator) and LSTM (Long Short-Term Memory) networks are utilized and evaluated. We use a large clinical dataset for a centralized blood distribution centre for four hospitals in Hamilton, Ontario, spanning from 2010 to 2018 and consisting of daily platelet transfusions along with information such as the product specifications, the recipients' characteristics, and the recipients' laboratory test results. This study is the first to utilize different methods from statistical time series models to data-driven regression and a machine learning technique for platelet transfusion using clinical predictors and with different amounts of data. We find that the multivariate approaches have the highest accuracy in general, however, if sufficient data are available, a simpler time series approach such as ARIMA appears to be sufficient. We also comment on the approach to choose clinical indicators (inputs) for the multivariate models.

translated by 谷歌翻译

Comparison and Evaluation of Methods for a Predict+Optimize Problem in Renewable Energy

Christoph Bergmeir , Frits de Nijs , Abishek Sriramulu , Mahdi Abolghasemi , Richard Bean , John Betts , Quang Bui , Nam Trong Dinh , Nils Einecke , Rasul Esmaeilbeigi

分类：人工智能

2022-12-21

Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends on unknown future values, which therefore need to be forecast. As both forecasting and optimization are difficult problems in their own right, relatively few research has been done in this area. This paper presents the findings of the ``IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling," held in 2021. We present a comparison and evaluation of the seven highest-ranked solutions in the competition, to provide researchers with a benchmark problem and to establish the state of the art for this benchmark, with the aim to foster and facilitate research in this area. The competition used data from the Monash Microgrid, as well as weather data and energy market data. It then focused on two main challenges: forecasting renewable energy production and demand, and obtaining an optimal schedule for the activities (lectures) and on-site batteries that lead to the lowest cost of energy. The most accurate forecasts were obtained by gradient-boosted tree and random forest models, and optimization was mostly performed using mixed integer linear and quadratic programming. The winning method predicted different scenarios and optimized over all scenarios jointly using a sample average approximation method.

translated by 谷歌翻译

LEAF: Navigating Concept Drift in Cellular Networks

Shinan Liu , Francesco Bronzino , Paul Schmitt , Arjun Nitin Bhagoji , Nick Feamster , Hector Garcia Crespo , Timothy Coyle , Brian Ward

分类：机器学习

2021-09-07

操作网络通常依靠机器学习模型来进行许多任务，包括检测异常，推断应用程序性能和预测需求。然而，不幸的是，模型精度会因概念漂移而降低，从而，由于从软件升级到季节性到用户行为的变化，功能和目标预测之间的关系会发生变化。因此，缓解概念漂移是操作机器学习模型的重要组成部分，尽管它很重要，但在网络或一般的回归模型的背景下，概念漂移并未得到广泛的探索。因此，对于当前依赖机器学习模型的许多常见网络管理任务，如何检测或减轻它并不是一件好事。不幸的是，正如我们所展示的那样，通过使用新可用的数据经常重新培训模型可以充分缓解概念漂移，甚至可以进一步降低模型的准确性。在本文中，我们表征了美国主要大都市地区的大型蜂窝网络中的概念漂移。我们发现，概念漂移发生在许多重要的关键性能指标（KPI）上，独立于模型，训练集大小和时间间隔，因此需要采用实用方法来检测，解释和减轻它。为此，我们开发了特征（叶）的局部误差近似。叶检测到漂移；解释最有助于漂移的功能和时间间隔；并使用遗忘和过度采样来减轻漂移。我们使用超过四年的蜂窝KPI数据来评估叶子与行业标准的缓解方法。在美国，我们对主要的细胞提供商进行的初步测试表明，LEAF在各种KPI和模型上都是有效的。叶子始终优于周期性，并触发重新培训，同时还要降低昂贵的重新经营操作。

translated by 谷歌翻译

Online Advertising Revenue Forecasting: An Interpretable Deep Learning Approach

Max Würfel , Qiwei Han , Maximilian Kaiser

分类：机器学习 | 人工智能

2021-11-16

在线广告收入占发布者的收入流越来越多的份额，特别是对于依赖谷歌和Facebook等技术公司广告网络的中小型出版商而言。因此，出版商可能会从准确的在线广告收入预测中获益，以更好地管理其网站货币化战略。但是，只能获得自己的收入数据的出版商缺乏出版商广告总市场的整体视图，这反过来限制了他们在他们未来的在线广告收入中产生见解的能力。为了解决这一业务问题，我们利用了一个专有的数据库，包括来自各种各样的地区的大量出版商的Google Adsense收入。我们采用时间融合变压器（TFT）模型，这是一种新的基于关注的架构，以预测出版商的广告收入。我们利用多个协变量，不仅包括出版商自己的特征，还包括其他出版商的广告收入。我们的预测结果优于多个时间范围的几个基准深度学习时间系列预测模型。此外，我们通过分析可变重要性重量来识别显着的特征和自我注意重量来解释结果，以揭示持久的时间模式。

translated by 谷歌翻译

A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the United States

Benjamin Lucas , Behzad Vahedi , Morteza Karimzadeh

分类： (统计)机器学习 | 机器学习

2021-09-24

随着Covid-19影响每个国家的全球和改变日常生活，预测疾病的传播的能力比任何先前的流行病更重要。常规的疾病 - 展开建模方法，隔间模型，基于对病毒的扩散的时空均匀性的假设，这可能导致预测到欠低，特别是在高空间分辨率下。本文采用替代技术 - 时空机器学习方法。我们提出了Covid-LSTM，一种基于长期短期内存深度学习架构的数据驱动模型，用于预测Covid-19在美国县级的发病率。我们使用每周数量的新阳性案例作为时间输入，以及来自Facebook运动和连通数据集的手工工程空间特征，以捕捉时间和空间的疾病的传播。 Covid-LSTM在我们的17周的评估期间优于Covid-19预测集线器集合模型（CovidHub-Ensemble），使其首先比一个或多个预测期更准确的模型。在4周的预测地平线上，我们的型号平均每县平均50例比CovidHub-Ensemble更准确。我们强调，在Covid-19之前，在Covid-19之前的数据驱动预测的未充分利用疾病传播的预测可能是由于以前疾病缺乏足够的数据，除了最近的时尚预测方法的机器学习方法的进步。我们讨论了更广泛的数据驱动预测的障碍，以及将来将使用更多的基于学习的模型。

translated by 谷歌翻译

Data-Centric Epidemic Forecasting: A Survey

Alexander Rodríguez , Harshavardhan Kamarthi , Pulak Agarwal , Javen Ho , Mira Patel , Suchet Sapre , B. Aditya Prakash

分类：机器学习

2022-07-19

COVID-19的大流行提出了对多个领域决策者的流行预测的重要性，从公共卫生到整个经济。虽然预测流行进展经常被概念化为类似于天气预测，但是它具有一些关键的差异，并且仍然是一项非平凡的任务。疾病的传播受到人类行为，病原体动态，天气和环境条件的多种混杂因素的影响。由于政府公共卫生和资助机构的倡议，捕获以前无法观察到的方面的丰富数据来源的可用性增加了研究的兴趣。这尤其是在“以数据为中心”的解决方案上进行的一系列工作，这些解决方案通过利用非传统数据源以及AI和机器学习的最新创新来增强我们的预测能力的潜力。这项调查研究了各种数据驱动的方法论和实践进步，并介绍了一个概念框架来导航它们。首先，我们列举了与流行病预测相关的大量流行病学数据集和新的数据流，捕获了各种因素，例如有症状的在线调查，零售和商业，流动性，基因组学数据等。接下来，我们将讨论关注最近基于数据驱动的统计和深度学习方法的方法和建模范式，以及将机械模型知识域知识与统计方法的有效性和灵活性相结合的新型混合模型类别。我们还讨论了这些预测系统的现实部署中出现的经验和挑战，包括预测信息。最后，我们重点介绍了整个预测管道中发现的一些挑战和开放问题。

translated by 谷歌翻译

Contextually Enhanced ES-dRNN with Dynamic Attention for Short-Term Load Forecasting

Slawek Smyl , Grzegorz Dudek , Paweł Pełka

分类：机器学习 | 人工智能 | 神经与进化计算

2022-12-18

In this paper, we propose a new short-term load forecasting (STLF) model based on contextually enhanced hybrid and hierarchical architecture combining exponential smoothing (ES) and a recurrent neural network (RNN). The model is composed of two simultaneously trained tracks: the context track and the main track. The context track introduces additional information to the main track. It is extracted from representative series and dynamically modulated to adjust to the individual series forecasted by the main track. The RNN architecture consists of multiple recurrent layers stacked with hierarchical dilations and equipped with recently proposed attentive dilated recurrent cells. These cells enable the model to capture short-term, long-term and seasonal dependencies across time series as well as to weight dynamically the input information. The model produces both point forecasts and predictive intervals. The experimental part of the work performed on 35 forecasting problems shows that the proposed model outperforms in terms of accuracy its predecessor as well as standard statistical models and state-of-the-art machine learning models.

translated by 谷歌翻译

Causal Effect Estimation with Global Probabilistic Forecasting: A Case Study of the Impact of Covid-19 Lockdowns on Energy Demand

Ankitha Nandipura Prasanna , Priscila Grecov , Angela Dieyu Weng , Christoph Bergmeir

分类：机器学习 | 人工智能

2022-09-19

电力行业正在大力实施智能网格技术，以提高可靠性，可用性，安全性和效率。该实施需要技术进步，标准和法规的发展以及测试和计划。智能电网载荷预测和管理对于降低需求波动和改善连接发电机，分销商和零售商的市场机制至关重要。在政策实施或外部干预措施中，有必要分析其对电力需求的影响的不确定性，以使系统对需求的波动更加准确。本文分析了外部干预的不确定性对电力需求的影响。它实现了一种结合概率和全局预测模型的框架，使用深度学习方法来估计干预措施的因果影响分布。通过预测受影响实例的反事实分布结果，然后将其与实际结果进行对比来评估因果效应。我们将COVID-19锁定对能源使用的影响视为评估这种干预对电力需求分布的不均匀影响的案例研究。我们可以证明，在澳大利亚和某些欧洲国家的最初封锁期间，槽通常比峰值更大的下降，而平均值几乎不受影响。

translated by 谷歌翻译

DeepAR: Probabilistic Forecasting with Autoregressive Recurrent Networks

David Salinas , Valentin Flunkert , Jan Gasthaus

分类：

2017-04-13

Probabilistic forecasting, i.e. estimating the probability distribution of a time series' future given its past, is a key enabler for optimizing business processes. In retail businesses, for example, forecasting demand is crucial for having the right inventory available at the right time at the right place. In this paper we propose DeepAR, a methodology for producing accurate probabilistic forecasts, based on training an auto-regressive recurrent network model on a large number of related time series. We demonstrate how by applying deep learning techniques to forecasting, one can overcome many of the challenges faced by widely-used classical approaches to the problem. We show through extensive empirical evaluation on several real-world forecasting data sets accuracy improvements of around 15% compared to state-of-the-art methods.

translated by 谷歌翻译

A Temporal Fusion Transformer for Long-term Explainable Prediction of Emergency Department Overcrowding

Francisco M. Caldas , Cláudia Soares

分类：机器学习

2022-07-01

急诊部门（EDS）是葡萄牙国家卫生服务局的基本要素，可作为具有多样化和非常严重医疗问题的用户的切入点。由于ED的固有特征；预测使用服务的患者数量特别具有挑战性。富裕和医疗专业人员人数之间的不匹配可能会导致提供的服务质量下降，并造成对整个医院产生影响的问题，并从其他部门征用医疗保健工作者以及推迟手术。。 ED人满为患的部分是由非紧急患者驱动的，尽管没有医疗紧急情况，但诉诸于紧急服务，几乎占每日患者总数的一半。本文描述了一种新颖的深度学习体系结构，即时间融合变压器，该结构使用日历和时间序列协变量来预测预测间隔和4周期间的点预测。我们得出的结论是，可以预测葡萄牙健康区域（HRA）（HRA）的平均绝对百分比误差（MAPE）和均方根误差（RMSE）为84.4102人/天的平均绝对百分比误差（MAPE）。本文显示了支持使用静态和时间序列协变量的多元方法的经验证据，同时超越了文献中常见的其他模型。

translated by 谷歌翻译

Deep Learning for Time Series Forecasting: Tutorial and Literature Survey

Konstantinos Benidis , Syama Sundar Rangapuram , Valentin Flunkert , Yuyang Wang , Danielle Maddix , Caner Turkmen , Jan Gasthaus , Michael Bohlke-Schneider , David Salinas , Lorenzo Stella

分类：机器学习 | (统计)机器学习

2020-04-21

基于预测方法的深度学习已成为时间序列预测或预测的许多应用中的首选方法，通常通常优于其他方法。因此，在过去的几年中，这些方法现在在大规模的工业预测应用中无处不在，并且一直在预测竞赛（例如M4和M5）中排名最佳。这种实践上的成功进一步提高了学术兴趣，以理解和改善深厚的预测方法。在本文中，我们提供了该领域的介绍和概述：我们为深入预测的重要构建块提出了一定深度的深入预测；随后，我们使用这些构建块，调查了最近的深度预测文献的广度。

translated by 谷歌翻译

Daily peak electrical load forecasting with a multi-resolution approach

Yvenn Amara-Ouali , Matteo Fasiolo , Yannig Goude , Hui Yan

分类：机器学习

2021-12-08

在智能电网和负载平衡的背景下，每日峰值负荷预测已成为能源行业利益相关者的关键活动。对峰值幅度和时序的理解对于实现峰值剃须等智能电网策略至关重要。本文提出的建模方法利用了高分辨率和低分辨率信息来预测每日峰值需求规模和时序。由此产生的多分辨率建模框架可以适应不同的模型类。本文的主要贡献是一般性和正式介绍多分辨率建模方法，b）关于通过广义添加剂模型和神经网络和C）实验结果的不同决议的建模方法的讨论英国电力市场。结果证实，建议的建模方法的预测性能与低分辨率和高分辨率替代品具有竞争力。

translated by 谷歌翻译

Multistep Multiappliance Load Prediction

Alona Zharova , Antonia Scherz

分类：机器学习

2022-12-19

A well-performing prediction model is vital for a recommendation system suggesting actions for energy-efficient consumer behavior. However, reliable and accurate predictions depend on informative features and a suitable model design to perform well and robustly across different households and appliances. Moreover, customers' unjustifiably high expectations of accurate predictions may discourage them from using the system in the long term. In this paper, we design a three-step forecasting framework to assess predictability, engineering features, and deep learning architectures to forecast 24 hourly load values. First, our predictability analysis provides a tool for expectation management to cushion customers' anticipations. Second, we design several new weather-, time- and appliance-related parameters for the modeling procedure and test their contribution to the model's prediction performance. Third, we examine six deep learning techniques and compare them to tree- and support vector regression benchmarks. We develop a robust and accurate model for the appliance-level load prediction based on four datasets from four different regions (US, UK, Austria, and Canada) with an equal set of appliances. The empirical results show that cyclical encoding of time features and weather indicators alongside a long-short term memory (LSTM) model offer the optimal performance.

translated by 谷歌翻译

TACTiS: Transformer-Attentional Copulas for Time Series

Alexandre Drouin , Étienne Marcotte , Nicolas Chapados

分类：机器学习 | (统计)机器学习

2022-02-07

时间变化数量的估计是医疗保健和金融等领域决策的基本组成部分。但是，此类估计值的实际实用性受到它们量化预测不确定性的准确程度的限制。在这项工作中，我们解决了估计高维多元时间序列的联合预测分布的问题。我们提出了一种基于变压器体系结构的多功能方法，该方法使用基于注意力的解码器估算关节分布，该解码器可被学会模仿非参数Copulas的性质。最终的模型具有多种理想的属性：它可以扩展到数百个时间序列，支持预测和插值，可以处理不规则和不均匀的采样数据，并且可以在训练过程中无缝地适应丢失的数据。我们从经验上证明了这些属性，并表明我们的模型在多个现实世界数据集上产生了最新的预测。

translated by 谷歌翻译

A Survey of Open Source Automation Tools for Data Science Predictions

Nicholas Hoell

分类：机器学习

2022-08-24

我们介绍了数据科学预测生命周期中各个阶段开发和采用自动化的技术和文化挑战的说明概述，从而将重点限制为使用结构化数据集的监督学习。此外，我们回顾了流行的开源Python工具，这些工具实施了针对自动化挑战的通用解决方案模式，并突出了我们认为进步仍然需要的差距。

translated by 谷歌翻译

HTML版本

Deep Learning for Time Series Anomaly Detection: A Survey

Zahra Zamanzadeh Darban , Geoffrey I. Webb , Shirui Pan , Charu C. Aggarwal , Mahsa Salehi

分类：机器学习 | 人工智能

2022-11-09

Time series anomaly detection has applications in a wide range of research fields and applications, including manufacturing and healthcare. The presence of anomalies can indicate novel or unexpected events, such as production faults, system defects, or heart fluttering, and is therefore of particular interest. The large size and complex patterns of time series have led researchers to develop specialised deep learning models for detecting anomalous patterns. This survey focuses on providing structured and comprehensive state-of-the-art time series anomaly detection models through the use of deep learning. It providing a taxonomy based on the factors that divide anomaly detection models into different categories. Aside from describing the basic anomaly detection technique for each category, the advantages and limitations are also discussed. Furthermore, this study includes examples of deep anomaly detection in time series across various application domains in recent years. It finally summarises open issues in research and challenges faced while adopting deep anomaly detection models.

translated by 谷歌翻译

Agnostic Learning for Packing Machine Stoppage Prediction in Smart Factories

Gabriel Filios , Ioannis Katsidimas , Sotiris Nikoletseas , Stefanos H. Panagiotou , Theofanis P. Raptis

分类：机器学习

2022-12-12

The cyber-physical convergence is opening up new business opportunities for industrial operators. The need for deep integration of the cyber and the physical worlds establishes a rich business agenda towards consolidating new system and network engineering approaches. This revolution would not be possible without the rich and heterogeneous sources of data, as well as the ability of their intelligent exploitation, mainly due to the fact that data will serve as a fundamental resource to promote Industry 4.0. One of the most fruitful research and practice areas emerging from this data-rich, cyber-physical, smart factory environment is the data-driven process monitoring field, which applies machine learning methodologies to enable predictive maintenance applications. In this paper, we examine popular time series forecasting techniques as well as supervised machine learning algorithms in the applied context of Industry 4.0, by transforming and preprocessing the historical industrial dataset of a packing machine's operational state recordings (real data coming from the production line of a manufacturing plant from the food and beverage domain). In our methodology, we use only a single signal concerning the machine's operational status to make our predictions, without considering other operational variables or fault and warning signals, hence its characterization as ``agnostic''. In this respect, the results demonstrate that the adopted methods achieve a quite promising performance on three targeted use cases.

translated by 谷歌翻译

LoMEF: A Framework to Produce Local Explanations for Global Model Time Series Forecasts

Dilini Rajapaksha , Christoph Bergmeir , Rob J Hyndman

分类：机器学习 | 人工智能 | (统计)机器学习

2021-11-13

与单变量预测方法相比，在一组多个时间序列中培训的全球预测模型（GFM）在许多预测竞赛和现实世界应用方面表现出优越的结果。 ETS和Arima等统计预测模型的普及的一个方面是它们相对简单和可解释性（就相关的滞后，趋势，季节性等），而GFM通常缺乏可解释性，特别是对特定时间序列。这减少了基于预测的决策时对利益相关者的信任和信心，而不是能够理解预测。为了减轻这个问题，在这项工作中，我们提出了一种新颖的本地模型 - 不可知论解释方法来解释GFM的预测。我们培训更简单的单变量代理模型，这些模型被认为是通过自动启动或直截了当地作为时间序列的一步的全局黑匣子模型预测所获得的邻域内的邻域内的样本的可解释（例如，ETS）。需要解释哪些。之后，我们评估了对全球模型在定性和定量方面的预测的解释，例如准确性，保真度，稳定性和可理性，并且能够展示我们方法的好处。

translated by 谷歌翻译