智能论文笔记

Addressing Deep Learning Model Uncertainty in Long-Range Climate Forecasting with Late Fusion

Ken C. L. Wong , Hongzhi Wang , Etienne E. Vos , Bianca Zadrozny , Campbell D. Watson , Tanveer Syeda-Mahmood

分类：机器学习

2021-12-10

全球变暖导致气候极端频率和强度的增加，导致生活巨大损失。准确的远程气候预测允许更多时间进行准备和灾害风险管理，以获得此类极端事件。虽然机器学习方法已经表明了远程气候预测结果，但相关的模型不确定性可能会降低其可靠性。为了解决这个问题，我们提出了一种后期的融合方法，系统地将预测从多种模型中组合以减少融合结果的预期误差。我们还提出了一种具有新型Denormalization层的网络架构，以获得数据标准化的好处，而无需实际归一化数据。远程2M温度预测的实验结果表明，该框架优于30年气候法线，通过增加模型数量可以提高准确性。

translated by 谷歌翻译

GraphCast: Learning skillful medium-range global weather forecasting

Remi Lam , Alvaro Sanchez-Gonzalez , Matthew Willson , Peter Wirnsberger , Meire Fortunato , Alexander Pritzel , Suman Ravuri , Timo Ewalds , Ferran Alet , Zach Eaton-Rosen

分类：机器学习

2022-12-24

We introduce a machine-learning (ML)-based weather simulator--called "GraphCast"--which outperforms the most accurate deterministic operational medium-range weather forecasting system in the world, as well as all previous ML baselines. GraphCast is an autoregressive model, based on graph neural networks and a novel high-resolution multi-scale mesh representation, which we trained on historical weather data from the European Centre for Medium-Range Weather Forecasts (ECMWF)'s ERA5 reanalysis archive. It can make 10-day forecasts, at 6-hour time intervals, of five surface variables and six atmospheric variables, each at 37 vertical pressure levels, on a 0.25-degree latitude-longitude grid, which corresponds to roughly 25 x 25 kilometer resolution at the equator. Our results show GraphCast is more accurate than ECMWF's deterministic operational forecasting system, HRES, on 90.0% of the 2760 variable and lead time combinations we evaluated. GraphCast also outperforms the most accurate previous ML-based weather forecasting model on 99.2% of the 252 targets it reported. GraphCast can generate a 10-day forecast (35 gigabytes of data) in under 60 seconds on Cloud TPU v4 hardware. Unlike traditional forecasting methods, ML-based forecasting scales well with data: by training on bigger, higher quality, and more recent data, the skill of the forecasts can improve. Together these results represent a key step forward in complementing and improving weather modeling with ML, open new opportunities for fast, accurate forecasting, and help realize the promise of ML-based simulation in the physical sciences.

translated by 谷歌翻译

Learned Benchmarks for Subseasonal Forecasting

Soukayna Mouatadid , Paulo Orenstein , Genevieve Flaspohler , Miruna Oprescu , Judah Cohen , Franklyn Wang , Sean Knight , Maria Geogdzhayeva , Sam Levang , Ernest Fraenkel

分类：机器学习 | (统计)机器学习

2021-09-21

我们基准了一个简单学习模型的亚季节预测工具包，该工具包优于操作实践和最先进的机器学习和深度学习方法。这些模型，由Mouatadid等人引入。（2022），包括（a）气候++，这是气候学的一种适应性替代品，对于降水而言，准确性9％，比美国运营气候预测系统（CFSV2）高9％，熟练250％；（b）CFSV2 ++，一种学习的CFSV2校正，可将温度和降水精度提高7-8％，技能提高50-275％；（c）持久性++是一种增强的持久性模型，将CFSV2预测与滞后测量相结合，以将温度和降水精度提高6-9％，技能提高40-130％。在整个美国，气候++，CFSV2 ++和持久性++工具包始终优于标准气象基准，最先进的机器和深度学习方法，以及欧洲中等范围的天气预报集合中心。

translated by 谷歌翻译

Beyond Ensemble Averages: Leveraging Climate Model Ensembles for Subseasonal Forecasting

Elena Orlova , Haokun Liu , Raphael Rossellini , Benjamin Cash , Rebecca Willett

分类：机器学习

2022-11-29

Producing high-quality forecasts of key climate variables such as temperature and precipitation on subseasonal time scales has long been a gap in operational forecasting. Recent studies have shown promising results using machine learning (ML) models to advance subseasonal forecasting (SSF), but several open questions remain. First, several past approaches use the average of an ensemble of physics-based forecasts as an input feature of these models. However, ensemble forecasts contain information that can aid prediction beyond only the ensemble mean. Second, past methods have focused on average performance, whereas forecasts of extreme events are far more important for planning and mitigation purposes. Third, climate forecasts correspond to a spatially-varying collection of forecasts, and different methods account for spatial variability in the response differently. Trade-offs between different approaches may be mitigated with model stacking. This paper describes the application of a variety of ML methods used to predict monthly average precipitation and two meter temperature using physics-based predictions (ensemble forecasts) and observational data such as relative humidity, pressure at sea level, or geopotential height, two weeks in advance for the whole continental United States. Regression, quantile regression, and tercile classification tasks using linear models, random forests, convolutional neural networks, and stacked models are considered. The proposed models outperform common baselines such as historical averages (or quantiles) and ensemble averages (or quantiles). This paper further includes an investigation of feature importance, trade-offs between using the full ensemble or only the ensemble average, and different modes of accounting for spatial variability.

translated by 谷歌翻译

Sky-image-based solar forecasting using deep learning with multi-location data: training models locally, globally or via transfer learning?

Yuhao Nie , Quentin Paletta , Andea Scotta , Luis Martin Pomares , Guillaume Arbod , Sgouris Sgouridis , Joan Lasenby , Adam Brandt

分类：计算机视觉 | 人工智能

2022-11-03

Solar forecasting from ground-based sky images using deep learning models has shown great promise in reducing the uncertainty in solar power generation. One of the biggest challenges for training deep learning models is the availability of labeled datasets. With more and more sky image datasets open sourced in recent years, the development of accurate and reliable solar forecasting methods has seen a huge growth in potential. In this study, we explore three different training strategies for deep-learning-based solar forecasting models by leveraging three heterogeneous datasets collected around the world with drastically different climate patterns. Specifically, we compare the performance of models trained individually based on local datasets (local models) and models trained jointly based on the fusion of multiple datasets from different locations (global models), and we further examine the knowledge transfer from pre-trained solar forecasting models to a new dataset of interest (transfer learning models). The results suggest that the local models work well when deployed locally, but significant errors are observed for the scale of the prediction when applied offsite. The global model can adapt well to individual locations, while the possible increase in training efforts need to be taken into account. Pre-training models on a large and diversified source dataset and transferring to a local target dataset generally achieves superior performance over the other two training strategies. Transfer learning brings the most benefits when there are limited local data. With 80% less training data, it can achieve 1% improvement over the local baseline model trained using the entire dataset. Therefore, we call on the efforts from the solar forecasting community to contribute to a global dataset containing a massive amount of imagery and displaying diversified samples with a range of sky conditions.

translated by 谷歌翻译

Deep Learning-based Extreme Heatwave Forecast

Valérian Jacques-Dumas , Francesco Ragone , Pierre Borgnat , Patrice Abry , Freddy Bouchet

分类：机器学习

2021-03-17

由于极端热波和热圆顶对社会和生物多样性的影响，他们的研究是一个关键挑战。我们专门研究了持久的极端热浪，这是气候影响最重要的热潮。物理驱动天气预报系统或气候模型可用于预测其发生或预测其概率。目前的工作探讨了使用深度学习架构的使用，使用气候模型的输出训练，作为预测极端持久热浪的发生的替代策略。这种新方法将对包括气候模型统计数据研究的几个关键科学目标，建立了对气候模型中罕见事件的定量代理，研究了气候变化的影响，并最终应对预测有用。履行这些重要目标意味着解决与罕见事件预测有本质相关的类大小不平衡的问题，评估转移学习的潜在好处，以解决极端事件的嵌套性质（自然包含在不太极端的情况下）。我们训练一个卷积神经网络，使用1000年的气候模型产出，具有大级欠采样和转移学习。从观察到的表面温度和500 HPA地球态高度场的快照，训练有素的网络在预测持久的极端热浪的发生时实现了显着性能。我们能够以三种不同的强度预测它们，早在活动开始前15天（事件结束前30天）。

translated by 谷歌翻译

ECLIPSE : Envisioning Cloud Induced Perturbations in Solar Energy

Quentin Paletta , Anthony Hu , Guillaume Arbod , Joan Lasenby

分类：计算机视觉 | 人工智能 | 机器学习

2021-04-26

太阳能的高效整合到电力组合中取决于其间歇性的可靠预期。预测由云覆盖动态产生的太阳辐照度的时间变异的有希望的方法是基于地面天空图像或卫星图像序列的分析。尽管结果令人鼓舞，但现有深度学习方法的经常性限制在于对过去观察的反应而不是积极预期未来事件的无处不在的趋势。这导致频繁的时间滞后和有限的预测突发事件的能力。为了解决这一挑战，我们介绍了Eclipse，一种时空神经网络架构，即模型从天空图像模拟云运动，不仅预测未来的辐照水平，而且还可以在本地辐照度图上提供更丰富的信息。我们表明Eclipse预期关键事件，并在产生视觉上现实期货的同时降低时间延误。

translated by 谷歌翻译

Evaluation of Machine Learning Techniques for Forecast Uncertainty Quantification

Maximiliano A. Sacco , Juan J. Ruiz , Manuel Pulido , Pierre Tandeo

分类：机器学习 | 人工智能

2021-11-29

生产精确的天气预报和不确定的不确定性的可靠量化是一个开放的科学挑战。到目前为止，集团预测是最成功的方法，以产生相关预测的方法以及估计其不确定性。集合预测的主要局限性是高计算成本，难以捕获和量化不同的不确定性来源，特别是与模型误差相关的源。在这项工作中，进行概念证据模型实验，以检查培训的ANN的性能，以预测系统的校正状态和使用单个确定性预测作为输入的状态不确定性。我们比较不同的培训策略：一个基于使用集合预测的平均值和传播作为目标的直接培训，另一个依赖于使用确定性预测作为目标的决定性预测，其中来自数据隐含地学习不确定性。对于最后一种方法，提出和评估了两个替代损失函数，基于数据观察似然和基于误差的本地估计来评估另一个丢失功能。在不同的交货时间和方案中检查网络的性能，在没有模型错误的情况下。使用Lorenz'96模型的实验表明，ANNS能够模拟集合预测的一些属性，如最不可预测模式的过滤和预测不确定性的状态相关量化。此外，ANNS提供了在模型误差存在下的预测不确定性的可靠估计。

translated by 谷歌翻译

Machine Learning for Postprocessing Medium-range Ensemble Streamflow Forecasts

Sanjib Sharma , Ganesh Raj Ghimire , Ridwan Siddique

分类：机器学习

2021-06-15

熟练的水流预测可以为水政策和管理各个领域的决策提供信息。我们集成了数值天气预测集合和分布式水文模型，以在中范围的交货时间（1-7天）下生成集合流量预测。我们展示了一项用于在美国东部的Susquehanna河流盆地的后处理过程中进行机器学习应用的案例研究。为了进行预测验证，我们使用不同的指标，例如技能得分和可靠性图，以提前时间，流量阈值和季节为条件。验证结果表明，机器学习后处理器可以改善相对于低复杂性预测（例如气候和时间持久性）以及确定性和原始集合预测的水流预测。与原始合奏相比，与较短的交货时间相比，在中等时间表的相对增益在后期时间表通常更高。与低压流相比，高流量和与凉爽的流量相比。总体而言，我们的结果突出了机器学习在许多方面的好处，以提高流量预测的技能和可靠性。

translated by 谷歌翻译

Probabilistic forecasts of extreme heatwaves using convolutional neural networks in a regime of lack of data

George Miloshevich , Bastien Cozian , Patrice Abry , Pierre Borgnat , Freddy Bouchet

分类：机器学习

2022-08-01

了解极端事件及其可能性是研究气候变化影响，风险评估，适应和保护生物的关键。在这项工作中，我们开发了一种方法来构建极端热浪的预测模型。这些模型基于卷积神经网络，对极长的8，000年气候模型输出进行了培训。由于极端事件之间的关系本质上是概率的，因此我们强调概率预测和验证。我们证明，深度神经网络适用于法国持续持续14天的热浪，快速动态驱动器提前15天（500 hpa地球电位高度场），并且在慢速较长的交货时间内，慢速物理时间驱动器（土壤水分）。该方法很容易实现和通用。我们发现，深神经网络选择了与北半球波数字3模式相关的极端热浪。我们发现，当将2米温度场添加到500 HPA地球电位高度和土壤水分场中时，2米温度场不包含任何新的有用统计信息。主要的科学信息是，训练深层神经网络预测极端热浪的发生是在严重缺乏数据的情况下发生的。我们建议大多数其他应用在大规模的大气和气候现象中都是如此。我们讨论了处理缺乏数据制度的观点，例如罕见的事件模拟，以及转移学习如何在后一种任务中发挥作用。

translated by 谷歌翻译

Simultaneous Multivariate Forecast of Space Weather Indices using Deep Neural Network Ensembles

Bernard Benson , Edward Brown , Stefano Bonasera , Giacomo Acciarini , Jorge A. Pérez-Hernández , Eric Sutton , Moriba K. Jah , Christopher Bridges , Meng Jin , Atılım Güneş Baydin

分类：机器学习

2021-12-16

太阳能无线电通量以及地磁指数是太阳能活动的重要指标及其效果。耀斑和地磁风暴等极端太阳能事件可能对低地轨道中的卫星的空间环境产生负面影响。因此，预测这些空间天气指数在太空运营和科学方面具有重要意义。在这项研究中，我们提出了一种基于长期短期内存神经网络的模型，以了解时间序列数据的分布，以便使用时间序列以及太阳能图像提供空间天气指标的同时多元27天预测数据。我们展示了30-40 \％的根均方误差改进了，而仅包括使用时间序列数据的太阳能图像数据，而单独使用时间序列数据。与训练有素的深神经网络模型相比，诸如持久性和运行平均预测之类的简单基线也将与训练有素的深神经网络模型进行比较。我们还使用模型集合量化我们预测中的不确定性。

translated by 谷歌翻译

A Concurrent CNN-RNN Approach for Multi-Step Wind Power Forecasting

Syed Kazmi , Berk Gorgulu , Mucahit Cevik , Mustafa Gokce Baydogan

分类：机器学习

2023-01-02

Wind power forecasting helps with the planning for the power systems by contributing to having a higher level of certainty in decision-making. Due to the randomness inherent to meteorological events (e.g., wind speeds), making highly accurate long-term predictions for wind power can be extremely difficult. One approach to remedy this challenge is to utilize weather information from multiple points across a geographical grid to obtain a holistic view of the wind patterns, along with temporal information from the previous power outputs of the wind farms. Our proposed CNN-RNN architecture combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to extract spatial and temporal information from multi-dimensional input data to make day-ahead predictions. In this regard, our method incorporates an ultra-wide learning view, combining data from multiple numerical weather prediction models, wind farms, and geographical locations. Additionally, we experiment with global forecasting approaches to understand the impact of training the same model over the datasets obtained from multiple different wind farms, and we employ a method where spatial information extracted from convolutional layers is passed to a tree ensemble (e.g., Light Gradient Boosting Machine (LGBM)) instead of fully connected layers. The results show that our proposed CNN-RNN architecture outperforms other models such as LGBM, Extra Tree regressor and linear regression when trained globally, but fails to replicate such performance when trained individually on each farm. We also observe that passing the spatial information from CNN to LGBM improves its performance, providing further evidence of CNN's spatial feature extraction capabilities.

translated by 谷歌翻译

A spatiotemporal machine learning approach to forecasting COVID-19 incidence at the county level in the United States

Benjamin Lucas , Behzad Vahedi , Morteza Karimzadeh

分类： (统计)机器学习 | 机器学习

2021-09-24

随着Covid-19影响每个国家的全球和改变日常生活，预测疾病的传播的能力比任何先前的流行病更重要。常规的疾病 - 展开建模方法，隔间模型，基于对病毒的扩散的时空均匀性的假设，这可能导致预测到欠低，特别是在高空间分辨率下。本文采用替代技术 - 时空机器学习方法。我们提出了Covid-LSTM，一种基于长期短期内存深度学习架构的数据驱动模型，用于预测Covid-19在美国县级的发病率。我们使用每周数量的新阳性案例作为时间输入，以及来自Facebook运动和连通数据集的手工工程空间特征，以捕捉时间和空间的疾病的传播。 Covid-LSTM在我们的17周的评估期间优于Covid-19预测集线器集合模型（CovidHub-Ensemble），使其首先比一个或多个预测期更准确的模型。在4周的预测地平线上，我们的型号平均每县平均50例比CovidHub-Ensemble更准确。我们强调，在Covid-19之前，在Covid-19之前的数据驱动预测的未充分利用疾病传播的预测可能是由于以前疾病缺乏足够的数据，除了最近的时尚预测方法的机器学习方法的进步。我们讨论了更广泛的数据驱动预测的障碍，以及将来将使用更多的基于学习的模型。

translated by 谷歌翻译

An ensemble neural network approach to forecast Dengue outbreak based on climatic condition

Madhurima Panja , Tanujit Chakraborty , Sk Shahid Nadim , Indrajit Ghosh , Uttam Kumar , Nan Liu

分类：机器学习

2022-12-16

Dengue fever is a virulent disease spreading over 100 tropical and subtropical countries in Africa, the Americas, and Asia. This arboviral disease affects around 400 million people globally, severely distressing the healthcare systems. The unavailability of a specific drug and ready-to-use vaccine makes the situation worse. Hence, policymakers must rely on early warning systems to control intervention-related decisions. Forecasts routinely provide critical information for dangerous epidemic events. However, the available forecasting models (e.g., weather-driven mechanistic, statistical time series, and machine learning models) lack a clear understanding of different components to improve prediction accuracy and often provide unstable and unreliable forecasts. This study proposes an ensemble wavelet neural network with exogenous factor(s) (XEWNet) model that can produce reliable estimates for dengue outbreak prediction for three geographical regions, namely San Juan, Iquitos, and Ahmedabad. The proposed XEWNet model is flexible and can easily incorporate exogenous climate variable(s) confirmed by statistical causality tests in its scalable framework. The proposed model is an integrated approach that uses wavelet transformation into an ensemble neural network framework that helps in generating more reliable long-term forecasts. The proposed XEWNet allows complex non-linear relationships between the dengue incidence cases and rainfall; however, mathematically interpretable, fast in execution, and easily comprehensible. The proposal's competitiveness is measured using computational experiments based on various statistical metrics and several statistical comparison tests. In comparison with statistical, machine learning, and deep learning methods, our proposed XEWNet performs better in 75% of the cases for short-term and long-term forecasting of dengue incidence.

translated by 谷歌翻译

Omnivision forecasting: combining satellite observations with sky images for improved intra-hour solar energy predictions

Quentin Paletta , Guillaume Arbod , Joan Lasenby

分类：计算机视觉 | 人工智能

2022-06-07

将间歇性可再生能源集成到大量的电网中是具有挑战性的。旨在解决这一困难的建立良好的方法涉及即将到来的能源供应可变性以适应电网的响应。在太阳能中，可以在全天空摄像机（前方30分钟）和卫星观测（提前6小时）的不同时间尺度上预测由遮挡云引起的短期变化。在这项研究中，我们将这两种互补的观点集成到单个机器学习框架中的云覆盖物上，以改善时间内（最高60分钟）的辐照度预测。确定性和概率预测均在不同的天气条件（晴朗，多云，阴天）以及不同的输入配置（天空图像，卫星观测和/或过去的辐照度值）中进行评估。我们的结果表明，混合模型在晴朗的条件下有益于预测，并改善了长期预测。这项研究为将来的新颖方法奠定了基础，即在单个学习框架中将天空图像和卫星观测结合起来，以推动太阳现象。

translated by 谷歌翻译

Flood forecasting with machine learning models in an operational framework

Sella Nevo , Efrat Morin , Adi Gerzi Rosenthal , Asher Metzger , Chen Barshai , Dana Weitzner , Dafi Voloshin , Frederik Kratzert , Gal Elidan , Gideon Dror

分类：机器学习

2021-11-04

谷歌的运营洪水预测系统是制定的，为机构和公众提供准确的实时洪水警告，重点是河流洪水在大型潮流的河流中。它在2018年开始运作，自从地理位置扩展以来。该预测系统由四个子系统组成：数据验证，阶段预测，淹没建模和警报分配。机器学习用于两个子系统。阶段预测采用长短期内存（LSTM）网络和线性模型进行建模。使用阈值和歧管模型计算洪水淹没，前者计算淹没程度，后者计算淹没程度和深度。本文首次提供的歧管模型提供了一种机器学习替代洪水淹没的液压建模。在评估历史数据时，所有型号都可以实现可操作使用的足够高的度量指标。 LSTM表现出比线性模型更高的技能，而阈值和歧管模型达到了类似的性能度量，以便在淹没程度上进行建模。在2021年的季风季节期间，洪水预警系统在印度和孟加拉国运营，覆盖河流的洪水区，总面积287,000平方公里，拥有350多万人。超过100米的洪水警报被发送给受影响的人口，相关当局以及紧急组织。系统上的当前和未来的工作包括将覆盖范围扩展到额外的洪水易发位置，以及提高建模能力和准确性。

translated by 谷歌翻译

An Interpretable Probabilistic Model for Short-Term Solar Power Forecasting Using Natural Gradient Boosting

Georgios Mitrentsis , Hendrik Lens

分类：机器学习

2021-08-05

PV power forecasting models are predominantly based on machine learning algorithms which do not provide any insight into or explanation about their predictions (black boxes). Therefore, their direct implementation in environments where transparency is required, and the trust associated with their predictions may be questioned. To this end, we propose a two stage probabilistic forecasting framework able to generate highly accurate, reliable, and sharp forecasts yet offering full transparency on both the point forecasts and the prediction intervals (PIs). In the first stage, we exploit natural gradient boosting (NGBoost) for yielding probabilistic forecasts, while in the second stage, we calculate the Shapley additive explanation (SHAP) values in order to fully comprehend why a prediction was made. To highlight the performance and the applicability of the proposed framework, real data from two PV parks located in Southern Germany are employed. Comparative results with two state-of-the-art algorithms, namely Gaussian process and lower upper bound estimation, manifest a significant increase in the point forecast accuracy and in the overall probabilistic performance. Most importantly, a detailed analysis of the model's complex nonlinear relationships and interaction effects between the various features is presented. This allows interpreting the model, identifying some learned physical properties, explaining individual predictions, reducing the computational requirements for the training without jeopardizing the model accuracy, detecting possible bugs, and gaining trust in the model. Finally, we conclude that the model was able to develop complex nonlinear relationships which follow known physical properties as well as human logic and intuition.

translated by 谷歌翻译

Statistical Post-Processing for Gridded Temperature Prediction Using Encoder-Decoder-Based Deep Convolutional Neural Networks

Atsushi Kudo

分类：机器学习

2021-03-02

日本气象学机构经营着网格的温度指导，以预测二维降雪量和降水类型，例如雨雪，因为表面温度是预测它们的关键元素之一。操作温度引导基于卡尔曼滤波器，该滤波器使用温度观察和数值天气预报（NWP）仅在观察部位周围输出。当NWP模型错误地预测前部的位置或观察到的温度非常冷或热时，纠正温度场。在这项研究中，已经提出了一种基于编码器的解码器的卷积神经网络，以预测日本的康托地区周围的表面上的包装温度。验证结果表明，该模型大大提高了运营指导，可以纠正NWP模型偏差，如前方和极端温度的位置误差。

translated by 谷歌翻译

Hurricane Forecasting: A Novel Multimodal Machine Learning Framework

Léonard Boussioux , Cynthia Zeng , Théo Guénais , Dimitris Bertsimas

分类：机器学习 | 人工智能

2020-11-11

本文描述了一个新颖的机器学习（ML）框架，用于热带气旋强度和轨道预测，结合了多种ML技术并利用了多种数据源。我们的多模式框架（称为Hurricast）有效地结合了时空数据和统计数据，通过提取具有深度学习的编码器编码器体系结构的特征，并通过梯度增强的树进行预测。我们在2016 - 2019年在北大西洋和东太平洋盆地进行了24小时的提前时间和强度预测，评估我们的模型，并表明它们在秒内计算时达到了当前操作预测模型的可比平均绝对误差和技能。此外，将飓风纳入运营预测的共识模型可以改善国家飓风中心的官方预测，从而通过现有方法突出显示互补物业。总而言之，我们的工作表明，利用机器学习技术结合不同的数据源可以带来热带气旋预测的新机会。

translated by 谷歌翻译

An Extreme-Adaptive Time Series Prediction Model Based on Probability-Enhanced LSTM Neural Networks

Yanhong Li , Jack Xu , David C. Anastasiu

分类：机器学习 | 人工智能

2022-11-29

Forecasting time series with extreme events has been a challenging and prevalent research topic, especially when the time series data are affected by complicated uncertain factors, such as is the case in hydrologic prediction. Diverse traditional and deep learning models have been applied to discover the nonlinear relationships and recognize the complex patterns in these types of data. However, existing methods usually ignore the negative influence of imbalanced data, or severe events, on model training. Moreover, methods are usually evaluated on a small number of generally well-behaved time series, which does not show their ability to generalize. To tackle these issues, we propose a novel probability-enhanced neural network model, called NEC+, which concurrently learns extreme and normal prediction functions and a way to choose among them via selective back propagation. We evaluate the proposed model on the difficult 3-day ahead hourly water level prediction task applied to 9 reservoirs in California. Experimental results demonstrate that the proposed model significantly outperforms state-of-the-art baselines and exhibits superior generalization ability on data with diverse distributions.

translated by 谷歌翻译