智能论文笔记

Long-term hail risk assessment with deep neural networks

Ivan Lukyanenko , Mikhail Mozikov , Yury Maximov , Ilya Makarov

分类：机器学习

2022-08-31

冰雹风险评估对于估计和减少对农作物，果园和基础设施的破坏是必要的。此外，它有助于估计和减少企业，尤其是保险公司的损失。但是冰雹预测具有挑战性。用于此目的的设计模型的数据是树维的地理空间时间序列。关于可用数据集的分辨率，冰雹是一个非常本地的事件。同样，冰雹事件很少见 - 观测中只有1％的目标标记为“冰雹”。现象和短期冰雹预测的模型正在改善。将机器学习模型引入气象学领域并不是什么新鲜事。还有各种气候模型反映了未来气候变化的可能情况。但是，没有用于数据驱动的机器学习模型来预测给定区域的冰雹频率变化。后一项任务的第一种可能方法是忽略空间和时间结构，并开发一种能够将气象变量的给定垂直轮廓分类为有利于冰雹形成的模型。尽管这种方法肯定忽略了重要的信息，但它的加权非常轻，很容易扩展，因为它将观察值视为彼此独立的。更高级的方法是设计能够处理地理空间数据的神经网络。我们在这里的想法是将负责处理空间数据处理的卷积层与能够使用时间结构工作的复发神经网络块相结合。这项研究比较了两种方法，并引入了一个适合预测冰雹频率变化的任务的模型。

translated by 谷歌翻译

Benchmark Dataset for Precipitation Forecasting by Post-Processing the Numerical Weather Prediction

Taehyeon Kim , Namgyu Ho , Donggyu Kim , Se-Young Yun

分类：机器学习

2022-06-30

降水预测是一项重要的科学挑战，对社会产生广泛影响。从历史上看，这项挑战是使用数值天气预测（NWP）模型解决的，该模型基于基于物理的模拟。最近，许多作品提出了一种替代方法，使用端到端深度学习（DL）模型来替代基于物理的NWP。尽管这些DL方法显示出提高的性能和计算效率，但它们在长期预测中表现出局限性，并且缺乏NWP模型的解释性。在这项工作中，我们提出了一个混合NWP-DL工作流程，以填补独立NWP和DL方法之间的空白。在此工作流程下，NWP输出被馈入深层模型，该模型后处理数据以产生精致的降水预测。使用自动气象站（AWS）观测值作为地面真相标签，对深层模型进行了监督训练。这可以实现两全其美，甚至可以从NWP技术的未来改进中受益。为了促进朝这个方向进行研究，我们提出了一个专注于朝鲜半岛的新型数据集，该数据集称为KOMET（KOMEN（KOREA气象数据集），由NWP预测和AWS观察组成。对于NWP，我们使用全局数据同化和预测系统-KOREA集成模型（GDAPS-KIM）。

translated by 谷歌翻译

Agnostic Learning for Packing Machine Stoppage Prediction in Smart Factories

Gabriel Filios , Ioannis Katsidimas , Sotiris Nikoletseas , Stefanos H. Panagiotou , Theofanis P. Raptis

分类：机器学习

2022-12-12

The cyber-physical convergence is opening up new business opportunities for industrial operators. The need for deep integration of the cyber and the physical worlds establishes a rich business agenda towards consolidating new system and network engineering approaches. This revolution would not be possible without the rich and heterogeneous sources of data, as well as the ability of their intelligent exploitation, mainly due to the fact that data will serve as a fundamental resource to promote Industry 4.0. One of the most fruitful research and practice areas emerging from this data-rich, cyber-physical, smart factory environment is the data-driven process monitoring field, which applies machine learning methodologies to enable predictive maintenance applications. In this paper, we examine popular time series forecasting techniques as well as supervised machine learning algorithms in the applied context of Industry 4.0, by transforming and preprocessing the historical industrial dataset of a packing machine's operational state recordings (real data coming from the production line of a manufacturing plant from the food and beverage domain). In our methodology, we use only a single signal concerning the machine's operational status to make our predictions, without considering other operational variables or fault and warning signals, hence its characterization as ``agnostic''. In this respect, the results demonstrate that the adopted methods achieve a quite promising performance on three targeted use cases.

translated by 谷歌翻译

Skillful Twelve Hour Precipitation Forecasts using Large Context Neural Networks

Lasse Espeholt , Shreya Agrawal , Casper Sønderby , Manoj Kumar , Jonathan Heek , Carla Bromberg , Cenk Gazen , Jason Hickey , Aaron Bell , Nal Kalchbrenner

分类：机器学习

2021-11-14

由于其对人类生命，运输，粮食生产和能源管理的高度影响，因此在科学上研究了预测天气的问题。目前的运营预测模型基于物理学，并使用超级计算机来模拟大气预测，提前预测数小时和日期。更好的基于物理的预测需要改进模型本身，这可能是一个实质性的科学挑战，以及潜在的分辨率的改进，可以计算令人望而却步。基于神经网络的新出现的天气模型代表天气预报的范式转变：模型学习来自数据的所需变换，而不是依赖于手工编码的物理，并计算效率。然而，对于神经模型，每个额外的辐射时间都会构成大量挑战，因为它需要捕获更大的空间环境并增加预测的不确定性。在这项工作中，我们提出了一个神经网络，能够提前十二小时的大规模降水预测，并且从相同的大气状态开始，该模型能够比最先进的基于物理的模型更高的技能HRRR和HREF目前在美国大陆运营。可解释性分析加强了模型学会模拟先进物理原则的观察。这些结果代表了建立与神经网络有效预测的新范式的实质性步骤。

translated by 谷歌翻译

Seamless lightning nowcasting with recurrent-convolutional deep learning

Jussi Leinonen , Ulrich Hamann , Urs Germann

分类：机器学习

2022-03-15

提出了一个深度学习模型，以便在未来60分钟的五分钟时间分辨率下以闪电的形式出现。该模型基于反复横向的结构，该结构使其能够识别并预测对流的时空发展，包括雷暴细胞的运动，生长和衰变。预测是在固定网格上执行的，而无需使用风暴对象检测和跟踪。从瑞士和周围的区域收集的输入数据包括地面雷达数据，可见/红外卫星数据以及衍生的云产品，闪电检测，数值天气预测和数字高程模型数据。我们分析了不同的替代损失功能，班级加权策略和模型特征，为将来的研究提供了指南，以最佳地选择损失功能，并正确校准其模型的概率预测。基于这些分析，我们在这项研究中使用焦点损失，但得出结论，它仅在交叉熵方面提供了较小的好处，如果模型的重新校准不实用，这是一个可行的选择。该模型在60分钟的现有周期内实现了0.45的像素临界成功指数（CSI）为0.45，以预测8 km的闪电发生，范围从5分钟的CSI到5分钟的提前时间到CSI到CSI的0.32在A处。收货时间60分钟。

translated by 谷歌翻译

A Generative Deep Learning Approach to Stochastic Downscaling of Precipitation Forecasts

Lucy Harris , Andrew T. T. McRae , Matthew Chantry , Peter D. Dueben , Tim N. Palmer

分类：人工智能 | 计算机视觉 | 机器学习 | (统计)机器学习

2022-04-05

尽管有持续的改进，但降水预测仍然没有其他气象变量的准确和可靠。造成这种情况的一个主要因素是，几个影响降水分布和强度的关键过程出现在全球天气模型的解决规模以下。计算机视觉社区已经证明了生成的对抗网络（GAN）在超分辨率问题上取得了成功，即学习为粗图像添加精细的结构。 Leinonen等。（2020年）先前使用GAN来产生重建的高分辨率大气场的集合，并给定较粗糙的输入数据。在本文中，我们证明了这种方法可以扩展到更具挑战性的问题，即通过使用高分辨率雷达测量值作为“地面真相”来提高天气预报模型中相对低分辨率输入的准确性和分辨率。神经网络必须学会添加分辨率和结构，同时考虑不可忽略的预测错误。我们表明，甘斯和vae-gan可以在创建高分辨率的空间相干降水图的同时，可以匹配最新的后处理方法的统计特性。我们的模型比较比较与像素和合并的CRP分数，功率谱信息和等级直方图（用于评估校准）的最佳现有缩减方法。我们测试了我们的模型，并表明它们在各种场景中的表现，包括大雨。

translated by 谷歌翻译

Learning to forecast vegetation greenness at fine resolution over Africa with ConvLSTMs

Claire Robin , Christian Requena-Mesa , Vitus Benson , Lazaro Alonso , Jeran Poehls , Nuno Carvalhais , Markus Reichstein

分类：机器学习 | 计算机视觉

2022-10-24

Forecasting the state of vegetation in response to climate and weather events is a major challenge. Its implementation will prove crucial in predicting crop yield, forest damage, or more generally the impact on ecosystems services relevant for socio-economic functioning, which if absent can lead to humanitarian disasters. Vegetation status depends on weather and environmental conditions that modulate complex ecological processes taking place at several timescales. Interactions between vegetation and different environmental drivers express responses at instantaneous but also time-lagged effects, often showing an emerging spatial context at landscape and regional scales. We formulate the land surface forecasting task as a strongly guided video prediction task where the objective is to forecast the vegetation developing at very fine resolution using topography and weather variables to guide the prediction. We use a Convolutional LSTM (ConvLSTM) architecture to address this task and predict changes in the vegetation state in Africa using Sentinel-2 satellite NDVI, having ERA5 weather reanalysis, SMAP satellite measurements, and topography (DEM of SRTMv4.1) as variables to guide the prediction. Ours results highlight how ConvLSTM models can not only forecast the seasonal evolution of NDVI at high resolution, but also the differential impacts of weather anomalies over the baselines. The model is able to predict different vegetation types, even those with very high NDVI variability during target length, which is promising to support anticipatory actions in the context of drought-related disasters.

translated by 谷歌翻译

A Concurrent CNN-RNN Approach for Multi-Step Wind Power Forecasting

Syed Kazmi , Berk Gorgulu , Mucahit Cevik , Mustafa Gokce Baydogan

分类：机器学习

2023-01-02

Wind power forecasting helps with the planning for the power systems by contributing to having a higher level of certainty in decision-making. Due to the randomness inherent to meteorological events (e.g., wind speeds), making highly accurate long-term predictions for wind power can be extremely difficult. One approach to remedy this challenge is to utilize weather information from multiple points across a geographical grid to obtain a holistic view of the wind patterns, along with temporal information from the previous power outputs of the wind farms. Our proposed CNN-RNN architecture combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to extract spatial and temporal information from multi-dimensional input data to make day-ahead predictions. In this regard, our method incorporates an ultra-wide learning view, combining data from multiple numerical weather prediction models, wind farms, and geographical locations. Additionally, we experiment with global forecasting approaches to understand the impact of training the same model over the datasets obtained from multiple different wind farms, and we employ a method where spatial information extracted from convolutional layers is passed to a tree ensemble (e.g., Light Gradient Boosting Machine (LGBM)) instead of fully connected layers. The results show that our proposed CNN-RNN architecture outperforms other models such as LGBM, Extra Tree regressor and linear regression when trained globally, but fails to replicate such performance when trained individually on each farm. We also observe that passing the spatial information from CNN to LGBM improves its performance, providing further evidence of CNN's spatial feature extraction capabilities.

translated by 谷歌翻译

Flood forecasting with machine learning models in an operational framework

Sella Nevo , Efrat Morin , Adi Gerzi Rosenthal , Asher Metzger , Chen Barshai , Dana Weitzner , Dafi Voloshin , Frederik Kratzert , Gal Elidan , Gideon Dror

分类：机器学习

2021-11-04

谷歌的运营洪水预测系统是制定的，为机构和公众提供准确的实时洪水警告，重点是河流洪水在大型潮流的河流中。它在2018年开始运作，自从地理位置扩展以来。该预测系统由四个子系统组成：数据验证，阶段预测，淹没建模和警报分配。机器学习用于两个子系统。阶段预测采用长短期内存（LSTM）网络和线性模型进行建模。使用阈值和歧管模型计算洪水淹没，前者计算淹没程度，后者计算淹没程度和深度。本文首次提供的歧管模型提供了一种机器学习替代洪水淹没的液压建模。在评估历史数据时，所有型号都可以实现可操作使用的足够高的度量指标。 LSTM表现出比线性模型更高的技能，而阈值和歧管模型达到了类似的性能度量，以便在淹没程度上进行建模。在2021年的季风季节期间，洪水预警系统在印度和孟加拉国运营，覆盖河流的洪水区，总面积287,000平方公里，拥有350多万人。超过100米的洪水警报被发送给受影响的人口，相关当局以及紧急组织。系统上的当前和未来的工作包括将覆盖范围扩展到额外的洪水易发位置，以及提高建模能力和准确性。

translated by 谷歌翻译

Probabilistic forecasts of extreme heatwaves using convolutional neural networks in a regime of lack of data

George Miloshevich , Bastien Cozian , Patrice Abry , Pierre Borgnat , Freddy Bouchet

分类：机器学习

2022-08-01

了解极端事件及其可能性是研究气候变化影响，风险评估，适应和保护生物的关键。在这项工作中，我们开发了一种方法来构建极端热浪的预测模型。这些模型基于卷积神经网络，对极长的8，000年气候模型输出进行了培训。由于极端事件之间的关系本质上是概率的，因此我们强调概率预测和验证。我们证明，深度神经网络适用于法国持续持续14天的热浪，快速动态驱动器提前15天（500 hpa地球电位高度场），并且在慢速较长的交货时间内，慢速物理时间驱动器（土壤水分）。该方法很容易实现和通用。我们发现，深神经网络选择了与北半球波数字3模式相关的极端热浪。我们发现，当将2米温度场添加到500 HPA地球电位高度和土壤水分场中时，2米温度场不包含任何新的有用统计信息。主要的科学信息是，训练深层神经网络预测极端热浪的发生是在严重缺乏数据的情况下发生的。我们建议大多数其他应用在大规模的大气和气候现象中都是如此。我们讨论了处理缺乏数据制度的观点，例如罕见的事件模拟，以及转移学习如何在后一种任务中发挥作用。

translated by 谷歌翻译

Deep Learning-based Extreme Heatwave Forecast

Valérian Jacques-Dumas , Francesco Ragone , Pierre Borgnat , Patrice Abry , Freddy Bouchet

分类：机器学习

2021-03-17

由于极端热波和热圆顶对社会和生物多样性的影响，他们的研究是一个关键挑战。我们专门研究了持久的极端热浪，这是气候影响最重要的热潮。物理驱动天气预报系统或气候模型可用于预测其发生或预测其概率。目前的工作探讨了使用深度学习架构的使用，使用气候模型的输出训练，作为预测极端持久热浪的发生的替代策略。这种新方法将对包括气候模型统计数据研究的几个关键科学目标，建立了对气候模型中罕见事件的定量代理，研究了气候变化的影响，并最终应对预测有用。履行这些重要目标意味着解决与罕见事件预测有本质相关的类大小不平衡的问题，评估转移学习的潜在好处，以解决极端事件的嵌套性质（自然包含在不太极端的情况下）。我们训练一个卷积神经网络，使用1000年的气候模型产出，具有大级欠采样和转移学习。从观察到的表面温度和500 HPA地球态高度场的快照，训练有素的网络在预测持久的极端热浪的发生时实现了显着性能。我们能够以三种不同的强度预测它们，早在活动开始前15天（事件结束前30天）。

translated by 谷歌翻译

Deep Learning Methods for Daily Wildfire Danger Forecasting

Ioannis Prapas , Spyros Kondylatos , Ioannis Papoutsis , Gustau Camps-Valls , Michele Ronco , Miguel-Ángel Fernández-Torres , Maria Piles Guillem , Nuno Carvalhais

分类：机器学习 | 人工智能 | 计算机视觉

2021-11-04

野火预测对于减少灾害风险和环境可持续性至关重要。我们将每日火灾危险预测作为机器学习任务，使用过去十年来预测下一天的火灾危险。为此，我们收集，预先处理和协调开放式DataCube，其中包括一组协变量，共同影响火灾发生和传播，例如天气条件，卫星衍生的产品，与人类活动相关的地形特征和变量。我们实施各种深度学习（DL）模型，以捕获空间，时间或时空上下文，并将它们与随机林（RF）基线进行比较。我们发现空间或时间上下文足以超越RF，而利用时空上下文的Convlstm在接收器的操作特性为0.926的接收器下的测试区域最佳地执行。我们基于DL的概念证明提供了全国范围的日常火灾危险地图，其空间分辨率高于现有的运营解决方案。

translated by 谷歌翻译

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

Kaifeng Bi , Lingxi Xie , Hengheng Zhang , Xin Chen , Xiaotao Gu , Qi Tian

分类：人工智能 | 计算机视觉 | 机器学习

2022-11-03

In this paper, we present Pangu-Weather, a deep learning based system for fast and accurate global weather forecast. For this purpose, we establish a data-driven environment by downloading $43$ years of hourly global weather data from the 5th generation of ECMWF reanalysis (ERA5) data and train a few deep neural networks with about $256$ million parameters in total. The spatial resolution of forecast is $0.25^\circ\times0.25^\circ$, comparable to the ECMWF Integrated Forecast Systems (IFS). More importantly, for the first time, an AI-based method outperforms state-of-the-art numerical weather prediction (NWP) methods in terms of accuracy (latitude-weighted RMSE and ACC) of all factors (e.g., geopotential, specific humidity, wind speed, temperature, etc.) and in all time ranges (from one hour to one week). There are two key strategies to improve the prediction accuracy: (i) designing a 3D Earth Specific Transformer (3DEST) architecture that formulates the height (pressure level) information into cubic data, and (ii) applying a hierarchical temporal aggregation algorithm to alleviate cumulative forecast errors. In deterministic forecast, Pangu-Weather shows great advantages for short to medium-range forecast (i.e., forecast time ranges from one hour to one week). Pangu-Weather supports a wide range of downstream forecast scenarios, including extreme weather forecast (e.g., tropical cyclone tracking) and large-member ensemble forecast in real-time. Pangu-Weather not only ends the debate on whether AI-based methods can surpass conventional NWP methods, but also reveals novel directions for improving deep learning weather forecast systems.

translated by 谷歌翻译

Fuzzy clustering for the within-season estimation of cotton phenology

Vasileios Sitokonstantinou , Alkiviadis Koukos , Ilias Tsoumas , Nikolaos S. Bartsotas , Charalampos Kontoes , Vassilia Karathanassi

分类：机器学习

2022-11-25

Crop phenology is crucial information for crop yield estimation and agricultural management. Traditionally, phenology has been observed from the ground; however Earth observation, weather and soil data have been used to capture the physiological growth of crops. In this work, we propose a new approach for the within-season phenology estimation for cotton at the field level. For this, we exploit a variety of Earth observation vegetation indices (derived from Sentinel-2) and numerical simulations of atmospheric and soil parameters. Our method is unsupervised to address the ever-present problem of sparse and scarce ground truth data that makes most supervised alternatives impractical in real-world scenarios. We applied fuzzy c-means clustering to identify the principal phenological stages of cotton and then used the cluster membership weights to further predict the transitional phases between adjacent stages. In order to evaluate our models, we collected 1,285 crop growth ground observations in Orchomenos, Greece. We introduced a new collection protocol, assigning up to two phenology labels that represent the primary and secondary growth stage in the field and thus indicate when stages are transitioning. Our model was tested against a baseline model that allowed to isolate the random agreement and evaluate its true competence. The results showed that our model considerably outperforms the baseline one, which is promising considering the unsupervised nature of the approach. The limitations and the relevant future work are thoroughly discussed. The ground observations are formatted in an ready-to-use dataset and will be available at https://github.com/Agri-Hub/cotton-phenology-dataset upon publication.

translated by 谷歌翻译

Location-aware Adaptive Denormalization: A Deep Learning Approach For Wildfire Danger Forecasting

Mohamad Hakam Shams Eddin , Ribana Roscher , Juergen Gall

分类：计算机视觉

2022-12-16

Climate change is expected to intensify and increase extreme events in the weather cycle. Since this has a significant impact on various sectors of our life, recent works are concerned with identifying and predicting such extreme events from Earth observations. This paper proposes a 2D/3D two-branch convolutional neural network (CNN) for wildfire danger forecasting. To use a unified framework, previous approaches duplicate static variables along the time dimension and neglect the intrinsic differences between static and dynamic variables. Furthermore, most existing multi-branch architectures lose the interconnections between the branches during the feature learning stage. To address these issues, we propose a two-branch architecture with a Location-aware Adaptive Denormalization layer (LOADE). Using LOADE as a building block, we can modulate the dynamic features conditional on their geographical location. Thus, our approach considers feature properties as a unified yet compound 2D/3D model. Besides, we propose using an absolute temporal encoding for time-related forecasting problems. Our experimental results show a better performance of our approach than other baselines on the challenging FireCube dataset.

translated by 谷歌翻译

An Extreme-Adaptive Time Series Prediction Model Based on Probability-Enhanced LSTM Neural Networks

Yanhong Li , Jack Xu , David C. Anastasiu

分类：机器学习 | 人工智能

2022-11-29

Forecasting time series with extreme events has been a challenging and prevalent research topic, especially when the time series data are affected by complicated uncertain factors, such as is the case in hydrologic prediction. Diverse traditional and deep learning models have been applied to discover the nonlinear relationships and recognize the complex patterns in these types of data. However, existing methods usually ignore the negative influence of imbalanced data, or severe events, on model training. Moreover, methods are usually evaluated on a small number of generally well-behaved time series, which does not show their ability to generalize. To tackle these issues, we propose a novel probability-enhanced neural network model, called NEC+, which concurrently learns extreme and normal prediction functions and a way to choose among them via selective back propagation. We evaluate the proposed model on the difficult 3-day ahead hourly water level prediction task applied to 9 reservoirs in California. Experimental results demonstrate that the proposed model significantly outperforms state-of-the-art baselines and exhibits superior generalization ability on data with diverse distributions.

translated by 谷歌翻译

A Moment in the Sun: Solar Nowcasting from Multispectral Satellite Data using Self-Supervised Learning

Akansha Singh Bansal , Trapit Bansal , David Irwin

分类：机器学习 | 人工智能 | 计算机视觉

2021-12-28

太阳能现在是历史上最便宜的电力形式。不幸的是，由于其变异性，显着提高栅格的太阳能的一部分仍然具有挑战性，这使得电力的供需平衡更加困难。虽然热发电机坡度 - 它们可以改变输出的最高速率 - 是有限的，太阳能的坡度基本上是无限的。因此，准确的近期太阳能预测或垂圈，对于提供预警来调整热发电机输出，以响应于太阳能变化来调整热发电机，以确保平衡供需。为了解决问题，本文开发了使用自我监督学习的丰富和易于使用的多光谱卫星数据的太阳能垂圈的一般模型。具体而言，我们使用卷积神经网络（CNN）和长短期内存网络（LSTM）开发深度自动回归模型，这些模型在多个位置训练全球培训，以预测最近推出的最近收集的时空数据的未来观察-R系列卫星。我们的模型估计了基于卫星观测的未来的太阳辐照度，我们向较小的场地特定的太阳能数据培训的回归模型提供，以提供近期太阳能光伏（PV）预测，其考虑了现场特征的特征。我们评估了我们在25个太阳能场所的不同覆盖区域和预测视野的方法，并表明我们的方法利用地面真理观察结果产生靠近模型的错误。

translated by 谷歌翻译

Prediction of Solar Radiation Based on Spatial and Temporal Embeddings for Solar Generation Forecast

Mohammad Alqudah , Tatjana Dokic , Mladen Kezunovic , Zoran Obradovic

分类：机器学习

2022-06-17

提出了一种使用天气数据实时太阳生成预测的新方法，同时提出了既有空间结构依赖性的依赖。随着时间的推移，观察到的网络被预测到较低维度的表示，在该表示的情况下，在推理阶段使用天气预报时，使用各种天气测量来训练结构化回归模型。从国家太阳辐射数据库获得的德克萨斯州圣安东尼奥地区的288个地点进行了实验。该模型预测具有良好精度的太阳辐照度（夏季R2 0.91，冬季为0.85，全球模型为0.89）。随机森林回归者获得了最佳准确性。进行了多个实验来表征缺失数据的影响和不同的时间范围的影响，这些范围提供了证据表明，新算法不仅在随机的情况下，而且在机制是空间和时间上都丢失的数据是可靠的。

translated by 谷歌翻译

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast

Saleh Ashkboos , Langwen Huang , Nikoli Dryden , Tal Ben-Nun , Peter Dueben , Lukas Gianinazzi , Luca Kummer , Torsten Hoefler

分类：机器学习

2022-06-29

后处理整体预测系统可以改善天气预报，尤其是对于极端事件预测。近年来，已经开发出不同的机器学习模型来提高后处理步骤的质量。但是，这些模型在很大程度上依赖数据并生成此类合奏成员需要以高计算成本的数值天气预测模型进行多次运行。本文介绍了ENS-10数据集，由十个合奏成员组成，分布在20年中（1998-2017）。合奏成员是通过扰动数值天气模拟来捕获地球的混乱行为而产生的。为了代表大气的三维状态，ENS-10在11个不同的压力水平以及0.5度分辨率的表面中提供了最相关的大气变量。该数据集以48小时的交货时间针对预测校正任务，这实质上是通过消除合奏成员的偏见来改善预测质量。为此，ENS-10为预测交货时间t = 0、24和48小时（每周两个数据点）提供了天气变量。我们在ENS-10上为此任务提供了一组基线，并比较了它们在纠正不同天气变量预测时的性能。我们还评估了使用数据集预测极端事件的基准。 ENS-10数据集可在创意共享归因4.0国际（CC By 4.0）许可下获得。

translated by 谷歌翻译

Improving debris flow evacuation alerts in Taiwan using machine learning

Yi-Lin Tsai , Jeremy Irvin , Suhas Chundi , João Estacio Gaspar Araujo , Andrew Y. Ng , Christopher B. Field , Peter K. Kitanidis

分类：机器学习 | 人工智能

2022-08-27

台湾对全球碎片流的敏感性和死亡人数最高。台湾现有的碎屑流警告系统，该系统使用降雨量的时间加权度量，当该措施超过预定义的阈值时，会导致警报。但是，该系统会产生许多错误的警报，并错过了实际碎屑流的很大一部分。为了改善该系统，我们实施了五个机器学习模型，以输入历史降雨数据并预测是否会在选定的时间内发生碎屑流。我们发现，随机的森林模型在五个模型中表现最好，并优于台湾现有系统。此外，我们确定了与碎屑流的发生密切相关的降雨轨迹，并探索了缺失碎屑流的风险与频繁的虚假警报之间的权衡。这些结果表明，仅在小时降雨数据中训练的机器学习模型的潜力可以挽救生命，同时减少虚假警报。

translated by 谷歌翻译