智能论文笔记

Multi-Dimensional Self Attention based Approach for Remaining Useful Life Estimation

Zhi Lai , Mengjuan Liu , Yunzhu Pan , Dajiang Chen

分类：机器学习 | (统计)机器学习

2022-12-12

Remaining Useful Life (RUL) estimation plays a critical role in Prognostics and Health Management (PHM). Traditional machine health maintenance systems are often costly, requiring sufficient prior expertise, and are difficult to fit into highly complex and changing industrial scenarios. With the widespread deployment of sensors on industrial equipment, building the Industrial Internet of Things (IIoT) to interconnect these devices has become an inexorable trend in the development of the digital factory. Using the device's real-time operational data collected by IIoT to get the estimated RUL through the RUL prediction algorithm, the PHM system can develop proactive maintenance measures for the device, thus, reducing maintenance costs and decreasing failure times during operation. This paper carries out research into the remaining useful life prediction model for multi-sensor devices in the IIoT scenario. We investigated the mainstream RUL prediction models and summarized the basic steps of RUL prediction modeling in this scenario. On this basis, a data-driven approach for RUL estimation is proposed in this paper. It employs a Multi-Head Attention Mechanism to fuse the multi-dimensional time-series data output from multiple sensors, in which the attention on features is used to capture the interactions between features and attention on sequences is used to learn the weights of time steps. Then, the Long Short-Term Memory Network is applied to learn the features of time series. We evaluate the proposed model on two benchmark datasets (C-MAPSS and PHM08), and the results demonstrate that it outperforms the state-of-art models. Moreover, through the interpretability of the multi-head attention mechanism, the proposed model can provide a preliminary explanation of engine degradation. Therefore, this approach is promising for predictive maintenance in IIoT scenarios.

translated by 谷歌翻译

A novel time-frequency Transformer based on self-attention mechanism and its application in fault diagnosis of rolling bearings

Yifei Ding , Minping Jia , Qiuhua Miao , Yudong Cao

分类：人工智能 | 机器学习

2021-04-19

通过深度学习（DL）大大扩展了数据驱动故障诊断模型的范围。然而，经典卷积和反复化结构具有计算效率和特征表示的缺陷，而基于注意机制的最新变压器架构尚未应用于该字段。为了解决这些问题，我们提出了一种新颖的时变电片（TFT）模型，其灵感来自序列加工的香草变压器大规模成功。特别是，我们设计了一个新的笨蛋和编码器模块，以从振动信号的时频表示（TFR）中提取有效抽象。在此基础上，本文提出了一种基于时变电片的新的端到端故障诊断框架。通过轴承实验数据集的案例研究，我们构建了最佳变压器结构并验证了其故障诊断性能。与基准模型和其他最先进的方法相比，证明了所提出的方法的优越性。

translated by 谷歌翻译

Deep Learning for Time Series Anomaly Detection: A Survey

Zahra Zamanzadeh Darban , Geoffrey I. Webb , Shirui Pan , Charu C. Aggarwal , Mahsa Salehi

分类：机器学习 | 人工智能

2022-11-09

Time series anomaly detection has applications in a wide range of research fields and applications, including manufacturing and healthcare. The presence of anomalies can indicate novel or unexpected events, such as production faults, system defects, or heart fluttering, and is therefore of particular interest. The large size and complex patterns of time series have led researchers to develop specialised deep learning models for detecting anomalous patterns. This survey focuses on providing structured and comprehensive state-of-the-art time series anomaly detection models through the use of deep learning. It providing a taxonomy based on the factors that divide anomaly detection models into different categories. Aside from describing the basic anomaly detection technique for each category, the advantages and limitations are also discussed. Furthermore, this study includes examples of deep anomaly detection in time series across various application domains in recent years. It finally summarises open issues in research and challenges faced while adopting deep anomaly detection models.

translated by 谷歌翻译

A CNN-BiLSTM Model with Attention Mechanism for Earthquake Prediction

Parisa Kavianpour , Mohammadreza Kavianpour , Ehsan Jahani , Amin Ramezani

分类：机器学习

2021-12-26

作为自然现象的地震，历史上不断造成伤害和人类生活的损失。地震预测是任何社会计划的重要方面，可以增加公共准备，并在很大程度上减少损坏。然而，由于地震的随机特征以及实现了地震预测的有效和可靠模型的挑战，迄今为止努力一直不足，需要新的方法来解决这个问题。本文意识到这些问题，提出了一种基于注意机制（AM），卷积神经网络（CNN）和双向长短期存储器（BILSTM）模型的新型预测方法，其可以预测数量和最大幅度中国大陆各地区的地震为基于该地区的地震目录。该模型利用LSTM和CNN具有注意机制，以更好地关注有效的地震特性并产生更准确的预测。首先，将零阶保持技术应用于地震数据上的预处理，使得模型的输入数据更适当。其次，为了有效地使用空间信息并减少输入数据的维度，CNN用于捕获地震数据之间的空间依赖性。第三，使用Bi-LSTM层来捕获时间依赖性。第四，引入了AM层以突出其重要的特征来实现更好的预测性能。结果表明，该方法具有比其他预测方法更好的性能和概括能力。

translated by 谷歌翻译

A Hybrid Deep Learning Model-based Remaining Useful Life Estimation for Reed Relay with Degradation Pattern Clustering

Chinthaka Gamanayake , Yan Qin , Chau Yuen , Lahiru Jayasinghe , Dominique-Ea Tan , Jenny Low

分类：机器学习

2022-09-14

REED继电器是功能测试的基本组成部分，与电子产品的成功质量检查密切相关。为了为REED继电器提供准确的剩余使用寿命（RUL）估计，根据以下三个考虑，提出了具有降解模式聚类的混合深度学习网络。首先，对于REED继电器，观察到多种降解行为，因此提供了基于动态的$ K $ -MEANS聚类，以区分彼此的退化模式。其次，尽管适当的功能选择具有重要意义，但很少有研究可以指导选择。提出的方法建议进行操作规则，以实施轻松实施。第三，提出了用于剩余使用寿命估计的神经网络（RULNET），以解决卷积神经网络（CNN）在捕获顺序数据的时间信息中的弱点，该信息在卷积操作的高级特征表示后结合了时间相关能力。通过这种方式，lulnet的三种变体由健康指标，具有自组织地图的功能或具有曲线拟合的功能构建。最终，将提出的混合模型与典型的基线模型（包括CNN和长期记忆网络（LSTM））进行了比较，该模型通过具有两个不同不同降级方式的实用REED继电器数据集进行了比较。两种降解案例的结果表明，所提出的方法在索引均方根误差方面优于CNN和LSTM。

translated by 谷歌翻译

Traffic Flow Prediction via Variational Bayesian Inference-based Encoder-Decoder Framework

Jianlei Kong , Xiaomeng Fan , Xue-Bo Jin , Min Zuo

分类：机器学习

2022-12-14

Accurate traffic flow prediction, a hotspot for intelligent transportation research, is the prerequisite for mastering traffic and making travel plans. The speed of traffic flow can be affected by roads condition, weather, holidays, etc. Furthermore, the sensors to catch the information about traffic flow will be interfered with by environmental factors such as illumination, collection time, occlusion, etc. Therefore, the traffic flow in the practical transportation system is complicated, uncertain, and challenging to predict accurately. This paper proposes a deep encoder-decoder prediction framework based on variational Bayesian inference. A Bayesian neural network is constructed by combining variational inference with gated recurrent units (GRU) and used as the deep neural network unit of the encoder-decoder framework to mine the intrinsic dynamics of traffic flow. Then, the variational inference is introduced into the multi-head attention mechanism to avoid noise-induced deterioration of prediction accuracy. The proposed model achieves superior prediction performance on the Guangzhou urban traffic flow dataset over the benchmarks, particularly when the long-term prediction.

translated by 谷歌翻译

Stock Market Prediction via Deep Learning Techniques: A Survey

Jinan Zou , Qingying Zhao , Yang Jiao , Haiyao Cao , Yanxi Liu , Qingsen Yan , Ehsan Abbasnejad , Lingqiao Liu , Javen Qinfeng Shi

分类：人工智能

2022-12-24

The stock market prediction has been a traditional yet complex problem researched within diverse research areas and application domains due to its non-linear, highly volatile and complex nature. Existing surveys on stock market prediction often focus on traditional machine learning methods instead of deep learning methods. Deep learning has dominated many domains, gained much success and popularity in recent years in stock market prediction. This motivates us to provide a structured and comprehensive overview of the research on stock market prediction focusing on deep learning techniques. We present four elaborated subtasks of stock market prediction and propose a novel taxonomy to summarize the state-of-the-art models based on deep neural networks from 2011 to 2022. In addition, we also provide detailed statistics on the datasets and evaluation metrics commonly used in the stock market. Finally, we highlight some open issues and point out several future directions by sharing some new perspectives on stock market prediction.

translated by 谷歌翻译

Latent Variable Models in the Era of Industrial Big Data: Extension and Beyond

Xiangyin Kong , Xiaoyu Jiang , Bingxin Zhang , Jinsong Yuan , Zhiqiang Ge

分类：机器学习

2022-08-23

大量的数据和创新算法使数据驱动的建模成为现代行业的流行技术。在各种数据驱动方法中，潜在变量模型（LVM）及其对应物占主要份额，并在许多工业建模领域中起着至关重要的作用。 LVM通常可以分为基于统计学习的经典LVM和基于神经网络的深层LVM（DLVM）。我们首先讨论经典LVM的定义，理论和应用，该定义和应用既是综合教程，又是对经典LVM的简短申请调查。然后，我们对当前主流DLVM进行了彻底的介绍，重点是其理论和模型体系结构，此后不久就提供了有关DLVM的工业应用的详细调查。上述两种类型的LVM具有明显的优势和缺点。具体而言，经典的LVM具有简洁的原理和良好的解释性，但是它们的模型能力无法解决复杂的任务。基于神经网络的DLVM具有足够的模型能力，可以在复杂的场景中实现令人满意的性能，但它以模型的解释性和效率为例。旨在结合美德并减轻这两种类型的LVM的缺点，并探索非神经网络的举止以建立深层模型，我们提出了一个新颖的概念，称为“轻量级Deep LVM（LDLVM）”。在提出了这个新想法之后，该文章首先阐述了LDLVM的动机和内涵，然后提供了两个新颖的LDLVM，并详尽地描述了其原理，建筑和优点。最后，讨论了前景和机会，包括重要的开放问题和可能的研究方向。

translated by 谷歌翻译

Transfer Learning and Vision Transformer based State-of-Health prediction of Lithium-Ion Batteries

Pengyu Fu , Liang Chu , Zhuoran Hou , Jincheng Hu , Yanjun Huang , Yuanjian Zhang

分类：计算机视觉 | 人工智能

2022-09-07

近年来，在运输电气化方面取得了重大进展。作为主要的储能设备，锂离子电池（LIB）已受到广泛关注。准确地预测健康状况（SOH）不仅可以缓解用户对电池寿命的焦虑，而且还可以为电池管理提供重要信息。本文提出了一种基于视觉变压器（VIT）模型的SOH的预测方法。首先，预定义电压范围的离散充电数据用作输入数据矩阵。然后，电池的循环特征是由VIT捕获的，可以获得可以获得全局特征，并且通过将循环特征与完整连接（FC）层相结合来获得SOH。同时，引入了转移学习（TL），并根据目标任务电池的早期周期数据进一步微调基于源任务电池训练的预测模型，以提供准确的预测。实验表明，与现有的深度学习方法相比，我们的方法可以获得更好的特征表达，从而可以实现更好的预测效果和传递效果。

translated by 谷歌翻译

Spatial-Temporal Feature Extraction and Evaluation Network for Citywide Traffic Condition Prediction

Shilin Pu , Liang Chu , Zhuoran Hou , Jincheng Hu , Yanjun Huang , Yuanjian Zhang

分类：机器学习

2022-07-22

流量预测在智能运输系统中交通控制和调度任务的实现中起着重要作用。随着数据源的多元化，合理地使用丰富的流量数据来对流量流中复杂的时空依赖性和非线性特征进行建模是智能运输系统的关键挑战。此外，清楚地评估从不同数据中提取的时空特征的重要性成为一个挑战。提出了双层 - 空间时间特征提取和评估（DL -STFEE）模型。 DL-STFEE的下层是时空特征提取层。流量数据中的空间和时间特征是通过多画图卷积和注意机制提取的，并生成了空间和时间特征的不同组合。 DL-STFEE的上层是时空特征评估层。通过高维自我注意力发项机制产生的注意力评分矩阵，空间特征组合被融合和评估，以便获得不同组合对预测效应的影响。在实际的流量数据集上进行了三组实验，以表明DL-STFEE可以有效地捕获时空特征并评估不同时空特征组合的重要性。

translated by 谷歌翻译

A Survey on Societal Event Forecasting with Deep Learning

Songgaojun Deng , Yue Ning

分类：机器学习 | 人工智能

2021-12-12

人口级社会事件，如民事骚乱和犯罪，往往对我们的日常生活产生重大影响。预测此类事件对于决策和资源分配非常重要。由于缺乏关于事件发生的真实原因和潜在机制的知识，事件预测传统上具有挑战性。近年来，由于两个主要原因，研究事件预测研究取得了重大进展：（1）机器学习和深度学习算法的开发和（2）社交媒体，新闻来源，博客，经济等公共数据的可访问性指标和其他元数据源。软件/硬件技术中的数据的爆炸性增长导致了社会事件研究中的深度学习技巧的应用。本文致力于提供社会事件预测的深层学习技术的系统和全面概述。我们专注于两个社会事件的域名：\ Texit {Civil unrest}和\ texit {犯罪}。我们首先介绍事件预测问题如何作为机器学习预测任务制定。然后，我们总结了这些问题的数据资源，传统方法和最近的深度学习模型的发展。最后，我们讨论了社会事件预测中的挑战，并提出了一些有希望的未来研究方向。

translated by 谷歌翻译

A Transferable Intersection Reconstruction Network for Traffic Speed Prediction

Pengyu Fu , Liang Chu , Zhuoran Hou , Jincheng Hu , Yanjun Huang , Yuanjian Zhang

分类：机器学习

2022-07-22

交通速度预测是许多有价值应用程序的关键，由于其各种影响因素，它也是一项具有挑战性的任务。最近的工作试图通过各种混合模型获得更多信息，从而提高了预测准确性。但是，这些方法的空间信息采集方案存在两级分化问题。建模很简单，但包含很少的空间信息，或者建模是完整的，但缺乏灵活性。为了基于确保灵活性引入更多空间信息，本文提出了IRNET（可转让的交叉点重建网络）。首先，本文将相交重建为与相同结构的虚拟交集，从而简化了道路网络的拓扑结构。然后，将空间信息细分为交叉信息和交通流向的序列信息，并通过各种模型获得时空特征。第三，一种自我发项机制用于融合时空特征以进行预测。在与基线的比较实验中，不仅预测效应，而且转移性能具有明显的优势。

translated by 谷歌翻译

Spatial-Temporal Interactive Dynamic Graph Convolution Network for Traffic Forecasting

Aoyu Liu , Yaying Zhang

分类：机器学习 | 人工智能

2022-05-18

准确的交通预测对于智能城市实现交通控制，路线计划和流动检测至关重要。尽管目前提出了许多时空方法，但这些方法在同步捕获流量数据的时空依赖性方面缺陷。此外，大多数方法忽略了随着流量数据的变化而产生的道路网络节点之间的动态变化相关性。我们建议基于神经网络的时空交互式动态图卷积网络（STIDGCN），以应对上述流量预测的挑战。具体而言，我们提出了一个交互式动态图卷积结构，该结构将序列划分为间隔，并通过交互式学习策略同步捕获流量数据的时空依赖性。交互式学习策略使StidGCN有效地预测。我们还提出了一个新颖的动态图卷积模块，以捕获由图生成器和融合图卷积组成的流量网络中动态变化的相关性。动态图卷积模块可以使用输入流量数据和预定义的图形结构来生成图形结构。然后将其与定义的自适应邻接矩阵融合，以生成动态邻接矩阵，该矩阵填充了预定义的图形结构，并模拟了道路网络中节点之间的动态关联的产生。在四个现实世界流量流数据集上进行的广泛实验表明，StidGCN的表现优于最先进的基线。

translated by 谷歌翻译

Deep Learning for Time Series Forecasting: Tutorial and Literature Survey

Konstantinos Benidis , Syama Sundar Rangapuram , Valentin Flunkert , Yuyang Wang , Danielle Maddix , Caner Turkmen , Jan Gasthaus , Michael Bohlke-Schneider , David Salinas , Lorenzo Stella

分类：机器学习 | (统计)机器学习

2020-04-21

基于预测方法的深度学习已成为时间序列预测或预测的许多应用中的首选方法，通常通常优于其他方法。因此，在过去的几年中，这些方法现在在大规模的工业预测应用中无处不在，并且一直在预测竞赛（例如M4和M5）中排名最佳。这种实践上的成功进一步提高了学术兴趣，以理解和改善深厚的预测方法。在本文中，我们提供了该领域的介绍和概述：我们为深入预测的重要构建块提出了一定深度的深入预测；随后，我们使用这些构建块，调查了最近的深度预测文献的广度。

translated by 谷歌翻译

A Novel Deep Parallel Time-series Relation Network for Fault Diagnosis

Chun Yang

分类：机器学习 | 人工智能

2021-12-03

考虑到应用时间序列数据的上下文信息的模型可以改善故障诊断性能，提出了一些神经网络结构（例如RNN，LSTM和GRU）有效地对故障诊断进行建模。但是，这些模型受其串行计算的限制，因此无法实现高诊断效率。同样，平行CNN很难以有效的方式实施故障诊断，因为它需要更大的卷积内核或深层结构才能实现长期特征提取能力。此外，BERT模型还采用绝对位置嵌入以将上下文信息引入模型，这将为原始数据带来噪声，因此不能直接应用于故障诊断。为了解决上述问题，本文提出了一个名为“深层平行时间序列关系网络”（DPTRN）的故障诊断模型。 DPTRN有三个优点：（1）我们提出的时间关系单元基于完整的多层感知器（MLP）结构，因此，DPTRN以并行方式执行故障诊断，并显着提高计算效率。（2）通过改善绝对位置的嵌入，我们的新型解耦位置嵌入单元可以直接应用于故障诊断并学习上下文信息。（3）我们提出的DPTRN在功能解释性方面具有明显的优势。我们确认了所提出的方法对四个数据集的影响，结果显示了所提出的DPTRN模型的有效性，效率和解释性。

translated by 谷歌翻译

Deep learning for laboratory earthquake prediction and autoregressive forecasting of fault zone stress

Laura Laurenti , Elisa Tinti , Fabio Galasso , Luca Franco , Chris Marone

分类：计算机视觉

2022-03-24

地震的预测和预测有很长的时间，在某些情况下有肮脏的历史，但是最近的工作重新点燃了基于预警的进步，诱发地震性的危害评估以及对实验室地震的成功预测。在实验室中，摩擦滑移事件为地震和地震周期提供了类似物。 Labquakes是机器学习（ML）的理想目标，因为它们可以在受控条件下以长序列生产。最近的作品表明，ML可以使用断层区的声学排放来预测实验室的几个方面。在这里，我们概括了这些结果，并探索了Labquake预测和自动回归（AR）预测的深度学习（DL）方法。 DL改善了现有的Labquake预测方法。 AR方法允许通过迭代预测在未来的视野中进行预测。我们证明，基于长期任期内存（LSTM）和卷积神经网络的DL模型可以预测在几种条件下实验室，并且可以以忠诚度预测断层区应力，证实声能是断层区应力的指纹。我们还预测了实验室的失败开始（TTSF）和失败结束（TTEF）的时间。有趣的是，在所有地震循环中都可以成功预测TTEF，而TTSF的预测随preseismisic断层蠕变的数量而变化。我们报告了使用三个序列建模框架：LSTM，时间卷积网络和变压器网络预测故障应力演变的AR方法。 AR预测与现有的预测模型不同，该模型仅在特定时间预测目标变量。超出单个地震周期的预测结果有限，但令人鼓舞。我们的ML/DL模型优于最先进的模型，我们的自回归模型代表了一个新颖的框架，可以增强当前的地震预测方法。

translated by 谷歌翻译

AIST: An Interpretable Attention-based Deep Learning Model for Crime Prediction

Yeasir Rayhan , Tanzima Hashem

分类：机器学习

2020-12-16

准确性和可解释性是犯罪预测模型的两个基本属性。由于犯罪可能对人类生命，经济和安全的不利影响，我们需要一个可以尽可能准确地预测未来犯罪的模型，以便可以采取早期步骤来避免犯罪。另一方面，可解释的模型揭示了模型预测背后的原因，确保其透明度并允许我们相应地规划预防犯罪步骤。开发模型的关键挑战是捕获特定犯罪类别的非线性空间依赖和时间模式，同时保持模型的底层结构可解释。在本文中，我们开发AIST，一种用于犯罪预测的注意力的可解释的时空时间网络。基于过去的犯罪发生，外部特征（例如，流量流量和兴趣点（POI）信息）和犯罪趋势，AICT模拟了犯罪类别的动态时空相关性。广泛的实验在使用真实数据集的准确性和解释性方面表现出我们模型的优越性。

translated by 谷歌翻译

Paying Attention to Astronomical Transients: Introducing the Time-series Transformer for Photometric Classification

Tarek Allam Jr. , Jason D. McEwen

分类：机器学习

2021-05-13

Future surveys such as the Legacy Survey of Space and Time (LSST) of the Vera C. Rubin Observatory will observe an order of magnitude more astrophysical transient events than any previous survey before. With this deluge of photometric data, it will be impossible for all such events to be classified by humans alone. Recent efforts have sought to leverage machine learning methods to tackle the challenge of astronomical transient classification, with ever improving success. Transformers are a recently developed deep learning architecture, first proposed for natural language processing, that have shown a great deal of recent success. In this work we develop a new transformer architecture, which uses multi-head self attention at its core, for general multi-variate time-series data. Furthermore, the proposed time-series transformer architecture supports the inclusion of an arbitrary number of additional features, while also offering interpretability. We apply the time-series transformer to the task of photometric classification, minimising the reliance of expert domain knowledge for feature selection, while achieving results comparable to state-of-the-art photometric classification methods. We achieve a logarithmic-loss of 0.507 on imbalanced data in a representative setting using data from the Photometric LSST Astronomical Time-Series Classification Challenge (PLAsTiCC). Moreover, we achieve a micro-averaged receiver operating characteristic area under curve of 0.98 and micro-averaged precision-recall area under curve of 0.87.

translated by 谷歌翻译

Modeling Long-term Dependencies and Short-term Correlations in Patient Journey Data with Temporal Attention Networks for Health Prediction

Yuxi Liu , Zhenhao Zhang , Antonio Jimeno Yepes , Flora D. Salim

分类：机器学习 | 人工智能 | 自然语言处理

2022-07-13

基于电子健康记录（EHR）的健康预测建筑模型已成为一个活跃的研究领域。 EHR患者旅程数据由患者定期的临床事件/患者访问组成。大多数现有研究的重点是建模访问之间的长期依赖性，而无需明确考虑连续访问之间的短期相关性，在这种情况下，将不规则的时间间隔（并入为辅助信息）被送入健康预测模型中以捕获患者期间的潜在渐进模式。。我们提出了一个具有四个模块的新型深神经网络，以考虑各种变量对健康预测的贡献：i）堆叠的注意力模块在每个患者旅程中加强了临床事件中的深层语义，并产生访问嵌入，ii）短 - 术语时间关注模块模型在连续访问嵌入之间的短期相关性，同时捕获这些访问嵌入中时间间隔的影响，iii）长期时间关注模块模型的长期依赖模型，同时捕获时间间隔内的时间间隔的影响这些访问嵌入，iv），最后，耦合的注意模块适应了短期时间关注和长期时间注意模块的输出，以做出健康预测。对模拟III的实验结果表明，与现有的最新方法相比，我们的模型的预测准确性以及该方法的可解释性和鲁棒性。此外，我们发现建模短期相关性有助于局部先验的产生，从而改善了患者旅行的预测性建模。

translated by 谷歌翻译

A Concurrent CNN-RNN Approach for Multi-Step Wind Power Forecasting

Syed Kazmi , Berk Gorgulu , Mucahit Cevik , Mustafa Gokce Baydogan

分类：机器学习

2023-01-02

Wind power forecasting helps with the planning for the power systems by contributing to having a higher level of certainty in decision-making. Due to the randomness inherent to meteorological events (e.g., wind speeds), making highly accurate long-term predictions for wind power can be extremely difficult. One approach to remedy this challenge is to utilize weather information from multiple points across a geographical grid to obtain a holistic view of the wind patterns, along with temporal information from the previous power outputs of the wind farms. Our proposed CNN-RNN architecture combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to extract spatial and temporal information from multi-dimensional input data to make day-ahead predictions. In this regard, our method incorporates an ultra-wide learning view, combining data from multiple numerical weather prediction models, wind farms, and geographical locations. Additionally, we experiment with global forecasting approaches to understand the impact of training the same model over the datasets obtained from multiple different wind farms, and we employ a method where spatial information extracted from convolutional layers is passed to a tree ensemble (e.g., Light Gradient Boosting Machine (LGBM)) instead of fully connected layers. The results show that our proposed CNN-RNN architecture outperforms other models such as LGBM, Extra Tree regressor and linear regression when trained globally, but fails to replicate such performance when trained individually on each farm. We also observe that passing the spatial information from CNN to LGBM improves its performance, providing further evidence of CNN's spatial feature extraction capabilities.

translated by 谷歌翻译