Accurate PhotoVoltaic (PV) power generation forecasting is vital for the efficient operation of Smart Grids. The automated design of such accurate forecasting models for individual PV plants includes two challenges: First, information about the PV mounting configuration (i.e. inclination and azimuth angles) is often missing. Second, for new PV plants, the amount of historical data available to train a forecasting model is limited (cold-start problem). We address these two challenges by proposing a new method for day-ahead PV power generation forecasts called AutoPV. AutoPV is a weighted ensemble of forecasting models that represent different PV mounting configurations. This representation is achieved by pre-training each forecasting model on a separate PV plant and by scaling the model's output with the peak power rating of the corresponding PV plant. To tackle the cold-start problem, we initially weight each forecasting model in the ensemble equally. To tackle the problem of missing information about the PV mounting configuration, we use new data that become available during operation to adapt the ensemble weights to minimize the forecasting error. AutoPV is advantageous as the unknown PV mounting configuration is implicitly reflected in the ensemble weights, and only the PV plant's peak power rating is required to re-scale the ensemble's output. AutoPV also allows to represent PV plants with panels distributed on different roofs with varying alignments, as these mounting configurations can be reflected proportionally in the weighting. Additionally, the required computing memory is decoupled when scaling AutoPV to hundreds of PV plants, which is beneficial in Smart Grids with limited computing capabilities. For a real-world data set with 11 PV plants, the accuracy of AutoPV is comparable to a model trained on two years of data and outperforms an incrementally trained model.
translated by 谷歌翻译
分布式的小型太阳能光伏(PV)系统正在以快速增加的速度安装。这可能会对分销网络和能源市场产生重大影响。结果,在不同时间分辨率和视野中,非常需要改善对这些系统发电的预测。但是,预测模型的性能取决于分辨率和地平线。在这种情况下,将多个模型的预测结合到单个预测中的预测组合(合奏)可能是鲁棒的。因此,在本文中,我们提供了对五个最先进的预测模型的性能以及在多个分辨率和视野下的现有预测组合的比较和见解。我们提出了一种基于粒子群优化(PSO)的预测组合方法,该方法将通过加权单个模型产生的预测来使预报掌握能够为手头的任务产生准确的预测。此外,我们将提出的组合方法的性能与现有的预测组合方法进行了比较。使用现实世界中的PV电源数据集进行了全面的评估,该数据集在美国三个位置的25个房屋中测得。在四种不同的分辨率和四个不同视野之间的结果表明,基于PSO的预测组合方法的表现优于使用任何单独的预测模型和其他预测组合的使用,而平均平均绝对规模误差降低了3.81%,而最佳性能则最佳性能单个个人模型。我们的方法使太阳预报员能够为其应用产生准确的预测,而不管预测分辨率或视野如何。
translated by 谷歌翻译
最近有兴趣在计算机视觉任务中使用模型中心(预训练模型的集合)。要使用模型中心,我们首先选择一个源模型,然后调整目标的模型以补偿差异。尽管对计算机视觉任务的模型选择和适应性的研究仍有有限的研究,但对于可再生能源领域而言,这甚至更多。同时,根据数值天气预测的天气特征,为对电力预测的需求不断增长提供预测是一个至关重要的挑战。我们通过进行第一个彻底的实验来弥合这些差距,以进行模型选择和适应性的适应性,以在可再生能力预测中转移学习,从而采用了六个数据集中计算机视觉领域的最新结果。我们根据不同季节的数据采用模型,并限制培训数据的量。作为当前最新状态的扩展,我们利用贝叶斯线性回归来预测基于从神经网络中提取的特征的响应。这种方法的表现仅超过基线,只有7天的培训数据。我们进一步展示了如何通过合奏组合多个模型可以显着改善模型选择和适应方法。实际上,有了超过30天的培训数据,两种提出的模型组合技术都取得了与经过一年的培训数据训练的模型相似的结果。
translated by 谷歌翻译
我们基于技能评分,对确定性太阳预测进行了首次全面的荟萃分析,筛选了Google Scholar的1,447篇论文,并审查了320篇论文的全文以进行数据提取。用多元自适应回归样条模型,部分依赖图和线性回归构建和分析了4,758点的数据库。值得注意的是,分析说明了数据中最重要的非线性关系和交互项。我们量化了对重要变量的预测准确性的影响,例如预测范围,分辨率,气候条件,区域的年度太阳辐照度水平,电力系统大小和容量,预测模型,火车和测试集以及使用不同的技术和投入。通过控制预测之间的关键差异,包括位置变量,可以在全球应用分析的发现。还提供了该领域科学进步的概述。
translated by 谷歌翻译
预测组合在预测社区中蓬勃发展,近年来,已经成为预测研究和活动主流的一部分。现在,由单个(目标)系列产生的多个预测组合通过整合来自不同来源收集的信息,从而提高准确性,从而减轻了识别单个“最佳”预测的风险。组合方案已从没有估计的简单组合方法演变为涉及时间变化的权重,非线性组合,组件之间的相关性和交叉学习的复杂方法。它们包括结合点预测和结合概率预测。本文提供了有关预测组合的广泛文献的最新评论,并参考可用的开源软件实施。我们讨论了各种方法的潜在和局限性,并突出了这些思想如何随着时间的推移而发展。还调查了有关预测组合实用性的一些重要问题。最后,我们以当前的研究差距和未来研究的潜在见解得出结论。
translated by 谷歌翻译
PV power forecasting models are predominantly based on machine learning algorithms which do not provide any insight into or explanation about their predictions (black boxes). Therefore, their direct implementation in environments where transparency is required, and the trust associated with their predictions may be questioned. To this end, we propose a two stage probabilistic forecasting framework able to generate highly accurate, reliable, and sharp forecasts yet offering full transparency on both the point forecasts and the prediction intervals (PIs). In the first stage, we exploit natural gradient boosting (NGBoost) for yielding probabilistic forecasts, while in the second stage, we calculate the Shapley additive explanation (SHAP) values in order to fully comprehend why a prediction was made. To highlight the performance and the applicability of the proposed framework, real data from two PV parks located in Southern Germany are employed. Comparative results with two state-of-the-art algorithms, namely Gaussian process and lower upper bound estimation, manifest a significant increase in the point forecast accuracy and in the overall probabilistic performance. Most importantly, a detailed analysis of the model's complex nonlinear relationships and interaction effects between the various features is presented. This allows interpreting the model, identifying some learned physical properties, explaining individual predictions, reducing the computational requirements for the training without jeopardizing the model accuracy, detecting possible bugs, and gaining trust in the model. Finally, we conclude that the model was able to develop complex nonlinear relationships which follow known physical properties as well as human logic and intuition.
translated by 谷歌翻译
The cyber-physical convergence is opening up new business opportunities for industrial operators. The need for deep integration of the cyber and the physical worlds establishes a rich business agenda towards consolidating new system and network engineering approaches. This revolution would not be possible without the rich and heterogeneous sources of data, as well as the ability of their intelligent exploitation, mainly due to the fact that data will serve as a fundamental resource to promote Industry 4.0. One of the most fruitful research and practice areas emerging from this data-rich, cyber-physical, smart factory environment is the data-driven process monitoring field, which applies machine learning methodologies to enable predictive maintenance applications. In this paper, we examine popular time series forecasting techniques as well as supervised machine learning algorithms in the applied context of Industry 4.0, by transforming and preprocessing the historical industrial dataset of a packing machine's operational state recordings (real data coming from the production line of a manufacturing plant from the food and beverage domain). In our methodology, we use only a single signal concerning the machine's operational status to make our predictions, without considering other operational variables or fault and warning signals, hence its characterization as ``agnostic''. In this respect, the results demonstrate that the adopted methods achieve a quite promising performance on three targeted use cases.
translated by 谷歌翻译
Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends on unknown future values, which therefore need to be forecast. As both forecasting and optimization are difficult problems in their own right, relatively few research has been done in this area. This paper presents the findings of the ``IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling," held in 2021. We present a comparison and evaluation of the seven highest-ranked solutions in the competition, to provide researchers with a benchmark problem and to establish the state of the art for this benchmark, with the aim to foster and facilitate research in this area. The competition used data from the Monash Microgrid, as well as weather data and energy market data. It then focused on two main challenges: forecasting renewable energy production and demand, and obtaining an optimal schedule for the activities (lectures) and on-site batteries that lead to the lowest cost of energy. The most accurate forecasts were obtained by gradient-boosted tree and random forest models, and optimization was mostly performed using mixed integer linear and quadratic programming. The winning method predicted different scenarios and optimized over all scenarios jointly using a sample average approximation method.
translated by 谷歌翻译
Wind power forecasting helps with the planning for the power systems by contributing to having a higher level of certainty in decision-making. Due to the randomness inherent to meteorological events (e.g., wind speeds), making highly accurate long-term predictions for wind power can be extremely difficult. One approach to remedy this challenge is to utilize weather information from multiple points across a geographical grid to obtain a holistic view of the wind patterns, along with temporal information from the previous power outputs of the wind farms. Our proposed CNN-RNN architecture combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to extract spatial and temporal information from multi-dimensional input data to make day-ahead predictions. In this regard, our method incorporates an ultra-wide learning view, combining data from multiple numerical weather prediction models, wind farms, and geographical locations. Additionally, we experiment with global forecasting approaches to understand the impact of training the same model over the datasets obtained from multiple different wind farms, and we employ a method where spatial information extracted from convolutional layers is passed to a tree ensemble (e.g., Light Gradient Boosting Machine (LGBM)) instead of fully connected layers. The results show that our proposed CNN-RNN architecture outperforms other models such as LGBM, Extra Tree regressor and linear regression when trained globally, but fails to replicate such performance when trained individually on each farm. We also observe that passing the spatial information from CNN to LGBM improves its performance, providing further evidence of CNN's spatial feature extraction capabilities.
translated by 谷歌翻译
评估能源转型和能源市场自由化对资源充足性的影响是一种越来越重要和苛刻的任务。能量系统的上升复杂性需要足够的能量系统建模方法,从而提高计算要求。此外,随着复杂性,同样调用概率评估和场景分析同样增加不确定性。为了充分和高效地解决这些各种要求,需要来自数据科学领域的新方法来加速当前方法。通过我们的系统文献综述,我们希望缩小三个学科之间的差距(1)电力供应安全性评估,(2)人工智能和(3)实验设计。为此,我们对所选应用领域进行大规模的定量审查,并制作彼此不同学科的合成。在其他发现之外,我们使用基于AI的方法和应用程序的AI方法和应用来确定电力供应模型的复杂安全性的元素,并作为未充分涵盖的应用领域的储存调度和(非)可用性。我们结束了推出了一种新的方法管道,以便在评估电力供应安全评估时充分有效地解决当前和即将到来的挑战。
translated by 谷歌翻译
能源基础架构的数字转换实现了机器学习模型通常支持的新的,数据驱动的应用程序。但是,在现代数据驱动管道中的域特定数据转换,预处理和管理尚待解决。在本文中,我们对能够支持设计功能管理解决方案的通用数据模型进行了首次研究,这些解决方案是开发基于ML的能源应用中最重要的组成部分。我们首先提出了一种针对能源应用的数据模型的分类法,请说明该模型如何支持功能的设计及其后续的专用功能商店的管理。使用短期预测数据集,我们展示了设计更丰富的数据模型和工程性能的功能的好处。最后,我们基准了三个互补功能管理解决方案,包括适合时间序列的开源功能商店。
translated by 谷歌翻译
近年来,随着传感器和智能设备的广泛传播,物联网(IoT)系统的数据生成速度已大大增加。在物联网系统中,必须经常处理,转换和分析大量数据,以实现各种物联网服务和功能。机器学习(ML)方法已显示出其物联网数据分析的能力。但是,将ML模型应用于物联网数据分析任务仍然面临许多困难和挑战,特别是有效的模型选择,设计/调整和更新,这给经验丰富的数据科学家带来了巨大的需求。此外,物联网数据的动态性质可能引入概念漂移问题,从而导致模型性能降解。为了减少人类的努力,自动化机器学习(AUTOML)已成为一个流行的领域,旨在自动选择,构建,调整和更新机器学习模型,以在指定任务上实现最佳性能。在本文中,我们对Automl区域中模型选择,调整和更新过程中的现有方法进行了审查,以识别和总结将ML算法应用于IoT数据分析的每个步骤的最佳解决方案。为了证明我们的发现并帮助工业用户和研究人员更好地实施汽车方法,在这项工作中提出了将汽车应用于IoT异常检测问题的案例研究。最后,我们讨论并分类了该领域的挑战和研究方向。
translated by 谷歌翻译
建筑物和校园的电力负荷预测随着分布式能源(DERs)的渗透而越来越重要。高效的操作和调度DER需要合理准确的未来能耗预测,以便进行现场发电和存储资产的近实时优化派遣。电力公用事业公司传统上对跨越地理区域的负载口袋进行了负荷预测,因此预测不是建筑物和校园运营商的常见做法。鉴于电网交互式高效建筑域中的研究和原型趋势不断发展,超出简单算法预测精度的特点对于确定智能建筑算法的真正效用很重要。其他特性包括部署架构的整体设计和预测系统的运行效率。在这项工作中,我们介绍了一个基于深度学习的负载预测系统,将来预测1小时的时间间隔18小时。我们还讨论了与此类系统的实时部署相关的挑战,以及通过在国家可再生能源实验室智能校园计划中开发的全功能预测系统提供的研究机会。
translated by 谷歌翻译
我们向Facebook先知推出了一位继任者,为可解释,可扩展和用户友好的预测框架制定了一个行业标准。随着时间序列数据的扩散,可说明的预测仍然是企业和运营决策的具有挑战性的任务。需要混合解决方案来弥合可解释的古典方法与可扩展深层学习模型之间的差距。我们将先知视为这样一个解决方案的前兆。然而,先知缺乏本地背景,这对于预测近期未来至关重要,并且由于其斯坦坦后代而挑战。 NeultProphet是一种基于Pytorch的混合预测框架,并用标准的深度学习方法培训,开发人员可以轻松扩展框架。本地上下文使用自动回归和协变量模块引入,可以配置为经典线性回归或作为神经网络。否则,NeultProphet保留了先知的设计理念,提供了相同的基本模型组件。我们的结果表明,NeultProcrophet在一组生成的时间序列上产生了相当或优质的质量的可解释的预测组件。 NeultProphet在各种各样的现实数据集合中占先知。对于中期预测,NeultProclecrophet将预测精度提高55%至92%。
translated by 谷歌翻译
在智能电网和负载平衡的背景下,每日峰值负荷预测已成为能源行业利益相关者的关键活动。对峰值幅度和时序的理解对于实现峰值剃须等智能电网策略至关重要。本文提出的建模方法利用了高分辨率和低分辨率信息来预测每日峰值需求规模和时序。由此产生的多分辨率建模框架可以适应不同的模型类。本文的主要贡献是一般性和正式介绍多分辨率建模方法,b)关于通过广义添加剂模型和神经网络和C)实验结果的不同决议的建模方法的讨论英国电力市场。结果证实,建议的建模方法的预测性能与低分辨率和高分辨率替代品具有竞争力。
translated by 谷歌翻译
A well-performing prediction model is vital for a recommendation system suggesting actions for energy-efficient consumer behavior. However, reliable and accurate predictions depend on informative features and a suitable model design to perform well and robustly across different households and appliances. Moreover, customers' unjustifiably high expectations of accurate predictions may discourage them from using the system in the long term. In this paper, we design a three-step forecasting framework to assess predictability, engineering features, and deep learning architectures to forecast 24 hourly load values. First, our predictability analysis provides a tool for expectation management to cushion customers' anticipations. Second, we design several new weather-, time- and appliance-related parameters for the modeling procedure and test their contribution to the model's prediction performance. Third, we examine six deep learning techniques and compare them to tree- and support vector regression benchmarks. We develop a robust and accurate model for the appliance-level load prediction based on four datasets from four different regions (US, UK, Austria, and Canada) with an equal set of appliances. The empirical results show that cyclical encoding of time features and weather indicators alongside a long-short term memory (LSTM) model offer the optimal performance.
translated by 谷歌翻译
我们基准了一个简单学习模型的亚季节预测工具包,该工具包优于操作实践和最先进的机器学习和深度学习方法。这些模型,由Mouatadid等人引入。 (2022),包括(a)气候++,这是气候学的一种适应性替代品,对于降水而言,准确性9%,比美国运营气候预测系统(CFSV2)高9%,熟练250%; (b)CFSV2 ++,一种学习的CFSV2校正,可将温度和降水精度提高7-8%,技能提高50-275%; (c)持久性++是一种增强的持久性模型,将CFSV2预测与滞后测量相结合,以将温度和降水精度提高6-9%,技能提高40-130%。在整个美国,气候++,CFSV2 ++和持久性++工具包始终优于标准气象基准,最先进的机器和深度学习方法,以及欧洲中等范围的天气预报集合中心。
translated by 谷歌翻译
电价是影响所有市场参与者决策的关键因素。准确的电价预测非常重要,并且由于各种因素,电价高度挥发性,电价也非常具有挑战性。本文提出了一项综合的长期经常性卷积网络(ILRCN)模型,以预测考虑到市场价格的大多数贡献属性的电力价格。所提出的ILRCN模型将卷积神经网络和长短期记忆(LSTM)算法的功能与所提出的新颖的条件纠错项相结合。组合的ILRCN模型可以识别输入数据内的线性和非线性行为。我们使用鄂尔顿批发市场价格数据以及负载型材,温度和其他因素来说明所提出的模型。使用平均绝对误差和准确性等性能/评估度量来验证所提出的ILRCN电价预测模型的性能。案例研究表明,与支持向量机(SVM)模型,完全连接的神经网络模型,LSTM模型和LRCN模型,所提出的ILRCN模型在电价预测中是准确和有效的电力价格预测。
translated by 谷歌翻译
提出了一种使用天气数据实时太阳生成预测的新方法,同时提出了既有空间结构依赖性的依赖。随着时间的推移,观察到的网络被预测到较低维度的表示,在该表示的情况下,在推理阶段使用天气预报时,使用各种天气测量来训练结构化回归模型。从国家太阳辐射数据库获得的德克萨斯州圣安东尼奥地区的288个地点进行了实验。该模型预测具有良好精度的太阳辐照度(夏季R2 0.91,冬季为0.85,全球模型为0.89)。随机森林回归者获得了最佳准确性。进行了多个实验来表征缺失数据的影响和不同的时间范围的影响,这些范围提供了证据表明,新算法不仅在随机的情况下,而且在机制是空间和时间上都丢失的数据是可靠的。
translated by 谷歌翻译
With the evolution of power systems as it is becoming more intelligent and interactive system while increasing in flexibility with a larger penetration of renewable energy sources, demand prediction on a short-term resolution will inevitably become more and more crucial in designing and managing the future grid, especially when it comes to an individual household level. Projecting the demand for electricity for a single energy user, as opposed to the aggregated power consumption of residential load on a wide scale, is difficult because of a considerable number of volatile and uncertain factors. This paper proposes a customized GRU (Gated Recurrent Unit) and Long Short-Term Memory (LSTM) architecture to address this challenging problem. LSTM and GRU are comparatively newer and among the most well-adopted deep learning approaches. The electricity consumption datasets were obtained from individual household smart meters. The comparison shows that the LSTM model performs better for home-level forecasting than alternative prediction techniques-GRU in this case. To compare the NN-based models with contrast to the conventional statistical technique-based model, ARIMA based model was also developed and benchmarked with LSTM and GRU model outcomes in this study to show the performance of the proposed model on the collected time series data.
translated by 谷歌翻译