智能论文笔记

Fine-resolution landscape-scale biomass mapping using a spatiotemporal patchwork of LiDAR coverages

Lucas K. Johnson , Michael J. Mahoney , Eddie Bevilacqua , Stephen V. Stehman , Grant Domke , Colin M. Beier

分类：机器学习

2022-05-17

估计大规模森林AGB和精细的空间决议对于温室气体会计，监测和验证工作以减轻气候变化的范围变得越来越重要。机载LiDAR对于在包括AGB在内的森林结构的属性建模非常有价值，但大多数LiDAR收集都发生在涵盖不规则，不连续的足迹的本地或区域尺度上，导致不同景观细分市场在各个时间点进行拼布。在这里，作为纽约州（美国）全州森林碳评估的一部分，我们解决了利用激光雷达拼布在景观尺度上的雷达拼凑而成的障碍，包括选择培训数据，对预测的区域或覆盖范围的特定模式的调查错误，并绘制与多个量表的现场清单一致。三种机器学习算法和一个集合模型经过FIA场测量，空气传播的激光雷达和地形，气候和心形地理训练。使用一组严格的地块选择标准，选择了801个FIA图，并从17个叶子覆盖范围（2014-2019）的拼布中绘制的共同定位的点云（2014-2019）。我们的合奏模型用于在预测定义的适用性区域（占激光雷达覆盖率的98％）内生成30 m AGB的预测表面，并将所得的AGB图与FIA绘图级别和面积估计值进行比较。我们的模型总体准确（％RMSE 22-45％; MAE 11.6-29.4 mg ha $^{ - 1} $; me 2.4-6.3 mg ha $^{ - 1} $），解释了73-80％的领域 - 观察到的变化，并得出与FIA基于设计的估计值一致的估计值（FIA 95％CI中的估计值的89％）。我们分享实用的解决方案，以使用LIDAR的时空拼布面临的挑战来满足不断增长的AGB映射需求，以支持森林碳会计和生态系统中的应用。

translated by 谷歌翻译

Classification and mapping of low-statured 'shrubland' cover types in post-agricultural landscapes of the US Northeast

Michael J Mahoney , Lucas K Johnson , Abigail Z Guinan , Colin M Beier

分类：计算机视觉 | 机器学习

2022-05-09

Novel plant communities reshape landscapes and pose challenges for land cover classification and mapping that can constrain research and stewardship efforts. In the US Northeast, emergence of low-statured woody vegetation, or shrublands, instead of secondary forests in post-agricultural landscapes is well-documented by field studies, but poorly understood from a landscape perspective, which limits the ability to systematically study and manage these lands. To address gaps in classification/mapping of low-statured cover types where they have been historically rare, we developed models to predict shrubland distributions at 30m resolution across New York State (NYS), using a stacked ensemble combining a random forest, gradient boosting machine, and artificial neural network to integrate remote sensing of structural (airborne LIDAR) and optical (satellite imagery) properties of vegetation cover. We first classified a 1m canopy height model (CHM), derived from a patchwork of available LIDAR coverages, to define shrubland presence/absence. Next, these non-contiguous maps were used to train a model ensemble based on temporally-segmented imagery to predict shrubland probability for the entire study landscape (NYS). Approximately 2.5% of the CHM coverage area was classified as shrubland. Models using Landsat predictors trained on the classified CHM were effective at identifying shrubland (test set AUC=0.893, real-world AUC=0.904), in discriminating between shrub/young forest and other cover classes, and produced qualitatively sensible maps, even when extending beyond the original training data. Our results suggest that incorporation of airborne LiDAR, even from a discontinuous patchwork of coverages, can improve land cover classification of historically rare but increasingly prevalent shrubland habitats across broader areas.

translated by 谷歌翻译

Country-wide Retrieval of Forest Structure From Optical and SAR Satellite Imagery With Bayesian Deep Learning

Alexander Becker , Stefania Russo , Stefano Puliti , Nico Lang , Konrad Schindler , Jan Dirk Wegner

分类：计算机视觉 | 机器学习

2021-11-25

以知情方式监测和管理地球林是解决生物多样性损失和气候变化等挑战的重要要求。虽然森林评估的传统或空中运动提供了在区域一级分析的准确数据，但将其扩展到整个国家，以外的高度分辨率几乎不可能。在这项工作中，我们提出了一种贝叶斯深度学习方法，以10米的分辨率为全国范围的森林结构变量，使用自由可用的卫星图像作为输入。我们的方法将Sentinel-2光学图像和Sentinel-1合成孔径雷达图像共同变换为五种不同的森林结构变量的地图：95th高度百分位，平均高度，密度，基尼系数和分数盖。我们从挪威的41个机载激光扫描任务中培训和测试我们的模型，并证明它能够概括取消测试区域，从而达到11％和15％之间的归一化平均值误差，具体取决于变量。我们的工作也是第一个提出贝叶斯深度学习方法的工作，以预测具有良好校准的不确定性估计的森林结构变量。这些提高了模型的可信度及其适用于需要可靠的信心估计的下游任务，例如知情决策。我们提出了一组广泛的实验，以验证预测地图的准确性以及预测的不确定性的质量。为了展示可扩展性，我们为五个森林结构变量提供挪威地图。

translated by 谷歌翻译

High-resolution canopy height map in the Landes forest (France) based on GEDI, Sentinel-1, and Sentinel-2 data with a deep learning approach

Martin Schwartz , Philippe Ciais , Catherine Ottlé , Aurelien De Truchis , Cedric Vega , Ibrahim Fayad , Martin Brandt , Rasmus Fensholt , Nicolas Baghdadi , François Morneau

分类：计算机视觉

2022-12-20

In intensively managed forests in Europe, where forests are divided into stands of small size and may show heterogeneity within stands, a high spatial resolution (10 - 20 meters) is arguably needed to capture the differences in canopy height. In this work, we developed a deep learning model based on multi-stream remote sensing measurements to create a high-resolution canopy height map over the "Landes de Gascogne" forest in France, a large maritime pine plantation of 13,000 km$^2$ with flat terrain and intensive management. This area is characterized by even-aged and mono-specific stands, of a typical length of a few hundred meters, harvested every 35 to 50 years. Our deep learning U-Net model uses multi-band images from Sentinel-1 and Sentinel-2 with composite time averages as input to predict tree height derived from GEDI waveforms. The evaluation is performed with external validation data from forest inventory plots and a stereo 3D reconstruction model based on Skysat imagery available at specific locations. We trained seven different U-net models based on a combination of Sentinel-1 and Sentinel-2 bands to evaluate the importance of each instrument in the dominant height retrieval. The model outputs allow us to generate a 10 m resolution canopy height map of the whole "Landes de Gascogne" forest area for 2020 with a mean absolute error of 2.02 m on the Test dataset. The best predictions were obtained using all available satellite layers from Sentinel-1 and Sentinel-2 but using only one satellite source also provided good predictions. For all validation datasets in coniferous forests, our model showed better metrics than previous canopy height models available in the same region.

translated by 谷歌翻译

Annual field-scale maps of tall and short crops at the global scale using GEDI and Sentinel-2

Stefania Di Tommaso , Sherrie Wang , Vivek Vajipey , Noel Gorelick , Rob Strey , David B. Lobell

分类：计算机视觉 | 机器学习

2022-12-19

Crop type maps are critical for tracking agricultural land use and estimating crop production. Remote sensing has proven an efficient and reliable tool for creating these maps in regions with abundant ground labels for model training, yet these labels remain difficult to obtain in many regions and years. NASA's Global Ecosystem Dynamics Investigation (GEDI) spaceborne lidar instrument, originally designed for forest monitoring, has shown promise for distinguishing tall and short crops. In the current study, we leverage GEDI to develop wall-to-wall maps of short vs tall crops on a global scale at 10 m resolution for 2019-2021. Specifically, we show that (1) GEDI returns can reliably be classified into tall and short crops after removing shots with extreme view angles or topographic slope, (2) the frequency of tall crops over time can be used to identify months when tall crops are at their peak height, and (3) GEDI shots in these months can then be used to train random forest models that use Sentinel-2 time series to accurately predict short vs. tall crops. Independent reference data from around the world are then used to evaluate these GEDI-S2 maps. We find that GEDI-S2 performed nearly as well as models trained on thousands of local reference training points, with accuracies of at least 87% and often above 90% throughout the Americas, Europe, and East Asia. Systematic underestimation of tall crop area was observed in regions where crops frequently exhibit low biomass, namely Africa and South Asia, and further work is needed in these systems. Although the GEDI-S2 approach only differentiates tall from short crops, in many landscapes this distinction goes a long way toward mapping the main individual crop types. The combination of GEDI and Sentinel-2 thus presents a very promising path towards global crop mapping with minimal reliance on ground data.

translated by 谷歌翻译

Global canopy height regression and uncertainty estimation from GEDI LIDAR waveforms with deep ensembles

Nico Lang , Nikolai Kalischek , John Armston , Konrad Schindler , Ralph Dubayah , Jan Dirk Wegner

分类：机器学习 | 计算机视觉

2021-03-05

美国宇航局的全球生态系统动力学调查（GEDI）是一个关键的气候使命，其目标是推进我们对森林在全球碳循环中的作用的理解。虽然GEDI是第一个基于空间的激光器，明确优化，以测量地上生物质的垂直森林结构预测，这对广泛的观测和环境条件的大量波形数据的准确解释是具有挑战性的。在这里，我们提出了一种新颖的监督机器学习方法来解释GEDI波形和全球标注冠层顶部高度。我们提出了一种基于深度卷积神经网络（CNN）集合的概率深度学习方法，以避免未知效果的显式建模，例如大气噪声。该模型学会提取概括地理区域的强大特征，此外，产生可靠的预测性不确定性估计。最终，我们模型产生的全球顶棚顶部高度估计估计的预期RMSE为2.7米，低偏差。

translated by 谷歌翻译

Deep Learning Based 3D Point Cloud Regression for Estimating Forest Biomass

Stefan Oehmcke , Lei Li , Jaime Revenga , Thomas Nord-Larsen , Katerina Trepekli , Fabian Gieseke , Christian Igel

分类：计算机视觉 | 机器学习

2021-12-21

对森林生物量股票的知识及其发展对于实施有效的气候变化缓解措施是重要的。需要研究驾驶AF的过程，重新砍伐和森林砍伐，是碳核算的先决条件。使用空机激光雷达的遥感可用于测量大规模植被生物量。我们呈现深度学习系统，用于预测木材体积，地上生物量（AGB），随后直接从3D LIDAR点云数据碳。我们设计了不同的神经网络架构进行点云回归，并在遥感数据上评估AGB估计从国家森林库存中的现场测量获得的遥感数据。我们对回归的Minkowski卷积神经网络的调整给出了最佳结果。与在Point云的基本统计中运营的最先进的方法相比，深度神经网络产生了明显更准确的木材体积，AGB和碳估计，我们希望这一发现对基于LIDAR的分析产生了强烈影响陆地生态系统动态。

translated by 谷歌翻译

A CNN based method for Sub-pixel Urban Land Cover Classification using Landsat-5 TM and Resourcesat-1 LISS-IV Imagery

Krishna Kumar Perikamana , Krishnachandran Balakrishnan , Pratyush Tripathy

分类：计算机视觉 | 机器学习

2021-12-16

城市土地覆盖的时间序列数据在分析城市增长模式方面具有很大的效用，不透水表面和植被的分布变化以及对城市微观气候产生影响。虽然Landsat数据非常适于这种分析，但由于长时间系列的免费图像，传统的每像素硬分类未能产生Landsat数据的全部潜力。本文提出了一种子像素分类方法，其利用Landsat-5 TM和Resorational-1 Liss-IV传感器的时间重叠。我们训练卷积神经网络，预测30米Landsat-5 TM数据的分数陆地覆盖。从2011年的Bengaluru的一个艰难的5.8M Liss-IV图像估计参考陆地覆盖分数。此外，我们从2009年使用Mumbai数据并将其与使用的结果进行了概括和卓越的性能随机森林分类器。对于Bengaluru（2011）和Mumbai（2009）数据，我们的CNN模型的平均绝对百分比误差在30M细胞水平上的内置和植被分数预测的7.2至11.3。与最近的最近的研究不同，在使用数据在空间范围进行有限的空间范围进行验证，我们的模型已经过度培训并验证了两个不同时间段的两个Mega城市的完整空间范围的数据。因此，它可以可靠地从Landsat-5 TM时间序列数据中可靠地产生30M内置和植被分数图，以分析长期城市增长模式。

translated by 谷歌翻译

Spatial machine-learning model diagnostics: a model-agnostic distance-based approach

Alexander Brenning

分类：机器学习

2021-11-13

虽然对解释黑盒机学习（ML）模型进行了重大进展，但仍然有明显缺乏诊断工具，阐明了在预测技能和可变重要性方面阐明ML模型的空间行为。该贡献提出了空间预测误差简档（SPEP）和空间可变重要性配置文件（SVIPS）作为用于空间预测模型的新型模型 - 不可止结的评估和解释工具，其侧重于预测距离。它们的适用性在两个案例研究中展示了在环境 - 科学环境中的区域化任务，以及来自远程感应的土地覆盖分类的分类任务。在这些案例研究中，地质统计方法的SPEP和SVIPS，线性模型，随机林和杂交算法显示出惊人的差异，但也是相关的相似之处。概述了相关交叉验证技术的局限性，并且使建模者应将其模型评估和对预定空间预测视野的解释集中起来。相比之下，自相关的范围不是用于定义空间交叉验证测试集的合适标准。新型诊断工具丰富了空间数据科学的工具包，可以改善ML模型解释，选择和设计。

translated by 谷歌翻译

Towards an unsupervised large-scale 2D and 3D building mapping with airborne LiDAR data

Hunsoo Song , Jinha Jung

分类：计算机视觉

2022-05-29

2D和3D建筑图提供了宝贵的信息，以了解人类活动及其对地球及其环境的影响。尽管为提高建筑地图的质量而做出了巨大努力，但自动化方法产生的当前大规模建筑地图仍存在许多错误和不确定性，并且通常仅限于提供2D建筑信息。这项研究提出了一种开源无监督的2D和3D建筑物提取算法，并带有适用于大型建筑物映射的机载LIDAR数据。我们的算法以完全无监督的方式运行，不需要任何培训标签或培训程序。我们的算法由形态过滤和基于平面的过滤组成。因此，计算是有效的，结果易于预测，这可以大大减少所得建筑图中的不确定性。丹佛和纽约市的大规模数据集（> 550 $ km^2 $）的定量和定性评估表明，我们的算法比通过基于深度学习的方法生成的Microsoft Building Footprints可以产生更准确的建筑图。在不同条件下进行的广泛评估证实，我们的算法是可扩展的，可以通过适当的参数选择进一步改进。我们还详细介绍了参数和潜在错误来源的影响，以帮助我们算法的潜在用户。我们的基于激光雷达的算法具有优势，即生成2D和3D构建图在计算上有效，而它产生了准确且可解释的结果。我们提出的算法为带有机载激光雷达数据的全球尺度2D和3D建筑物映射提供了巨大的潜力。

translated by 谷歌翻译

Learning Inter-Annual Flood Loss Risk Models From Historical Flood Insurance Claims and Extreme Rainfall Data

Joaquin Salas , Anamitra Saha , Sai Ravela

分类：机器学习 | (统计)机器学习

2022-12-15

Flooding is one of the most disastrous natural hazards, responsible for substantial economic losses. A predictive model for flood-induced financial damages is useful for many applications such as climate change adaptation planning and insurance underwriting. This research assesses the predictive capability of regressors constructed on the National Flood Insurance Program (NFIP) dataset using neural networks (Conditional Generative Adversarial Networks), decision trees (Extreme Gradient Boosting), and kernel-based regressors (Gaussian Process). The assessment highlights the most informative predictors for regression. The distribution for claims amount inference is modeled with a Burr distribution permitting the introduction of a bias correction scheme and increasing the regressor's predictive capability. Aiming to study the interaction with physical variables, we incorporate Daymet rainfall estimation to NFIP as an additional predictor. A study on the coastal counties in the eight US South-West states resulted in an $R^2=0.807$. Further analysis of 11 counties with a significant number of claims in the NFIP dataset reveals that Extreme Gradient Boosting provides the best results, that bias correction significantly improves the similarity with the reference distribution, and that the rainfall predictor strengthens the regressor performance.

translated by 谷歌翻译

Soil Erosion in the United States. Present and Future (2020-2050)

Shahab Aldin Shojaeezadeh , Malik Al-Wardy , Mohammad Reza Nikoo , Mehrdad Ghorbani Mooselu , Mohammad Reza Alizadeh , Jan Franklin Adamowski , Hamid Moradkhani , Nasrin Alamdari , Amir H. Gandomi

分类：机器学习

2022-07-14

土壤侵蚀是对世界各地环境和长期土地管理的重大威胁。人类活动加速的土壤侵蚀会造成陆地和水生生态系统的极端变化，这在现场阶段（30-m）的当前和可能的未来没有得到充分的调查/预测。在这里，我们使用三种替代方案（2.6、4.5和8.5）估计/预测通过水侵蚀（薄板和RILL侵蚀）的土壤侵蚀速率，共享社会经济途径和代表性浓度途径（SSP-RCP）情景。田间尺度的土壤侵蚀模型（FSSLM）估计依赖于由卫星和基于图像的土地使用和土地覆盖的估计（LULC）集成的高分辨率（30-m）G2侵蚀模型，对长期降水量的规范观察，以及耦合模型比较项目阶段6（CMIP6）的方案。基线模型（2020年）估计土壤侵蚀速率为2.32 mg HA 1年1年，具有当前的农业保护实践（CPS）。当前CPS的未来情况表明，在气候和LULC变化的SSP-RCP方案的不同组合下，增加了8％至21％。 2050年的土壤侵蚀预测表明，所有气候和LULC场景都表明极端事件的增加或极端空间位置的变化很大程度上从南部到美国东部和东北地区。

translated by 谷歌翻译

Mapping smallholder cashew plantations to inform sustainable tree crop expansion in Benin

Leikun Yin , Rahul Ghosh , Chenxi Lin , David Hale , Christoph Weigl , James Obarowski , Junxiong Zhou , Jessica Till , Xiaowei Jia , Troy Mao

分类：计算机视觉 | 机器学习

2023-01-01

Cashews are grown by over 3 million smallholders in more than 40 countries worldwide as a principal source of income. As the third largest cashew producer in Africa, Benin has nearly 200,000 smallholder cashew growers contributing 15% of the country's national export earnings. However, a lack of information on where and how cashew trees grow across the country hinders decision-making that could support increased cashew production and poverty alleviation. By leveraging 2.4-m Planet Basemaps and 0.5-m aerial imagery, newly developed deep learning algorithms, and large-scale ground truth datasets, we successfully produced the first national map of cashew in Benin and characterized the expansion of cashew plantations between 2015 and 2021. In particular, we developed a SpatioTemporal Classification with Attention (STCA) model to map the distribution of cashew plantations, which can fully capture texture information from discriminative time steps during a growing season. We further developed a Clustering Augmented Self-supervised Temporal Classification (CASTC) model to distinguish high-density versus low-density cashew plantations by automatic feature extraction and optimized clustering. Results show that the STCA model has an overall accuracy of 80% and the CASTC model achieved an overall accuracy of 77.9%. We found that the cashew area in Benin has doubled from 2015 to 2021 with 60% of new plantation development coming from cropland or fallow land, while encroachment of cashew plantations into protected areas has increased by 70%. Only half of cashew plantations were high-density in 2021, suggesting high potential for intensification. Our study illustrates the power of combining high-resolution remote sensing imagery and state-of-the-art deep learning algorithms to better understand tree crops in the heterogeneous smallholder landscape.

translated by 谷歌翻译

Landslide Susceptibility Modeling by Interpretable Neural Network

Khaled Youssef , Kevin Shao , Seulgi Moon , Louis-Serge Bouchard

分类：机器学习

2022-01-18

众所周知，由于许多空间和时间变化的因素有助于斜率稳定性，因此难以预测滑坡。人工神经网络（ANN）已被证明可以提高预测准确性。但是，传统的ANN是无法解释的，复杂的黑匣子模型。这使得很难在建模区域中提取有关滑坡控制的机械信息，或在此高风险应用中信任结果。在此，我们介绍了可解释的加性神经网络在滑坡易感性建模中的首次应用。我们介绍了一个新的添加剂ANN优化框架，以及新的数据集除法和结果解释技术，适用于使用空间依赖的数据结构（例如滑坡易感性）建模应用程序。我们将我们的方法称为完全可解释性，高精度，高推广性和低模型复杂性作为超固有神经网络（SNN）优化的方法。我们通过培训模型来验证我们的方法，以评估喜马拉雅山脉最容易受到滑坡的三个不同区域的滑坡敏感性。 SNN生成的可解释的神经网络模型胜过基于物理的稳定性和统计模型，并实现了与最先进的深神经网络相似的性能，同时提供了有关滑坡控制因素的相对重要性的见解。 SNN模型发现，斜坡，降水和山坡方面的产物是对研究区域中高压滑敏感性的重要主要因素。这些确定的控件表明，强烈的斜坡气候耦合以及微气候以及在最东部喜马拉雅山的滑坡事件中起主要作用。

translated by 谷歌翻译

Towards Daily High-resolution Inundation Observations using Deep Learning and EO

Antara Dasgupta , Lasse Hybbeneth , Björn Waske

分类：计算机视觉 | 机器学习

2022-08-10

卫星遥感提供了一种具有成本效益的概要洪水监测的解决方案，卫星衍生的洪水图为传统上使用的数值洪水淹没模型提供了一种计算有效的替代方法。尽管卫星碰巧涵盖正在进行的洪水事件时确实提供了及时的淹没信息，但它们受其时空分辨率的限制，因为它们在各种规模上动态监测洪水演变的能力。不断改善对新卫星数据源的访问以及大数据处理功能，就此问题的数据驱动解决方案而言，已经解锁了前所未有的可能性。具体而言，来自卫星的数据融合，例如哥白尼前哨，它们具有很高的空间和低时间分辨率，以及来自NASA SMAP和GPM任务的数据，它们的空间较低，但时间较高的时间分辨率可能会导致高分辨率的洪水淹没在A处的高分辨率洪水。每日规模。在这里，使用Sentinel-1合成孔径雷达和各种水文，地形和基于土地利用的预测因子衍生出的洪水淹没图对卷积神经网络进行了训练，以预测高分辨率的洪水泛滥概率图。使用Sentinel-1和Sentinel-2衍生的洪水面罩，评估了UNET和SEGNET模型架构的性能，分别具有95％的信心间隔。精确召回曲线（PR-AUC）曲线下的区域（AUC）被用作主要评估指标，这是由于二进制洪水映射问题中类固有的不平衡性质，最佳模型提供了PR-AUC 0.85。

translated by 谷歌翻译

Spatio-temporal estimation of wind speed and wind power using machine learning: predictions, uncertainty and technical potential

Federico Amato , Fabian Guignard , Alina Walch , Nahid Mohajeri , Jean-Louis Scartezzini , Mikhail Kanevski

分类：人工智能 | 机器学习

2021-07-29

在过去的几十年中，风产能的增长表明，风能可以促进世界许多地区的能源过渡。对于模型的高度可变和复杂，对风能的时空变化和相关的不确定性的定量与能源计划者高度相关。机器学习已成为执行风速和功率预测的流行工具。但是，现有方法有几个局限性。其中包括（i）在风速数据中不足以考虑时空相关性，（ii）缺乏量化风速预测不确定性及其对风能估算的不确定性的现有方法，以及（iii）焦点在少于小时的频率上。为了克服这些局限性，我们引入了一个框架，以从不规则分布的风速测量值中的常规网格上重建时空场。将数据分解为时间引用的基础函数及其相应的空间分布系数后，后者是使用极端学习机对空间建模的。然后，对模型和预测不确定性的估计及其在风速转化为风能后的传播的估计值，然后将提供对数据分布模式的任何假设。该方法适用于研究瑞士100米轮毂高度的250 x 250平方米的小时风能潜力，为该国提供了其类型的第一个数据集。潜在的风力发电与风力涡轮机安装的可用区域相结合，以估算瑞士风力发电的技术潜力。此处介绍的风力估算代表了计划人员的重要意见，以支持风力发电增加的未来能源系统的设计。

translated by 谷歌翻译

Applications of Machine Learning in Chemical and Biological Oceanography

Balamurugan Sadaiappan , Preethiya Balakrishnan , Vishal CR , Neethu T Vijayan , Mahendran Subramanian , Mangesh U Gauns

分类：机器学习

2022-09-23

机器学习（ML）是指根据大量数据预测有意义的输出或对复杂系统进行分类的计算机算法。 ML应用于各个领域，包括自然科学，工程，太空探索甚至游戏开发。本文的重点是在化学和生物海洋学领域使用机器学习。在预测全球固定氮水平，部分二氧化碳压力和其他化学特性时，ML的应用是一种有前途的工具。机器学习还用于生物海洋学领域，可从各种图像（即显微镜，流车和视频记录器），光谱仪和其他信号处理技术中检测浮游形式。此外，ML使用其声学成功地对哺乳动物进行了分类，在特定的环境中检测到濒临灭绝的哺乳动物和鱼类。最重要的是，使用环境数据，ML被证明是预测缺氧条件和有害藻华事件的有效方法，这是对环境监测的重要测量。此外，机器学习被用来为各种物种构建许多对其他研究人员有用的数据库，而创建新算法将帮助海洋研究界更好地理解海洋的化学和生物学。

translated by 谷歌翻译

Detecting Crop Burning in India using Satellite Data

Kendra Walker , Ben Moscona , Kelsey Jack , Seema Jayachandran , Namrata Kala , Rohini Pande , Jiani Xue , Marshall Burke

分类：计算机视觉 | 机器学习

2022-09-21

农作物残留物燃烧是世界许多地方的空气污染的主要来源，尤其是南亚。政策制定者，从业人员和研究人员都投资了衡量影响和制定干预措施以减少燃烧。但是，测量燃烧的影响或干预措施的有效性减少燃烧需要数据燃烧的位置。这些数据在成本和可行性方面都在现场收集具有挑战性。我们利用印度旁遮普邦旁遮普邦农作物残留物燃烧的地面监测的数据，以探索使用可访问的卫星图像是否可以更有效地检测到燃烧。具体而言，我们使用了具有高时间分辨率（最多每天）的3M Planetscope数据以及具有每周时间分辨率但光谱信息深度的公共可用Sentinel-2数据。在分析了不同光谱带和燃烧指数单独分离燃烧和未燃烧图的能力之后，我们构建了一个随机森林模型，这些模型确定提供了最大的分离性，并用地面验证的数据评估了模型性能。鉴于测量所带来的挑战，我们的总体模型精度为82％是有利的。基于此过程的见解，我们讨论了检测卫星图像中农作物残留物燃烧的技术挑战，以及衡量燃烧和政策干预措施的影响的挑战。

translated by 谷歌翻译

Generating gapless land surface temperature with a high spatio-temporal resolution by fusing multi-source satellite-observed and model-simulated data

Jun Ma , Huanfeng Shen , Penghai Wu , Jingan Wu , Meiling Gao , Chunlei Meng

分类：人工智能

2021-11-29

陆地温度（LST）是监控土地面过程时的关键参数。然而，云污染和空间和时间分辨率之间的权衡大大妨碍了对高质量的热红外（TIR）遥感数据的访问。尽管采取了巨大的努力来解决这些困境，但仍然难以通过并发空间完整性和高时空分辨率产生LST估计。陆地表面模型（LSM）可用于模拟高度的时间分辨率的Genpless LST，但这通常具有低空间分辨率。在本文中，我们向卫星观察和LSM模拟LST数据提供了一个集成的温度融合框架，以通过60米的空间分辨率和半小时时间分辨率映射Gapless LST。全局线性模型（GLOLM）模型和昼夜陆地表面温度周期（DTC）模型分别作为预处理步骤进行传感器和不同LST数据之间的时间归一化。然后使用基于滤波器的时空集成融合模型融合Landsat LST，适度分辨率成像光谱仪（MODIS）LST和社区土地模型5.0（CLM 5.0）-SIMUTION LST。在一个城市主导地区（中国武汉市）和自然主导地区（中国海河流域）实施了评估，在准确性，空间可变性和日颞动力学方面。结果表明，熔融LST与实际LANDSAT LST数据（原位LST测量）高于Pearson相关系数，在0.94（0.97-0.99）方面，平均绝对误差为0.71-0.98k（0.82-3.17 k ）和根平均误差为0.97-1.26 k（1.09-3.97 k）。

translated by 谷歌翻译

A Machine Learning Tutorial for Operational Meteorology, Part I: Traditional Machine Learning

Randy J. Chase , David R. Harrison , Amanda Burke , Gary M. Lackmann , Amy McGovern

分类：机器学习

2022-04-15

最近，在气象学中使用机器学习大大增加了。尽管许多机器学习方法并不是什么新鲜事物，但有关机器学习的大学课程在很大程度上是气象学专业的学生，不需要成为气象学家。缺乏正式的教学导致人们认为机器学习方法是“黑匣子”，因此最终用户不愿在每天的工作流程中应用机器学习方法。为了减少机器学习方法的不透明性，并降低了对气象学中机器学习的犹豫，本文对一些最常见的机器学习方法进行了调查。一个熟悉的气象示例用于将机器学习方法背景化，同时还使用普通语言讨论机器学习主题。证明了以下机器学习方法：线性回归；逻辑回归；决策树；随机森林；梯度增强了决策树；天真的贝叶斯；并支持向量机。除了讨论不同的方法外，本文还包含有关通用机器学习过程的讨论以及最佳实践，以使读者能够将机器学习应用于自己的数据集。此外，所有代码（以Jupyter笔记本电脑和Google Colaboratory Notebooks的形式）用于在论文中进行示例，以促进气象学中的机器学习使用。

translated by 谷歌翻译