智能论文笔记

Building Matters: Spatial Variability in Machine Learning Based Thermal Comfort Prediction in Winters

Betty Lala , Srikant Manas Kala , Anmol Rastogi , Kunal Dahiya , Hirozumi Yamaguchi , Aya Hagishima

分类：机器学习

2022-06-28

室内环境中的热舒适感会对乘员的健康，福祉和表现产生巨大影响。鉴于对能源效率和实现智能建筑的关注，机器学习（ML）越来越多地用于数据驱动的热舒适度（TC）预测。通常，提出了用于空调或HVAC通风建筑物的基于ML的解决方案，这些模型主要是为成年人设计的。另一方面，在大多数国家 /地区，自然通风（NV）的建筑物是常态。它们也是节能和长期可持续性目标的理想选择。但是，NV建筑物的室内环境缺乏热调节，并且在空间环境中差异很大。这些因素使TC预测极具挑战性。因此，确定建筑环境对TC模型性能的影响很重要。此外，需要研究跨不同NV室内空间的TC预测模型的概括能力。这项工作解决了这些问题。数据是通过在5个自然通风的学校建筑中进行的为期一个月的实地实验，涉及512名小学生。空间变异性对学生舒适度的影响通过预测准确性的变化（高达71％）来证明。还通过特征重要性的变化来证明建筑环境对TC预测的影响。此外，对儿童（我们的数据集）和成人（ASHRAE-II数据库）进行了模型性能的空间变异性比较分析。最后，评估了NV教室中热舒适模型的概括能力，并强调了主要挑战。

translated by 谷歌翻译

Are You Comfortable Now: Deep Learning the Temporal Variation in Thermal Comfort in Winters

Betty Lala , Srikant Manas Kala , Anmol Rastogi , Kunal Dahiya , Aya Hagishima

分类：机器学习 | 人工智能

2022-08-20

智能建筑中的室内热舒适对乘员的健康和表现有重大影响。因此，机器学习（ML）越来越多地用于解决与室内热舒适的挑战。热舒适感的时间变化是调节居住者福祉和能耗的重要问题。但是，在大多数基于ML的热舒适研究中，不考虑时间中的时间方面，例如一天中的时间，昼夜节律和室外温度。这项工作解决了这些问题。它研究了昼夜节律和室外温度对ML模型的预测准确性和分类性能的影响。数据是通过在14个教室中进行的长达一个月的实地实验收集的，其中512名小学生。四个热舒适度指标被认为是深神经网络的输出，并支持数据集的向量机模型。时间变异性对学童舒适性的影响通过“一天中的时间”分析显示。预测准确性的时间差异已显示（多达80％）。此外，我们表明室外温度（随时间变化）对热舒适模型的预测性能产生了积极影响高达30％。时空环境的重要性通过对比的是微观级别（特定于位置）和宏观级别（整个城市的6个位置）的重要性。这项工作的最重要发现是，对于多种热舒适度指标，显示了预测准确性的明确提高，而天空中的时间和天空照明则有所增加。

translated by 谷歌翻译

Cohort comfort models -- Using occupants' similarity to predict personal thermal preference with less data

Matias Quintana , Stefano Schiavon , Federico Tartarini , Joyce Kim , Clayton Miller

分类：机器学习

2022-08-05

我们介绍了队列舒适模型，这是一个新框架，用于预测新乘员如何看待其热环境。队列舒适模型利用从样本人群中收集的历史数据，这些数据具有一些潜在的偏好相似性，以预测新居民的热偏好反应。我们的框架能够利用可用的背景信息，例如物理特征和一次性的登机调查（对生活尺度的满意度，高度敏感的人尺度，五个个性特征）以及新乘员以及生理和环境传感器的测量值与热偏好响应配对。我们在两个公开可用的数据集中实施了框架，其中包含来自55人的纵向数据，其中包括6,000多个单独的热舒适调查。我们观察到，使用背景信息的队列舒适模型几乎没有变化的热偏好预测性能，但没有使用历史数据。另一方面，使用队列舒适模型的每个数据集占用人群的一半和三分之一的占用人群，而目标居民的历史数据较少，同类舒适模型将其热偏好预测增加了8〜 \％，平均为5〜 \％与对整个乘员人群进行训练的通用模型相比，某些乘员最多可容纳36点\％和46〜％。该框架以数据和站点不可知的方式呈现，其不同的组件很容易根据乘员和建筑物的数据可用性定制。队列舒适模型可能是迈向个性化的重要一步，而无需为每个新乘员开发个性化模型。

translated by 谷歌翻译

Understanding occupants' behaviour, engagement, emotion, and comfort indoors with heterogeneous sensors and wearables

Nan Gao , Max Marschall , Jane Burry , Simon Watkins , Flora D. Salim

分类：机器学习

2021-05-14

我们在澳大利亚墨尔本郊区的K-12私立学校进行了一个田间研究。数据捕获包含两个元素：首先，使用两个室外气象站的5个月纵向场研究，以及17个教室的室内气象站和乘员控制的房间空调的通风口上的温度传感器;这些在5分钟的测井频率下为每个教室的各个数据集中的各个数据集，包括乘员存在的额外数据。数据集用于推出乘员如何运营房间空调单元的预测模型。其次，我们在4周的横断面研究en-gage中跟踪了23名学生和6名教师，使用可穿戴传感器来记录生理数据，以及日常调查来查询乘客的热舒适度，学习参与，情绪和座位行为。总的来说，组合的数据集可用于分析校园内室内/室外气候和学生行为/精神状态之间的关系，这为未来设计智能反馈系统的机会为学生和员工受益。

translated by 谷歌翻译

Machine Learning for Smart and Energy-Efficient Buildings

Hari Prasanna Das , Yu-Wen Lin , Utkarsha Agwan , Lucas Spangher , Alex Devonport , Yu Yang , Jan Drgona , Adrian Chong , Stefano Schiavon , Costas J. Spanos

分类：机器学习

2022-11-27

Energy consumption in buildings, both residential and commercial, accounts for approximately 40% of all energy usage in the U.S., and similar numbers are being reported from countries around the world. This significant amount of energy is used to maintain a comfortable, secure, and productive environment for the occupants. So, it is crucial that the energy consumption in buildings must be optimized, all the while maintaining satisfactory levels of occupant comfort, health, and safety. Recently, Machine Learning has been proven to be an invaluable tool in deriving important insights from data and optimizing various systems. In this work, we review the ways in which machine learning has been leveraged to make buildings smart and energy-efficient. For the convenience of readers, we provide a brief introduction of several machine learning paradigms and the components and functioning of each smart building system we cover. Finally, we discuss challenges faced while implementing machine learning algorithms in smart buildings and provide future avenues for research at the intersection of smart buildings and machine learning.

translated by 谷歌翻译

High-resolution synthetic residential energy use profiles for the United States

Swapna Thorve , Young Yun Baek , Samarth Swarup , Henning Mortveit , Achla Marathe , Anil Vullikanti , Madhav Marathe

分类：人工智能

2022-10-14

Efficient energy consumption is crucial for achieving sustainable energy goals in the era of climate change and grid modernization. Thus, it is vital to understand how energy is consumed at finer resolutions such as household in order to plan demand-response events or analyze the impacts of weather, electricity prices, electric vehicles, solar, and occupancy schedules on energy consumption. However, availability and access to detailed energy-use data, which would enable detailed studies, has been rare. In this paper, we release a unique, large-scale, synthetic, residential energy-use dataset for the residential sector across the contiguous United States covering millions of households. The data comprise of hourly energy use profiles for synthetic households, disaggregated into Thermostatically Controlled Loads (TCL) and appliance use. The underlying framework is constructed using a bottom-up approach. Diverse open-source surveys and first principles models are used for end-use modeling. Extensive validation of the synthetic dataset has been conducted through comparisons with reported energy-use data. We present a detailed, open, high-resolution, residential energy-use dataset for the United States.

translated by 谷歌翻译

Reshaping Smart Energy Transition: An analysis of human-building interactions in Qatar Using Machine Learning Techniques

Rateb Jabbar , Esmat Zaidan , Ahmed ben Said , Ali Ghofrani

分类：机器学习

2021-11-16

政策规划有可能为发展中国家的战略发展和经济多样化做出贡献，即使没有相当的结构性变化。在这项研究中，我们分析了一系列以人为本的尺寸，旨在改善与卡塔尔建筑业有关的能源政策。考虑到不同金融和文化背景的高百分比和移民社区与GCC联盟的当地社区相比，有不同的金融和文化背景和行为模式，需要调查人类方面以提出适当的能源政策。本研究探讨了社会经济，行为和人口统计尺寸的相关性，以确定能源使用，职责，动机，习惯和整体福祉差异背后的主要因素。该样本包括卡塔尔的2,200人，它被聚集成两个消费类别：高低。特别是，该研究侧重于探索人类室内舒适感依赖性，具有建筑功能。根据行为模式，探讨了需求计划和能源补贴的金融司机。随后，数据分析导致对干预措施，社会福祉和意识的能源政策的影响。机器学习方法用于执行特征重要性分析以确定人类行为的主要因素。本研究的调查结果表明人类因素如何影响住宅和工作环境，规范，习惯，自责，后果意识和消费的舒适感。该研究对开发有针对性的策略具有重要意义，旨在提高能源政策和可持续性绩效指标的疗效。

translated by 谷歌翻译

Estimating Building Energy Efficiency From Street View Imagery, Aerial Imagery, and Land Surface Temperature Data

Kevin Mayer , Lukas Haas

分类：计算机视觉 | 人工智能

2022-06-05

通过以有针对性和高效的方式改善现有建筑物库存的能源效率来提高建筑业的脱碳化，这仍然具有挑战性。这是因为截至目前，建筑物的能源效率通常取决于经过认证的能源审核员的现场访问，这使得该过程缓慢，昂贵且地理位置上不完整。为了加速大规模识别有希望的改造目标，我们建议仅从远程感知的数据源估算建筑能源效率。为此，我们收集了街景，空中景观，足迹和卫星式土地表面温度（LST）数据，用于英国四个不同地理位置上的近40,000座建筑物。在训练了融合输入数据的多个端到端深度学习模型之后，我们将建筑物分类为节能（EU等级A-D）或效率低下（EU等级E-G），我们在定量和质量上分析了最佳性能模型。最后，我们通过在消融研究中研究每个数据源的预测能力来扩展分析。我们发现，最佳的端到端深度学习模型的F1得分为62.06％，并且胜过基于K-NN和SVM的基线模型的表现分别为5.62至11.47个百分点。因此，这项工作显示了远程感知的数据在预测能源效率方面的潜力和互补性，并为将来的工作打开了新的机会，以整合其他数据源。

translated by 谷歌翻译

Comparison and Evaluation of Methods for a Predict+Optimize Problem in Renewable Energy

Christoph Bergmeir , Frits de Nijs , Abishek Sriramulu , Mahdi Abolghasemi , Richard Bean , John Betts , Quang Bui , Nam Trong Dinh , Nils Einecke , Rasul Esmaeilbeigi

分类：人工智能

2022-12-21

Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends on unknown future values, which therefore need to be forecast. As both forecasting and optimization are difficult problems in their own right, relatively few research has been done in this area. This paper presents the findings of the ``IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling," held in 2021. We present a comparison and evaluation of the seven highest-ranked solutions in the competition, to provide researchers with a benchmark problem and to establish the state of the art for this benchmark, with the aim to foster and facilitate research in this area. The competition used data from the Monash Microgrid, as well as weather data and energy market data. It then focused on two main challenges: forecasting renewable energy production and demand, and obtaining an optimal schedule for the activities (lectures) and on-site batteries that lead to the lowest cost of energy. The most accurate forecasts were obtained by gradient-boosted tree and random forest models, and optimization was mostly performed using mixed integer linear and quadratic programming. The winning method predicted different scenarios and optimized over all scenarios jointly using a sample average approximation method.

translated by 谷歌翻译

Machine Learning Application Development: Practitioners' Insights

Md Saidur Rahman , Foutse Khomh , Alaleh Hamidi , Jinghui Cheng , Giuliano Antoniol , Hironori Washizaki

分类：机器学习

2021-12-31

如今，由于最近在人工智能（AI）和机器学习（ML）中的近期突破，因此，智能系统和服务越来越受欢迎。然而，机器学习不仅满足软件工程，不仅具有有希望的潜力，而且还具有一些固有的挑战。尽管最近的一些研究努力，但我们仍然没有明确了解开发基于ML的申请和当前行业实践的挑战。此外，目前尚不清楚软件工程研究人员应将其努力集中起来，以更好地支持ML应用程序开发人员。在本文中，我们报告了一个旨在了解ML应用程序开发的挑战和最佳实践的调查。我们合成从80名从业者（以不同的技能，经验和应用领域）获得的结果为17个调查结果;概述ML应用程序开发的挑战和最佳实践。参与基于ML的软件系统发展的从业者可以利用总结最佳实践来提高其系统的质量。我们希望报告的挑战将通知研究界有关需要调查的主题，以改善工程过程和基于ML的申请的质量。

translated by 谷歌翻译

Machine Learning to Predict the Antimicrobial Activity of Cold Atmospheric Plasma-Activated Liquids

Mehmet Akif Ozdemir , Gizem Dilara Ozdemir , Merve Gul , Onan Guren , Utku Kursat Ercan

分类：机器学习

2022-07-25

血浆定义为物质的第四个状态，在高电场下可以在大气压下产生非热血浆。现在众所周知，血浆激活液体（PAL）的强和广谱抗菌作用。机器学习（ML）在医疗领域的可靠适用性也鼓励其在等离子体医学领域的应用。因此，在PALS上的ML应用可以提出一种新的观点，以更好地了解各种参数对其抗菌作用的影响。在本文中，通过使用先前获得的数据来定性预测PAL的体外抗菌活性，从而介绍了比较监督的ML模型。进行了文献搜索，并从33个相关文章中收集了数据。在所需的预处理步骤之后，将两种监督的ML方法（即分类和回归）应用于数据以获得微生物灭活（MI）预测。对于分类，MI分为四类，对于回归，MI被用作连续变量。为分类和回归模型进行了两种不同的可靠交叉验证策略，以评估所提出的方法。重复分层的K折交叉验证和K折交叉验证。我们还研究了不同特征对模型的影响。结果表明，高参数优化的随机森林分类器（ORFC）和随机森林回归者（ORFR）分别比其他模型进行了分类和回归的模型更好。最后，获得ORFC的最佳测试精度为82.68％，ORFR的R2为0.75。 ML技术可能有助于更好地理解在所需的抗菌作用中具有主要作用的血浆参数。此外，此类发现可能有助于将来的血浆剂量定义。

translated by 谷歌翻译

Auditing the Imputation Effect on Fairness of Predictive Analytics in Higher Education

Hadis Anahideh , Parian Haghighat , Nazanin Nezami , Denisa G`andara

分类：机器学习

2021-09-13

Colleges and universities use predictive analytics in a variety of ways to increase student success rates. Despite the potential for predictive analytics, two major barriers exist to their adoption in higher education: (a) the lack of democratization in deployment, and (b) the potential to exacerbate inequalities. Education researchers and policymakers encounter numerous challenges in deploying predictive modeling in practice. These challenges present in different steps of modeling including data preparation, model development, and evaluation. Nevertheless, each of these steps can introduce additional bias to the system if not appropriately performed. Most large-scale and nationally representative education data sets suffer from a significant number of incomplete responses from the research participants. While many education-related studies addressed the challenges of missing data, little is known about the impact of handling missing values on the fairness of predictive outcomes in practice. In this paper, we set out to first assess the disparities in predictive modeling outcomes for college-student success, then investigate the impact of imputation techniques on the model performance and fairness using a commonly used set of metrics. We conduct a prospective evaluation to provide a less biased estimation of future performance and fairness than an evaluation of historical data. Our comprehensive analysis of a real large-scale education dataset reveals key insights on modeling disparities and how imputation techniques impact the fairness of the student-success predictive outcome under different testing scenarios. Our results indicate that imputation introduces bias if the testing set follows the historical distribution. However, if the injustice in society is addressed and consequently the upcoming batch of observations is equalized, the model would be less biased.

translated by 谷歌翻译

Automated Systems For Diagnosis of Dysgraphia in Children: A Survey and Novel Framework

Jayakanth Kunhoth , Somaya Al-Maadeed , Suchithra Kunhoth , Younus Akbari

分类：机器学习 | 人工智能 | 计算机视觉

2022-06-27

众所周知，学习障碍主要干扰阅读，写作和数学等基本学习技能，会影响世界上约10％的儿童。作为神经发育障碍的一部分的运动技能和运动协调不足可能成为学习写作困难（障碍）的原因因素，从而阻碍了个人的学术轨道。障碍症的体征和症状包括但不限于不规则的笔迹，不正确的写作媒介处理，缓慢或劳力的写作，不寻常的手部位等。所有类型的学习障碍的评估标准是由医学医学进行的检查专家。少数可用的人工智能筛查系统用于障碍症，依赖于相应图像中手写的独特特征。这项工作对文献中儿童的现有自动化障碍诊断系统进行了综述。这项工作的主要重点是审查基于人工智能的儿童诊断的基于人工智能的系统。这项工作讨论了数据收集方法，重要的手写功能，用于诊断障碍症的文献中使用的机器学习算法。除此之外，本文还讨论了一些基于非人工智能的自动化系统。此外，本文讨论了现有系统的缺点，并提出了一个新颖的障碍诊断框架。

translated by 谷歌翻译

A Concurrent CNN-RNN Approach for Multi-Step Wind Power Forecasting

Syed Kazmi , Berk Gorgulu , Mucahit Cevik , Mustafa Gokce Baydogan

分类：机器学习

2023-01-02

Wind power forecasting helps with the planning for the power systems by contributing to having a higher level of certainty in decision-making. Due to the randomness inherent to meteorological events (e.g., wind speeds), making highly accurate long-term predictions for wind power can be extremely difficult. One approach to remedy this challenge is to utilize weather information from multiple points across a geographical grid to obtain a holistic view of the wind patterns, along with temporal information from the previous power outputs of the wind farms. Our proposed CNN-RNN architecture combines convolutional neural networks (CNNs) and recurrent neural networks (RNNs) to extract spatial and temporal information from multi-dimensional input data to make day-ahead predictions. In this regard, our method incorporates an ultra-wide learning view, combining data from multiple numerical weather prediction models, wind farms, and geographical locations. Additionally, we experiment with global forecasting approaches to understand the impact of training the same model over the datasets obtained from multiple different wind farms, and we employ a method where spatial information extracted from convolutional layers is passed to a tree ensemble (e.g., Light Gradient Boosting Machine (LGBM)) instead of fully connected layers. The results show that our proposed CNN-RNN architecture outperforms other models such as LGBM, Extra Tree regressor and linear regression when trained globally, but fails to replicate such performance when trained individually on each farm. We also observe that passing the spatial information from CNN to LGBM improves its performance, providing further evidence of CNN's spatial feature extraction capabilities.

translated by 谷歌翻译

Applications of Machine Learning in Chemical and Biological Oceanography

Balamurugan Sadaiappan , Preethiya Balakrishnan , Vishal CR , Neethu T Vijayan , Mahendran Subramanian , Mangesh U Gauns

分类：机器学习

2022-09-23

机器学习（ML）是指根据大量数据预测有意义的输出或对复杂系统进行分类的计算机算法。 ML应用于各个领域，包括自然科学，工程，太空探索甚至游戏开发。本文的重点是在化学和生物海洋学领域使用机器学习。在预测全球固定氮水平，部分二氧化碳压力和其他化学特性时，ML的应用是一种有前途的工具。机器学习还用于生物海洋学领域，可从各种图像（即显微镜，流车和视频记录器），光谱仪和其他信号处理技术中检测浮游形式。此外，ML使用其声学成功地对哺乳动物进行了分类，在特定的环境中检测到濒临灭绝的哺乳动物和鱼类。最重要的是，使用环境数据，ML被证明是预测缺氧条件和有害藻华事件的有效方法，这是对环境监测的重要测量。此外，机器学习被用来为各种物种构建许多对其他研究人员有用的数据库，而创建新算法将帮助海洋研究界更好地理解海洋的化学和生物学。

translated by 谷歌翻译

Fruit Ripeness Classification: a Survey

Matteo Rizzo , Matteo Marcuzzo , Alessandro Zangari , Andrea Gasparetto , Andrea Albarelli

分类：计算机视觉 | 机器学习

2022-12-29

Fruit is a key crop in worldwide agriculture feeding millions of people. The standard supply chain of fruit products involves quality checks to guarantee freshness, taste, and, most of all, safety. An important factor that determines fruit quality is its stage of ripening. This is usually manually classified by experts in the field, which makes it a labor-intensive and error-prone process. Thus, there is an arising need for automation in the process of fruit ripeness classification. Many automatic methods have been proposed that employ a variety of feature descriptors for the food item to be graded. Machine learning and deep learning techniques dominate the top-performing methods. Furthermore, deep learning can operate on raw data and thus relieve the users from having to compute complex engineered features, which are often crop-specific. In this survey, we review the latest methods proposed in the literature to automatize fruit ripeness classification, highlighting the most common feature descriptors they operate on.

translated by 谷歌翻译

Computer vision-based analysis of buildings and built environments: A systematic review of current approaches

Małgorzata B. Starzyńska , Robin Roussel , Sam Jacoby , Ali Asadipour

分类：计算机视觉

2022-08-01

分析了2011年至2021年发表的88个来源，本文对基于计算机的建筑物和建筑环境进行了首次系统评价，以评估其对建筑和城市设计研究的价值。遵循多阶段的选择过程，讨论了有关建筑应用，例如建筑物分类，详细分类，定性环境分析，建筑条件调查和建筑价值估算等建筑应用程序的类型。这揭示了当前的研究差距和趋势，并突出了研究目标的两个主要类别。首先，要使用或优化计算机视觉方法进行体系结构图像数据，然后可以帮助自动化耗时，劳动密集型或复杂的视觉分析任务。其次，通过查找视觉，统计和定性数据之间的模式和关系来探索机器学习方法的方法论上的好处，以研究有关建筑环境的新问题，这可以克服传统手动分析的局限性。不断增长的研究为建筑和设计研究提供了新的方法，论文确定了未来的研究挑战和方向。

translated by 谷歌翻译

Predicting the Location of Bicycle-sharing Stations using OpenStreetMap Data

Kamil Raczycki

分类：机器学习 | 人工智能

2021-11-02

规划自行车共享站的布局是一个复杂的过程，特别是在刚刚实施自行车共享系统的城市。城市规划者通常必须根据公开可用的数据并私下提供来自管理的数据，然后使用现场流行的位置分配模型。较小城市的许多城市可能难以招聘专家进行此类规划。本文提出了一种新的解决方案来简化和促进通过使用空间嵌入方法来实现这种规划的过程。仅基于来自OpenStreetMap的公开数据，以及来自欧洲34个城市的站布局，已经开发了一种使用优步H3离散全球电网系统将城市分成微区域的方法，并指示其值得放置站的区域在不同城市使用转移学习的现有系统。工作的结果是在规划驻地布局的决策中支持规划者的机制，以选择参考城市。

translated by 谷歌翻译

Artificial Intelligence-Based Analytics for Impacts of COVID-19 and Online Learning on College Students' Mental Health

Mostafa Rezapour , Scott K. Elmshaeuser

分类：机器学习

2022-02-07

Covid-19是由新型冠状病毒（SARS-COV-2）引起的疾病，于2019年12月下旬首次在中国武汉出现。不久之后，该病毒在全球范围内传播，并于3月被世界卫生组织宣布为大流行病。 2020年。这造成了世界各地和美国的许多变化，包括向在线学习的教育转变。在本文中，我们试图了解Covid-19-19的大流行和在线学习的增加如何影响大学生的情感福祉。我们使用几种机器学习和统计模型来分析卢布尔雅那大学公共行政学院，斯洛文尼亚大学，与国际大学，其他高等教育机构和学生协会一起收集的数据。我们的结果表明，与学生的学术生活有关的特征对他们的情感健康产生了最大的影响。其他重要因素包括学生对大学和政府对大流行的处理以及学生的财务安全的满意。

translated by 谷歌翻译

IoT Data Analytics in Dynamic Environments: From An Automated Machine Learning Perspective

Li Yang , Abdallah Shami

分类：机器学习

2022-09-16

近年来，随着传感器和智能设备的广泛传播，物联网（IoT）系统的数据生成速度已大大增加。在物联网系统中，必须经常处理，转换和分析大量数据，以实现各种物联网服务和功能。机器学习（ML）方法已显示出其物联网数据分析的能力。但是，将ML模型应用于物联网数据分析任务仍然面临许多困难和挑战，特别是有效的模型选择，设计/调整和更新，这给经验丰富的数据科学家带来了巨大的需求。此外，物联网数据的动态性质可能引入概念漂移问题，从而导致模型性能降解。为了减少人类的努力，自动化机器学习（AUTOML）已成为一个流行的领域，旨在自动选择，构建，调整和更新机器学习模型，以在指定任务上实现最佳性能。在本文中，我们对Automl区域中模型选择，调整和更新过程中的现有方法进行了审查，以识别和总结将ML算法应用于IoT数据分析的每个步骤的最佳解决方案。为了证明我们的发现并帮助工业用户和研究人员更好地实施汽车方法，在这项工作中提出了将汽车应用于IoT异常检测问题的案例研究。最后，我们讨论并分类了该领域的挑战和研究方向。

translated by 谷歌翻译