智能论文笔记

Systematic Literature Review: Quantum Machine Learning and its applications

David Peral García , Juan Cruz-Benito , Francisco José García-Peñalvo

分类：机器学习

2022-01-11

量子计算是使用量子力学执行计算的过程。该领域研究某些亚杀菌粒子的量子行为，以便随后在执行计算，以及大规模信息处理中使用。这些能力可以在计算时间和经典计算机上的成本方面提供量子计算机的优势。如今，由于计算复杂性或计算所需的时间，具有科学挑战，这是由于古典计算而无法执行，并且量子计算是可能的答案之一。然而，电流量子器件尚未实现必要的QUBITS，并且没有足够的容错才能实现这些目标。尽管如此，还有其他领域，如机器学习或化学，其中量子计算对电流量子器件有用。本手稿旨在展示2017年和2021年之间发布的论文的系统文献综述，以确定，分析和分类量子机器学习和其应用中使用的不同算法。因此，该研究确定了使用量子机器学习技术和算法的52篇文章。发现算法的主要类型是经典机器学习算法的量子实现，例如支持向量机或K最近邻模型，以及古典的深度学习算法，如量子神经网络。许多文章试图解决目前通过古典机器学习回答的问题，但使用量子设备和算法。即使结果很有希望，量子机器学习也远未实现其全部潜力。由于现有量子计算机缺乏足够的质量，速度和比例以允许量子计算来实现其全部潜力，因此需要提高量子硬件。

translated by 谷歌翻译

Proactive Detractor Detection Framework Based on Message-Wise Sentiment Analysis Over Customer Support Interactions

Juan Sebastián Salcedo Gallo , Jesús Solano , Javier Hernán García , David Zarruk-Valencia , Alejandro Correa-Bahnsen

分类：自然语言处理 | 机器学习

2022-11-08

In this work, we propose a framework relying solely on chat-based customer support (CS) interactions for predicting the recommendation decision of individual users. For our case study, we analyzed a total number of 16.4k users and 48.7k customer support conversations within the financial vertical of a large e-commerce company in Latin America. Consequently, our main contributions and objectives are to use Natural Language Processing (NLP) to assess and predict the recommendation behavior where, in addition to using static sentiment analysis, we exploit the predictive power of each user's sentiment dynamics. Our results show that, with respective feature interpretability, it is possible to predict the likelihood of a user to recommend a product or service, based solely on the message-wise sentiment evolution of their CS conversations in a fully automated way.

translated by 谷歌翻译

A Mosquito is Worth 16x16 Larvae: Evaluation of Deep Learning Architectures for Mosquito Larvae Classification

Aswin Surya , David B. Peral , Austin VanLoon , Akhila Rajesh

分类：计算机视觉 | 人工智能 | 机器学习

2022-09-16

蚊子传播的疾病（MBD），例如登革热病毒，基孔肯雅病毒和西尼罗河病毒，每年在全球造成超过100万人死亡。由于许多这样的疾病都被伊蚊和库氏蚊子传播，因此跟踪这些幼虫对于缓解MBD的传播至关重要。即使公民科学成长并获得了较大的蚊子图像数据集，蚊子图像的手动注释变得越来越耗时且效率低下。先前的研究使用计算机视觉识别蚊子物种，卷积神经网络（CNN）已成为图像分类的事实。但是，这些模型通常需要大量的计算资源。这项研究介绍了视觉变压器（VIT）在比较研究中的应用，以改善伊蚊和库尔克斯幼虫的图像分类。在蚊子幼虫图像数据上对两个VIT模型，Vit-Base和CVT-13以及两个CNN模型进行了RESNET-18和CORVNEXT的培训，并比较确定最有效的模型，以将蚊子幼虫区分为AEDES或CULEX。测试表明，Convnext获得了所有分类指标的最大值，证明了其对蚊子幼虫分类的生存能力。基于这些结果，未来的研究包括通过结合CNN和Transformer架构元素来创建专门为蚊子幼虫分类设计的模型。

translated by 谷歌翻译

Vehicle Trajectory Prediction on Highways Using Bird Eye View Representations and Deep Learning

Rubén Izquierdo , Álvaro Quintanar , David Fernández Llorca , Iván García Daza , Noelia Hernández , Ignacio Parra , Miguel Ángel Sotelo

分类：计算机视觉 | 人工智能

2022-07-04

这项工作提出了一种新的方法，可以使用有效的鸟类视图表示和卷积神经网络在高速公路场景中预测车辆轨迹。使用基本的视觉表示，很容易将车辆位置，运动历史，道路配置和车辆相互作用轻松包含在预测模型中。 U-NET模型已被选为预测内核，以使用图像到图像回归方法生成场景的未来视觉表示。已经实施了一种方法来从生成的图形表示中提取车辆位置以实现子像素分辨率。该方法已通过预防数据集（一个板载传感器数据集）进行了培训和评估。已经评估了不同的网络配置和场景表示。这项研究发现，使用线性终端层和车辆的高斯表示，具有6个深度水平的U-NET是最佳性能配置。发现使用车道标记不会改善预测性能。平均预测误差为0.47和0.38米，对于纵向和横向坐标的最终预测误差分别为0.76和0.53米，预测轨迹长度为2.0秒。与基线方法相比，预测误差低至50％。

translated by 谷歌翻译

Towards view-invariant vehicle speed detection from driving simulator images

Antonio Hernández Martínez , David Fernandez Llorca , Iván García Daza

分类：计算机视觉 | 人工智能

2022-06-01

与其他技术（例如电感回路，雷达或激光器）相比，使用摄像头进行车速测量的成本效益要高得多。但是，由于相机的固有局限性提供准确的范围估计值，因此准确的速度测量仍然是一个挑战。此外，基于经典的视觉方法对相机和道路之间的外部校准非常敏感。在这种情况下，使用数据驱动的方法是一种有趣的选择。但是，数据收集需要一个复杂且昂贵的设置，以在与高精度速度传感器同步的相机中录制视频，以生成地面真相速度值。最近已经证明，使用驾驶模拟器（例如Carla）可以用作生成大型合成数据集的强大替代方案，以实现对单个摄像机的车辆速度估算的应用。在本文中，我们在不同的虚拟位置和不同的外部参数中使用多个摄像机研究相同的问题。我们解决了复杂的3D-CNN体系结构是否能够使用单个模型隐式学习视图速度的问题，或者特定于视图的模型是否更合适。结果非常有前途，因为它们表明具有来自多个视图的数据报告的单个模型比摄像机特异性模型更好地准确性，从而铺平了迈向视图的车辆速度测量系统。

translated by 谷歌翻译

Insertion of real agents behaviors in CARLA autonomous driving simulator

Sergio Martín Serrano , David Fernández Llorca , Iván García Daza , Miguel Ángel Sotelo

分类：机器人

2022-06-01

由于需要快速原型制作和广泛的测试，模拟在自主驾驶中的作用变得越来越重要。基于物理的模拟使用涉及多个利益和优势，以合理的成本消除了对原型，驱动因素和脆弱道路使用者的风险。但是，有两个主要局限性。首先，众所周知的现实差距是指现实与模拟之间的差异，这阻止了模拟自主驾驶体验实现有效的现实性能。其次，缺乏有关真实代理商的行为的经验知识，包括备用驾驶员或乘客以及其他道路使用者，例如车辆，行人或骑自行车的人。代理仿真通常是根据实际数据进行确定性，随机概率或生成的预编程的，但它不代表与特定模拟方案相互作用的真实试剂的行为。在本文中，我们提出了一个初步框架，以实现真实试剂与模拟环境（包括自动驾驶汽车）之间的实时互动，并从多个视图中从模拟传感器数据中生成合成序列，这些视图可用于培训依赖行为模型的预测系统。我们的方法将沉浸式的虚拟现实和人类运动捕获系统与Carla模拟器进行自主驾驶。我们描述了提出的硬件和软件体系结构，并讨论所谓的行为差距或存在。我们提出了支持这种方法的潜力并讨论未来步骤的初步但有希望的结果。

translated by 谷歌翻译

Invalidator: Automated Patch Correctness Assessment via Semantic and Syntactic Reasoning

Thanh Le-Cong , Duc-Minh Luong , Xuan Bach D. Le , David Lo , Nhat-Hoa Tran , Bui Quang-Huy , Quyet-Thang Huynh

分类：机器学习

2023-01-03

In this paper, we propose a novel technique, namely INVALIDATOR, to automatically assess the correctness of APR-generated patches via semantic and syntactic reasoning. INVALIDATOR reasons about program semantic via program invariants while it also captures program syntax via language semantic learned from large code corpus using the pre-trained language model. Given a buggy program and the developer-patched program, INVALIDATOR infers likely invariants on both programs. Then, INVALIDATOR determines that a APR-generated patch overfits if: (1) it violates correct specifications or (2) maintains errors behaviors of the original buggy program. In case our approach fails to determine an overfitting patch based on invariants, INVALIDATOR utilizes a trained model from labeled patches to assess patch correctness based on program syntax. The benefit of INVALIDATOR is three-fold. First, INVALIDATOR is able to leverage both semantic and syntactic reasoning to enhance its discriminant capability. Second, INVALIDATOR does not require new test cases to be generated but instead only relies on the current test suite and uses invariant inference to generalize the behaviors of a program. Third, INVALIDATOR is fully automated. We have conducted our experiments on a dataset of 885 patches generated on real-world programs in Defects4J. Experiment results show that INVALIDATOR correctly classified 79% overfitting patches, accounting for 23% more overfitting patches being detected by the best baseline. INVALIDATOR also substantially outperforms the best baselines by 14% and 19% in terms of Accuracy and F-Measure, respectively.

translated by 谷歌翻译

Conservation Tools: The Next Generation of Engineering--Biology Collaborations

Andrew Schulz , Cassie Shriver , Suzanne Stathatos , Benjamin Seleb , Emily Weigel , Young-Hui Chang , M. Saad Bhamla , David Hu , Joseph R. Mendelson III , .

分类：机器学习

2023-01-03

The recent increase in public and academic interest in preserving biodiversity has led to the growth of the field of conservation technology. This field involves designing and constructing tools that utilize technology to aid in the conservation of wildlife. In this article, we will use case studies to demonstrate the importance of designing conservation tools with human-wildlife interaction in mind and provide a framework for creating successful tools. These case studies include a range of complexities, from simple cat collars to machine learning and game theory methodologies. Our goal is to introduce and inform current and future researchers in the field of conservation technology and provide references for educating the next generation of conservation technologists. Conservation technology not only has the potential to benefit biodiversity but also has broader impacts on fields such as sustainability and environmental protection. By using innovative technologies to address conservation challenges, we can find more effective and efficient solutions to protect and preserve our planet's resources.

translated by 谷歌翻译

Posterior Collapse and Latent Variable Non-identifiability

Yixin Wang , David M. Blei , John P. Cunningham

分类： (统计)机器学习 | 机器学习

2023-01-02

Variational autoencoders model high-dimensional data by positing low-dimensional latent variables that are mapped through a flexible distribution parametrized by a neural network. Unfortunately, variational autoencoders often suffer from posterior collapse: the posterior of the latent variables is equal to its prior, rendering the variational autoencoder useless as a means to produce meaningful representations. Existing approaches to posterior collapse often attribute it to the use of neural networks or optimization issues due to variational approximation. In this paper, we consider posterior collapse as a problem of latent variable non-identifiability. We prove that the posterior collapses if and only if the latent variables are non-identifiable in the generative model. This fact implies that posterior collapse is not a phenomenon specific to the use of flexible distributions or approximate inference. Rather, it can occur in classical probabilistic models even with exact inference, which we also demonstrate. Based on these results, we propose a class of latent-identifiable variational autoencoders, deep generative models which enforce identifiability without sacrificing flexibility. This model class resolves the problem of latent variable non-identifiability by leveraging bijective Brenier maps and parameterizing them with input convex neural networks, without special variational inference objectives or optimization tricks. Across synthetic and real datasets, latent-identifiable variational autoencoders outperform existing methods in mitigating posterior collapse and providing meaningful representations of the data.

translated by 谷歌翻译

Mapping smallholder cashew plantations to inform sustainable tree crop expansion in Benin

Leikun Yin , Rahul Ghosh , Chenxi Lin , David Hale , Christoph Weigl , James Obarowski , Junxiong Zhou , Jessica Till , Xiaowei Jia , Troy Mao

分类：计算机视觉 | 机器学习

2023-01-01

Cashews are grown by over 3 million smallholders in more than 40 countries worldwide as a principal source of income. As the third largest cashew producer in Africa, Benin has nearly 200,000 smallholder cashew growers contributing 15% of the country's national export earnings. However, a lack of information on where and how cashew trees grow across the country hinders decision-making that could support increased cashew production and poverty alleviation. By leveraging 2.4-m Planet Basemaps and 0.5-m aerial imagery, newly developed deep learning algorithms, and large-scale ground truth datasets, we successfully produced the first national map of cashew in Benin and characterized the expansion of cashew plantations between 2015 and 2021. In particular, we developed a SpatioTemporal Classification with Attention (STCA) model to map the distribution of cashew plantations, which can fully capture texture information from discriminative time steps during a growing season. We further developed a Clustering Augmented Self-supervised Temporal Classification (CASTC) model to distinguish high-density versus low-density cashew plantations by automatic feature extraction and optimized clustering. Results show that the STCA model has an overall accuracy of 80% and the CASTC model achieved an overall accuracy of 77.9%. We found that the cashew area in Benin has doubled from 2015 to 2021 with 60% of new plantation development coming from cropland or fallow land, while encroachment of cashew plantations into protected areas has increased by 70%. Only half of cashew plantations were high-density in 2021, suggesting high potential for intensification. Our study illustrates the power of combining high-resolution remote sensing imagery and state-of-the-art deep learning algorithms to better understand tree crops in the heterogeneous smallholder landscape.

translated by 谷歌翻译