智能论文笔记

Deep Residual Shrinkage Networks for EMG-based Gesture Identification

Yueying Ma , Chengbo Wang , Chengenze Jiang , Zimo Li

分类：机器学习

2022-02-07

这项工作介绍了一种基于高准确性EMG的手势识别的方法。一种新开发的深度学习方法，即，深层残留的收缩网络用于执行手势识别。基于手势引起的EMG信号的特征，进行了优化以提高识别精度。最后，应用三种不同的算法将EMG信号识别的准确性与DRSN的精度进行比较。结果表明，DRSN在EMG识别准确性方面表现出传统的神经网络。本文提供了一种对EMG信号进行分类以及探索DRSN可能应用的可靠方法。

translated by 谷歌翻译

MP-SeizNet: A Multi-Path CNN Bi-LSTM Network for Seizure-Type Classification Using EEG

Hezam Albaqami , Ghulam Mubashar Hassan , Amitava Datta

分类：机器学习

2022-11-09

Seizure type identification is essential for the treatment and management of epileptic patients. However, it is a difficult process known to be time consuming and labor intensive. Automated diagnosis systems, with the advancement of machine learning algorithms, have the potential to accelerate the classification process, alert patients, and support physicians in making quick and accurate decisions. In this paper, we present a novel multi-path seizure-type classification deep learning network (MP-SeizNet), consisting of a convolutional neural network (CNN) and a bidirectional long short-term memory neural network (Bi-LSTM) with an attention mechanism. The objective of this study was to classify specific types of seizures, including complex partial, simple partial, absence, tonic, and tonic-clonic seizures, using only electroencephalogram (EEG) data. The EEG data is fed to our proposed model in two different representations. The CNN was fed with wavelet-based features extracted from the EEG signals, while the Bi-LSTM was fed with raw EEG signals to let our MP-SeizNet jointly learns from different representations of seizure data for more accurate information learning. The proposed MP-SeizNet was evaluated using the largest available EEG epilepsy database, the Temple University Hospital EEG Seizure Corpus, TUSZ v1.5.2. We evaluated our proposed model across different patient data using three-fold cross-validation and across seizure data using five-fold cross-validation, achieving F1 scores of 87.6% and 98.1%, respectively.

translated by 谷歌翻译

EEG-based Cross-Subject Driver Drowsiness Recognition with an Interpretable Convolutional Neural Network

Jian Cui , Zirui Lan , Olga Sourina , Wolfgang Müller-Wittig

分类：机器学习 | 神经与进化计算

2021-05-30

在脑电图（EEG）的驾驶员的背景下，设计无校准系统仍然具有挑战性，因为EEG信号在不同的主题和录音会话之间显着变化。已经努力使用EEG信号的深度学习方法来利用精神状态识别。然而，现有工作主要将深入学习模型视为黑匣子分类器，而模型已经学习的是什么以及它们在脑电图数据中受到噪声的影响仍然是曝光的。在本文中，我们开发了一种新颖的卷积神经网络，可以通过突出显示包含分类重要信息的输入样本的本地区域来解释其决定。该网络具有紧凑的结构，利用可分离卷曲来处理空间序列中的EEG信号。结果表明，该模型在11个受试者上实现了78.35％的平均准确性，用于休假交叉对象嗜睡识别，其高于传统的基线方法为53.4％-72.68％和最先进的深层学习方法63.90％-65.78％。可视化结果表明，该模型已经学会了识别EEG信号的生物学可解释的特征，例如，α主轴，作为不同受试者的嗜睡的强指标。此外，我们还探讨了一些错误分类的样本背后的原因，具有可视化技术，并讨论了提高识别准确性的潜在方法。我们的作品说明了使用可解释的深度学习模型的有希望的方向，以从复杂的EEG信号发现与不同心理状态相关的有意义的模式。

translated by 谷歌翻译

Automatic COVID-19 disease diagnosis using 1D convolutional neural network and augmentation with human respiratory sound based on parameters: cough, breath, and voice

Kranthi Kumar Lella , Alphonse Pja

分类：机器学习

2021-12-14

呼吸声分类中的问题已在去年的临床科学家和医学研究员团体中获得了良好的关注，以诊断Covid-19疾病。迄今为止，各种模型的人工智能（AI）进入了现实世界，从人类生成的声音等人生成的声音中检测了Covid-19疾病，例如语音/言语，咳嗽和呼吸。实现卷积神经网络（CNN）模型，用于解决基于人工智能（AI）的机器上的许多真实世界问题。在这种情况下，建议并实施一个维度（1D）CNN，以诊断Covid-19的呼吸系统疾病，例如语音，咳嗽和呼吸。应用基于增强的机制来改善Covid-19声音数据集的预处理性能，并使用1D卷积网络自动化Covid-19疾病诊断。此外，使用DDAE（数据去噪自动编码器）技术来产生诸如输入功能的深声特征，而不是采用MFCC（MEL频率跳跃系数）的标准输入，并且它更好地执行比以前的型号的准确性和性能。

translated by 谷歌翻译

Application of Machine Learning to Sleep Stage Classification

Andrew Smith , Hardik Anand , Snezana Milosavljevic , Katherine M. Rentschler , Ana Pocivavsek , Homayoun Valafar

分类：机器学习

2021-11-04

睡眠研究必须携带与睡眠损失相关的表型和有助于精神病理学的露出机制。最常见的是，调查人员手动将多色网络分类为警惕状态，这是耗时的，需要广泛的培训，并且容易出现帧间间变异性。虽然许多作品已经基于多个EEG通道成功开发了自动化状态分类器，但是我们的目标是生产一种自动化和开放式分类器，可以基于来自啮齿动物的单个皮质脑电图（EEG）来可靠地预测警惕状态，以最大限度地减少伴随的缺点通过电线束缚小动物到计算机程序。大约427小时的连续监测的脑电图，电灰度（EMG）和活性由总数据的571小时的域专家标记。在这里，我们评估各种机器学习技术对分类10-秒钟时期的各种机器学习技术的性能，进入三个离散类中的一种：矛盾，慢波或唤醒。我们的调查包括决策树，随机森林，天真贝叶斯分类器，Logistic回归分类器和人工神经网络。这些方法达到了约74％至约96％的精度。最值得注意的是，随机森林和巢穴分别实现了95.78％和93.31％的显着准确性。在这里，我们已经示出了各种机器学习分类器的潜力，以基于单个EEG读数和单一EMG读数自动，准确地和可靠地对警惕状态进行自动。

translated by 谷歌翻译

Ubi-SleepNet: Advanced Multimodal Fusion Techniques for Three-stage Sleep Classification Using Ubiquitous Sensing

Bing Zhai , Yu Guan , Michael Catt , Thomas Ploetz

分类：机器学习 | 人工智能 | 计算机视觉

2021-11-19

睡眠是一种基本的生理过程，对于维持健康的身心至关重要。临床睡眠监测的黄金标准是多核桃摄影（PSG），基于哪个睡眠可以分为五个阶段，包括尾脉冲睡眠（REM睡眠）/非REM睡眠1（N1）/非REM睡眠2 （n2）/非REM睡眠3（n3）。然而，PSG昂贵，繁重，不适合日常使用。对于长期睡眠监测，无处不在的感测可以是解决方案。最近，心脏和运动感测在分类三阶段睡眠方面变得流行，因为两种方式都可以从研究级或消费者级设备中获得（例如，Apple Watch）。但是，为最大准确性融合数据的最佳仍然是一个打开的问题。在这项工作中，我们综合地研究了深度学习（DL）的高级融合技术，包括三种融合策略，三个融合方法以及三级睡眠分类，基于两个公共数据集。实验结果表明，通过融合心脏/运动传感方式可以可靠地分类三阶段睡眠，这可能成为在睡眠中进行大规模睡眠阶段评估研究或长期自动跟踪的实用工具。为了加快普遍存在/可穿戴计算社区的睡眠研究的进展，我们制作了该项目开源，可以在：https://github.com/bzhai/ubi-sleepnet找到代码。

translated by 谷歌翻译

Deep learning for time series classification: a review

Hassan Ismail Fawaz , Germain Forestier , Jonathan Weber , Lhassane Idoumghar , Pierre-Alain Muller

分类：

2018-09-12

Time Series Classification (TSC) is an important and challenging problem in data mining. With the increase of time series data availability, hundreds of TSC algorithms have been proposed. Among these methods, only a few have considered Deep Neural Networks (DNNs) to perform this task. This is surprising as deep learning has seen very successful applications in the last years. DNNs have indeed revolutionized the field of computer vision especially with the advent of novel deeper architectures such as Residual and Convolutional Neural Networks. Apart from images, sequential data such as text and audio can also be processed with DNNs to reach state-of-the-art performance for document classification and speech recognition. In this article, we study the current state-ofthe-art performance of deep learning algorithms for TSC by presenting an empirical study of the most recent DNN architectures for TSC. We give an overview of the most successful deep learning applications in various time series domains under a unified taxonomy of DNNs for TSC. We also provide an open source deep learning framework to the TSC community where we implemented each of the compared approaches and evaluated them on a univariate TSC benchmark (the UCR/UEA archive) and 12 multivariate time series datasets. By training 8,730 deep learning models on 97 time series datasets, we propose the most exhaustive study of DNNs for TSC to date.

translated by 谷歌翻译

In-field early disease recognition of potato late blight based on deep learning and proximal hyperspectral imaging

Chao Qi , Murilo Sandroni , Jesper Cairo Westergaard , Ea Høegh Riis Sundmark , Merethe Bagge , Erik Alexandersson , Junfeng Gao

分类：计算机视觉

2021-11-23

有效的早期检测马铃薯晚枯萎病（PLB）是马铃薯栽培的必要方面。然而，由于缺乏在冠层水平上缺乏视觉线索，在具有传统成像方法的领域的早期阶段来检测晚期枯萎是一项挑战。高光谱成像可以，捕获来自宽范围波长的光谱信号也在视觉波长之外。在这种情况下，通过将2D卷积神经网络（2D-CNN）和3D-CNN与深度合作的网络（PLB-2D-3D-A）组合来提出高光谱图像的深度学习分类架构。首先，2D-CNN和3D-CNN用于提取丰富的光谱空间特征，然后使用注意力块和SE-RESET用于强调特征图中的突出特征，并提高模型的泛化能力。数据集采用15,360张图像（64x64x204）构建，从在实验领域捕获的240个原始图像裁剪，具有超过20种马铃薯基因型。 2000年图像的测试数据集中的精度在全带中达到0.739，特定带中的0.790（492nm，519nm，560nm，592nm，717nm和765nm）。本研究表明，具有深入学习和近端高光谱成像的早期检测PLB的令人鼓舞的结果。

translated by 谷歌翻译

A General End-to-end Diagnosis Framework for Manufacturing Systems

Ye Yuan , Guijun Ma , Cheng Cheng , Beitong Zhou , Huan Zhao , Hai-Tao Zhang , Han Ding

分类：机器学习 | (统计)机器学习

2018-12-17

设想制造部门受到基于人工智能的技术的严重影响，计算能力和数据量的大幅增加。制造业领域的一个核心挑战在于一般框架的要求，以确保满足不同制造应用中的诊断和监视性能。在这里，我们提出了一个通用数据驱动的端到端框架，用于监视制造系统。该框架是从深度学习技术中得出的，评估了融合的感觉测量值，以检测甚至预测故障和磨损条件。这项工作利用了深度学习的预测能力，从嘈杂的时间表数据中自动提取隐藏的降解功能。我们已经在从各种制造应用中绘制的十个代表性数据集上试验了拟议的框架。结果表明，该框架在检查的基准应用中表现良好，可以在不同的情况下应用，这表明其潜在用作智能制造中的关键角石。

translated by 谷歌翻译

Deep conv-attention model for diagnosing left bundle branch block from 12-lead electrocardiograms

Alireza Sadeghi , Alireza Rezaee , Farshid Hajati

分类：机器学习

2022-12-07

Cardiac resynchronization therapy (CRT) is a treatment that is used to compensate for irregularities in the heartbeat. Studies have shown that this treatment is more effective in heart patients with left bundle branch block (LBBB) arrhythmia. Therefore, identifying this arrhythmia is an important initial step in determining whether or not to use CRT. On the other hand, traditional methods for detecting LBBB on electrocardiograms (ECG) are often associated with errors. Thus, there is a need for an accurate method to diagnose this arrhythmia from ECG data. Machine learning, as a new field of study, has helped to increase human systems' performance. Deep learning, as a newer subfield of machine learning, has more power to analyze data and increase systems accuracy. This study presents a deep learning model for the detection of LBBB arrhythmia from 12-lead ECG data. This model consists of 1D dilated convolutional layers. Attention mechanism has also been used to identify important input data features and classify inputs more accurately. The proposed model is trained and validated on a database containing 10344 12-lead ECG samples using the 10-fold cross-validation method. The final results obtained by the model on the 12-lead ECG data are as follows. Accuracy: 98.80+-0.08%, specificity: 99.33+-0.11 %, F1 score: 73.97+-1.8%, and area under the receiver operating characteristics curve (AUC): 0.875+-0.0192. These results indicate that the proposed model in this study can effectively diagnose LBBB with good efficiency and, if used in medical centers, will greatly help diagnose this arrhythmia and early treatment.

translated by 谷歌翻译

EMC2A-Net: An Efficient Multibranch Cross-channel Attention Network for SAR Target Classification

Xiang Yu , Zhe Geng , Xiaohua Huang , Qinglu Wang , Daiyin Zhu

分类：计算机视觉

2022-08-03

近年来，卷积神经网络（CNN）在合成孔径雷达（SAR）目标识别方面表现出巨大的潜力。 SAR图像具有强烈的粒度感，并且具有不同的纹理特征，例如斑点噪声，目标优势散射器和目标轮廓，这些轮廓很少在传统的CNN模型中被考虑。本文提出了两个残留块，即具有多尺度接收场（RFS）的EMC2A块，基于多型结构，然后设计了有效的同位素体系结构深CNN（DCNN），EMC2A-net。 EMC2A阻止使用不同的扩张速率利用平行的扩张卷积，这可以有效地捕获多尺度上下文特征而不会显着增加计算负担。为了进一步提高多尺度功能融合的效率，本文提出了多尺度特征跨通道注意模块，即EMC2A模块，采用了局部的多尺度特征交互策略，而无需降低维度。该策略通过有效的一维（1D） - 圆形卷积和Sigmoid函数适应每个通道的权重，以指导全球通道明智的关注。 MSTAR数据集上的比较结果表明，EMC2A-NET优于相同类型的现有模型，并且具有相对轻巧的网络结构。消融实验结果表明，仅使用一些参数和适当的跨渠道相互作用，EMC2A模块可显着提高模型的性能。

translated by 谷歌翻译

Chronological age estimation of lateral cephalometric radiographs with deep learning

Ningtao Liu

分类：计算机视觉 | 机器学习

2021-01-28

传统的手动年龄估计方法是基于多种X射线图像的关键劳动力。一些目前的研究表明，横向头颅（LC）图像可用于估计年龄。然而，这些方法基于手动测量某些图像特征，并根据经验或得分制定年龄估计。因此，这些方法是耗时和劳动密集型的，效果将受主观意见的影响。在这项工作中，我们提出了显着的图增强年龄估计方法，其可以基于LC图像自动执行年龄估计。同时，它还可以显示年龄估计图像中每个区域的重要性，这无疑会增加方法的解释性。我们的方法在4至40岁以上的3014 LC图像上进行了测试。实验结果的MEA是1.250，这少于最先进的基准的结果，因为它在年龄组中表现得更少，数据较少。此外，我们的模型在每个区域培训，在LC图像中的年龄估计的贡献很高，因此验证了这些不同区域对年龄估计任务的影响。因此，我们得出结论，提出的显着性图增强了横向头颅射线照片的时间年龄估计方法可以很好地在时间年龄估计任务中工作，特别是当数据量很小时。此外，与传统深度学习相比，我们的方法也是可解释的。

translated by 谷歌翻译

A Comparison Study of Deep CNN Architecture in Detecting of Pneumonia

Al Mohidur Rahman Porag , Md. Mahedi Hasan , Dr. Md Taimur Ahad

分类：计算机视觉 | 机器学习

2022-12-30

Pneumonia, a respiratory infection brought on by bacteria or viruses, affects a large number of people, especially in developing and impoverished countries where high levels of pollution, unclean living conditions, and overcrowding are frequently observed, along with insufficient medical infrastructure. Pleural effusion, a condition in which fluids fill the lung and complicate breathing, is brought on by pneumonia. Early detection of pneumonia is essential for ensuring curative care and boosting survival rates. The approach most usually used to diagnose pneumonia is chest X-ray imaging. The purpose of this work is to develop a method for the automatic diagnosis of bacterial and viral pneumonia in digital x-ray pictures. This article first presents the authors' technique, and then gives a comprehensive report on recent developments in the field of reliable diagnosis of pneumonia. In this study, here tuned a state-of-the-art deep convolutional neural network to classify plant diseases based on images and tested its performance. Deep learning architecture is compared empirically. VGG19, ResNet with 152v2, Resnext101, Seresnet152, Mobilenettv2, and DenseNet with 201 layers are among the architectures tested. Experiment data consists of two groups, sick and healthy X-ray pictures. To take appropriate action against plant diseases as soon as possible, rapid disease identification models are preferred. DenseNet201 has shown no overfitting or performance degradation in our experiments, and its accuracy tends to increase as the number of epochs increases. Further, DenseNet201 achieves state-of-the-art performance with a significantly a smaller number of parameters and within a reasonable computing time. This architecture outperforms the competition in terms of testing accuracy, scoring 95%. Each architecture was trained using Keras, using Theano as the backend.

translated by 谷歌翻译

Deep learning and machine learning for Malaria detection: overview, challenges and future directions

Imen Jdey , Ghazala Hcini , Hela Ltifi

分类：机器学习 | 人工智能

2022-09-27

为了产生最大的影响，必须使用基于证据的决策制定公共卫生计划。创建机器学习算法是为了收集，存储，处理和分析数据以提供知识和指导决策。任何监视系统的关键部分是图像分析。截至最近，计算机视觉和机器学习的社区最终对此感到好奇。这项研究使用各种机器学习和图像处理方法来检测和预测疟疾疾病。在我们的研究中，我们发现了深度学习技术作为具有更广泛适用于疟疾检测的智能工具的潜力，通过协助诊断病情，可以使医生受益。我们研究了针对计算机框架和组织的深度学习的共同限制，计算需要准备数据，准备开销，实时执行和解释能力，并发现对这些限制的轴承的未来询问。

translated by 谷歌翻译

A Survey: Deep Learning for Hyperspectral Image Classification with Few Labeled Samples

Sen Jia , Shuguo Jiang , Zhijie Lin , Nanying Li , Meng Xu , Shiqi Yu

分类：计算机视觉 | 人工智能

2021-12-03

随着深度学习技术的快速发展和计算能力的提高，深度学习已广泛应用于高光谱图像（HSI）分类领域。通常，深度学习模型通常包含许多可训练参数，并且需要大量标记的样品来实现最佳性能。然而，关于HSI分类，由于手动标记的难度和耗时的性质，大量标记的样本通常难以获取。因此，许多研究工作侧重于建立一个少数标记样本的HSI分类的深层学习模型。在本文中，我们专注于这一主题，并对相关文献提供系统审查。具体而言，本文的贡献是双重的。首先，相关方法的研究进展根据学习范式分类，包括转移学习，积极学习和少量学习。其次，已经进行了许多具有各种最先进的方法的实验，总结了结果以揭示潜在的研究方向。更重要的是，虽然深度学习模型（通常需要足够的标记样本）和具有少量标记样本的HSI场景之间存在巨大差距，但是通过深度学习融合，可以很好地表征小样本集的问题方法和相关技术，如转移学习和轻量级模型。为了再现性，可以在HTTPS://github.com/shuguoj/hsi-classification中找到纸张中评估的方法的源代码.git。

translated by 谷歌翻译

Two Decades of Bengali Handwritten Digit Recognition: A Survey

A. B. M. Ashikur Rahman , Md. Bakhtiar Hasan , Sabbir Ahmed , Tasnim Ahmed , Md. Hamjajul Ashmafee , Mohammad Ridwan Kabir , Md. Hasanul Kabir

分类：计算机视觉

2022-06-05

手写数字识别（HDR）是光学特征识别（OCR）领域中最具挑战性的任务之一。不管语言如何，HDR都存在一些固有的挑战，这主要是由于个人跨个人的写作风格的变化，编写媒介和环境的变化，无法在反复编写任何数字等时保持相同的笔触。除此之外，特定语言数字的结构复杂性可能会导致HDR的模棱两可。多年来，研究人员开发了许多离线和在线HDR管道，其中不同的图像处理技术与传统的机器学习（ML）基于基于的和/或基于深度学习（DL）的体系结构相结合。尽管文献中存在有关HDR的广泛审查研究的证据，例如：英语，阿拉伯语，印度，法尔西，中文等，但几乎没有对孟加拉人HDR（BHDR）的调查，这缺乏对孟加拉语HDR（BHDR）的研究，而这些调查缺乏对孟加拉语HDR（BHDR）的研究。挑战，基础识别过程以及可能的未来方向。在本文中，已经分析了孟加拉语手写数字的特征和固有的歧义，以及二十年来最先进的数据集的全面见解和离线BHDR的方法。此外，还详细讨论了一些涉及BHDR的现实应用特定研究。本文还将作为对离线BHDR背后科学感兴趣的研究人员的汇编，煽动了对相关研究的新途径的探索，这可能会进一步导致在不同应用领域对孟加拉语手写数字进行更好的离线认识。

translated by 谷歌翻译

A 3D 2D convolutional Neural Network Model for Hyperspectral Image Classification

Jiaxin Cao , Xiaoyan Li

分类：计算机视觉 | 机器学习

2021-11-19

在所提出的Sehybridsn模型中，使用密集块来重用浅特征，并旨在更好地利用分层空间谱特征。随后的深度可分离卷积层用于区分空间信息。通过通道注意方法实现了空间谱特征的进一步改进，该方法在每个3D卷积层和每个2D卷积层后面进行。实验结果表明，我们所提出的模型使用很少的训练数据了解更多辨别的空间谱特征。Sehybridsn使用仅0.05和0.01个标记的训练数据，获得了非常令人满意的性能。

translated by 谷歌翻译

Cross-Subject Domain Adaptation for Classifying Working Memory Load with Multi-Frame EEG Images

Junfu Chen , Xiaoyi Jiang , Yang Chen , Bi Wang

分类：机器学习 | 计算机视觉

2021-06-12

工作记忆（WM）表示在脑海中存储的信息，是人类认知领域的一个基本研究主题。可以监测大脑的电活动的脑电图（EEG）已被广泛用于测量WM的水平。但是，关键的挑战之一是个体差异可能会导致无效的结果，尤其是当既定模型符合陌生主题时。在这项工作中，我们提出了一个具有空间注意力（CS-DASA）的跨主题深层适应模型，以概括跨科目的工作负载分类。首先，我们将EEG时间序列转换为包含空间，光谱和时间信息的多帧EEG图像。首先，CS-DASA中的主题共享模块从源和目标主题中接收多帧的EEG图像数据，并学习了共同的特征表示。然后，在特定于主题的模块中，实现了最大平均差异，以测量重现的内核希尔伯特空间中的域分布差异，这可以为域适应增加有效的罚款损失。此外，采用主题对象的空间注意机制专注于目标图像数据的判别空间特征。在包含13个受试者的公共WM EEG数据集上进行的实验表明，所提出的模型能够达到比现有最新方法更好的性能。

translated by 谷歌翻译

SCAI: A Spectral data Classification framework with Adaptive Inference for the IoT platform

Yundong Sun , Dongjie Zhu , Haiwen Du , Yansong Wang , Zhaoshuo Tian

分类：机器学习

2022-06-24

目前，这是一个热门的研究主题，可以在深度学习和物联网技术的帮助下实现大量光谱数据的准确，高效和实时识别。深度神经网络在光谱分析中起着关键作用。但是，更深层模型的推断是以静态方式进行的，不能根据设备进行调整。并非所有样本都需要分配所有计算以实现自信的预测，这阻碍了最大化整体性能。为了解决上述问题，我们提出了一个具有自适应推理的光谱数据分类框架。具体而言，要为不同样本分配不同的计算，同时更好地利用不同设备之间的协作，我们利用早期外观体系结构，将中间分类器放置在架构的不同深度，并在预测置信度达到预设阈值时输出结果。我们提出了一个自我介绍学习的训练范式，最深的分类器对浅的分类器进行了软监督，以最大程度地提高其性能和训练速度。同时，为了减轻早期外观范式中中间分类器的位置和数字设置的性能脆弱性，我们提出了一个自适应的残留网络。它可以调整不同曲线位置下每个块中的层数，因此它可以专注于曲线的重要位置（例如：拉曼峰），并根据任务性能和计算资源准确地分配适当的计算预算。据我们所知，本文是首次尝试通过自适应推断物联网平台下的光谱检测来进行优化。我们进行了许多实验，实验结果表明，我们所提出的方法可以比现有方法实现更高的计算预算性能。

translated by 谷歌翻译

Palm Vein Recognition via Multi-task Loss Function and Attention Layer

Jiashu Lou , Jie zou , Baohua Wang

分类：计算机视觉 | 机器学习

2022-11-11

With the improvement of arithmetic power and algorithm accuracy of personal devices, biological features are increasingly widely used in personal identification, and palm vein recognition has rich extractable features and has been widely studied in recent years. However, traditional recognition methods are poorly robust and susceptible to environmental influences such as reflections and noise. In this paper, a convolutional neural network based on VGG-16 transfer learning fused attention mechanism is used as the feature extraction network on the infrared palm vein dataset. The palm vein classification task is first trained using palmprint classification methods, followed by matching using a similarity function, in which we propose the multi-task loss function to improve the accuracy of the matching task. In order to verify the robustness of the model, some experiments were carried out on datasets from different sources. Then, we used K-means clustering to determine the adaptive matching threshold and finally achieved an accuracy rate of 98.89% on prediction set. At the same time, the matching is with high efficiency which takes an average of 0.13 seconds per palm vein pair, and that means our method can be adopted in practice.

translated by 谷歌翻译