智能论文笔记

Hybrid Parallel Imaging and Compressed Sensing MRI Reconstruction with GRAPPA Integrated Multi-loss Supervised GAN

Farhan Sadik , Md. Kamrul Hasan

分类：计算机视觉

2022-09-19

目的：并行成像通过用一系列接收器线圈获取其他灵敏度信息，从而加速了磁共振成像（MRI）数据，从而降低了相位编码步骤。压缩传感磁共振成像（CS-MRI）在医学成像领域中获得了普及，因为其数据要求较少，而不是平行成像。并行成像和压缩传感（CS）均通过最大程度地减少K空间中捕获的数据量来加快传统MRI获取。由于采集时间与样品的数量成反比，因此从缩短的K空间样品中的图像的反向形成会导致收购更快，但具有混乱的伪像。本文提出了一种新型的生成对抗网络（GAN），即雷德格尔（Recgan-gr）受到多模式损失的监督，以消除重建的图像。方法：与现有的GAN网络相反，我们提出的方法引入了一种新型的发电机网络，即与双域损耗函数集成的弹药网络，包括加权幅度和相位损耗函数以及基于平行成像的损失，即Grappa一致性损失。提出了K空间校正块，以使GAN网络自动化生成不必要的数据，从而使重建过程的收敛性更快。结果：全面的结果表明，拟议的Recgan-GR在基于GAN的方法中的PSNR有4 dB的改善，并且在文献中可用的传统最先进的CNN方法中有2 dB的改进。结论和意义：拟议的工作有助于显着改善低保留数据的图像质量，从而更快地获取了5倍或10倍。

translated by 谷歌翻译

Computational Sarcasm Analysis on Social Media: A Systematic Review

Faria Binte Kader , Nafisa Hossain Nujat , Tasmia Binte Sogir , Mohsinul Kabir , Hasan Mahmud , Kamrul Hasan

分类：自然语言处理

2022-09-13

讽刺可以被定义为说或写讽刺与一个人真正想表达的相反，通常是为了侮辱，刺激或娱乐某人。由于文本数据中讽刺性的性质晦涩难懂，因此检测到情感分析研究社区的困难和非常感兴趣。尽管讽刺检测的研究跨越了十多年，但最近已经取得了一些重大进步，包括在多模式环境中采用了无监督的预训练的预训练的变压器，并整合了环境以识别讽刺。在这项研究中，我们旨在简要概述英语计算讽刺研究的最新进步和趋势。我们描述了与讽刺有关的相关数据集，方法，趋势，问题，挑战和任务，这些数据集，趋势，问题，挑战和任务是无法检测到的。我们的研究提供了讽刺数据集，讽刺特征及其提取方法以及各种方法的性能分析，这些表可以帮助相关领域的研究人员了解当前的讽刺检测中最新实践。

translated by 谷歌翻译

Skin Lesion Analysis: A State-of-the-Art Survey, Systematic Review, and Future Trends

Md. Kamrul Hasan , Md. Asif Ahamad , Choon Hwai Yap , Guang Yang

分类：计算机视觉 | 机器学习

2022-08-25

用于皮肤病变分析的计算机辅助诊断系统（CAD）系统是一个新兴的研究领域，有可能减轻皮肤癌筛查的负担和成本。研究人员最近表示，对开发此类CAD系统的兴趣日益增加，目的是向皮肤科医生提供用户友好的工具，以减少手动检查提出的挑战。本文的目的是提供对2011年至2020年之间发表的尖端CAD技术的完整文献综述。使用系统评价和荟萃分析的首选报告项目（PRISMA）方法用于确定总共365个出版物，221用于皮肤病变细分，144用于皮肤病变分类。这些文章以多种不同的方式进行分析和汇总，以便我们可以贡献有关CAD系统发展方法的重要信息。这些方式包括：相关和基本的定义和理论，输入数据（数据集利用，预处理，增强和解决不平衡问题），方法配置（技术，体系结构，模块框架和损失），培训策略（超级表格设置）以及评估（评估）标准（指标）。我们还打算研究各种增强性能的方法，包括合奏和后处理。此外，在这项调查中，我们强调了使用最小数据集评估皮肤病变细分和分类系统的主要问题，以及对这些困境的潜在解决方案。总之，讨论了启发性发现，建议和趋势，目的是为了在关注的相关领域进行未来的研究监视。可以预见的是，它将在开发自动化和健壮的CAD系统进行皮肤病变分析的过程中指导从初学者到专家的各个级别的研究人员。

translated by 谷歌翻译

HTML版本

Lung-Originated Tumor Segmentation from Computed Tomography Scan (LOTUS) Benchmark

Parnian Afshar , Arash Mohammadi , Konstantinos N. Plataniotis , Keyvan Farahani , Justin Kirby , Anastasia Oikonomou , Amir Asif , Leonard Wee , Andre Dekker , Xin Wu

分类：计算机视觉 | 机器学习

2022-01-03

肺癌是最致命的癌症之一，部分诊断和治疗取决于肿瘤的准确描绘。目前是最常见的方法的人以人为本的分割，须遵守观察者间变异性，并且考虑到专家只能提供注释的事实，也是耗时的。最近展示了有前途的结果，自动和半自动肿瘤分割方法。然而，随着不同的研究人员使用各种数据集和性能指标验证了其算法，可靠地评估这些方法仍然是一个开放的挑战。通过2018年IEEE视频和图像处理（VIP）杯竞赛创建的计算机断层摄影扫描（LOTUS）基准测试的肺起源肿瘤分割的目标是提供唯一的数据集和预定义的指标，因此不同的研究人员可以开发和以统一的方式评估他们的方法。 2018年VIP杯始于42个国家的全球参与，以获得竞争数据。在注册阶段，有129名成员组成了来自10个国家的28个团队，其中9个团队将其达到最后阶段，6队成功完成了所有必要的任务。简而言之，竞争期间提出的所有算法都是基于深度学习模型与假阳性降低技术相结合。三种决赛选手开发的方法表明，有希望的肿瘤细分导致导致越来越大的努力应降低假阳性率。本次竞争稿件概述了VIP-Cup挑战，以及所提出的算法和结果。

translated by 谷歌翻译

Is Speech Emotion Recognition Language-Independent? Analysis of English and Bangla Languages using Language-Independent Vocal Features

Fardin Saad , Hasan Mahmud , Md. Alamin Shaheen , Md. Kamrul Hasan , Paresha Farastu

分类：自然语言处理 | 机器学习

2021-11-21

识别语音情绪的语言不可知论的方法仍然是一个不完整和具有挑战性的任务。在本文中，我们使用Bangla和英语语言来评估与语音中的情感是否与语言无关。这项研究分类了以下情绪：幸福，愤怒，中立，悲伤，厌恶和恐惧。我们雇用了三种情绪言论，其中前两组是由孟加拉和英语语言的本土孟加拉语扬声器开发的。第三个是多伦多情感演讲（苔丝），由加拿大母语的英语发言者开发。我们仔细选择了语言无关的韵律特征，采用了支持向量机（SVM）模型，并进行了三个实验来执行我们的主张。在第一个实验中，我们单独测量三种语音组的性能。接下来是第二种实验，我们通过组合语音集来记录分类率。最后，在第三个实验中，我们通过培训和测试不同语音集来测量识别率。虽然这项研究表明，言语情感认可（SER）大多是语言无关的，但在识别出在这两种语言中的厌恶和恐惧之类的情绪状态时存在一些差异。此外，我们的调查推断出非母语人员通过言语传达情绪，就像以其母语在母语中表达自己。

translated by 谷歌翻译

Explainable Artificial Intelligence for Smart City Application: A Secure and Trusted Platform

M. Humayn Kabir , Khondokar Fida Hasan , Mohammad Kamrul Hasan , Keyvan Ansari

分类：机器学习

2021-10-31

人工智能（AI）是塑造未来的颠覆性技术之一。它在主要智能城市解决方案中的数据驱动决策越来越多，包括运输，教育，医疗保健，公共治理和电力系统。与此同时，它在保护Cyber威胁，攻击，损害或未授权访问中保护关键网络基础设施时越来越受欢迎。然而，那些传统的AI技术的重要问题之一（例如，深度学习）是，复杂性和复杂性的快速进展推进，并原始是不可诠释的黑匣子。在很多场合，了解控制和信任系统意外或看似不可预测的输出的决策和偏见是非常具有挑战性的。承认，对决策可解释性的控制丧失成为许多数据驱动自动化应用的重要问题。但它可能会影响系统的安全性和可信度吗？本章对网络安全的机器学习应用进行了全面的研究，以表示需要解释来解决这个问题。在这样做的同时，本章首先探讨了智能城市智能城市安全应用程序的AI技术的黑匣子问题。后来，考虑到新的技术范式，解释说明的人工智能（XAI），本章讨论了从黑盒到白盒的过渡。本章还讨论了关于智能城市应用不同自治系统在应用基于AI的技术的解释性，透明度，可辨能力和解释性的过渡要求。最后，它介绍了一些商业XAI平台，在提出未来的挑战和机遇之前，对传统的AI技术提供解释性。

translated by 谷歌翻译

A Deep Learning-based Multimodal Depth-Aware Dynamic Hand Gesture Recognition System

Hasan Mahmud , Mashrur M. Morshed , Md. Kamrul Hasan

分类：计算机视觉 | 机器学习

2021-07-06

动态手势识别任务已经看过各种单向和多式联运方法的研究。此前，研究人员已经探索了深度和基于2D骨架的多模式融合CRNN（卷积经常性神经网络），但在获得预期识别结果方面存在局限性。在本文中，我们重新审视了这种方法来手势识别并提出了几种改进。我们观察到原始深度图像在感兴趣的手区域（ROI）中具有低对比度。它们不突出显示重要的精细细节，例如手指方向，在手指和手掌之间重叠，或在多个手指之间重叠。因此，我们提出将深度值量化到几个离散区域中，以在手的若干关键部分之间产生更高的对比度。此外，我们提出了几种方法来解决现有多模式融合CRNN架构中的高方差问题。我们在两个基准上评估我们的方法：DHG-14/28数据集和SHREC'17 Track DataSet。我们的方法显示了先前类似的多模式方法的准确性和参数效率的显着提高，与最先进的结果相当的结果。

translated by 谷歌翻译

Look, Listen, and Attack: Backdoor Attacks Against Video Action Recognition

Hasan Abed Al Kader Hammoud , Shuming Liu , Mohammad Alkhrasi , Fahad AlBalawi , Bernard Ghanem

分类：计算机视觉 | 机器学习

2023-01-03

Deep neural networks (DNNs) are vulnerable to a class of attacks called "backdoor attacks", which create an association between a backdoor trigger and a target label the attacker is interested in exploiting. A backdoored DNN performs well on clean test images, yet persistently predicts an attacker-defined label for any sample in the presence of the backdoor trigger. Although backdoor attacks have been extensively studied in the image domain, there are very few works that explore such attacks in the video domain, and they tend to conclude that image backdoor attacks are less effective in the video domain. In this work, we revisit the traditional backdoor threat model and incorporate additional video-related aspects to that model. We show that poisoned-label image backdoor attacks could be extended temporally in two ways, statically and dynamically, leading to highly effective attacks in the video domain. In addition, we explore natural video backdoors to highlight the seriousness of this vulnerability in the video domain. And, for the first time, we study multi-modal (audiovisual) backdoor attacks against video action recognition models, where we show that attacking a single modality is enough for achieving a high attack success rate.

translated by 谷歌翻译

Relevance Classification of Flood-related Twitter Posts via Multiple Transformers

Wisal Mukhtiar , Waliiya Rizwan , Aneela Habib , Yasir Saleem Afridi , Laiq Hasan , Kashif Ahmad

分类：自然语言处理

2023-01-01

In recent years, social media has been widely explored as a potential source of communication and information in disasters and emergency situations. Several interesting works and case studies of disaster analytics exploring different aspects of natural disasters have been already conducted. Along with the great potential, disaster analytics comes with several challenges mainly due to the nature of social media content. In this paper, we explore one such challenge and propose a text classification framework to deal with Twitter noisy data. More specifically, we employed several transformers both individually and in combination, so as to differentiate between relevant and non-relevant Twitter posts, achieving the highest F1-score of 0.87.

translated by 谷歌翻译

A Comparison Study of Deep CNN Architecture in Detecting of Pneumonia

Al Mohidur Rahman Porag , Md. Mahedi Hasan , Dr. Md Taimur Ahad

分类：计算机视觉 | 机器学习

2022-12-30

Pneumonia, a respiratory infection brought on by bacteria or viruses, affects a large number of people, especially in developing and impoverished countries where high levels of pollution, unclean living conditions, and overcrowding are frequently observed, along with insufficient medical infrastructure. Pleural effusion, a condition in which fluids fill the lung and complicate breathing, is brought on by pneumonia. Early detection of pneumonia is essential for ensuring curative care and boosting survival rates. The approach most usually used to diagnose pneumonia is chest X-ray imaging. The purpose of this work is to develop a method for the automatic diagnosis of bacterial and viral pneumonia in digital x-ray pictures. This article first presents the authors' technique, and then gives a comprehensive report on recent developments in the field of reliable diagnosis of pneumonia. In this study, here tuned a state-of-the-art deep convolutional neural network to classify plant diseases based on images and tested its performance. Deep learning architecture is compared empirically. VGG19, ResNet with 152v2, Resnext101, Seresnet152, Mobilenettv2, and DenseNet with 201 layers are among the architectures tested. Experiment data consists of two groups, sick and healthy X-ray pictures. To take appropriate action against plant diseases as soon as possible, rapid disease identification models are preferred. DenseNet201 has shown no overfitting or performance degradation in our experiments, and its accuracy tends to increase as the number of epochs increases. Further, DenseNet201 achieves state-of-the-art performance with a significantly a smaller number of parameters and within a reasonable computing time. This architecture outperforms the competition in terms of testing accuracy, scoring 95%. Each architecture was trained using Keras, using Theano as the backend.

translated by 谷歌翻译