智能论文笔记

Reliable Joint Segmentation of Retinal Edema Lesions in OCT Images

Meng Wang , Kai Yu , Chun-Mei Feng , Ke Zou , Yanyu Xu , Qingquan Meng , Rick Siow Mong Goh , Yong Liu , Xinxing Xu , Huazhu Fu

分类：计算机视觉

2022-12-01

Focusing on the complicated pathological features, such as blurred boundaries, severe scale differences between symptoms, background noise interference, etc., in the task of retinal edema lesions joint segmentation from OCT images and enabling the segmentation results more reliable. In this paper, we propose a novel reliable multi-scale wavelet-enhanced transformer network, which can provide accurate segmentation results with reliability assessment. Specifically, aiming at improving the model's ability to learn the complex pathological features of retinal edema lesions in OCT images, we develop a novel segmentation backbone that integrates a wavelet-enhanced feature extractor network and a multi-scale transformer module of our newly designed. Meanwhile, to make the segmentation results more reliable, a novel uncertainty segmentation head based on the subjective logical evidential theory is introduced to generate the final segmentation results with a corresponding overall uncertainty evaluation score map. We conduct comprehensive experiments on the public database of AI-Challenge 2018 for retinal edema lesions segmentation, and the results show that our proposed method achieves better segmentation accuracy with a high degree of reliability as compared to other state-of-the-art segmentation approaches. The code will be released on: https://github.com/LooKing9218/ReliableRESeg.

translated by 谷歌翻译

BinImg2Vec: Augmenting Malware Binary Image Classification with Data2Vec

Joon Sern Lee , Kai Keng Tay , Zong Fu Chua

分类：计算机视觉 | 机器学习

2022-09-02

COVID-19大流行刺激的快速数字化导致了更多的网络犯罪。现在，恶意软件即服务是网络犯罪分子的蓬勃发展的业务。随着恶意软件活动的激增，对于网络辩护人来说，更多地了解他们手头的恶意软件样本，因为这些信息可以极大地影响他们在违规过程中的下一步行动。最近，研究人员展示了如何通过将恶意软件二进制文件转换为灰度图像，然后通过神经网络进行分类来完成恶意软件家庭分类。但是，大多数工作着重于研究不同神经网络体系结构对分类性能的影响。在去年，研究人员表明，通过自我监督学习来增强监督学习可以提高绩效。甚至最近，Data2Vec被提议为一种训练神经网络的情态自我监督框架。在本文中，我们介绍了Binimg2Vec，这是一个培训恶意软件二进制图像分类器的框架，该框架既包含了自我监督的学习和监督学习，又可以产生一个模型，该模型始终优于仅通过监督学习而受过培训的模型。我们能够在分类性能上提高4％，并在多次运行中降低0.5％的性能差异。我们还展示了我们的框架如何产生可以很好地聚类的嵌入，从而促进模型的解释。

translated by 谷歌翻译

HTML版本

Transfer learning to decode brain states reflecting the relationship between cognitive tasks

Youzhi Qu , Xinyao Jian , Wenxin Che , Penghui Du , Kai Fu , Quanying Liu

分类：人工智能 | 机器学习

2022-06-07

转移学习通过利用特定源任务的数据来提高目标任务的性能：源和目标任务之间的关系越接近，通过转移学习的绩效提高越大。在神经科学中，认知任务之间的关系通常由激活的大脑区域或神经表示的相似性表示。但是，没有研究将转移学习和神经科学联系起来，以揭示认知任务之间的关系。在这项研究中，我们提出了一个转移学习框架，以反映认知任务之间的关系，并比较通过转移学习和大脑区域（例如Neurosynth）反映的任务关系。我们的转移学习结果创建了认知任务，以反映认知任务之间的关系，这与来自神经合成的任务关系非常一致。如果源和目标认知任务激活相似的大脑区域，则转移学习在任务解码方面的性能更好。我们的研究发现了多个认知任务的关系，并为基于小样本数据的神经解码转移学习中的源任务选择提供了指导。

translated by 谷歌翻译

Enhancing Quality of Pose-varied Face Restoration with Local Weak Feature Sensing and GAN Prior

Kai Hu , Yu Liu , Renhe Liu , Wei Lu , Gang Yu , Bin Fu

分类：计算机视觉

2022-05-28

近年来，面部语义指导（包括面部地标，面部热图和面部解析图）和面部生成对抗网络（GAN）近年来已广泛用于盲面修复（BFR）。尽管现有的BFR方法在普通案例中取得了良好的性能，但这些解决方案在面对严重降解和姿势变化的图像时具有有限的弹性（例如，在现实世界情景中看起来右，左看，笑等）。在这项工作中，我们提出了一个精心设计的盲人面部修复网络，具有生成性面部先验。所提出的网络主要由非对称编解码器和stylegan2先验网络组成。在非对称编解码器中，我们采用混合的多路残留块（MMRB）来逐渐提取输入图像的弱纹理特征，从而可以更好地保留原始面部特征并避免过多的幻想。 MMRB也可以在其他网络中插入插件。此外，多亏了StyleGAN2模型的富裕和多样化的面部先验，我们采用了微调的方法来灵活地恢复自然和现实的面部细节。此外，一种新颖的自我监督训练策略是专门设计用于面部修复任务的，以使分配更接近目标并保持训练稳定性。关于合成和现实世界数据集的广泛实验表明，我们的模型在面部恢复和面部超分辨率任务方面取得了卓越的表现。

translated by 谷歌翻译

PPA: Preference Profiling Attack Against Federated Learning

Chunyi Zhou , Yansong Gao , Anmin Fu , Kai Chen , Zhiyang Dai , Zhi Zhang , Minhui Xue , Yuqing Zhang

分类：机器学习

2022-02-10

联合学习（FL）在许多分散的用户中训练全球模型，每个用户都有本地数据集。与传统的集中学习相比，FL不需要直接访问本地数据集，因此旨在减轻数据隐私问题。但是，由于推理攻击，包括成员推理，属性推理和数据反演，FL中的数据隐私泄漏仍然存在。在这项工作中，我们提出了一种新型的隐私推理攻击，创造的偏好分析攻击（PPA），它准确地介绍了本地用户的私人偏好，例如，最喜欢（不喜欢）来自客户的在线购物中的（不喜欢）项目和最常见的表达式从用户的自拍照中。通常，PPA可以在本地客户端（用户）的特征上介绍top-k（即，尤其是k = 1、2、3和k = 1）的偏好。我们的关键见解是，本地用户模型的梯度变化对给定类别的样本比例（尤其是大多数（少数）类别的样本比例具有明显的敏感性。通过观察用户模型对类的梯度敏感性，PPA可以介绍用户本地数据集中类的样本比例，从而公开用户对类的偏好。 FL的固有统计异质性进一步促进了PPA。我们使用四个数据集（MNIST，CIFAR10，RAF-DB和PRODUCTS-10K）广泛评估了PPA的有效性。我们的结果表明，PPA分别达到了MNIST和CIFAR10的90％和98％的TOP-1攻击精度。更重要的是，在实际的购物商业商业场景（即产品-10k）和社交网络（即RAF-DB）中，PPA在前一种情况下，PPA获得了78％的TOP-1攻击精度，以推断出最有序的物品（即作为商业竞争对手），在后一种情况下，有88％来推断受害者用户最常见的面部表情，例如恶心。

translated by 谷歌翻译

Learning to Navigate in a VUCA Environment: Hierarchical Multi-expert Approach

Wenqi Zhang , Kai Zhao , Peng Li , Xiao Zhu , Faping Ye , Weijie Jiang , Huiqiao Fu , Tao Wang

分类：机器人

2021-11-16

尽管数十年的努力，但在真正的情景中的机器人导航具有波动性，不确定性，复杂性和歧义（vuca短暂），仍然是一个具有挑战性的话题。受到中枢神经系统（CNS）的启发，我们提出了一个在Vuca环境中的自主导航的分层多专家学习框架。通过考虑目标位置，路径成本和安全水平的启发式探索机制，上层执行同时映射探索和路线规划，以避免陷入盲巷，类似于CNS中的大脑。使用本地自适应模型融合多种差异策略，下层追求碰撞 - 避免和直接策略之间的平衡，作为CNS中的小脑。我们在多个平台上进行仿真和实际实验，包括腿部和轮式机器人。实验结果表明我们的算法在任务成就，时间效率和安全性方面优于现有方法。

translated by 谷歌翻译

Exploring Separable Attention for Multi-Contrast MR Image Super-Resolution

Chun-Mei Feng , Yunlu Yan , Kai Yu , Yong Xu , Ling Shao , Huazhu Fu

分类：计算机视觉

2021-09-03

在相应的辅助对比的指导下，目标对比度的超级分辨磁共振（MR）图像（提供了其他解剖信息）是快速MR成像的新解决方案。但是，当前的多对比超分辨率（SR）方法倾向于直接连接不同的对比度，从而忽略了它们在不同的线索中的关系，例如在高强度和低强度区域中。在这项研究中，我们提出了一个可分离的注意网络（包括高强度的优先注意力和低强度分离注意力），名为SANET。我们的卫生网可以借助辅助对比度探索“正向”和“反向”方向中高强度和低强度区域的区域，同时学习目标对比MR的SR的更清晰的解剖结构和边缘信息图片。 SANET提供了三个吸引人的好处：（1）这是第一个探索可分离的注意机制的模型，该机制使用辅助对比来预测高强度和低强度区域，将更多的注意力转移到精炼这些区域和这些区域之间的任何不确定细节和纠正重建结果中的细小区域。（2）提出了一个多阶段集成模块，以学习多个阶段的多对比度融合的响应，获得融合表示之间的依赖性，并提高其表示能力。（3）在FastMRI和Clinical \ textit {in Vivo}数据集上进行了各种最先进的多对比度SR方法的广泛实验，证明了我们模型的优势。

translated by 谷歌翻译

On Learning the Right Attention Point for Feature Enhancement

Liqiang Lin , Pengdi Huang , Chi-Wing Fu , Kai Xu , Hao Zhang , Hui Huang

分类：计算机视觉

2020-12-11

我们提出了一种基于注意力的新型机制，可以学习用于点云处理任务的增强点特征，例如分类和分割。与先前的作品不同，该作品经过培训以优化预选的一组注意点的权重，我们的方法学会了找到最佳的注意点，以最大程度地提高特定任务的性能，例如点云分类。重要的是，我们主张使用单个注意点来促进语义理解在点特征学习中。具体而言，我们制定了一种新的简单卷积，该卷积结合了输入点及其相应学习的注意点或膝盖的卷积特征。我们的注意机制可以轻松地纳入最新的点云分类和分割网络中。对诸如ModelNet40，ShapenetPart和S3DIS之类的常见基准测试的广泛实验都表明，我们的支持LAP的网络始终优于各自的原始网络，以及其他竞争性替代方案，这些替代方案在我们的膝盖下采用了多个注意力框架。

translated by 谷歌翻译

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Yulun Zhang , Kunpeng Li , Kai Li , Lichen Wang , Bineng Zhong , Yun Fu

分类：

2018-07-08

Convolutional neural network (CNN) depth is of crucial importance for image super-resolution (SR). However, we observe that deeper networks for image SR are more difficult to train. The lowresolution inputs and features contain abundant low-frequency information, which is treated equally across channels, hence hindering the representational ability of CNNs. To solve these problems, we propose the very deep residual channel attention networks (RCAN). Specifically, we propose a residual in residual (RIR) structure to form very deep network, which consists of several residual groups with long skip connections. Each residual group contains some residual blocks with short skip connections. Meanwhile, RIR allows abundant low-frequency information to be bypassed through multiple skip connections, making the main network focus on learning high-frequency information. Furthermore, we propose a channel attention mechanism to adaptively rescale channel-wise features by considering interdependencies among channels. Extensive experiments show that our RCAN achieves better accuracy and visual improvements against state-of-the-art methods.

translated by 谷歌翻译

MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

Shuhao Shi , Kai Qiao , Jian Chen , Shuai Yang , Jie Yang , Baojie Song , Linyuan Wang , Bin Yan

分类：计算机视觉

2023-01-03

The development of social media user stance detection and bot detection methods rely heavily on large-scale and high-quality benchmarks. However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research. To address these issues, we propose a Multi-Relational Graph-Based Twitter Account Detection Benchmark (MGTAB), the first standardized graph-based benchmark for account detection. To our knowledge, MGTAB was built based on the largest original data in the field, with over 1.55 million users and 130 million tweets. MGTAB contains 10,199 expert-annotated users and 7 types of relationships, ensuring high-quality annotation and diversified relations. In MGTAB, we extracted the 20 user property features with the greatest information gain and user tweet features as the user features. In addition, we performed a thorough evaluation of MGTAB and other public datasets. Our experiments found that graph-based approaches are generally more effective than feature-based approaches and perform better when introducing multiple relations. By analyzing experiment results, we identify effective approaches for account detection and provide potential future research directions in this field. Our benchmark and standardized evaluation procedures are freely available at: https://github.com/GraphDetec/MGTAB.

translated by 谷歌翻译