智能论文笔记

Tractable hierarchies of convex relaxations for polynomial optimization on the nonnegative orthant

Ngoc Hoang Anh Mai , Victor Magron , Jean-Bernard Lasserre , Kim-Chuan Toh

分类：机器学习

2022-09-13

我们考虑在非负轨道中包含的半格式集中的多项式优化问题（POP）（紧凑型集合上的每个POP都可以通过对Origin的简单翻译来以这种格式放置）。通过将每个变量平行，可以将这样的POP转换为等效的POP。使用偶数对称性和因子宽度的概念，我们根据Dickinson-Povh提出了基于P \'Olya的Potitivstellensatz的扩展，提出了半决赛弛豫的层次结构。作为其显着特征和关键特征，可以任意选择每个结果的半芬特弛豫的最大矩阵大小，此外，我们证明了新层次结构返回的值的序列收敛到原始POP的最佳值，以$ o的速率$ o。（\ varepsilon^{ - c}）$如果半gebraic集具有非空内饰。当应用于（i）多层神经网络的鲁棒性认证和（ii）计算积极的最大奇异值时，我们的方法基于p \'olya的Potitivstellensatz提供了更好的界限，并且比标准瞬间层次结构更快地运行了几百倍。

translated by 谷歌翻译

Improving Pareto Front Learning via Multi-Sample Hypernetworks

Long Phi Hoang , Dung Duy Le , Tuan Anh Tran , Thang Tran Ngoc

分类：机器学习

2022-12-02

Pareto Front Learning (PFL) was recently introduced as an effective approach to obtain a mapping function from a given trade-off vector to a solution on the Pareto front, which solves the multi-objective optimization (MOO) problem. Due to the inherent trade-off between conflicting objectives, PFL offers a flexible approach in many scenarios in which the decision makers can not specify the preference of one Pareto solution over another, and must switch between them depending on the situation. However, existing PFL methods ignore the relationship between the solutions during the optimization process, which hinders the quality of the obtained front. To overcome this issue, we propose a novel PFL framework namely \ourmodel, which employs a hypernetwork to generate multiple solutions from a set of diverse trade-off preferences and enhance the quality of the Pareto front by maximizing the Hypervolume indicator defined by these solutions. The experimental results on several MOO machine learning tasks show that the proposed framework significantly outperforms the baselines in producing the trade-off Pareto front.

translated by 谷歌翻译

DRG-Net: Interactive Joint Learning of Multi-lesion Segmentation and Classification for Diabetic Retinopathy Grading

Hasan Md Tusfiqur , Duy M. H. Nguyen , Mai T. N. Truong , Triet A. Nguyen , Binh T. Nguyen , Michael Barz , Hans-Juergen Profitlich , Ngoc T. T. Than , Ngan Le , Pengtao Xie

分类：计算机视觉

2022-12-30

Diabetic Retinopathy (DR) is a leading cause of vision loss in the world, and early DR detection is necessary to prevent vision loss and support an appropriate treatment. In this work, we leverage interactive machine learning and introduce a joint learning framework, termed DRG-Net, to effectively learn both disease grading and multi-lesion segmentation. Our DRG-Net consists of two modules: (i) DRG-AI-System to classify DR Grading, localize lesion areas, and provide visual explanations; (ii) DRG-Expert-Interaction to receive feedback from user-expert and improve the DRG-AI-System. To deal with sparse data, we utilize transfer learning mechanisms to extract invariant feature representations by using Wasserstein distance and adversarial learning-based entropy minimization. Besides, we propose a novel attention strategy at both low- and high-level features to automatically select the most significant lesion information and provide explainable properties. In terms of human interaction, we further develop DRG-Net as a tool that enables expert users to correct the system's predictions, which may then be used to update the system as a whole. Moreover, thanks to the attention mechanism and loss functions constraint between lesion features and classification features, our approach can be robust given a certain level of noise in the feedback of users. We have benchmarked DRG-Net on the two largest DR datasets, i.e., IDRID and FGADR, and compared it to various state-of-the-art deep learning networks. In addition to outperforming other SOTA approaches, DRG-Net is effectively updated using user feedback, even in a weakly-supervised manner.

translated by 谷歌翻译

Multiple Perturbation Attack: Attack Pixelwise Under Different $\ell_p$-norms For Better Adversarial Performance

Ngoc N. Tran , Anh Tuan Bui , Dinh Phung , Trung Le

分类：计算机视觉 | 机器学习

2022-12-05

Adversarial machine learning has been both a major concern and a hot topic recently, especially with the ubiquitous use of deep neural networks in the current landscape. Adversarial attacks and defenses are usually likened to a cat-and-mouse game in which defenders and attackers evolve over the time. On one hand, the goal is to develop strong and robust deep networks that are resistant to malicious actors. On the other hand, in order to achieve that, we need to devise even stronger adversarial attacks to challenge these defense models. Most of existing attacks employs a single $\ell_p$ distance (commonly, $p\in\{1,2,\infty\}$) to define the concept of closeness and performs steepest gradient ascent w.r.t. this $p$-norm to update all pixels in an adversarial example in the same way. These $\ell_p$ attacks each has its own pros and cons; and there is no single attack that can successfully break through defense models that are robust against multiple $\ell_p$ norms simultaneously. Motivated by these observations, we come up with a natural approach: combining various $\ell_p$ gradient projections on a pixel level to achieve a joint adversarial perturbation. Specifically, we learn how to perturb each pixel to maximize the attack performance, while maintaining the overall visual imperceptibility of adversarial examples. Finally, through various experiments with standardized benchmarks, we show that our method outperforms most current strong attacks across state-of-the-art defense mechanisms, while retaining its ability to remain clean visually.

translated by 谷歌翻译

Joint Self-Supervised Image-Volume Representation Learning with Intra-Inter Contrastive Clustering

Duy M. H. Nguyen , Hoang Nguyen , Mai T. N. Truong , Tri Cao , Binh T. Nguyen , Nhat Ho , Paul Swoboda , Shadi Albarqouni , Pengtao Xie , Daniel Sonntag

分类：计算机视觉

2022-12-04

Collecting large-scale medical datasets with fully annotated samples for training of deep networks is prohibitively expensive, especially for 3D volume data. Recent breakthroughs in self-supervised learning (SSL) offer the ability to overcome the lack of labeled training samples by learning feature representations from unlabeled data. However, most current SSL techniques in the medical field have been designed for either 2D images or 3D volumes. In practice, this restricts the capability to fully leverage unlabeled data from numerous sources, which may include both 2D and 3D data. Additionally, the use of these pre-trained networks is constrained to downstream tasks with compatible data dimensions. In this paper, we propose a novel framework for unsupervised joint learning on 2D and 3D data modalities. Given a set of 2D images or 2D slices extracted from 3D volumes, we construct an SSL task based on a 2D contrastive clustering problem for distinct classes. The 3D volumes are exploited by computing vectored embedding at each slice and then assembling a holistic feature through deformable self-attention mechanisms in Transformer, allowing incorporating long-range dependencies between slices inside 3D volumes. These holistic features are further utilized to define a novel 3D clustering agreement-based SSL task and masking embedding prediction inspired by pre-trained language models. Experiments on downstream tasks, such as 3D brain segmentation, lung nodule detection, 3D heart structures segmentation, and abnormal chest X-ray detection, demonstrate the effectiveness of our joint 2D and 3D SSL approach. We improve plain 2D Deep-ClusterV2 and SwAV by a significant margin and also surpass various modern 2D and 3D SSL approaches.

translated by 谷歌翻译

From Disfluency Detection to Intent Detection and Slot Filling

Mai Hoang Dao , Thinh Hung Truong , Dat Quoc Nguyen

分类：自然语言处理

2022-09-17

我们介绍了第一项经验研究，研究了突发性检测对意向检测和插槽填充的下游任务的影响。我们对越南人进行了这项研究，这是一种低资源语言，没有以前的研究，也没有公共数据集可用于探索。首先，我们通过手动添加上下文不满并注释它们来扩展流利的越南意图检测和插槽填充phoatis。然后，我们使用强基线进行实验进行实验，以基于预训练的语言模型，以检测和关节意图检测和插槽填充。我们发现：（i）爆发对下游意图检测和插槽填充任务的性能产生负面影响，并且（ii）在探索环境中，预先训练的多语言语言模型XLM-R有助于产生更好的意图检测和插槽比预先训练的单语言模型phobert填充表演，这与在流利性环境中通常发现的相反。

translated by 谷歌翻译

Maximising the Utility of Validation Sets for Imbalanced Noisy-label Meta-learning

Dung Anh Hoang , Cuong Nguyen anh Belagiannis Vasileios , Gustavo Carneiro

分类：机器学习 | 计算机视觉

2022-08-17

元学习是一种处理不平衡和嘈杂标签学习的有效方法，但它取决于验证集，其中包含随机选择，手动标记和平衡的分布式样品。该验证集的随机选择和手动标记和平衡不仅是元学习的最佳选择，而且随着类的数量，它的缩放范围也很差。因此，最近的元学习论文提出了临时启发式方法来自动构建和标记此验证集，但是这些启发式方法仍然是元学习的最佳选择。在本文中，我们分析了元学习算法，并提出了新的标准来表征验证集的实用性，基于：1）验证集的信息性； 2）集合的班级分配余额； 3）集合标签的正确性。此外，我们提出了一种新的不平衡的嘈杂标签元学习（INOLML）算法，该算法会自动构建通过上面的标准最大化其实用程序来构建验证。我们的方法比以前的元学习方法显示出显着改进，并在几个基准上设定了新的最新技术。

translated by 谷歌翻译

A Deep Reinforcement Learning-based Adaptive Charging Policy for Wireless Rechargeable Sensor Networks

Ngoc Bui , Phi Le Nguyen , Viet Anh Nguyen , Phan Thuan Do

分类：机器学习

2022-08-16

无线传感器网络由随机分布的传感器节点组成，用于监视目标或感兴趣的区域。由于每个传感器的电池容量有限，因此维持连续监视的网络是一个挑战。无线电源传输技术正在作为可靠的解决方案，用于通过部署移动充电器（MC）为传感器充电传感器。但是，由于网络中出现不确定性，为MC设计最佳的充电路径是具有挑战性的。由于网络拓扑的不可预测的变化，例如节点故障，传感器的能耗率可能会显着波动。这些变化也导致每个传感器的重要性变化，在现有作品中通常被认为是相同的。我们在本文中提出了一种使用深度强化学习（DRL）方法提出新颖的自适应充电方案，以解决这些挑战。具体来说，我们赋予MC采用充电策略，该策略确定了下一个在网络当前状态上充电条件的传感器。然后，我们使用深层神经网络来参数这项收费策略，该策略将通过强化学习技术进行培训。我们的模型可以适应网络拓扑的自发变化。经验结果表明，所提出的算法的表现优于现有的按需算法的大幅度边缘。

translated by 谷歌翻译

Using Artificial Intelligence and IoT for Constructing a Smart Trash Bin

Khang Nhut Lam , Nguyen Hoang Huynh , Nguyen Bao Ngoc , To Thi Huynh Nhu , Nguyen Thanh Thao , Pham Hoang Hao , Vo Van Kiet , Bui Xuan Huynh , Jugal Kalita

分类：计算机视觉

2022-08-12

本文报道的研究通过应用计算机视觉技术将普通的垃圾桶转化为更聪明的垃圾箱。在传感器和执行器设备的支持下，垃圾桶可以自动对垃圾进行分类。特别是，垃圾箱上的摄像头拍摄垃圾的照片，然后进行中央处理单元分析，并决定将垃圾桶放入哪个垃圾箱中。我们的垃圾箱系统的准确性达到90％。此外，我们的模型已连接到Internet，以更新垃圾箱状态以进行进一步管理。开发了用于管理垃圾箱的移动应用程序。

translated by 谷歌翻译

Detecting COVID-19 from digitized ECG printouts using 1D convolutional neural networks

Thao Nguyen , Hieu H. Pham , Huy Khiem Le , Anh Tu Nguyen , Ngoc Tien Thanh , Cuong Do

分类：计算机视觉

2022-08-10

COVID-19大流行已经暴露了全球医疗服务的脆弱性，增加了开发新颖的工具来提供快速且具有成本效益的筛查和诊断的需求。临床报告表明，Covid-19感染可能导致心脏损伤，心电图（ECG）可以作为Covid-19的诊断生物标志物。这项研究旨在利用ECG信号自动检测COVID-19。我们提出了一种从ECG纸记录中提取ECG信号的新方法，然后将其送入一维卷积神经网络（1D-CNN）中，以学习和诊断疾病。为了评估数字信号的质量，标记了基于纸张的ECG图像中的R峰。之后，将从每个图像计算的RR间隔与相应数字化信号的RR间隔进行比较。 COVID-19 ECG图像数据集上的实验表明，提出的数字化方法能够正确捕获原始信号，平均绝对误差为28.11 ms。我们提出的1D-CNN模型在数字化的心电图信号上进行了训练，允许准确识别患有COVID-19和其他受试者的个体，分类精度为98.42％，95.63％和98.50％，用于分类COVID-19 vs.正常，与正常人分类， COVID-19与异常心跳和Covid-19和其他类别分别与其他阶级。此外，提出的方法还为多分类任务实现了高级的性能。我们的发现表明，经过数字化的心电图信号训练的深度学习系统可以作为诊断Covid-19的潜在工具。

translated by 谷歌翻译