智能论文笔记

Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies

Runzhe Wan , Yingying Li , Wenbin Lu , Rui Song

分类：机器学习

2022-12-25

Latent factor model estimation typically relies on either using domain knowledge to manually pick several observed covariates as factor proxies, or purely conducting multivariate analysis such as principal component analysis. However, the former approach may suffer from the bias while the latter can not incorporate additional information. We propose to bridge these two approaches while allowing the number of factor proxies to diverge, and hence make the latent factor model estimation robust, flexible, and statistically more accurate. As a bonus, the number of factors is also allowed to grow. At the heart of our method is a penalized reduced rank regression to combine information. To further deal with heavy-tailed data, a computationally attractive penalized robust reduced rank regression method is proposed. We establish faster rates of convergence compared with the benchmark. Extensive simulations and real examples are used to illustrate the advantages.

translated by 谷歌翻译

Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples

Meng Chen , Li Lu , Jiadi Yu , Yingying Chen , Zhongjie Ba , Feng Lin , Kui Ren

分类：机器学习

2022-11-10

Faced with the threat of identity leakage during voice data publishing, users are engaged in a privacy-utility dilemma when enjoying convenient voice services. Existing studies employ direct modification or text-based re-synthesis to de-identify users' voices, but resulting in inconsistent audibility in the presence of human participants. In this paper, we propose a voice de-identification system, which uses adversarial examples to balance the privacy and utility of voice services. Instead of typical additive examples inducing perceivable distortions, we design a novel convolutional adversarial example that modulates perturbations into real-world room impulse responses. Benefit from this, our system could preserve user identity from exposure by Automatic Speaker Identification (ASI) while remaining the voice perceptual quality for non-intrusive de-identification. Moreover, our system learns a compact speaker distribution through a conditional variational auto-encoder to sample diverse target embeddings on demand. Combining diverse target generation and input-specific perturbation construction, our system enables any-to-any identify transformation for adaptive de-identification. Experimental results show that our system could achieve 98% and 79% successful de-identification on mainstream ASIs and commercial systems with an objective Mel cepstral distortion of 4.31dB and a subjective mean opinion score of 4.48.

translated by 谷歌翻译

Unified Multi-View Orthonormal Non-Negative Graph Based Clustering Framework

Liangchen Liu , Qiuhong Ke , Chaojie Li , Feiping Nie , Yingying Zhu

分类：计算机视觉

2022-11-03

Spectral clustering is an effective methodology for unsupervised learning. Most traditional spectral clustering algorithms involve a separate two-step procedure and apply the transformed new representations for the final clustering results. Recently, much progress has been made to utilize the non-negative feature property in real-world data and to jointly learn the representation and clustering results. However, to our knowledge, no previous work considers a unified model that incorporates the important multi-view information with those properties, which severely limits the performance of existing methods. In this paper, we formulate a novel clustering model, which exploits the non-negative feature property and, more importantly, incorporates the multi-view information into a unified joint learning framework: the unified multi-view orthonormal non-negative graph based clustering framework (Umv-ONGC). Then, we derive an effective three-stage iterative solution for the proposed model and provide analytic solutions for the three sub-problems from the three stages. We also explore, for the first time, the multi-model non-negative graph-based approach to clustering data based on deep features. Extensive experiments on three benchmark data sets demonstrate the effectiveness of the proposed method.

translated by 谷歌翻译

View-Disentangled Transformer for Brain Lesion Detection

Haofeng Li , Junjia Huang , Guanbin Li , Zhou Liu , Yihong Zhong , Yingying Chen , Yunfei Wang , Xiang Wan

分类：计算机视觉

2022-09-20

深度神经网络（DNN）已在脑病变检测和分割中广泛采用。但是，在2D MRI切片中定位小病变是具有挑战性的，需要在3D上下文聚集的粒度和计算复杂性之间取得平衡。在本文中，我们提出了一种新型的视角变压器，以增强MRI特征的提取，以进行更准确的肿瘤检测。首先，所提出的变压器在3D脑扫描中收获了不同位置之间的远程相关性。其次，变压器将一堆切片功能堆叠为多个2D视图，并增强这些特征的视图，该功能大致以有效的方式实现了3D相关计算。第三，我们将提出的变压器模块部署在变压器主链中，该模块可以有效地检测到脑损伤周围的2D区域。实验结果表明，我们提出的观看式变压器在具有挑战性的大脑MRI数据集上对大脑病变检测表现良好。

translated by 谷歌翻译

RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN

Huy Phan , Cong Shi , Yi Xie , Tianfang Zhang , Zhuohang Li , Tianming Zhao , Jian Liu , Yan Wang , Yingying Chen , Bo Yuan

分类：计算机视觉

2022-08-22

最近，后门攻击已成为对深神经网络（DNN）模型安全性的新兴威胁。迄今为止，大多数现有研究都集中于对未压缩模型的后门攻击。尽管在实际应用中广泛使用的压缩DNN的脆弱性尚未得到利用。在本文中，我们建议研究和发展针对紧凑型DNN模型（RIBAC）的强大和不可感知的后门攻击。通过对重要设计旋钮进行系统分析和探索，我们提出了一个框架，该框架可以有效地学习适当的触发模式，模型参数和修剪口罩。从而同时达到高触发隐形性，高攻击成功率和高模型效率。跨不同数据集的广泛评估，包括针对最先进的防御机制的测试，证明了RIBAC的高鲁棒性，隐身性和模型效率。代码可从https://github.com/huyvnphan/eccv2022-ribac获得

translated by 谷歌翻译

Memory Efficient Temporal & Visual Graph Model for Unsupervised Video Domain Adaptation

Xinyue Hu , Lin Gu , Liangchen Liu , Ruijiang Li , Chang Su , Tatsuya Harada , Yingying Zhu

分类：计算机视觉

2022-08-13

现有的视频域改编（DA）方法需要存储视频帧的所有时间组合或配对源和目标视频，这些视频和目标视频成本昂贵，无法扩展到长时间的视频。为了解决这些局限性，我们建议采用以下记忆高效的基于图形的视频DA方法。首先，我们的方法模型每个源或目标视频通过图：节点表示视频帧和边缘表示帧之间的时间或视觉相似性关系。我们使用图形注意力网络来了解单个帧的重量，并同时将源和目标视频对齐到域不变的图形特征空间中。我们的方法没有存储大量的子视频，而是仅构建一个图形，其中一个视频的图形注意机制，从而大大降低了内存成本。广泛的实验表明，与最先进的方法相比，我们在降低内存成本的同时取得了卓越的性能。

translated by 谷歌翻译

Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection

Ziteng Cui , Yingying Zhu , Lin Gu , Guo-Jun Qi , Xiaoxiao Li , Renrui Zhang , Zenghui Zhang , Tatsuya Harada

分类：计算机视觉

2022-08-05

图像恢复算法（例如超级分辨率（SR））是低质量图像中对象检测的必不可少的预处理模块。这些算法中的大多数假定降解是固定的，并且已知先验。但是，实际上，实际降解或最佳的上采样率是未知或与假设不同的，导致预处理模块和随之而来的高级任务（例如对象检测）的性能恶化。在这里，我们提出了一个新颖的自我监督框架，以检测低分辨率图像降解的对象。我们利用下采样降解作为一种自我监督信号的一种转换，以探索针对各种分辨率和其他退化条件的模棱两可的表示。自我设计（AERIS）框架中的自动编码分辨率可以进一步利用高级SR体系结构，并使用任意分辨率恢复解码器，以从退化的输入图像中重建原始对应关系。表示学习和对象检测均以端到端的培训方式共同优化。通用AERIS框架可以在具有不同骨架的各种主流对象检测架构上实现。广泛的实验表明，与现有方法相比，我们的方法在面对变化降解情况时取得了卓越的性能。代码将在https://github.com/cuiziteng/eccv_aeris上发布。

translated by 谷歌翻译

Multi-Forgery Detection Challenge 2022: Push the Frontier of Unconstrained and Diverse Forgery Detection

Jianshu Li , Man Luo , Jian Liu , Tao Chen , Chengjie Wang , Ziwei Liu , Shuo Liu , Kewei Yang , Xuning Shao , Kang Chen

分类：计算机视觉

2022-07-27

在本文中，我们提出了与IEEE计算机协会在CVPR 2022上同时与IEEE计算机协会研讨会同时举行的多手术检测挑战。我们的多手术检测挑战旨在检测自动图像操作，包括但不限于图像编辑，图像合成，图像合成，图像，图像，图像，图像合成，图像，图像编辑一代，图像Photoshop等。我们的挑战吸引了来自世界各地的674支团队，约有2000个有效的结果提交数量。我们邀请了前十支球队为挑战提供解决方案，其中三支球队在大结局中获得了奖项。在本文中，我们介绍了前三名团队的解决方案，以增强图像伪造检测领域的研究工作。

translated by 谷歌翻译

Explainable COVID-19 Infections Identification and Delineation Using Calibrated Pseudo Labels

Ming Li , Yingying Fang , Zeyu Tang , Chibudom Onuorah , Jun Xia , Javier Del Ser , Simon Walsh , Guang Yang

分类：计算机视觉 | 机器学习

2022-02-11

在过去的两年中，Covid-19-19的到来引起的动荡继续带来新的挑战。在这次COVID-19大流行期间，需要快速鉴定感染患者和计算机断层扫描（CT）图像中感染区域的特定描述。尽管已迅速建立了深层监督的学习方法，但图像级和像素级标签的稀缺性以及缺乏可解释的透明度仍然阻碍了AI的适用性。我们可以识别受感染的患者并以极端的监督描绘感染吗？半监督的学习表明，在有限的标记数据和足够的未标记数据下，表现出了有希望的表现。受到半监督学习的启发，我们提出了一种模型不可静止的校准伪标记策略，并将其应用于一致性正则化框架下，以生成可解释的识别和描述结果。我们通过有限的标记数据和足够的未标记数据或弱标记数据的组合证明了模型的有效性。广泛的实验表明，我们的模型可以有效利用有限的标记数据，并为临床常规中的决策提供可解释的分类和分割结果。该代码可从https://github.com/ayanglab/xai covid-11获得。

translated by 谷歌翻译

Statistical Learning for Individualized Asset Allocation

Yi Ding , Yingying Li , Rui Song

分类： (统计)机器学习

2022-01-20

我们为个性化资产分配建立了高维统计学习框架。我们提出的方法可以解决具有大量特征的连续行动决策。我们开发了一种离散化方法，以模拟连续动作的效果，并允许离散频率很大，并且与观测值的数量分歧。使用惩罚回归估算连续行动的价值函数，并通过我们提出的广义惩罚对模型系数的线性转换施加。我们表明，我们提出的离散和回归是在效应不连续性（Drove）方法上以广义折叠式惩罚（DROVE）方法具有理想的理论属性，并允许对与最佳决策相关的最佳价值进行统计推断。从经验上讲，提出的框架是通过健康和退休研究数据来寻找个性化最佳资产分配的。结果表明，我们个性化的最佳战略改善了人口财务状况。

translated by 谷歌翻译