智能论文笔记

PhysiNet: A Combination of Physics-based Model and Neural Network Model for Digital Twins

Chao Sun , Victor Guang Shi

分类：机器学习

2021-06-28

作为物理系统或过程的实时数字对应物，用于系统仿真和优化的数字双胞胎。神经网络是通过使用数据构建数字双胞胎模型的一种方法，尤其是当基于物理的模型不准确甚至不可用时，尤其是当基于物理的模型时。但是，对于新设计的系统，需要累积足够的神经网络模型的数据需要时间，并且只有近似的基于物理的模型。为了利用两种模型，本文提出了一种模型，它结合了基于物理的模型和神经网络模型，以提高系统的整个生命周期的预测精度。所提出的混合模型（Physeinet）能够自动结合模型并提高其预测性能。实验表明，物理体既优于基于物理的模型和神经网络模型。

translated by 谷歌翻译

AdaFocusV3: On Unified Spatial-temporal Dynamic Video Recognition

Yulin Wang , Yang Yue , Xinhong Xu , Ali Hassani , Victor Kulikov , Nikita Orlov , Shiji Song , Humphrey Shi , Gao Huang

分类：计算机视觉 | 人工智能 | 机器学习

2022-09-27

最近的研究表明，减少时间和空间冗余都是有效的视频识别方法的有效方法，例如，将大多数计算分配给与任务相关的框架或每个帧中最有价值的图像区域。但是，在大多数现有的作品中，任何一种类型的冗余通常都是用另一个缺失建模的。本文探讨了在最近提出的ADAFOCUSV2算法之上的时空动态计算的统一配方，从而有助于改进的ADAFOCUSV3框架。我们的方法仅在一些小但有益的3D视频立方体上激活昂贵的高容量网络来降低计算成本。这些立方体是从框架高度，宽度和视频持续时间形成的空间中裁剪的，而它们的位置则以每样本样本为基础的轻加权政策网络自适应地确定。在测试时间，与每个视频相对应的立方体的数量是动态配置的，即，对视频立方体进行顺序处理，直到产生足够可靠的预测为止。值得注意的是，可以通过近似可插入深度特征的插值来有效地训练adafocusv3。六个基准数据集（即ActivityNet，FCVID，Mini-Kinetics，Something Something V1＆V2和潜水48）上的广泛经验结果表明，我们的模型比竞争性基线要高得多。

translated by 谷歌翻译

Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

Masatoshi Uehara , Haruka Kiyohara , Andrew Bennett , Victor Chernozhukov , Nan Jiang , Nathan Kallus , Chengchun Shi , Wen Sun

分类：机器学习 | (统计)机器学习

2022-07-26

我们研究了具有一般函数近似的部分可观察的MDP（POMDP）的外部评估（OPE）。现有的方法，例如顺序重要性采样估计器和拟合-Q评估，受POMDP中的地平线的诅咒。为了解决这个问题，我们通过引入将未来代理作为输入的未来依赖性值函数来开发一种新颖的无模型OPE方法。未来依赖性的价值函数在完全可观察的MDP中起着与经典价值函数相似的角色。我们为未来依赖性价值作为条件矩方程提供了一个新的Bellman方程，将历史记录代理用作仪器变量。我们进一步提出了一种最小值学习方法，以使用新的Bellman方程来学习未来依赖的价值函数。我们获得PAC结果，这意味着我们的OPE估计器是一致的，只要期货和历史包含有关潜在状态和Bellman完整性的足够信息。最后，我们将方法扩展到学习动力学，并在POMDP中建立我们的方法与众所周知的光谱学习方法之间的联系。

translated by 谷歌翻译

Cancer Subtyping by Improved Transcriptomic Features Using Vector Quantized Variational Autoencoder

Zheng Chen , Ziwei Yang , Lingwei Zhu , Guang Shi , Kun Yue , Takashi Matsubara , Shigehiko Kanaya , MD Altaf-Ul-Amin

分类：机器学习 | 人工智能

2022-07-20

定义和分离癌症亚型对于促进个性化治疗方式和患者预后至关重要。由于我们深入了解，子类型的定义一直在经常重新校准。在此重新校准期间，研究人员通常依靠癌症数据的聚类来提供直观的视觉参考，以揭示亚型的内在特征。聚集的数据通常是OMICS数据，例如与基本生物学机制有很强相关性的转录组学。但是，尽管现有的研究显示出令人鼓舞的结果，但它们却遭受了与OMICS数据相关的问题：样本稀缺性和高维度。因此，现有方法通常会施加不切实际的假设来从数据中提取有用的特征，同时避免过度拟合虚假相关性。在本文中，我们建议利用最近的强生成模型量化量化自动编码器（VQ-VAE），以解决数据问题并提取信息的潜在特征，这些特征对于后续聚类的质量至关重要，仅保留与重建有关的信息相关的信息输入。 VQ-VAE不会施加严格的假设，因此其潜在特征是输入的更好表示，能够使用任何主流群集方法产生出色的聚类性能。在包括10种不同癌症的多个数据集上进行的广泛实验和医学分析表明，VQ-VAE聚类结果可以显着，稳健地改善对普遍的亚型系统的预后。

translated by 谷歌翻译

ACMP: Allen-Cahn Message Passing for Graph Neural Networks with Particle Phase Transition

Yuelin Wang , Kai Yi , Xinliang Liu , Yu Guang Wang , Shi Jin

分类：机器学习 | 人工智能

2022-06-11

神经消息传递是用于图形结构数据的基本功能提取单元，它考虑了相邻节点特征在网络传播中从一层到另一层的影响。我们通过相互作用的粒子系统与具有吸引力和排斥力的相互作用粒子系统以及在相变建模中产生的艾伦 - 卡恩力进行建模。该系统是一个反应扩散过程，可以将颗粒分离为不同的簇。这会导致图形神经网络的艾伦 - 卡恩消息传递（ACMP），其中解决方案的数值迭代构成了消息传播。 ACMP背后的机制是颗粒的相变，该颗粒能够形成多群集，从而实现GNNS预测进行节点分类。 ACMP可以将网络深度推向数百个层，理论上证明了严格的dirichlet能量下限。因此，它提供了GNN的深层模型，该模型避免了GNN过度厚度的常见问题。具有高均匀难度的各种实际节点分类数据集的实验表明，具有ACMP的GNN可以实现最先进的性能，而不会衰减Dirichlet Energy。

translated by 谷歌翻译

Alternately Optimized Graph Neural Networks

Haoyu Han , Xiaorui Liu , Torkamani Ali , Feng Shi , Victor Lee , Jiliang Tang

分类：机器学习

2022-06-08

图形神经网络（GNN）在许多基于图的任务中表现出强大的表示能力。具体而言，由于其简单性和性能优势，GNN（例如APPNP）的解耦结构变得流行。但是，这些GNN的端到端培训使它们在计算和记忆消耗方面效率低下。为了应对这些局限性，在这项工作中，我们为图形神经网络提供了交替的优化框架，不需要端到端培训。在不同设置下进行的广泛实验表明，所提出的算法的性能与现有的最新算法相当，但具有更好的计算和记忆效率。此外，我们表明我们的框架可以利用优势来增强现有的脱钩GNN。

translated by 谷歌翻译

AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

Yulin Wang , Yang Yue , Yuanze Lin , Haojun Jiang , Zihang Lai , Victor Kulikov , Nikita Orlov , Humphrey Shi , Gao Huang

分类：计算机视觉 | 人工智能 | 机器学习

2021-12-28

最近的作品表明，通过降低空间冗余，可以显着提高视频识别的计算效率。作为代表性的工作，自适应焦点方法（Adafocus）通过动态识别和参加每个视频帧中的信息区域来实现精度和推理速度之间的有利权衡。然而，除非领需要一个复杂的三阶段训练管道（涉及强化学习），导致收敛缓慢，对从业者不友好。这项工作通过引入基于分配的内插的补丁选择操作来重新重新培训ADAFOCUS作为简单的单级算法，实现有效的端到端优化。我们进一步提出了一种改进的培训计划，以解决一级制定的问题，包括缺乏监督，投入多样性和培训稳定性。此外，提出了一种条件 - 退出技术，用于在没有额外训练的情况下在Adafocus的顶部执行时间自适应计算。在六个基准数据集（即，ActivityNet，FCVID，Mini-Kinetics，Something-V1＆V2和Jesters）上进行了广泛的实验表明，我们的模型显着优于原始的Adafocus和其他竞争基础，同时培训更简单和有效。代码可在https://github.com/leaplabthu/adafocusv2获得。

translated by 谷歌翻译

MGTAB: A Multi-Relational Graph-Based Twitter Account Detection Benchmark

Shuhao Shi , Kai Qiao , Jian Chen , Shuai Yang , Jie Yang , Baojie Song , Linyuan Wang , Bin Yan

分类：计算机视觉

2023-01-03

The development of social media user stance detection and bot detection methods rely heavily on large-scale and high-quality benchmarks. However, in addition to low annotation quality, existing benchmarks generally have incomplete user relationships, suppressing graph-based account detection research. To address these issues, we propose a Multi-Relational Graph-Based Twitter Account Detection Benchmark (MGTAB), the first standardized graph-based benchmark for account detection. To our knowledge, MGTAB was built based on the largest original data in the field, with over 1.55 million users and 130 million tweets. MGTAB contains 10,199 expert-annotated users and 7 types of relationships, ensuring high-quality annotation and diversified relations. In MGTAB, we extracted the 20 user property features with the greatest information gain and user tweet features as the user features. In addition, we performed a thorough evaluation of MGTAB and other public datasets. Our experiments found that graph-based approaches are generally more effective than feature-based approaches and perform better when introducing multiple relations. By analyzing experiment results, we identify effective approaches for account detection and provide potential future research directions in this field. Our benchmark and standardized evaluation procedures are freely available at: https://github.com/GraphDetec/MGTAB.

translated by 谷歌翻译

OccluMix: Towards De-Occlusion Virtual Try-on by Semantically-Guided Mixup

Zhijing Yang , Junyang Chen , Yukai Shi , Hao Li , Tianshui Chen , Liang Lin

分类：计算机视觉

2023-01-03

Image Virtual try-on aims at replacing the cloth on a personal image with a garment image (in-shop clothes), which has attracted increasing attention from the multimedia and computer vision communities. Prior methods successfully preserve the character of clothing images, however, occlusion remains a pernicious effect for realistic virtual try-on. In this work, we first present a comprehensive analysis of the occlusions and categorize them into two aspects: i) Inherent-Occlusion: the ghost of the former cloth still exists in the try-on image; ii) Acquired-Occlusion: the target cloth warps to the unreasonable body part. Based on the in-depth analysis, we find that the occlusions can be simulated by a novel semantically-guided mixup module, which can generate semantic-specific occluded images that work together with the try-on images to facilitate training a de-occlusion try-on (DOC-VTON) framework. Specifically, DOC-VTON first conducts a sharpened semantic parsing on the try-on person. Aided by semantics guidance and pose prior, various complexities of texture are selectively blending with human parts in a copy-and-paste manner. Then, the Generative Module (GM) is utilized to take charge of synthesizing the final try-on image and learning to de-occlusion jointly. In comparison to the state-of-the-art methods, DOC-VTON achieves better perceptual quality by reducing occlusion effects.

translated by 谷歌翻译

Deep Spectral Q-learning with Application to Mobile Health

Yuhe Gao , Chengchun Shi , Rui Song

分类： (统计)机器学习 | 机器学习

2023-01-03

Dynamic treatment regimes assign personalized treatments to patients sequentially over time based on their baseline information and time-varying covariates. In mobile health applications, these covariates are typically collected at different frequencies over a long time horizon. In this paper, we propose a deep spectral Q-learning algorithm, which integrates principal component analysis (PCA) with deep Q-learning to handle the mixed frequency data. In theory, we prove that the mean return under the estimated optimal policy converges to that under the optimal one and establish its rate of convergence. The usefulness of our proposal is further illustrated via simulations and an application to a diabetes dataset.

translated by 谷歌翻译