智能论文笔记

Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model

Tianxi Cai , Dong Xia , Luwan Zhang , Doudou Zhou

分类： (统计)机器学习 | 机器学习

2022-09-28

网络分析一直是揭示大量对象之间关系和交互的强大工具。然而，它在准确识别重要节点节点相互作用的有效性受到快速增长的网络规模的挑战，数据以空前的粒度和规模收集。克服这种高维度的共同智慧是将节点崩溃成较小的群体，并在小组级别进行连通性分析。将努力分为两个阶段不可避免地打开了一致性的差距，并降低了效率。共识学习是通用知识发现的新常态，并具有多个可用的数据源。为此，本文以组合多个数据源来开发同时分组和连接分析的统一框架。该算法还保证了统计上最佳的估计器。

translated by 谷歌翻译

Federated Offline Reinforcement Learning

Doudou Zhou , Yufeng Zhang , Aaron Sonabend-W , Zhaoran Wang , Junwei Lu , Tianxi Cai

分类： (统计)机器学习 | 机器学习

2022-06-11

基于循证或数据驱动的动态治疗方案对于个性化医学至关重要，这可以受益于离线增强学习（RL）。尽管可以在医疗机构之间获得大量医疗保健数据，但由于隐私限制，它们被禁止共享。此外，异质性存在于不同的站点。结果，需要联合的离线RL算法，并且有望解决这些问题。在本文中，我们提出了一个多站点的马尔可夫决策过程模型，该模型允许跨站点的均质和异质效应。提出的模型使对站点级特征的分析成为可能。我们设计了具有样本复杂性的离线RL的第一个联合政策优化算法。所提出的算法是通信效率和隐私性的，它仅需要通过交换摘要统计信息进行一轮通信交互。我们为所提出的算法提供理论保证，而没有足够的动作覆盖率，在这种情况下，学到的策略的次优率与速率相当，就好像数据没有分布一样。广泛的模拟证明了拟议算法的有效性。该方法应用于多个站点中的败血症数据集，以说明其在临床环境中的使用。

translated by 谷歌翻译

Multimodal Learning on Graphs for Disease Relation Extraction

Yucong Lin , Keming Lu , Sheng Yu , Tianxi Cai , Marinka Zitnik

分类：机器学习 | 人工智能 | 自然语言处理

2022-03-16

目的：疾病知识图是一种连接，组织和访问有关疾病的不同信息的方式，对人工智能（AI）有很多好处。为了创建知识图，有必要以疾病概念之间的关系形式从多模式数据集中提取知识，并使概念和关系类型正常化。方法：我们介绍了Remap，这是一种多式模式提取和分类的方法。重新启动机器学习方法将部分不完整的知识图和医学语言数据集嵌入紧凑的潜在矢量空间中，然后将多模式嵌入以进行最佳疾病关系提取。结果：我们将重新映射方法应用于具有96,913个关系的疾病知识图和124万个句子的文本数据集。在由人类专家注释的数据集中，Remap通过将疾病知识图与文本信息融合，将基于文本的疾病关系提取提高了10.0％（准确性）和17.2％（F1分数）。此外，重建利用文本信息以推荐知识图中的新关系，优于基于图的方法，高于8.4％（准确性）和10.4％（F1得分）。结论：重塑是通过融合结构化知识和文本信息来提取和分类疾病关系的多模式方法。重映提供了灵活的神经体系结构，可轻松找到，访问和验证疾病概念之间的AI驱动关系。

translated by 谷歌翻译

Rethinking Rotation Invariance with Point Cloud Registration

Jianhui Yu , Chaoyi Zhang , Weidong Cai

分类：计算机视觉

2022-12-31

Recent investigations on rotation invariance for 3D point clouds have been devoted to devising rotation-invariant feature descriptors or learning canonical spaces where objects are semantically aligned. Examinations of learning frameworks for invariance have seldom been looked into. In this work, we review rotation invariance in terms of point cloud registration and propose an effective framework for rotation invariance learning via three sequential stages, namely rotation-invariant shape encoding, aligned feature integration, and deep feature registration. We first encode shape descriptors constructed with respect to reference frames defined over different scales, e.g., local patches and global topology, to generate rotation-invariant latent shape codes. Within the integration stage, we propose Aligned Integration Transformer to produce a discriminative feature representation by integrating point-wise self- and cross-relations established within the shape codes. Meanwhile, we adopt rigid transformations between reference frames to align the shape codes for feature consistency across different scales. Finally, the deep integrated feature is registered to both rotation-invariant shape codes to maximize feature similarities, such that rotation invariance of the integrated feature is preserved and shared semantic information is implicitly extracted from shape codes. Experimental results on 3D shape classification, part segmentation, and retrieval tasks prove the feasibility of our work. Our project page is released at: https://rotation3d.github.io/.

translated by 谷歌翻译

An Analysis of Attention via the Lens of Exchangeability and Latent Variable Models

Yufeng Zhang , Boyi Liu , Qi Cai , Lingxiao Wang , Zhaoran Wang

分类：机器学习

2022-12-30

With the attention mechanism, transformers achieve significant empirical successes. Despite the intuitive understanding that transformers perform relational inference over long sequences to produce desirable representations, we lack a rigorous theory on how the attention mechanism achieves it. In particular, several intriguing questions remain open: (a) What makes a desirable representation? (b) How does the attention mechanism infer the desirable representation within the forward pass? (c) How does a pretraining procedure learn to infer the desirable representation through the backward pass? We observe that, as is the case in BERT and ViT, input tokens are often exchangeable since they already include positional encodings. The notion of exchangeability induces a latent variable model that is invariant to input sizes, which enables our theoretical analysis. - To answer (a) on representation, we establish the existence of a sufficient and minimal representation of input tokens. In particular, such a representation instantiates the posterior distribution of the latent variable given input tokens, which plays a central role in predicting output labels and solving downstream tasks. - To answer (b) on inference, we prove that attention with the desired parameter infers the latent posterior up to an approximation error, which is decreasing in input sizes. In detail, we quantify how attention approximates the conditional mean of the value given the key, which characterizes how it performs relational inference over long sequences. - To answer (c) on learning, we prove that both supervised and self-supervised objectives allow empirical risk minimization to learn the desired parameter up to a generalization error, which is independent of input sizes. Particularly, in the self-supervised setting, we identify a condition number that is pivotal to solving downstream tasks.

translated by 谷歌翻译

Heterogeneous Synthetic Learner for Panel Data

Ye Shen , Runzhe Wan , Hengrui Cai , Rui Song

分类： (统计)机器学习 | 机器学习

2022-12-30

In the new era of personalization, learning the heterogeneous treatment effect (HTE) becomes an inevitable trend with numerous applications. Yet, most existing HTE estimation methods focus on independently and identically distributed observations and cannot handle the non-stationarity and temporal dependency in the common panel data setting. The treatment evaluators developed for panel data, on the other hand, typically ignore the individualized information. To fill the gap, in this paper, we initialize the study of HTE estimation in panel data. Under different assumptions for HTE identifiability, we propose the corresponding heterogeneous one-side and two-side synthetic learner, namely H1SL and H2SL, by leveraging the state-of-the-art HTE estimator for non-panel data and generalizing the synthetic control method that allows flexible data generating process. We establish the convergence rates of the proposed estimators. The superior performance of the proposed methods over existing ones is demonstrated by extensive numerical studies.

translated by 谷歌翻译

Feature Selection Approaches for Optimising Music Emotion Recognition Methods

Le Cai , Sam Ferguson , Haiyan Lu , Gengfa Fang

分类：机器学习

2022-12-27

The high feature dimensionality is a challenge in music emotion recognition. There is no common consensus on a relation between audio features and emotion. The MER system uses all available features to recognize emotion; however, this is not an optimal solution since it contains irrelevant data acting as noise. In this paper, we introduce a feature selection approach to eliminate redundant features for MER. We created a Selected Feature Set (SFS) based on the feature selection algorithm (FSA) and benchmarked it by training with two models, Support Vector Regression (SVR) and Random Forest (RF) and comparing them against with using the Complete Feature Set (CFS). The result indicates that the performance of MER has improved for both Random Forest (RF) and Support Vector Regression (SVR) models by using SFS. We found using FSA can improve performance in all scenarios, and it has potential benefits for model efficiency and stability for MER task.

translated by 谷歌翻译

Data-Driven Linear Complexity Low-Rank Approximation of General Kernel Matrices: A Geometric Approach

Difeng Cai , Edmond Chow , Yuanzhe Xi

分类：机器学习

2022-12-24

A general, {\em rectangular} kernel matrix may be defined as $K_{ij} = \kappa(x_i,y_j)$ where $\kappa(x,y)$ is a kernel function and where $X=\{x_i\}_{i=1}^m$ and $Y=\{y_i\}_{i=1}^n$ are two sets of points. In this paper, we seek a low-rank approximation to a kernel matrix where the sets of points $X$ and $Y$ are large and are not well-separated (e.g., the points in $X$ and $Y$ may be ``intermingled''). Such rectangular kernel matrices may arise, for example, in Gaussian process regression where $X$ corresponds to the training data and $Y$ corresponds to the test data. In this case, the points are often high-dimensional. Since the point sets are large, we must exploit the fact that the matrix arises from a kernel function, and avoid forming the matrix, and thus ruling out most algebraic techniques. In particular, we seek methods that can scale linearly, i.e., with computational complexity $O(m)$ or $O(n)$ for a fixed accuracy or rank. The main idea in this paper is to {\em geometrically} select appropriate subsets of points to construct a low rank approximation. An analysis in this paper guides how this selection should be performed.

translated by 谷歌翻译

Reversible Column Networks

Yuxuan Cai , Yizhuang Zhou , Qi Han , Jianjian Sun , Xiangwen Kong , Jun Li , Xiangyu Zhang

分类：计算机视觉

2022-12-22

We propose a new neural network design paradigm Reversible Column Network (RevCol). The main body of RevCol is composed of multiple copies of subnetworks, named columns respectively, between which multi-level reversible connections are employed. Such architectural scheme attributes RevCol very different behavior from conventional networks: during forward propagation, features in RevCol are learned to be gradually disentangled when passing through each column, whose total information is maintained rather than compressed or discarded as other network does. Our experiments suggest that CNN-style RevCol models can achieve very competitive performances on multiple computer vision tasks such as image classification, object detection and semantic segmentation, especially with large parameter budget and large dataset. For example, after ImageNet-22K pre-training, RevCol-XL obtains 88.2% ImageNet-1K accuracy. Given more pre-training data, our largest model RevCol-H reaches 90.0% on ImageNet-1K, 63.8% APbox on COCO detection minival set, 61.0% mIoU on ADE20k segmentation. To our knowledge, it is the best COCO detection and ADE20k segmentation result among pure (static) CNN models. Moreover, as a general macro architecture fashion, RevCol can also be introduced into transformers or other neural networks, which is demonstrated to improve the performances in both computer vision and NLP tasks. We release code and models at https://github.com/megvii-research/RevCol

translated by 谷歌翻译

Trajectory Generation and Tracking Control for Aggressive Tail-Sitter Flights

Guozheng Lu , Yixi Cai , Nan Chen , Fanze Kong , Yunfan Ren , Fu Zhang

分类：机器人

2022-12-22

We address the theoretical and practical problems related to the trajectory generation and tracking control of tail-sitter UAVs. Theoretically, we focus on the differential flatness property with full exploitation of actual UAV aerodynamic models, which lays a foundation for generating dynamically feasible trajectory and achieving high-performance tracking control. We have found that a tail-sitter is differentially flat with accurate aerodynamic models within the entire flight envelope, by specifying coordinate flight condition and choosing the vehicle position as the flat output. This fundamental property allows us to fully exploit the high-fidelity aerodynamic models in the trajectory planning and tracking control to achieve accurate tail-sitter flights. Particularly, an optimization-based trajectory planner for tail-sitters is proposed to design high-quality, smooth trajectories with consideration of kinodynamic constraints, singularity-free constraints and actuator saturation. The planned trajectory of flat output is transformed to state trajectory in real-time with consideration of wind in environments. To track the state trajectory, a global, singularity-free, and minimally-parameterized on-manifold MPC is developed, which fully leverages the accurate aerodynamic model to achieve high-accuracy trajectory tracking within the whole flight envelope. The effectiveness of the proposed framework is demonstrated through extensive real-world experiments in both indoor and outdoor field tests, including agile SE(3) flight through consecutive narrow windows requiring specific attitude and with speed up to 10m/s, typical tail-sitter maneuvers (transition, level flight and loiter) with speed up to 20m/s, and extremely aggressive aerobatic maneuvers (Wingover, Loop, Vertical Eight and Cuban Eight) with acceleration up to 2.5g.

translated by 谷歌翻译