智能论文笔记

Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation

Ying Chen , Siwei Qiang , Mingming Ha , Xiaolei Liu , Shaoshuai Li , Lingfeng Yuan , Xiaobo Guo , Zhenfeng Zhu

分类：机器学习 | 人工智能

2022-11-30

In recent years, semi-supervised graph learning with data augmentation (DA) is currently the most commonly used and best-performing method to enhance model robustness in sparse scenarios with few labeled samples. Differing from homogeneous graph, DA in heterogeneous graph has greater challenges: heterogeneity of information requires DA strategies to effectively handle heterogeneous relations, which considers the information contribution of different types of neighbors and edges to the target nodes. Furthermore, over-squashing of information is caused by the negative curvature that formed by the non-uniformity distribution and strong clustering in complex graph. To address these challenges, this paper presents a novel method named Semi-Supervised Heterogeneous Graph Learning with Multi-level Data Augmentation (HG-MDA). For the problem of heterogeneity of information in DA, node and topology augmentation strategies are proposed for the characteristics of heterogeneous graph. And meta-relation-based attention is applied as one of the indexes for selecting augmented nodes and edges. For the problem of over-squashing of information, triangle based edge adding and removing are designed to alleviate the negative curvature and bring the gain of topology. Finally, the loss function consists of the cross-entropy loss for labeled data and the consistency regularization for unlabeled data. In order to effectively fuse the prediction results of various DA strategies, the sharpening is used. Existing experiments on public datasets, i.e., ACM, DBLP, OGB, and industry dataset MB show that HG-MDA outperforms current SOTA models. Additionly, HG-MDA is applied to user identification in internet finance scenarios, helping the business to add 30% key users, and increase loans and balances by 3.6%, 11.1%, and 9.8%.

translated by 谷歌翻译

Ctrl-VIO: Continuous-Time Visual-Inertial Odometry for Rolling Shutter Cameras

Xiaolei Lang , Jiajun Lv , Jianxin Huang , Yukai Ma , Yong Liu , Xingxing Zuo

分类：机器人

2022-08-25

在本文中，我们提出了用于滚动快门摄像机的概率连续时间视觉惯性频道（VIO）。连续的时轨迹公式自然促进异步高频IMU数据和运动延伸的滚动快门图像的融合。为了防止棘手的计算负载，提出的VIO是滑动窗口和基于密钥帧的。我们建议概率地将控制点边缘化，以保持滑动窗口中恒定的密钥帧数。此外，可以在我们的连续时间VIO中在线校准滚动快门相机的线曝光时间差（线延迟）。为了广泛检查我们的连续时间VIO的性能，对公共可用的WHU-RSVI，TUM-RSVI和Sensetime-RSVI Rolling快门数据集进行了实验。结果表明，提出的连续时间VIO显着优于现有的最新VIO方法。本文的代码库也将通过\ url {https://github.com/april-zju/ctrl-vio}开源。

translated by 谷歌翻译

End-to-end Graph-constrained Vectorized Floorplan Generation with Panoptic Refinement

Jiachen Liu , Yuan Xue , Jose Duarte , Krishnendra Shekhawat , Zihan Zhou , Xiaolei Huang

分类：计算机视觉

2022-07-27

给定的用户输入的自动生成平面图在建筑设计中具有很大的潜力，最近在计算机视觉社区中探索了。但是，大多数现有方法以栅格化图像格式合成平面图，这些图像很难编辑或自定义。在本文中，我们旨在将平面图合成为1-D向量的序列，从而简化用户的互动和设计自定义。为了产生高保真矢量化的平面图，我们提出了一个新颖的两阶段框架，包括草稿阶段和多轮精炼阶段。在第一阶段，我们使用图形卷积网络（GCN）编码用户的房间连接图输入，然后应用自回归变压器网络以生成初始平面图。为了抛光最初的设计并生成更具视觉吸引力的平面图，我们进一步提出了一个由GCN和变压器网络组成的新颖的全景精炼网络（PRN）。 PRN将初始生成的序列作为输入，并完善了平面图设计，同时鼓励我们提出的几何损失来鼓励正确的房间连接。我们已经对现实世界平面图数据集进行了广泛的实验，结果表明，我们的方法在不同的设置和评估指标下实现了最先进的性能。

translated by 谷歌翻译

A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis

Jiarong Ye , Yuan Xue , Peter Liu , Richard Zaino , Keith Cheng , Xiaolei Huang

分类：计算机视觉

2021-11-10

已经在医学成像结构域中应用了生成模型，用于各种图像识别和综合任务。然而，对于诸如协助医学训练的重要应用，仍然需要更可控和可解释的图像合成模型。在这项工作中，我们利用了有效的自我关注和对比学习模块，并在最先进的生成的对抗网络（GAN）上建立，以实现一个属性感知的图像综合模型，称为attributegan，它可以产生高质量基于多属性输入的组织病理学图像。与现有的单个属性条件生成模型相比，我们提出的模型更好地反映了输入属性，并实现了属性值之间的更平滑的插值。我们对尿液癌的染色H＆E图像的组织病理学数据集进行实验，并通过与最先进的模型以及我们模型的不同变体来展示我们提出的模型的有效性。代码可在https://github.com/karenyyy/miccai2021AttribUtegan获得。

translated by 谷歌翻译

Aligning Visual and Lexical Semantics

Fausto Giunchiglia , Mayukh Bagchi , Xiaolei Diao

分类：计算机视觉 | 人工智能

2022-12-13

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on how humans build concepts of the same target reality through the use of language. The lack of coincidence between visual and lexical semantics, in turn, has a major impact on CV systems in the form of the Semantic Gap Problem (SGP). The paper, while extensively exemplifying the lack of coincidence as above, introduces a general, domain-agnostic methodology to enforce alignment between visual and lexical semantics.

translated by 谷歌翻译

Forecasting User Interests Through Topic Tag Predictions in Online Health Communities

Amogh Subbakrishna Adishesha , Lily Jakielaszek , Fariha Azhar , Peixuan Zhang , Vasant Honavar , Fenglong Ma , Chandra Belani , Prasenjit Mitra , Sharon Xiaolei Huang

分类：机器学习

2022-11-05

The increasing reliance on online communities for healthcare information by patients and caregivers has led to the increase in the spread of misinformation, or subjective, anecdotal and inaccurate or non-specific recommendations, which, if acted on, could cause serious harm to the patients. Hence, there is an urgent need to connect users with accurate and tailored health information in a timely manner to prevent such harm. This paper proposes an innovative approach to suggesting reliable information to participants in online communities as they move through different stages in their disease or treatment. We hypothesize that patients with similar histories of disease progression or course of treatment would have similar information needs at comparable stages. Specifically, we pose the problem of predicting topic tags or keywords that describe the future information needs of users based on their profiles, traces of their online interactions within the community (past posts, replies) and the profiles and traces of online interactions of other users with similar profiles and similar traces of past interaction with the target users. The result is a variant of the collaborative information filtering or recommendation system tailored to the needs of users of online health communities. We report results of our experiments on an expert curated data set which demonstrate the superiority of the proposed approach over the state of the art baselines with respect to accurate and timely prediction of topic tags (and hence information sources of interest).

translated by 谷歌翻译

Modeling sequential annotations for sequence labeling with crowds

Xiaolei Lu , Tommy W. S. Chow

分类：自然语言处理 | 机器学习

2022-09-20

人群顺序注释可能是一种有效且具有成本效益的方式，用于构建用于序列标签的大型数据集。不同于标记独立实例，对于人群顺序注释，标签序列的质量取决于注释者在捕获序列中每个令牌的内部依赖性方面的专业知识水平。在本文中，我们提出了与人群（SA-SLC）进行序列标记的序列注释。首先，开发了有条件的概率模型，以共同模拟顺序数据和注释者的专业知识，其中引入分类分布以估计每个注释者在捕获局部和非本地标签依赖性以进行顺序注释时的可靠性。为了加速所提出模型的边缘化，提出了有效的标签序列推理（VLSE）方法，以从人群顺序注释中得出有效的地面真相标签序列。 VLSE从令牌级别中得出了可能的地面真相标签，并在标签序列解码的正向推断中进一步介绍了李子标签。 VLSE减少了候选标签序列的数量，并提高了可能的地面真实标签序列的质量。自然语言处理的几个序列标记任务的实验结果显示了所提出的模型的有效性。

translated by 谷歌翻译

Weak Disambiguation for Partial Structured Output Learning

Xiaolei Lu , Tommy W. S. Chow

分类：自然语言处理 | 机器学习

2022-09-20

局部结构化输出学习的现有歧义策略不能很好地概括地解决有些候选人可能是假阳性或与地面真相标签相似的问题。在本文中，我们提出了针对部分结构化输出学习（WD-PSL）的新型弱歧义。首先，分段较大的边距公式被推广到部分结构化输出学习，该学习有效地避免处理大量的复杂结构候选结构化输出。其次，在拟议的弱歧义策略中，每个候选标签都具有一个置信值，表明其真实标签的可能性是多大的，该标签旨在减少学习过程中错误地面真相标签分配的负面影响。然后配制了两个大边缘，以结合两种类型的约束，这是候选人和非候选者之间的歧义，以及候选人的弱歧义。在交替优化的框架中，开发了一种新的2N-SLACK变量切割平面算法，以加速每种优化的迭代。自然语言处理的几个序列标记任务的实验结果显示了所提出的模型的有效性。

translated by 谷歌翻译

Partial sequence labeling with structured Gaussian Processes

Xiaolei Lu , Tommy W. S. Chow

分类：机器学习

2022-09-20

现有的部分序列标记模型主要集中在最大边缘框架上，该框架未能提供对预测的不确定性估计。此外，这些模型采用的独特地面真理歧义策略可能包括用于参数学习的错误标签信息。在本文中，我们提出了部分序列标签（SGPPSL）的结构化高斯过程，该过程编码了预测中的不确定性，并且不需要额外的努力来选择模型选择和超参数学习。该模型采用因子式近似，将线性链图结构划分为一组，从而保留了基本的马尔可夫随机场结构，并有效地避免处理由部分注释数据生成的大量候选输出序列。然后在模型中引入了置信度度量，以解决候选标签的不同贡献，这使得能够在参数学习中使用地面真相标签信息。基于所提出模型的变异下限的派生下限，在交替优化的框架中估计了变分参数和置信度度量。此外，提出了加权viterbi算法将置信度度量纳入序列预测，该预测考虑了训练数据中的多个注释，从而考虑了标签歧义，从而有助于提高性能。 SGPPSL在几个序列标记任务上进行了评估，实验结果显示了所提出的模型的有效性。

translated by 谷歌翻译

A Supervised Tensor Dimension Reduction-Based Prognostics Model for Applications with Incomplete Imaging Data

Chengyu Zhou , Xiaolei Fang

分类：机器学习 | (统计)机器学习

2022-07-22

本文提出了一种张量数据的监督尺寸减小方法，该方法比大多数基于图像的预后模型具有两个优点。首先，该模型不需要张量数据完成，从而将其应用程序扩展到不完整的数据。其次，它利用时间出现（TTF）来监督低维特征的提取，这使得提取的特征对后续预后更有效。此外，提出了一种优化算法以进行参数估计，并在某些分布中得出了封闭形式的溶液。

translated by 谷歌翻译