智能论文笔记

Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain

Chengzhi Zhang , Yi Xiang , Wenke Hao , Zhicheng Li , Yuchen Qian , Yuzhuo Wang

分类：自然语言处理

2022-12-28

Future work sentences (FWS) are the particular sentences in academic papers that contain the author's description of their proposed follow-up research direction. This paper presents methods to automatically extract FWS from academic papers and classify them according to the different future directions embodied in the paper's content. FWS recognition methods will enable subsequent researchers to locate future work sentences more accurately and quickly and reduce the time and cost of acquiring the corpus. The current work on automatic identification of future work sentences is relatively small, and the existing research cannot accurately identify FWS from academic papers, and thus cannot conduct data mining on a large scale. Furthermore, there are many aspects to the content of future work, and the subdivision of the content is conducive to the analysis of specific development directions. In this paper, Nature Language Processing (NLP) is used as a case study, and FWS are extracted from academic papers and classified into different types. We manually build an annotated corpus with six different types of FWS. Then, automatic recognition and classification of FWS are implemented using machine learning models, and the performance of these models is compared based on the evaluation metrics. The results show that the Bernoulli Bayesian model has the best performance in the automatic recognition task, with the Macro F1 reaching 90.73%, and the SCIBERT model has the best performance in the automatic classification task, with the weighted average F1 reaching 72.63%. Finally, we extract keywords from FWS and gain a deep understanding of the key content described in FWS, and we also demonstrate that content determination in FWS will be reflected in the subsequent research work by measuring the similarity between future work sentences and the abstracts.

translated by 谷歌翻译

CAT: Learning to Collaborate Channel and Spatial Attention from Multi-Information Fusion

Zizhang Wu , Man Wang , Weiwei Sun , Yuchen Li , Tianhao Xu , Fan Wang , Keke Huang

分类：计算机视觉

2022-12-13

Channel and spatial attention mechanism has proven to provide an evident performance boost of deep convolution neural networks (CNNs). Most existing methods focus on one or run them parallel (series), neglecting the collaboration between the two attentions. In order to better establish the feature interaction between the two types of attention, we propose a plug-and-play attention module, which we term "CAT"-activating the Collaboration between spatial and channel Attentions based on learned Traits. Specifically, we represent traits as trainable coefficients (i.e., colla-factors) to adaptively combine contributions of different attention modules to fit different image hierarchies and tasks better. Moreover, we propose the global entropy pooling (GEP) apart from global average pooling (GAP) and global maximum pooling (GMP) operators, an effective component in suppressing noise signals by measuring the information disorder of feature maps. We introduce a three-way pooling operation into attention modules and apply the adaptive mechanism to fuse their outcomes. Extensive experiments on MS COCO, Pascal-VOC, Cifar-100, and ImageNet show that our CAT outperforms existing state-of-the-art attention mechanisms in object detection, instance segmentation, and image classification. The model and code will be released soon.

translated by 谷歌翻译

Improving Training and Inference of Face Recognition Models via Random Temperature Scaling

Lei Shang , Mouxiao Huang , Wu Shi , Yuchen Liu , Yang Liu , Fei Wang , Baigui Sun , Xuansong Xie , Yu Qiao

分类：计算机视觉 | 人工智能

2022-12-02

Data uncertainty is commonly observed in the images for face recognition (FR). However, deep learning algorithms often make predictions with high confidence even for uncertain or irrelevant inputs. Intuitively, FR algorithms can benefit from both the estimation of uncertainty and the detection of out-of-distribution (OOD) samples. Taking a probabilistic view of the current classification model, the temperature scalar is exactly the scale of uncertainty noise implicitly added in the softmax function. Meanwhile, the uncertainty of images in a dataset should follow a prior distribution. Based on the observation, a unified framework for uncertainty modeling and FR, Random Temperature Scaling (RTS), is proposed to learn a reliable FR algorithm. The benefits of RTS are two-fold. (1) In the training phase, it can adjust the learning strength of clean and noisy samples for stability and accuracy. (2) In the test phase, it can provide a score of confidence to detect uncertain, low-quality and even OOD samples, without training on extra labels. Extensive experiments on FR benchmarks demonstrate that the magnitude of variance in RTS, which serves as an OOD detection metric, is closely related to the uncertainty of the input image. RTS can achieve top performance on both the FR and OOD detection tasks. Moreover, the model trained with RTS can perform robustly on datasets with noise. The proposed module is light-weight and only adds negligible computation cost to the model.

translated by 谷歌翻译

DAMO-NLP at NLPCC-2022 Task 2: Knowledge Enhanced Robust NER for Speech Entity Linking

Shen Huang , Yuchen Zhai , Xinwei Long , Yong Jiang , Xiaobin Wang , Yin Zhang , Pengjun Xie

分类：自然语言处理 | 人工智能 | 机器学习

2022-09-27

链接的语音实体旨在识别和消除语言中的命名实体。常规方法严重遭受了不受限制的语音样式和ASR系统产生的嘈杂笔录。在本文中，我们提出了一种名为“知识增强命名实体识别”（KENER）的新颖方法，该方法致力于通过在实体识别阶段无痛地纳入适当的知识来改善鲁棒性，从而改善实体联系的整体性能。肯纳（Kener）首先检索未提及的句子的候选实体，然后利用实体描述作为额外的信息来帮助识别提及。当输入短或嘈杂时，由密集检索模块检索的候选实体特别有用。此外，我们研究了各种数据采样策略和设计有效的损失功能，以提高识别和歧义阶段中检索实体的质量。最后，将与过滤模块的链接作为最终保障措施应用，从而可以过滤出错误认可的提及。我们的系统在NLPCC-2022共享任务2的轨道1中获得第一名，并在轨道1中获得第一名。

translated by 谷歌翻译

Whodunit? Learning to Contrast for Authorship Attribution

Bo Ai , Yuchen Wang , Yugin Tan , Samson Tan

分类：自然语言处理 | 人工智能 | 机器学习

2022-09-23

作者归因是确定给定文本的作者的任务。大多数现有方法都使用手动设计的功能来捕获数据集的内容和样式。但是，这种依赖数据集的方法会产生不一致的性能。因此，我们建议使用对比度学习和监督学习（Contra-X）的结合来微调预训练的语言表示。我们表明，Contra-X在多个人类和机器作者身份归因基准上提高了最先进的方法，从而提高了高达6.8％的改善。我们还表明，在不同的数据方案中，Contra-X始终优于跨凝性微调。至关重要的是，我们介绍了这些改进的定性和定量分析。我们博学的表示形成了不同作者的高度可分开的群集。但是，我们发现对比度学习以牺牲某些作者的牺牲成本提高了整体准确性。解决这种紧张关系将是未来工作的重要方向。据我们所知，我们是第一个分析将对比度学习与跨凝性微调相结合的作者归因的效果。

translated by 谷歌翻译

Masked Imitation Learning: Discovering Environment-Invariant Modalities in Multimodal Demonstrations

Yilun Hao , Ruinan Wang , Zhangjie Cao , Zihan Wang , Yuchen Cui , Dorsa Sadigh

分类：机器学习 | 机器人

2022-09-16

多模式演示为机器人提供了大量信息，以使世界有意义。但是，当从人类示威中学习感觉运动控制政策时，这种丰度可能并不总是会导致良好的表现。无关的数据模式可能导致状态过度规格，在该状态中包含的模式不仅可以在决策中无用，而且可以改变跨环境的数据分布。州过度规格会导致诸如学习的政策之类的问题，而不是在培训数据分布之外推广。在这项工作中，我们提出了掩盖的模仿学习（MIL），以选择性地使用信息方式来解决状态过度指定。具体来说，我们设计了带有二进制掩码的蒙版策略网络，以阻止某些方式。我们开发了一种双层优化算法，该算法可以学习此面具以准确过滤过度指定的模态。我们从经验上证明，使用Robomimic数据集在包括Mujoco和机器人ARM环境在内的模拟域中的基线算法均优于基线算法，并有效地在收集在真实机器人上收集的多模式数据集中有效地恢复了环境不变的模式。我们的项目网站在以下网址介绍了我们的结果的补充详细信息和视频：https：//tinyurl.com/masked-il

translated by 谷歌翻译

AACC: Asymmetric Actor-Critic in Contextual Reinforcement Learning

Wangyang Yue , Yuan Zhou , Xiaochuan Zhang , Yuchen Hua , Zhiyuan Wang , Guang Kou

分类：机器学习 | (统计)机器学习

2022-08-03

强化学习（RL）技术在许多具有挑战性的任务中引起了极大的关注，但是当应用于现实世界问题时，它们的性能急剧恶化。已经提出了各种方法，例如域随机化，以通过不同的环境设置下的培训代理来应对这种情况，因此在部署过程中可以将它们推广到不同的环境。但是，它们通常不包含与代理人正确相互作用的潜在环境因素信息，因此在面对周围环境变化时可能会过于保守。在本文中，我们首先将适应RL中的环境动态的任务形式化为使用上下文Markov决策过程（CMDP）的概括问题。然后，我们在上下文RL（AACC）中提出了不对称的参与者 - 作为处理此类概括任务的端到端参与者的方法。我们在一系列模拟环境中证明了AACC对现有基线的性能的基本改进。

translated by 谷歌翻译

Fully Decentralized Model-based Policy Optimization for Networked Systems

Yali Du , Chengdong Ma , Yuchen Liu , Runji Lin , Hao Dong , Jun Wang , Yaodong Yang

分类：机器学习 | 人工智能 | (统计)机器学习

2022-07-13

增强学习算法需要大量样品；这通常会限制他们的现实应用程序在简单的任务上。在多代理任务中，这种挑战更为出色，因为操作的每个步骤都需要进行沟通，转移或资源。这项工作旨在通过基于模型的学习来提高多代理控制的数据效率。我们考虑了代理商合作并仅与邻居进行当地交流的网络系统，并提出了基于模型的政策优化框架（DMPO）。在我们的方法中，每个代理都会学习一个动态模型，以预测未来的状态并通过通信广播其预测，然后在模型推出下训练策略。为了减轻模型生成数据的偏见，我们限制了用于产生近视推出的模型使用量，从而减少了模型生成的复合误差。为了使策略更新的独立性有关，我们引入了扩展的价值函数，理论上证明了由此产生的策略梯度是与真实策略梯度的紧密近似。我们在几个智能运输系统的基准上评估了我们的算法，这些智能运输系统是连接的自动驾驶汽车控制任务（FLOW和CACC）和自适应交通信号控制（ATSC）。经验结果表明，我们的方法可以实现卓越的数据效率，并使用真实模型匹配无模型方法的性能。

translated by 谷歌翻译

Joint Super-Resolution and Inverse Tone-Mapping: A Feature Decomposition Aggregation Network and A New Benchmark

Gang Xu , Yuchen Yang , Jun Xu , Liang Wang , Xian-Tong Zhen , Ming-Ming Cheng

分类：计算机视觉

2022-07-07

联合超分辨率和反音调映射（联合SR-ITM）旨在增加低分辨率和标准动态范围图像的分辨率和动态范围。重点方法主要是诉诸图像分解技术，使用多支化的网络体系结构。，这些方法采用的刚性分解在很大程度上将其力量限制在各种图像上。为了利用其潜在能力，在本文中，我们将分解机制从图像域概括为更广泛的特征域。为此，我们提出了一个轻巧的特征分解聚合网络（FDAN）。特别是，我们设计了一个功能分解块（FDB），可以实现功能细节和对比度的可学习分离。通过级联FDB，我们可以建立一个用于强大的多级特征分解的分层功能分解组。联合SR-ITM，\ ie，SRITM-4K的新基准数据集，该数据集是大规模的，为足够的模型培训和评估提供了多功能方案。两个基准数据集的实验结果表明，我们的FDAN表明我们的FDAN有效，并且胜过了以前的方法sr-itm.ar代码和数据集将公开发布。

translated by 谷歌翻译

Key-frame Guided Network for Thyroid Nodule Recognition using Ultrasound Videos

Yuchen Wang , Zhongyu Li , Xiangxiang Cui , Liangliang Zhang , Xiang Luo , Meng Yang , Shi Chang

分类：计算机视觉

2022-06-27

超声检查广泛用于甲状腺结节（良性/恶性）的临床诊断。但是，准确性在很大程度上取决于放射科医生的经验。尽管已经研究了甲状腺结节识别的深度学习技术。当前的解决方案主要基于静态超声图像，其时间信息有限，并且与临床诊断不一致。本文提出了一种通过详尽的超声视频和钥匙框架进行详尽的探索来自动识别甲状腺结节的新方法。我们首先提出一个检测 - 定位框架，以自动识别每个超声视频中典型结节的临床密钥框架。根据本地化的键框架，我们为甲状腺结节识别开发了一个钥匙框引导的视频分类模型。此外，我们引入了运动注意模块，以帮助网络关注超声视频中的重要帧，这与临床诊断一致。拟议的甲状腺结节识别框架已在临床收集的超声视频上进行了验证，与其他最先进的方法相比，表现出卓越的性能。

translated by 谷歌翻译