智能论文笔记

Extracting Medication Changes in Clinical Narratives using Pre-trained Language Models

Giridhar Kaushik Ramachandran , Kevin Lybarger , Yaya Liu , Diwakar Mahajan , Jennifer J. Liang , Ching-Huei Tsou , Meliha Yetisgen , Özlem Uzuner

分类：自然语言处理

2022-08-17

对于医疗保健提供者提供适当的患者护理的准确和详细说明，包括患者时间表中的药物变化，至关重要。医疗保健提供者或患者本身可能会引发患者药物的改变。用药更改采用多种形式，包括处方药和相关剂量修饰。这些更改提供了有关患者整体健康以及导致当前护理的理由的信息。然后，未来的护理可以基于患者的最终状态。这项工作探讨了从自由文本临床注释中自动提取药物变化信息。上下文药物事件数据集（CMED）是临床注释的语料库，其注释可以通过多种变化相关的属性来表征药物变化，包括更改的类型（启动，停止，增加等），更改，时间性，时间性，时间性，时间性，时间性，时间。改变可能性和否定。使用CMED，我们确定了临床文本中的药物提及，并提出了三个新型的基于BERT的新型基于BERT的系统，以解决注释的药物变化特征。我们证明，我们建议的体系结构改善了对CMED的初始工作改善药物变更分类的性能。我们确定了0.959 F1的高性能的药物提及，我们提出的系统将药物变化及其属性分类为0.827 F1。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

BKinD-3D: Self-Supervised 3D Keypoint Discovery from Multi-View Videos

Jennifer J. Sun , Pierre Karashchuk , Amil Dravid , Serim Ryou , Sonia Fereidooni , John Tuthill , Aggelos Katsaggelos , Bingni W. Brunton , Georgia Gkioxari , Ann Kennedy

分类：计算机视觉 | 人工智能

2022-12-14

Quantifying motion in 3D is important for studying the behavior of humans and other animals, but manual pose annotations are expensive and time-consuming to obtain. Self-supervised keypoint discovery is a promising strategy for estimating 3D poses without annotations. However, current keypoint discovery approaches commonly process single 2D views and do not operate in the 3D space. We propose a new method to perform self-supervised keypoint discovery in 3D from multi-view videos of behaving agents, without any keypoint or bounding box supervision in 2D or 3D. Our method uses an encoder-decoder architecture with a 3D volumetric heatmap, trained to reconstruct spatiotemporal differences across multiple views, in addition to joint length constraints on a learned 3D skeleton of the subject. In this way, we discover keypoints without requiring manual supervision in videos of humans and rats, demonstrating the potential of 3D keypoint discovery for studying behavior.

translated by 谷歌翻译

Generating Holistic 3D Human Motion from Speech

Hongwei Yi , Hualin Liang , Yifei Liu , Qiong Cao , Yandong Wen , Timo Bolkart , Dacheng Tao , Michael J. Black

分类：计算机视觉

2022-12-08

This work addresses the problem of generating 3D holistic body motions from human speech. Given a speech recording, we synthesize sequences of 3D body poses, hand gestures, and facial expressions that are realistic and diverse. To achieve this, we first build a high-quality dataset of 3D holistic body meshes with synchronous speech. We then define a novel speech-to-motion generation framework in which the face, body, and hands are modeled separately. The separated modeling stems from the fact that face articulation strongly correlates with human speech, while body poses and hand gestures are less correlated. Specifically, we employ an autoencoder for face motions, and a compositional vector-quantized variational autoencoder (VQ-VAE) for the body and hand motions. The compositional VQ-VAE is key to generating diverse results. Additionally, we propose a cross-conditional autoregressive model that generates body poses and hand gestures, leading to coherent and realistic motions. Extensive experiments and user studies demonstrate that our proposed approach achieves state-of-the-art performance both qualitatively and quantitatively. Our novel dataset and code will be released for research purposes at https://talkshow.is.tue.mpg.de.

translated by 谷歌翻译

Improving astroBERT using Semantic Textual Similarity

Felix Grezes , Thomas Allen , Sergi Blanco-Cuaresma , Alberto Accomazzi , Michael J. Kurtz , Golnaz Shapurian , Edwin Henneken , Carolyn S. Grant , Donna M. Thompson , Timothy W. Hostetler

分类：自然语言处理

2022-11-29

The NASA Astrophysics Data System (ADS) is an essential tool for researchers that allows them to explore the astronomy and astrophysics scientific literature, but it has yet to exploit recent advances in natural language processing. At ADASS 2021, we introduced astroBERT, a machine learning language model tailored to the text used in astronomy papers in ADS. In this work we: - announce the first public release of the astroBERT language model; - show how astroBERT improves over existing public language models on astrophysics specific tasks; - and detail how ADS plans to harness the unique structure of scientific papers, the citation graph and citation context, to further improve astroBERT.

translated by 谷歌翻译

Neurosymbolic Programming for Science

Jennifer J. Sun , Megan Tjandrasuwita , Atharva Sehgal , Armando Solar-Lezama , Swarat Chaudhuri , Yisong Yue , Omar Costilla-Reyes

分类：人工智能

2022-10-10

Neurosymbolic Programming (NP) techniques have the potential to accelerate scientific discovery. These models combine neural and symbolic components to learn complex patterns and representations from data, using high-level concepts or known constraints. NP techniques can interface with symbolic domain knowledge from scientists, such as prior knowledge and experimental context, to produce interpretable outputs. We identify opportunities and challenges between current NP models and scientific workflows, with real-world examples from behavior analysis in science: to enable the use of NP broadly for workflows across the natural and social sciences.

translated by 谷歌翻译

Multi-level Adversarial Spatio-temporal Learning for Footstep Pressure based FoG Detection

Kun Hu , Shaohui Mei , Wei Wang , Kaylena A. Ehgoetz Martens , Liang Wang , Simon J. G. Lewis , David D. Feng , Zhiyong Wang

分类：计算机视觉 | 人工智能

2022-09-22

步态冻结（FOG）是帕金森氏病的最常见症状之一，这是中枢神经系统的神经退行性疾病，影响了世界各地数百万的人。为了满足提高雾的治疗质量的紧迫需求，设计雾计算机辅助检测和量化工具的需求越来越重要。作为一种用于收集运动模式的非侵入性技术，从压力敏感步态垫中获得的脚步压力序列为评估诊所和家庭环境中的雾气提供了绝佳的机会。在这项研究中，提出了雾检测为一项顺序建模任务，并提出了一种新颖的深度学习结构，即对对抗性时空网络（ASTN），提出了跨多个级别的雾模式。引入了一种新型的对抗训练方案，并具有多级主题鉴别器，以获得独立的雾代表示，这有助于降低由于高主体间方差而导致的过度拟合风险。结果，对于看不见的受试者，可以实现强大的雾检测。拟议的计划还阐明了从其他场景中改善主题级临床研究，因为它可以与许多现有的深层建筑集成在一起。据我们所知，这是基于脚步压力的雾检测的最早研究之一，利用ASTN的方法是追求独立于主题的表示形式的第一个深神经网络架构。从21名受试者收集的393次试验的实验结果表明，AUC 0.85的雾检测提出的ASTN表现令人鼓舞。

translated by 谷歌翻译

Associative Learning for Network Embedding

Yuchen Liang , Dmitry Krotov , Mohammed J. Zaki

分类：机器学习 | 神经与进化计算

2022-08-30

网络嵌入任务是将网络中的节点表示为低维矢量，同时结合了拓扑和结构信息。大多数现有方法通过直接或隐式分配接近性矩阵来解决此问题。在这项工作中，我们从新的角度介绍了一种网络嵌入方法，该方法利用现代Hopfield网络（MHN）进行关联学习。我们的网络学习每个节点的内容与该节点的邻居之间的关联。这些关联是MHN中的回忆。鉴于该节点的邻居，网络的复发动力学使得可以恢复蒙版节点。我们提出的方法对不同的下游任务进行评估，例如节点分类和链接预测。与常见的矩阵分解技术和基于深度学习的方法相比，结果表明竞争性能。

translated by 谷歌翻译

The MABe22 Benchmarks for Representation Learning of Multi-Agent Behavior

Jennifer J. Sun , Andrew Ulmer , Dipam Chakraborty , Brian Geuther , Edward Hayes , Heng Jia , Vivek Kumar , Zachary Partridge , Alice Robie , Catherine E. Schretter

分类：机器学习 | 人工智能 | 计算机视觉

2022-07-21

现实世界的行为通常是由多种代理之间复杂的相互作用来塑造的。为了可靠地研究多代理行为，无监督和自我监督的学习的进步使从轨迹数据中学到了各种不同的行为表示。迄今为止，还没有一组统一的基准测试，可以在广泛的行为分析设置中进行定量和系统地比较方法。我们的目的是通过引入来自现实世界行为神经科学实验的大规模，多代理轨迹数据集来解决这一问题，该数据集涵盖了一系列行为分析任务。我们的数据集由来自通用模型生物的轨迹数据组成，其中有960万帧的小鼠数据和440万帧的飞行数据，在各种实验环境中，例如不同的菌株，相互作用的长度和光遗传学刺激。框架的子集还包括专家注销的行为标签。我们数据集的改进对应于跨多种生物的行为表示，并能够捕获常见行为分析任务的差异。

translated by 谷歌翻译

Deep Squared Euclidean Approximation to the Levenshtein Distance for DNA Storage

Alan J. X. Guo , Cong Liang , Qing-Hu Hou

分类：机器学习

2022-07-11

将信息存储在DNA分子中引起了极大的兴趣，因为它在寿命，高存储密度和低维护成本方面具有优势。DNA储存管道中的关键步骤是根据其相似性有效地聚集了检索到的DNA序列。Levenshtein距离是两个DNA序列之间相似性的最合适的度量，但在计算复杂性方面较低，与成熟的聚类算法兼容。在这项工作中，我们建议使用暹罗神经网络，平方欧几里得嵌入和卡方回归，提出了一种新型的深方形欧几里德嵌入DNA序列。Levenshtein的距离通过嵌入向量之间的平方欧几里德距离近似，该矢量是快速计算的，并且群集算法友好。理论上和实验中分析了所提出的方法。结果表明，所提出的嵌入是有效且健壮的。

translated by 谷歌翻译