Content-Controllable Summarization generates summaries focused on the given controlling signals. Due to the lack of large-scale training corpora for the task, we propose a plug-and-play module RelAttn to adapt any general summarizers to the content-controllable summarization task. RelAttn first identifies the relevant content in the source documents, and then makes the model attend to the right context by directly steering the attention weight. We further apply an unsupervised online adaptive parameter searching algorithm to determine the degree of control in the zero-shot setting, while such parameters are learned in the few-shot setting. By applying the module to three backbone summarization models, experiments show that our method effectively improves all the summarizers, and outperforms the prefix-based method and a widely used plug-and-play model in both zero- and few-shot settings. Tellingly, more benefit is observed in the scenarios when more control is needed.
translated by 谷歌翻译
过渡到成年是许多家庭的重要生活阶段。先前的研究表明,具有智力或发展的年轻人(IDD)比同龄人面临的挑战更多。这项研究是为了探索如何使用自然语言处理(NLP)方法,尤其是无监督的机器学习,以帮助心理学家分析情绪和情感,并使用主题建模来确定年轻人IDD及其家人所拥有的常见问题和挑战。此外,将结果与从没有IDD的年轻人那里获得的结果进行了比较。研究结果表明,NLP方法对于心理学家分析情绪,进行跨案例分析并从对话数据中汇总关键主题非常有用。我们的Python代码可在https://github.com/mlaricheva/emotion_topic_modeling上找到。
translated by 谷歌翻译
最近的神经监督主题细分模型具有优于无监督方法的杰出有效性,并从Wikipedia采样了大规模培训语料库。但是,这些模型可能会因利用简单的语言线索进行预测而引起的鲁棒性和可传递性有限,但忽略了更重要的索引间局部一致性。为了解决这个问题,我们提出了一种语言意识到的神经主题细分模型,并注入了句子上的话语依赖性结构,以鼓励模型使主题边界预测更多地基于句子之间的局部一致性。我们对英语评估数据集的实证研究表明,通过我们提出的策略将上述句子话语结构注入神经主题分段者可以实质上改善其在域内和外域数据上的性能,而模型的复杂性很小。
translated by 谷歌翻译
尽管最近的抽象性摘要在自动评估指标上取得了成功,但生成的摘要仍然与源文档呈现事实不一致。在本文中,我们专注于实体级别的事实不一致,即减少生成的摘要与源文档之间的不匹配实体。因此,我们提出了一种基于实体的新型跨度机制,并通过全球相关成分探索其扩展。四个摘要数据集的实验结果表明,跨度可以有效地改善实体级别的事实一致性,而单词级别和实体级别的显着性基本上没有变化。该代码可在https://github.com/wendy-xiao/entity基于基础上找到
translated by 谷歌翻译
会话数据在心理学中至关重要,因为它可以帮助研究人员了解个人的认知过程,情感和行为。话语标签是分析此类数据的常见策略。 NLP算法的开发使研究人员可以自动化此任务。但是,心理对话数据给NLP研究人员带来了一些挑战,包括多标签分类,大量类别和有限的可用数据。这项研究探讨了NLP方法生成的自动标签如何与人类在成年过渡的对话的背景下与人类标签相媲美。我们提出了应对心理学研究中提出的三个共同挑战的策略。我们的发现表明,具有领域适应性的深度学习方法(Roberta-Con)优于所有其他机器学习方法。我们提出的分层标签系统被证明可帮助研究人员战略性地分析对话数据。我们的Python代码和NLP模型可在https://github.com/mlaricheva/automated_labeling上获得。
translated by 谷歌翻译
RST样式话语解析在许多NLP任务中发挥着重要作用,揭示了潜在复杂和多样化的文件的潜在语义/务实结构。尽管重要的是,现代话语解析中最普遍的限制之一是缺乏大规模的数据集。为了克服数据稀疏问题,最近已经提出了远离情绪分析和概括的任务的远端监督方法。在这里,我们通过利用主题分割的遥远监督来扩展这一研究,可以可以说可以提供高级话语结构的强大和多个互补信号。两个人注释的话语TreeBanks的实验证实,我们的提案在句子和段落中产生了准确的树结构,始终如一地展现在句子到文件任务上的先前监督模型,偶尔在句子到段落上达到更高的分数。
translated by 谷歌翻译
最近经过彻底调查了变压器多头自我关注机制。一方面,研究人员对理解为什么以及变压器如何工作。另一方面,他们提出了新的注意增强方法,使变压器更准确,高效和可解释。在本文中,我们在循环管道中协同促使这两条研究线,首先找到了重要的任务特定的注意模式。然后应用那些模式,不仅应用于原始模型,还应用于较小的模型,作为人类引导的知识蒸馏过程。在提取摘要任务的情况下,在案例研究中对我们的管道的好处。在受欢迎的Bertsum模型中找到三种有意义的关注模式之后,实验表明,当我们注入这种模式时,原始和较小模型都显示出性能的改进,并且可以说是可争议的解释性。
translated by 谷歌翻译
Background: Image analysis applications in digital pathology include various methods for segmenting regions of interest. Their identification is one of the most complex steps, and therefore of great interest for the study of robust methods that do not necessarily rely on a machine learning (ML) approach. Method: A fully automatic and optimized segmentation process for different datasets is a prerequisite for classifying and diagnosing Indirect ImmunoFluorescence (IIF) raw data. This study describes a deterministic computational neuroscience approach for identifying cells and nuclei. It is far from the conventional neural network approach, but it is equivalent to their quantitative and qualitative performance, and it is also solid to adversative noise. The method is robust, based on formally correct functions, and does not suffer from tuning on specific data sets. Results: This work demonstrates the robustness of the method against the variability of parameters, such as image size, mode, and signal-to-noise ratio. We validated the method on two datasets (Neuroblastoma and NucleusSegData) using images annotated by independent medical doctors. Conclusions: The definition of deterministic and formally correct methods, from a functional to a structural point of view, guarantees the achievement of optimized and functionally correct results. The excellent performance of our deterministic method (NeuronalAlg) to segment cells and nuclei from fluorescence images was measured with quantitative indicators and compared with those achieved by three published ML approaches.
translated by 谷歌翻译
The broad usage of mobile devices nowadays, the sensitiveness of the information contained in them, and the shortcomings of current mobile user authentication methods are calling for novel, secure, and unobtrusive solutions to verify the users' identity. In this article, we propose TypeFormer, a novel Transformer architecture to model free-text keystroke dynamics performed on mobile devices for the purpose of user authentication. The proposed model consists in Temporal and Channel Modules enclosing two Long Short-Term Memory (LSTM) recurrent layers, Gaussian Range Encoding (GRE), a multi-head Self-Attention mechanism, and a Block-Recurrent structure. Experimenting on one of the largest public databases to date, the Aalto mobile keystroke database, TypeFormer outperforms current state-of-the-art systems achieving Equal Error Rate (EER) values of 3.25% using only 5 enrolment sessions of 50 keystrokes each. In such way, we contribute to reducing the traditional performance gap of the challenging mobile free-text scenario with respect to its desktop and fixed-text counterparts. Additionally, we analyse the behaviour of the model with different experimental configurations such as the length of the keystroke sequences and the amount of enrolment sessions, showing margin for improvement with more enrolment data. Finally, a cross-database evaluation is carried out, demonstrating the robustness of the features extracted by TypeFormer in comparison with existing approaches.
translated by 谷歌翻译
Digital media have enabled the access to unprecedented literary knowledge. Authors, readers, and scholars are now able to discover and share an increasing amount of information about books and their authors. Notwithstanding, digital archives are still unbalanced: writers from non-Western countries are less represented, and such a condition leads to the perpetration of old forms of discrimination. In this paper, we present the Under-Represented Writers Knowledge Graph (URW-KG), a resource designed to explore and possibly amend this lack of representation by gathering and mapping information about works and authors from Wikidata and three other sources: Open Library, Goodreads, and Google Books. The experiments based on KG embeddings showed that the integrated information encoded in the graph allows scholars and users to be more easily exposed to non-Western literary works and authors with respect to Wikidata alone. This opens to the development of fairer and effective tools for author discovery and exploration.
translated by 谷歌翻译