Timely and effective response to humanitarian crises requires quick and accurate analysis of large amounts of text data - a process that can highly benefit from expert-assisted NLP systems trained on validated and annotated data in the humanitarian response domain. To enable creation of such NLP systems, we introduce and release HumSet, a novel and rich multilingual dataset of humanitarian response documents annotated by experts in the humanitarian response community. The dataset provides documents in three languages (English, French, Spanish) and covers a variety of humanitarian crises from 2018 to 2021 across the globe. For each document, HUMSET provides selected snippets (entries) as well as assigned classes to each entry annotated using common humanitarian information analysis frameworks. HUMSET also provides novel and challenging entry extraction and multi-label entry classification tasks. In this paper, we take a first step towards approaching these tasks and conduct a set of experiments on Pre-trained Language Models (PLM) to establish strong baselines for future research in this domain. The dataset is available at https://blog.thedeep.io/humset/.
translated by 谷歌翻译
协作过滤算法捕获了基本的消费模式,包括特定的特定人口统计信息或用户的受保护信息,例如性别,种族和位置。这些编码的偏见可以影响推荐系统(RS)的决策,以进一步分离提供给各种人口统计亚组的内容,并提出有关披露用户受保护属性的隐私问题。在这项工作中,我们研究了从RS算法的学习交互表示中删除用户特定保护信息的可能性和挑战,同时保持其有效性。具体而言,我们将对抗性训练纳入最先进的多体架构中,从而产生了一种新颖的模型,具有多项式可能性(Adv-Multvae)的对抗性变异自动编码器(Adv-Multvae),旨在消除在保存受保护属性的隐含信息的同时建议性能。我们对Movielens-1M和LFM-2B - demobias数据集进行了实验,并根据外部攻击者无法揭示模型中用户的性别信息来评估偏差缓解方法的有效性。与基线多腔相比,结果表明,adv-multvae的性能边缘恶化(W.R.T. NDCG和召回),在两个数据集中都大大减轻了模型中固有的偏见。
translated by 谷歌翻译
近年来,语言模型已在各种自然语言处理任务上实现了最先进的表现。随着这些模型的尺寸不断增长,探索方法使其更有效的方法变得越来越重要。同时,它们的增强认知能力增加了模型权重中隐式编码数据集中存在的社会偏见的危险。我们提出了一种架构,该体系结构同时使用两种技术来处理这两个挑战:差异和对抗性培训。结果是一个模块化体系结构,该体系结构将原始的差异设置扩展到使用,并将其他稀疏子网应用于掩盖,以减少推理时预定义的受保护属性的效果。
translated by 谷歌翻译
我们提出了一种以最小计算成本提高广泛检索模型的性能的框架。它利用由基本密度检索方法提取的预先提取的文档表示,并且涉及训练模型以共同评分每个查询的一组检索到的候选文档,同时在其他候选的上下文中暂时转换每个文档的表示。以及查询本身。当基于其与查询的相似性进行评分文档表示时,该模型因此意识到其“对等”文档的表示。我们表明,我们的方法导致基本方法的检索性能以及彼此隔离的评分候选文档进行了大量改善,如在一对培训环境中。至关重要的是,与基于伯特式编码器的术语交互重型器不同,它在运行时在任何第一阶段方法的顶部引发可忽略不计的计算开销,允许它与任何最先进的密集检索方法容易地结合。最后,同时考虑给定查询的一组候选文档,可以在检索中进行额外的有价值的功能,例如评分校准和减轻排名中的社会偏差。
translated by 谷歌翻译
最近,大型预用语言模型(LMS)越来越受欢迎。培训这些模型需要更多的计算资源,并且大多数现有模型仅在英文文本上培训。以其他语言训练这些模型非常昂贵。为了缓解这个问题,我们介绍了一种叫做威施塞的方法 - 将英语模型传输到新语言。我们将英语模型的销量与目标语言中的销量交换,并初始化令牌嵌入式,以便通过利用覆盖英语和目标语言的多语言静态字嵌入来初始化令牌嵌入式。我们使用Wechsel将GPT-2和Roberta模型转移到4种其他语言(法语,德语,中文和斯瓦希里语)。 Wechsel通过以前提出的跨语言参数转移和优于比较大小的模型来改善从目标语言的划痕训练的相当大小的型号,距离培训速度较小。我们的方法使培训大型语言模型为新语言更容易访问,更少损害环境。我们宣传我们的代码和型号。
translated by 谷歌翻译
The field of autonomous mobile robots has undergone dramatic advancements over the past decades. Despite achieving important milestones, several challenges are yet to be addressed. Aggregating the achievements of the robotic community as survey papers is vital to keep the track of current state-of-the-art and the challenges that must be tackled in the future. This paper tries to provide a comprehensive review of autonomous mobile robots covering topics such as sensor types, mobile robot platforms, simulation tools, path planning and following, sensor fusion methods, obstacle avoidance, and SLAM. The urge to present a survey paper is twofold. First, autonomous navigation field evolves fast so writing survey papers regularly is crucial to keep the research community well-aware of the current status of this field. Second, deep learning methods have revolutionized many fields including autonomous navigation. Therefore, it is necessary to give an appropriate treatment of the role of deep learning in autonomous navigation as well which is covered in this paper. Future works and research gaps will also be discussed.
translated by 谷歌翻译
Regularising the parameter matrices of neural networks is ubiquitous in training deep models. Typical regularisation approaches suggest initialising weights using small random values, and to penalise weights to promote sparsity. However, these widely used techniques may be less effective in certain scenarios. Here, we study the Koopman autoencoder model which includes an encoder, a Koopman operator layer, and a decoder. These models have been designed and dedicated to tackle physics-related problems with interpretable dynamics and an ability to incorporate physics-related constraints. However, the majority of existing work employs standard regularisation practices. In our work, we take a step toward augmenting Koopman autoencoders with initialisation and penalty schemes tailored for physics-related settings. Specifically, we propose the "eigeninit" initialisation scheme that samples initial Koopman operators from specific eigenvalue distributions. In addition, we suggest the "eigenloss" penalty scheme that penalises the eigenvalues of the Koopman operator during training. We demonstrate the utility of these schemes on two synthetic data sets: a driven pendulum and flow past a cylinder; and two real-world problems: ocean surface temperatures and cyclone wind fields. We find on these datasets that eigenloss and eigeninit improves the convergence rate by up to a factor of 5, and that they reduce the cumulative long-term prediction error by up to a factor of 3. Such a finding points to the utility of incorporating similar schemes as an inductive bias in other physics-related deep learning approaches.
translated by 谷歌翻译
Backpropagation is widely used to train artificial neural networks, but its relationship to synaptic plasticity in the brain is unknown. Some biological models of backpropagation rely on feedback projections that are symmetric with feedforward connections, but experiments do not corroborate the existence of such symmetric backward connectivity. Random feedback alignment offers an alternative model in which errors are propagated backward through fixed, random backward connections. This approach successfully trains shallow models, but learns slowly and does not perform well with deeper models or online learning. In this study, we develop a novel meta-plasticity approach to discover interpretable, biologically plausible plasticity rules that improve online learning performance with fixed random feedback connections. The resulting plasticity rules show improved online training of deep models in the low data regime. Our results highlight the potential of meta-plasticity to discover effective, interpretable learning rules satisfying biological constraints.
translated by 谷歌翻译
We consider a radio resource management (RRM) problem in a multi-user wireless network, where the goal is to optimize a network-wide utility function subject to constraints on the ergodic average performance of users. We propose a state-augmented parameterization for the RRM policy, where alongside the instantaneous network states, the RRM policy takes as input the set of dual variables corresponding to the constraints. We provide theoretical justification for the feasibility and near-optimality of the RRM decisions generated by the proposed state-augmented algorithm. Focusing on the power allocation problem with RRM policies parameterized by a graph neural network (GNN) and dual variables sampled from the dual descent dynamics, we numerically demonstrate that the proposed approach achieves a superior trade-off between mean, minimum, and 5th percentile rates than baseline methods.
translated by 谷歌翻译
随着人工智能的最新进展,可以在人类日常生活的各个方面看到其应用。从语音助手到移动医疗保健和自动驾驶,我们依靠AI方法的性能来完成许多关键任务;因此,必须以适当的手段进行预防损坏的方式主张模型的性能。通常,AI模型的短缺,尤其是深度机器学习,当面对数据分布的变化时,性能下降。尽管如此,在现实世界应用中始终期望这些转变。因此,已经出现了一个研究领域,重点是检测分布外数据子集并实现更全面的概括。此外,由于许多基于深度学习的模型在基准数据集上取得了近乎完美的结果,因此需要评估这些模型的可靠性和可靠性以推向现实世界应用程序的需求,这比以往任何时候都更加强烈。这引起了越来越多的研究领域的研究和领域的概括,这引起了对从各个角度比较这些研究进行比较的调查的需求,并突出了它们的平直和弱点。本文提出了一项调查,除了审查该领域的70多篇论文外,还提出了未来作品的挑战和方向,并为各种类型的数据转移和解决方案提供了统一的外观,以更好地泛化。
translated by 谷歌翻译