智能论文笔记

A Bayesian decision support system for counteracting activities of terrorist groups

Aditi Shenvi , F. Oliver Bunnin , Jim Q. Smith

分类： (统计)机器学习

2020-07-08

恐怖群体的活动对公众的安全和福祉带来了严重的威胁。反恐当局旨在在投入行动之前识别和挫败恐怖群体的计划。虽然恐怖群体的活动可能被隐藏和伪装，但这些群体的成员需要沟通和协调组织他们的活动。当局可以利用这种可观察行为和通信数据来估计恐怖组织构成的威胁。然而，为了可信，任何此类统计模型需要折叠在本集团的每个成员构成的威胁水平。与其他良性形式的社交网络不同，考虑到恐怖主义群体作为可更换的成员，给出了该集团造成伤害的综合能力的不完整图片。在这里，我们开发了一个贝叶斯集成决策支持系统，可以将与恐怖主义组的每个成员相关的信息以及集团的组合活动。

translated by 谷歌翻译

Constructing a Chain Event Graph from a Staged Tree

Aditi Shenvi , Jim Q. Smith

分类： (统计)机器学习 | 机器学习

2020-06-29

链事件图（CEGS）是最近的概率图形模型 - 贝叶斯网络的概括 - 在图形拓扑中提供了结构零，结构缺失值和上下文的条件独立性的显式表示。通过从事件树的顶点的着色开始以识别一步转变对称的变换，从事件树构成CEG。这个彩色的事件树，也称为阶段树是用于这个家庭的学习算法的输出。令人惊讶的是，尚未设计一般算法，它会自动将任何分阶段的树转换为CEG表示。在本文中，我们为该转换提供了一种简单的迭代反向算法。此外，我们表明，没有任何信息从将阶段的树转换成CEG。最后，我们证明，通过最佳停止标准，我们的算法比Silander和Leong（2013）中出现的特殊情况的概率更有效。我们还提供使用此算法的Python代码从任何暂存树中获取CEG以及使用采样零添加边缘的功能。

translated by 谷歌翻译

Safe machine learning model release from Trusted Research Environments: The AI-SDC package

Jim Smith , Richard Preen , Andrew McCarthy , Alba Crespi Boixander , James Liley , Simon Rogers

分类：机器学习

2022-12-02

We present AI-SDC, an integrated suite of open source Python tools to facilitate Statistical Disclosure Control (SDC) of Machine Learning (ML) models trained on confidential data prior to public release. AI-SDC combines (i) a SafeModel package that extends commonly used ML models to provide ante-hoc SDC by assessing the vulnerability of disclosure posed by the training regime; and (ii) an Attacks package that provides post-hoc SDC by rigorously assessing the empirical disclosure risk of a model through a variety of simulated attacks after training. The AI-SDC code and documentation are available under an MIT license at https://github.com/AI-SDC/AI-SDC.

translated by 谷歌翻译

Machine Learning Models Disclosure from Trusted Research Environments (TRE), Challenges and Opportunities

Esma Mansouri-Benssassi , Simon Rogers , Jim Smith , Felix Ritchie , Emily Jefferson , University of Dundee , NHS National Services Scotland , University of the West of England

分类：人工智能

2021-11-10

可信的研究环境（TRE）S是安全和安全的环境，其中研究人员可以访问敏感数据。随着电子健康记录（EHR），医学成像和基因组数据等医疗数据的增长和多样性，通常在使用人工智能（AI）和机器学习子场（ML）的使用增加医疗领域。这产生了披露从TRES的新类型输出的希望，例如培训的机器学习模型。虽然特定的指导方针和政策存在于TRES中的统计披露控制，但它们并不令人满意地涵盖这些新类型的输出请求。在本文中，我们定义了在TRES内医疗保健机器学习的应用程序和披露的一些挑战。我们描述了各种漏洞，引入AI带来了TRES。我们还提供了与培训ML模型的披露相关的不同类型和风险水平的介绍。我们终于描述了开发和调整政策和工具的新研究机会，以安全地披露从TRES的机器学习输出。

translated by 谷歌翻译

Latent Discretization for Continuous-time Sequence Compression

Ricky T. Q. Chen , Matthew Le , Matthew Muckley , Maximilian Nickel , Karen Ullrich

分类：机器学习 | (统计)机器学习

2022-12-28

Neural compression offers a domain-agnostic approach to creating codecs for lossy or lossless compression via deep generative models. For sequence compression, however, most deep sequence models have costs that scale with the sequence length rather than the sequence complexity. In this work, we instead treat data sequences as observations from an underlying continuous-time process and learn how to efficiently discretize while retaining information about the full sequence. As a consequence of decoupling sequential information from its temporal discretization, our approach allows for greater compression rates and smaller computational complexity. Moreover, the continuous-time approach naturally allows us to decode at different time intervals. We empirically verify our approach on multiple domains involving compression of video and motion capture sequences, showing that our approaches can automatically achieve reductions in bit rates by learning how to discretize.

translated by 谷歌翻译

VQA and Visual Reasoning: An Overview of Recent Datasets, Methods and Challenges

Rufai Yusuf Zakari , Jim Wilson Owusu , Hailin Wang , Ke Qin , Zaharaddeen Karami Lawal , Yuezhou Dong

分类：计算机视觉

2022-12-26

Artificial Intelligence (AI) and its applications have sparked extraordinary interest in recent years. This achievement can be ascribed in part to advances in AI subfields including Machine Learning (ML), Computer Vision (CV), and Natural Language Processing (NLP). Deep learning, a sub-field of machine learning that employs artificial neural network concepts, has enabled the most rapid growth in these domains. The integration of vision and language has sparked a lot of attention as a result of this. The tasks have been created in such a way that they properly exemplify the concepts of deep learning. In this review paper, we provide a thorough and an extensive review of the state of the arts approaches, key models design principles and discuss existing datasets, methods, their problem formulation and evaluation measures for VQA and Visual reasoning tasks to understand vision and language representation learning. We also present some potential future paths in this field of research, with the hope that our study may generate new ideas and novel approaches to handle existing difficulties and develop new applications.

translated by 谷歌翻译

Metadata-guided Consistency Learning for High Content Images

Johan Fredin Haslum , Christos Matsoukas , Karl-Johan Leuchowius , Erik Müllers , Kevin Smith

分类：计算机视觉

2022-12-22

High content imaging assays can capture rich phenotypic response data for large sets of compound treatments, aiding in the characterization and discovery of novel drugs. However, extracting representative features from high content images that can capture subtle nuances in phenotypes remains challenging. The lack of high-quality labels makes it difficult to achieve satisfactory results with supervised deep learning. Self-Supervised learning methods, which learn from automatically generated labels has shown great success on natural images, offer an attractive alternative also to microscopy images. However, we find that self-supervised learning techniques underperform on high content imaging assays. One challenge is the undesirable domain shifts present in the data known as batch effects, which may be caused by biological noise or uncontrolled experimental conditions. To this end, we introduce Cross-Domain Consistency Learning (CDCL), a novel approach that is able to learn in the presence of batch effects. CDCL enforces the learning of biological similarities while disregarding undesirable batch-specific signals, which leads to more useful and versatile representations. These features are organised according to their morphological changes and are more useful for downstream tasks - such as distinguishing treatments and mode of action.

translated by 谷歌翻译

Forecasting West Nile Virus with Graph Neural Networks: Harnessing Spatial Dependence in Irregularly Sampled Geospatial Data

Adam Tonks , Trevor Harris , Bo Li , William Brown , Rebecca Smith

分类：机器学习

2022-12-21

Machine learning methods have seen increased application to geospatial environmental problems, such as precipitation nowcasting, haze forecasting, and crop yield prediction. However, many of the machine learning methods applied to mosquito population and disease forecasting do not inherently take into account the underlying spatial structure of the given data. In our work, we apply a spatially aware graph neural network model consisting of GraphSAGE layers to forecast the presence of West Nile virus in Illinois, to aid mosquito surveillance and abatement efforts within the state. More generally, we show that graph neural networks applied to irregularly sampled geospatial data can exceed the performance of a range of baseline methods including logistic regression, XGBoost, and fully-connected neural networks.

translated by 谷歌翻译

Does unsupervised grammar induction need pixels?

Boyi Li , Rodolfo Corona , Karttikeya Mangalam , Catherine Chen , Daniel Flaherty , Serge Belongie , Kilian Q. Weinberger , Jitendra Malik , Trevor Darrell , Dan Klein

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-20

Are extralinguistic signals such as image pixels crucial for inducing constituency grammars? While past work has shown substantial gains from multimodal cues, we investigate whether such gains persist in the presence of rich information from large language models (LLMs). We find that our approach, LLM-based C-PCFG (LC-PCFG), outperforms previous multi-modal methods on the task of unsupervised constituency parsing, achieving state-of-the-art performance on a variety of datasets. Moreover, LC-PCFG results in an over 50% reduction in parameter count, and speedups in training time of 1.7x for image-aided models and more than 5x for video-aided models, respectively. These results challenge the notion that extralinguistic signals such as image pixels are needed for unsupervised grammar induction, and point to the need for better text-only baselines in evaluating the need of multi-modality for the task.

translated by 谷歌翻译

Self-Instruct: Aligning Language Model with Self Generated Instructions

Yizhong Wang , Yeganeh Kordi , Swaroop Mishra , Alisa Liu , Noah A. Smith , Daniel Khashabi , Hannaneh Hajishirzi

分类：自然语言处理 | 人工智能

2022-12-20

Large "instruction-tuned" language models (finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off its own generations. Our pipeline generates instruction, input, and output samples from a language model, then prunes them before using them to finetune the original model. Applying our method to vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT_001, which is trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT_001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning.

translated by 谷歌翻译