智能论文笔记

The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB)

Javier Ortega-Garcia , Julian Fierrez , Fernando Alonso-Fernandez , Javier Galbally , Manuel R Freire , Joaquin Gonzalez-Rodriguez , Carmen Garcia-Mateo , Jose-Luis Alba-Castro , Elisardo Gonzalez-Agulla , Enrique Otero-Muras

分类：计算机视觉

2021-11-17

展示了在欧洲生物安全卓越网络框架内设计和获取的新的多模态生物识别数据库。它由600多个个人在三种情况下在三种情况下获得：1）在互联网上，2）在带台式PC的办公环境中，以及3）在室内/室外环境中，具有移动便携式硬件。这三种方案包括音频/视频数据的共同部分。此外，已使用桌面PC和移动便携式硬件获取签名和指纹数据。此外，使用桌面PC在第二个方案中获取手和虹膜数据。收购事项已于11名欧洲机构进行。 BioSecure多模式数据库（BMDB）的其他功能有：两个采集会话，在某些方式的几种传感器，均衡性别和年龄分布，多式化现实情景，每种方式，跨欧洲多样性，人口统计数据的可用性，以及人口统计数据的可用性与其他多模式数据库的兼容性。 BMDB的新型收购条件允许我们对单币或多模式生物识别系统进行新的具有挑战性的研究和评估，如最近的生物安全的多模式评估活动。还给出了该活动的描述，包括来自新数据库的单个模式的基线结果。预计数据库将通过2008年通过生物安全协会进行研究目的

translated by 谷歌翻译

Towards automated brain aneurysm detection in TOF-MRA: open data, weak labels, and anatomical knowledge

Tommaso Di Noto , Guillaume Marie , Sebastien Tourbier , Yasser Alemán-Gómez , Oscar Esteban , Guillaume Saliou , Meritxell Bach Cuadra , Patric Hagmann , Jonas Richiardi

分类：计算机视觉

2021-03-10

飞行时间磁共振血管造影（TOF-MRA）的脑动脉瘤检测经历了剧烈的改善，深入学习（DL）。然而，监督DL模型的性能严重依赖于标记样品的数量。为了减轻Voxel-Wise标签创建的反复瓶颈，我们调查了弱标签的使用：这些是超大的注释，这些注释是更快的创造。我们为在训练期间利用弱标签的动脉瘤检测提供了深入的学习算法。此外，我们的模型通过仅关注动脉瘤发生的合理地点来利用先前的解剖知识。我们创建了284个TOF-MRA受试者（170名女性）的回顾性数据集，其中157例是患者（带198个动脉瘤），127个是对照。我们开放的TOF-MRA DataSet，社区中最大的数据集在Openneuro上发布。为了评估型号的概括性，我们参与了具有TOF-MRA数据的动脉瘤检测的挑战（93例，20例，125例，125个动脉瘤）。弱标签比其voxel-Wise对应物速度快4倍。使用先前解剖知识时，我们的网络在内部数据上实现了80％的灵敏度，每位患者的假阳性（FP）率为1.2。在公共挑战上，敏感度为68％（FP率= 2.5），排名第4/18位开放排行榜。我们发现动脉瘤破裂群（P = 0.75），位置（P = 0.72）或大小（P = 0.15）之间没有显着差异。我们的代码可用于可重复性。

translated by 谷歌翻译

NISQ-ready community detection based on separation-node identification

Jonas Stein , Dominik Ott , Mirco Schoenfeld , Sebastian Feld

分类：机器学习

2022-12-30

The analysis of network structure is essential to many scientific areas, ranging from biology to sociology. As the computational task of clustering these networks into partitions, i.e., solving the community detection problem, is generally NP-hard, heuristic solutions are indispensable. The exploration of expedient heuristics has led to the development of particularly promising approaches in the emerging technology of quantum computing. Motivated by the substantial hardware demands for all established quantum community detection approaches, we introduce a novel QUBO based approach that only needs number-of-nodes many qubits and is represented by a QUBO-matrix as sparse as the input graph's adjacency matrix. The substantial improvement on the sparsity of the QUBO-matrix, which is typically very dense in related work, is achieved through the novel concept of separation-nodes. Instead of assigning every node to a community directly, this approach relies on the identification of a separation-node set, which -- upon its removal from the graph -- yields a set of connected components, representing the core components of the communities. Employing a greedy heuristic to assign the nodes from the separation-node sets to the identified community cores, subsequent experimental results yield a proof of concept. This work hence displays a promising approach to NISQ ready quantum community detection, catalyzing the application of quantum computers for the network structure analysis of large scale, real world problem instances.

translated by 谷歌翻译

Non-intrusive surrogate modelling using sparse random features with applications in crashworthiness analysis

Maternus Herold , Anna Veselovska , Jonas Jehle , Felix Krahmer

分类：机器学习 | (统计)机器学习

2022-12-30

Efficient surrogate modelling is a key requirement for uncertainty quantification in data-driven scenarios. In this work, a novel approach of using Sparse Random Features for surrogate modelling in combination with self-supervised dimensionality reduction is described. The method is compared to other methods on synthetic and real data obtained from crashworthiness analyses. The results show a superiority of the here described approach over state of the art surrogate modelling techniques, Polynomial Chaos Expansions and Neural Networks.

translated by 谷歌翻译

Restricting to the chip architecture maintains the quantum neural network accuracy, if the parameterization is a $2$-design

Lucas Friedrich , Jonas Maziero

分类：人工智能 | 机器学习

2022-12-29

In the era of noisy intermediate scale quantum devices, variational quantum circuits (VQCs) are currently one of the main strategies for building quantum machine learning models. These models are made up of a quantum part and a classical part. The quantum part is given by a parametrization $U$, which, in general, is obtained from the product of different quantum gates. By its turn, the classical part corresponds to an optimizer that updates the parameters of $U$ in order to minimize a cost function $C$. However, despite the many applications of VQCs, there are still questions to be answered, such as for example: What is the best sequence of gates to be used? How to optimize their parameters? Which cost function to use? How the architecture of the quantum chips influences the final results? In this article, we focus on answering the last question. We will show that, in general, the cost function will tend to a typical average value the closer the parameterization used is from a $2$-design. Therefore, the closer this parameterization is to a $2$-design, the less the result of the quantum neural network model will depend on its parametrization. As a consequence, we can use the own architecture of the quantum chips to defined the VQC parametrization, avoiding the use of additional swap gates and thus diminishing the VQC depth and the associated errors.

translated by 谷歌翻译

Cramming: Training a Language Model on a Single GPU in One Day

Jonas Geiping , Tom Goldstein

分类：自然语言处理 | 机器学习

2022-12-28

Recent trends in language modeling have focused on increasing performance through scaling, and have resulted in an environment where training language models is out of reach for most researchers and practitioners. While most in the community are asking how to push the limits of extreme computation, we ask the opposite question: How far can we get with a single GPU in just one day? We investigate the downstream performance achievable with a transformer-based language model trained completely from scratch with masked language modeling for a single day on a single consumer GPU. Aside from re-analyzing nearly all components of the pretraining pipeline for this scenario and providing a modified pipeline with performance close to BERT, we investigate why scaling down is hard, and which modifications actually improve performance in this scenario. We provide evidence that even in this constrained setting, performance closely follows scaling laws observed in large-compute settings. Through the lens of scaling laws, we categorize a range of recent improvements to training and architecture and discuss their merit and practical applicability (or lack thereof) for the limited compute setting.

translated by 谷歌翻译

Continual Causal Abstractions

Matej Zečević , Moritz Willig , Jonas Seng , Florian Peter Busch

分类：人工智能

2022-12-23

This short paper discusses continually updated causal abstractions as a potential direction of future research. The key idea is to revise the existing level of causal abstraction to a different level of detail that is both consistent with the history of observed data and more effective in solving a given task.

translated by 谷歌翻译

ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models

Jonas Belouadi , Steffen Eger

分类：自然语言处理

2022-12-20

State-of-the-art poetry generation systems are often complex. They either consist of task-specific model pipelines, incorporate prior knowledge in the form of manually created constraints or both. In contrast, end-to-end models would not suffer from the overhead of having to model prior knowledge and could learn the nuances of poetry from data alone, reducing the degree of human supervision required. In this work, we investigate end-to-end poetry generation conditioned on styles such as rhyme, meter, and alliteration. We identify and address lack of training data and mismatching tokenization algorithms as possible limitations of past attempts. In particular, we successfully pre-train and release ByGPT5, a new token-free decoder-only language model, and fine-tune it on a large custom corpus of English and German quatrains annotated with our styles. We show that ByGPT5 outperforms other models such as mT5, ByT5, GPT-2 and ChatGPT, while also being more parameter efficient and performing favorably compared to humans. In addition, we analyze its runtime performance and introspect the model's understanding of style conditions. We make our code, models, and datasets publicly available.

translated by 谷歌翻译

VoronoiPatches: Evaluating A New Data Augmentation Method

Steffen Illium , Gretchen Griffin , Michael Kölle , Maximilian Zorn , Jonas Nüßlein , Claudia Linnhoff-Popien

分类：计算机视觉 | 机器学习

2022-12-20

Overfitting is a problem in Convolutional Neural Networks (CNN) that causes poor generalization of models on unseen data. To remediate this problem, many new and diverse data augmentation methods (DA) have been proposed to supplement or generate more training data, and thereby increase its quality. In this work, we propose a new data augmentation algorithm: VoronoiPatches (VP). We primarily utilize non-linear recombination of information within an image, fragmenting and occluding small information patches. Unlike other DA methods, VP uses small convex polygon-shaped patches in a random layout to transport information around within an image. Sudden transitions created between patches and the original image can, optionally, be smoothed. In our experiments, VP outperformed current DA methods regarding model variance and overfitting tendencies. We demonstrate data augmentation utilizing non-linear re-combination of information within images, and non-orthogonal shapes and structures improves CNN model robustness on unseen data.

translated by 谷歌翻译

Attention-based Multiple Instance Learning for Survival Prediction on Lung Cancer Tissue Microarrays

Jonas Ammeling , Lars-Henning Schmidt , Jonathan Ganz , Tanja Niedermair , Christoph Brochhausen-Delius , Christian Schulz , Katharina Breininger , Marc Aubreville

分类：计算机视觉

2022-12-15

Attention-based multiple instance learning (AMIL) algorithms have proven to be successful in utilizing gigapixel whole-slide images (WSIs) for a variety of different computational pathology tasks such as outcome prediction and cancer subtyping problems. We extended an AMIL approach to the task of survival prediction by utilizing the classical Cox partial likelihood as a loss function, converting the AMIL model into a nonlinear proportional hazards model. We applied the model to tissue microarray (TMA) slides of 330 lung cancer patients. The results show that AMIL approaches can handle very small amounts of tissue from a TMA and reach similar C-index performance compared to established survival prediction methods trained with highly discriminative clinical factors such as age, cancer grade, and cancer stage

translated by 谷歌翻译