智能论文笔记

Cell Attention Networks

Lorenzo Giusti , Claudio Battiloro , Lucia Testa , Paolo Di Lorenzo , Stefania Sardellitti , Sergio Barbarossa

分类：机器学习 | 人工智能

2022-09-16

自引入以来，图形注意力网络在图表表示任务中取得了出色的结果。但是，这些网络仅考虑节点之间的成对关系，然后它们无法完全利用许多现实世界数据集中存在的高阶交互。在本文中，我们介绍了细胞注意网络（CANS），这是一种在图表上定义的数据上运行的神经体系结构，将图表示为介绍的细胞复合物的1个骨骼，以捕获高阶相互作用。特别是，我们利用细胞复合物中的下层和上层社区来设计两种独立的掩盖自我发项机制，从而推广了常规的图形注意力策略。罐中使用的方法是层次结构的，并结合了以下步骤：i）从{\ it node demantion}中学习{\ it Edge功能}的提升算法}; ii）一种细胞注意机制，可以在下层和上邻居上找到边缘特征的最佳组合； iii）层次{\ it Edge Pooling}机制，以提取一组紧凑的有意义的功能集。实验结果表明，CAN是一种低复杂性策略，它与基于图的学习任务的最新结果相比。

translated by 谷歌翻译

Multiscale Causal Structure Learning

Gabriele D'Acunto , Paolo Di Lorenzo , Sergio Barbarossa

分类：机器学习 | (统计)机器学习

2022-07-16

从观察到的数据中推断因果结构在揭示系统的基本动力学方面起着关键作用。本文揭示了一种新的方法，称为多阶段 - 造成结构学习（MS-Castle），以估计在不同时间尺度上发生的线性因果关系的结构。与现有方法不同，MS-Castle明确考虑了多个时间序列之间的即时和滞后相互关系，以不同的尺度表示，呈现固定小波变换和非凸线优化。 MS-Castle将其作为特殊情况融合了一个名为SS-Castle的单个尺度版本，该版本在计算效率，性能和鲁棒性方面相对于合成数据而言是有利的。我们使用MS-Castle研究了Covid-19-19大流行期间15个全球股票市场风险的多阶段因果结构，这说明了MS-Castle如何通过其多尺度分析（优于SS-Castle）提取有意义的信息。我们发现，最持久和最强烈的互动发生在中期决议。此外，我们确定了在经过考虑的时期内推动风险的股票市场：巴西，加拿大和意大利。拟议的方法可以由金融投资者利用，这些投资者取决于其投资视野，可以从因果关系的角度管理股票投资组合中的风险。

translated by 谷歌翻译

Surveillance Face Anti-spoofing

Hao Fang , Ajian Liu , Jun Wan , Sergio Escalera , Chenxu Zhao , Xu Zhang , Stan Z. Li , Zhen Lei

分类：计算机视觉

2023-01-03

Face Anti-spoofing (FAS) is essential to secure face recognition systems from various physical attacks. However, recent research generally focuses on short-distance applications (i.e., phone unlocking) while lacking consideration of long-distance scenes (i.e., surveillance security checks). In order to promote relevant research and fill this gap in the community, we collect a large-scale Surveillance High-Fidelity Mask (SuHiFiMask) dataset captured under 40 surveillance scenes, which has 101 subjects from different age groups with 232 3D attacks (high-fidelity masks), 200 2D attacks (posters, portraits, and screens), and 2 adversarial attacks. In this scene, low image resolution and noise interference are new challenges faced in surveillance FAS. Together with the SuHiFiMask dataset, we propose a Contrastive Quality-Invariance Learning (CQIL) network to alleviate the performance degradation caused by image quality from three aspects: (1) An Image Quality Variable module (IQV) is introduced to recover image information associated with discrimination by combining the super-resolution network. (2) Using generated sample pairs to simulate quality variance distributions to help contrastive learning strategies obtain robust feature representation under quality variation. (3) A Separate Quality Network (SQN) is designed to learn discriminative features independent of image quality. Finally, a large number of experiments verify the quality of the SuHiFiMask dataset and the superiority of the proposed CQIL.

translated by 谷歌翻译

Measuring and Estimating Key Quality Indicators in Cloud Gaming services

Carlos Baena , O. S. Peñaherrera-Pulla , Raquel Barco , Sergio Fortes

分类：机器学习

2022-12-28

User equipment is one of the main bottlenecks facing the gaming industry nowadays. The extremely realistic games which are currently available trigger high computational requirements of the user devices to run games. As a consequence, the game industry has proposed the concept of Cloud Gaming, a paradigm that improves gaming experience in reduced hardware devices. To this end, games are hosted on remote servers, relegating users' devices to play only the role of a peripheral for interacting with the game. However, this paradigm overloads the communication links connecting the users with the cloud. Therefore, service experience becomes highly dependent on network connectivity. To overcome this, Cloud Gaming will be boosted by the promised performance of 5G and future 6G networks, together with the flexibility provided by mobility in multi-RAT scenarios, such as WiFi. In this scope, the present work proposes a framework for measuring and estimating the main E2E metrics of the Cloud Gaming service, namely KQIs. In addition, different machine learning techniques are assessed for predicting KQIs related to Cloud Gaming user's experience. To this end, the main key quality indicators (KQIs) of the service such as input lag, freeze percent or perceived video frame rate are collected in a real environment. Based on these, results show that machine learning techniques provide a good estimation of these indicators solely from network-based metrics. This is considered a valuable asset to guide the delivery of Cloud Gaming services through cellular communications networks even without access to the user's device, as it is expected for telecom operators.

translated by 谷歌翻译

Reinforcement Learning in System Identification

Jose Antonio Martin H. , Oscar Fernandez Vicente , Sergio Perez , Anas Belfadil , Cristina Ibanez-Llano , Freddy Jose Perozo Rondon , Jose Javier Valle , Javier Arechalde Pelaz

分类：机器学习 | 人工智能

2022-12-14

System identification, also known as learning forward models, transfer functions, system dynamics, etc., has a long tradition both in science and engineering in different fields. Particularly, it is a recurring theme in Reinforcement Learning research, where forward models approximate the state transition function of a Markov Decision Process by learning a mapping function from current state and action to the next state. This problem is commonly defined as a Supervised Learning problem in a direct way. This common approach faces several difficulties due to the inherent complexities of the dynamics to learn, for example, delayed effects, high non-linearity, non-stationarity, partial observability and, more important, error accumulation when using bootstrapped predictions (predictions based on past predictions), over large time horizons. Here we explore the use of Reinforcement Learning in this problem. We elaborate on why and how this problem fits naturally and sound as a Reinforcement Learning problem, and present some experimental results that demonstrate RL is a promising technique to solve these kind of problems.

translated by 谷歌翻译

Sharing Linkable Learning Objects with the use of Metadata and a Taxonomy Assistant for Categorization

Valentina Franzoni , Sergio Tasso , Simonetta Pallottelli , Damiano Perri

分类：人工智能

2022-12-09

In this work, a re-design of the Moodledata module functionalities is presented to share learning objects between e-learning content platforms, e.g., Moodle and G-Lorep, in a linkable object format. The e-learning courses content of the Drupal-based Content Management System G-Lorep for academic learning is exchanged designing an object incorporating metadata to support the reuse and the classification in its context. In such an Artificial Intelligence environment, the exchange of Linkable Learning Objects can be used for dialogue between Learning Systems to obtain information, especially with the use of semantic or structural similarity measures to enhance the existent Taxonomy Assistant for advanced automated classification.

translated by 谷歌翻译

Towards a learning-based performance modeling for accelerating Deep Neural Networks

Damiano Perri , Paolo Sylos Labini , Osvaldo Gervasi , Sergio Tasso , Flavio Vella

分类：机器学习

2022-12-09

Emerging applications such as Deep Learning are often data-driven, thus traditional approaches based on auto-tuners are not performance effective across the wide range of inputs used in practice. In the present paper, we start an investigation of predictive models based on machine learning techniques in order to optimize Convolution Neural Networks (CNNs). As a use-case, we focus on the ARM Compute Library which provides three different implementations of the convolution operator at different numeric precision. Starting from a collation of benchmarks, we build and validate models learned by Decision Tree and naive Bayesian classifier. Preliminary experiments on Midgard-based ARM Mali GPU show that our predictive model outperforms all the convolution operators manually selected by the library.

translated by 谷歌翻译

Transformer-based normative modelling for anomaly detection of early schizophrenia

Pedro F Da Costa , Jessica Dafflon , Sergio Leonardo Mendes , João Ricardo Sato , M. Jorge Cardoso , Robert Leech , Emily JH Jones , Walter H. L. Pinaya

分类：机器学习 | 人工智能

2022-12-08

Despite the impact of psychiatric disorders on clinical health, early-stage diagnosis remains a challenge. Machine learning studies have shown that classifiers tend to be overly narrow in the diagnosis prediction task. The overlap between conditions leads to high heterogeneity among participants that is not adequately captured by classification models. To address this issue, normative approaches have surged as an alternative method. By using a generative model to learn the distribution of healthy brain data patterns, we can identify the presence of pathologies as deviations or outliers from the distribution learned by the model. In particular, deep generative models showed great results as normative models to identify neurological lesions in the brain. However, unlike most neurological lesions, psychiatric disorders present subtle changes widespread in several brain regions, making these alterations challenging to identify. In this work, we evaluate the performance of transformer-based normative models to detect subtle brain changes expressed in adolescents and young adults. We trained our model on 3D MRI scans of neurotypical individuals (N=1,765). Then, we obtained the likelihood of neurotypical controls and psychiatric patients with early-stage schizophrenia from an independent dataset (N=93) from the Human Connectome Project. Using the predicted likelihood of the scans as a proxy for a normative score, we obtained an AUROC of 0.82 when assessing the difference between controls and individuals with early-stage schizophrenia. Our approach surpassed recent normative methods based on brain age and Gaussian Process, showing the promising use of deep generative models to help in individualised analyses.

translated by 谷歌翻译

When & How to Transfer with Transfer Learning

Adrian Tormos , Dario Garcia-Gasulla , Victor Gimenez-Abalos , Sergio Alvarez-Napagao

分类：计算机视觉 | 人工智能

2022-11-08

In deep learning, transfer learning (TL) has become the de facto approach when dealing with image related tasks. Visual features learnt for one task have been shown to be reusable for other tasks, improving performance significantly. By reusing deep representations, TL enables the use of deep models in domains with limited data availability, limited computational resources and/or limited access to human experts. Domains which include the vast majority of real-life applications. This paper conducts an experimental evaluation of TL, exploring its trade-offs with respect to performance, environmental footprint, human hours and computational requirements. Results highlight the cases were a cheap feature extraction approach is preferable, and the situations where an expensive fine-tuning effort may be worth the added cost. Finally, a set of guidelines on the use of TL are proposed.

translated by 谷歌翻译

GoRela: Go Relative for Viewpoint-Invariant Motion Forecasting

Alexander Cui , Sergio Casas , Kelvin Wong , Simon Suo , Raquel Urtasun

分类：机器人 | 人工智能 | 计算机视觉 | 机器学习

2022-11-04

The task of motion forecasting is critical for self-driving vehicles (SDVs) to be able to plan a safe maneuver. Towards this goal, modern approaches reason about the map, the agents' past trajectories and their interactions in order to produce accurate forecasts. The predominant approach has been to encode the map and other agents in the reference frame of each target agent. However, this approach is computationally expensive for multi-agent prediction as inference needs to be run for each agent. To tackle the scaling challenge, the solution thus far has been to encode all agents and the map in a shared coordinate frame (e.g., the SDV frame). However, this is sample inefficient and vulnerable to domain shift (e.g., when the SDV visits uncommon states). In contrast, in this paper, we propose an efficient shared encoding for all agents and the map without sacrificing accuracy or generalization. Towards this goal, we leverage pair-wise relative positional encodings to represent geometric relationships between the agents and the map elements in a heterogeneous spatial graph. This parameterization allows us to be invariant to scene viewpoint, and save online computation by re-using map embeddings computed offline. Our decoder is also viewpoint agnostic, predicting agent goals on the lane graph to enable diverse and context-aware multimodal prediction. We demonstrate the effectiveness of our approach on the urban Argoverse 2 benchmark as well as a novel highway dataset.

translated by 谷歌翻译