智能论文笔记

Bayesian learning of feature spaces for multitasks problems

Carlos Sevilla-Salcedo , Ascensión Gallardo-Antolín , Vanessa Gómez-Verdejo , Emilio Parrado-Hernández

分类： (统计)机器学习 | 机器学习

2022-09-07

本文提出了一个贝叶斯框架，用于构建非线性，简约的浅层模型，用于多任务回归。提出的框架依赖于这样一个事实，即随机傅立叶特征（RFF）可以通过极端学习机器将RBF内核近似，其隐藏层由RFF形成。主要思想是将同一模型的两个双重视图结合在单个贝叶斯公式下，将稀疏的贝叶斯极限学习机器扩展到多任务问题。从内核方法的角度来看，提出的公式有助于通过RBF内核参数引入先前的域知识。从极端的学习机的角度来看，新的配方有助于控制过度拟合并实现简约的总体模型（服务每个任务的模型共享联合贝叶斯优化中选择的相同的RFF集合）。实验结果表明，在同一框架内将内核方法和极端学习机器的优势相结合可能会导致这两个范式中的每一个范式独立地取得的性能显着改善。

translated by 谷歌翻译

Multi-view hierarchical Variational AutoEncoders with Factor Analysis latent space

Alejandro Guerrero-López , Carlos Sevilla-Salcedo , Vanessa Gómez-Verdejo , Pablo M. Olmos

分类：机器学习 | 人工智能

2022-07-19

现实世界数据库很复杂，它们通常会呈现冗余，并在同一数据的异质和多个表示之间共享相关性。因此，在视图之间利用和解开共享信息至关重要。为此，最近的研究经常将所有观点融合到共享的非线性复杂潜在空间中，但它们失去了解释性。为了克服这一局限性，我们在这里提出了一种新的方法，将多个变异自动编码器（VAE）结构与因子分析潜在空间（FA-VAE）相结合。具体而言，我们使用VAE在连续的潜在空间中学习每个异质观点的私人表示。然后，我们通过使用线性投影矩阵将每个私有变量投影到低维的潜在空间来对共享潜在空间进行建模。因此，我们在私人信息和共享信息之间创建了可解释的层次依赖性。这样，新型模型可以同时：（i）从多种异质观点中学习，（ii）获得可解释的层次共享空间，以及（iii）在生成模型之间执行传输学习。

translated by 谷歌翻译

Multi-task longitudinal forecasting with missing values on Alzheimer's Disease

Carlos Sevilla-Salcedo , Vandad Imani , Pablo M. Olmos , Vanessa Gómez-Verdejo , Jussi Tohka

分类： (统计)机器学习 | 机器学习

2022-01-13

机器学习技术通常应用于痴呆症预测缺乏其能力，共同学习多个任务，处理时间相关的异构数据和缺失值。在本文中，我们建议使用最近呈现的SShiba模型提出了一个框架，用于在缺失值的纵向数据上联合学习不同的任务。该方法使用贝叶斯变分推理来赋予缺失值并组合多个视图的信息。这样，我们可以将不同的数据视图与共同的潜在空间中的不同时间点相结合，并在同时建模和预测若干输出变量的同时学习每个时间点之间的关系。我们应用此模型以预测痴呆症中的诊断，心室体积和临床评分。结果表明，SSHIBA能够学习缺失值的良好归因，同时预测三个不同任务的同时表现出基线。

translated by 谷歌翻译

An Empirical Investigation into the Use of Image Captioning for Automated Software Documentation

Kevin Moran , Ali Yachnes , George Purnell , Junayed Mahmud , Michele Tufano , Carlos Bernal-Cárdenas , Denys Poshyvanyk , Zach H'Doubler

分类：人工智能 | 计算机视觉 | 机器学习

2023-01-03

Existing automated techniques for software documentation typically attempt to reason between two main sources of information: code and natural language. However, this reasoning process is often complicated by the lexical gap between more abstract natural language and more structured programming languages. One potential bridge for this gap is the Graphical User Interface (GUI), as GUIs inherently encode salient information about underlying program functionality into rich, pixel-based data representations. This paper offers one of the first comprehensive empirical investigations into the connection between GUIs and functional, natural language descriptions of software. First, we collect, analyze, and open source a large dataset of functional GUI descriptions consisting of 45,998 descriptions for 10,204 screenshots from popular Android applications. The descriptions were obtained from human labelers and underwent several quality control mechanisms. To gain insight into the representational potential of GUIs, we investigate the ability of four Neural Image Captioning models to predict natural language descriptions of varying granularity when provided a screenshot as input. We evaluate these models quantitatively, using common machine translation metrics, and qualitatively through a large-scale user study. Finally, we offer learned lessons and a discussion of the potential shown by multimodal models to enhance future techniques for automated software documentation.

translated by 谷歌翻译

Measuring and Estimating Key Quality Indicators in Cloud Gaming services

Carlos Baena , O. S. Peñaherrera-Pulla , Raquel Barco , Sergio Fortes

分类：机器学习

2022-12-28

User equipment is one of the main bottlenecks facing the gaming industry nowadays. The extremely realistic games which are currently available trigger high computational requirements of the user devices to run games. As a consequence, the game industry has proposed the concept of Cloud Gaming, a paradigm that improves gaming experience in reduced hardware devices. To this end, games are hosted on remote servers, relegating users' devices to play only the role of a peripheral for interacting with the game. However, this paradigm overloads the communication links connecting the users with the cloud. Therefore, service experience becomes highly dependent on network connectivity. To overcome this, Cloud Gaming will be boosted by the promised performance of 5G and future 6G networks, together with the flexibility provided by mobility in multi-RAT scenarios, such as WiFi. In this scope, the present work proposes a framework for measuring and estimating the main E2E metrics of the Cloud Gaming service, namely KQIs. In addition, different machine learning techniques are assessed for predicting KQIs related to Cloud Gaming user's experience. To this end, the main key quality indicators (KQIs) of the service such as input lag, freeze percent or perceived video frame rate are collected in a real environment. Based on these, results show that machine learning techniques provide a good estimation of these indicators solely from network-based metrics. This is considered a valuable asset to guide the delivery of Cloud Gaming services through cellular communications networks even without access to the user's device, as it is expected for telecom operators.

translated by 谷歌翻译

On the Level Sets and Invariance of Neural Tuning Landscapes

Binxu Wang , Carlos R. Ponce

分类：人工智能 | 计算机视觉 | 神经与进化计算

2022-12-26

Visual representations can be defined as the activations of neuronal populations in response to images. The activation of a neuron as a function over all image space has been described as a "tuning landscape". As a function over a high-dimensional space, what is the structure of this landscape? In this study, we characterize tuning landscapes through the lens of level sets and Morse theory. A recent study measured the in vivo two-dimensional tuning maps of neurons in different brain regions. Here, we developed a statistically reliable signature for these maps based on the change of topology in level sets. We found this topological signature changed progressively throughout the cortical hierarchy, with similar trends found for units in convolutional neural networks (CNNs). Further, we analyzed the geometry of level sets on the tuning landscapes of CNN units. We advanced the hypothesis that higher-order units can be locally regarded as isotropic radial basis functions, but not globally. This shows the power of level sets as a conceptual tool to understand neuronal activations over image space.

translated by 谷歌翻译

Principled and Efficient Transfer Learning of Deep Models via Neural Collapse

Xiao Li , Sheng Liu , Jinxin Zhou , Xinyu Lu , Carlos Fernandez-Granda , Zhihui Zhu , Qing Qu

分类：机器学习 | 人工智能 | 计算机视觉 | (统计)机器学习

2022-12-23

With the ever-growing model size and the limited availability of labeled training data, transfer learning has become an increasingly popular approach in many science and engineering domains. For classification problems, this work delves into the mystery of transfer learning through an intriguing phenomenon termed neural collapse (NC), where the last-layer features and classifiers of learned deep networks satisfy: (i) the within-class variability of the features collapses to zero, and (ii) the between-class feature means are maximally and equally separated. Through the lens of NC, our findings for transfer learning are the following: (i) when pre-training models, preventing intra-class variability collapse (to a certain extent) better preserves the intrinsic structures of the input data, so that it leads to better model transferability; (ii) when fine-tuning models on downstream tasks, obtaining features with more NC on downstream data results in better test accuracy on the given task. The above results not only demystify many widely used heuristics in model pre-training (e.g., data augmentation, projection head, self-supervised learning), but also leads to more efficient and principled fine-tuning method on downstream tasks that we demonstrate through extensive experimental results.

translated by 谷歌翻译

Ensemble learning techniques for intrusion detection system in the context of cybersecurity

Andricson Abeline Moreira , Carlos A. C. Tojeiro , Carlos J. Reis , Gustavo Henrique Massaro , Igor Andrade Brito e Kelton A. P. da Costa

分类：机器学习

2022-12-21

Recently, there has been an interest in improving the resources available in Intrusion Detection System (IDS) techniques. In this sense, several studies related to cybersecurity show that the environment invasions and information kidnapping are increasingly recurrent and complex. The criticality of the business involving operations in an environment using computing resources does not allow the vulnerability of the information. Cybersecurity has taken on a dimension within the universe of indispensable technology in corporations, and the prevention of risks of invasions into the environment is dealt with daily by Security teams. Thus, the main objective of the study was to investigate the Ensemble Learning technique using the Stacking method, supported by the Support Vector Machine (SVM) and k-Nearest Neighbour (kNN) algorithms aiming at an optimization of the results for DDoS attack detection. For this, the Intrusion Detection System concept was used with the application of the Data Mining and Machine Learning Orange tool to obtain better results

translated by 谷歌翻译

AI applications in forest monitoring need remote sensing benchmark datasets

Emily R. Lines , Matt Allen , Carlos Cabo , Kim Calders , Amandine Debus , Stuart W. D. Grieve , Milto Miltiadou , Adam Noach , Harry J. F. Owen , Stefano Puliti

分类：人工智能

2022-12-20

With the rise in high resolution remote sensing technologies there has been an explosion in the amount of data available for forest monitoring, and an accompanying growth in artificial intelligence applications to automatically derive forest properties of interest from these datasets. Many studies use their own data at small spatio-temporal scales, and demonstrate an application of an existing or adapted data science method for a particular task. This approach often involves intensive and time-consuming data collection and processing, but generates results restricted to specific ecosystems and sensor types. There is a lack of widespread acknowledgement of how the types and structures of data used affects performance and accuracy of analysis algorithms. To accelerate progress in the field more efficiently, benchmarking datasets upon which methods can be tested and compared are sorely needed. Here, we discuss how lack of standardisation impacts confidence in estimation of key forest properties, and how considerations of data collection need to be accounted for in assessing method performance. We present pragmatic requirements and considerations for the creation of rigorous, useful benchmarking datasets for forest monitoring applications, and discuss how tools from modern data science can improve use of existing data. We list a set of example large-scale datasets that could contribute to benchmarking, and present a vision for how community-driven, representative benchmarking initiatives could benefit the field.

translated by 谷歌翻译

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu , Chia-Chih Chen , Zeyuan Chen , Rui Meng , Gang Wu , Paul Josel , Juan Carlos Niebles , Caiming Xiong , Ran Xu

分类：计算机视觉

2022-12-19

Graphic layout designs play an essential role in visual communication. Yet handcrafting layout designs are skill-demanding, time-consuming, and non-scalable to batch production. Although generative models emerge to make design automation no longer utopian, it remains non-trivial to customize designs that comply with designers' multimodal desires, i.e., constrained by background images and driven by foreground contents. In this study, we propose \textit{LayoutDETR} that inherits the high quality and realism from generative modeling, in the meanwhile reformulating content-aware requirements as a detection problem: we learn to detect in a background image the reasonable locations, scales, and spatial relations for multimodal elements in a layout. Experiments validate that our solution yields new state-of-the-art performance for layout generation on public benchmarks and on our newly-curated ads banner dataset. For practical usage, we build our solution into a graphical system that facilitates user studies. We demonstrate that our designs attract more subjective preference than baselines by significant margins. Our code, models, dataset, graphical system, and demos are available at https://github.com/salesforce/LayoutDETR.

translated by 谷歌翻译