智能论文笔记

Autonomously Untangling Long Cables

Vainavi Viswanath , Kaushik Shivakumar , Justin Kerr , Brijen Thananjeyan , Ellen Novoseller , Jeffrey Ichnowski , Alejandro Escontrela , Michael Laskey , Joseph E. Gonzalez , Ken Goldberg

分类：机器人 | 人工智能

2022-07-16

电缆在许多环境中无处不在，但容易出现自我闭合和结，使它们难以感知和操纵。挑战通常会随着电缆长度而增加：长电缆需要更复杂的松弛管理和策略，以促进可观察性和可及性。在本文中，我们专注于使用双边机器人自动弄清长达3米的电缆。我们开发了新的运动原语，以有效地解开长电缆和专门用于此任务的新型Gripper Jaws。我们提出了缠结操作（SGTM）的滑动和抓握，该算法将这些原始物与RGBD视觉构成迭代性毫无障碍。SGTM在隔离的外手上取消了67％的成功率，图8节和更复杂的配置上的50％。可以在https://sites.google.com/view/rss-2022-untangling/home上找到补充材料，可视化和视频。

translated by 谷歌翻译

DayDreamer: World Models for Physical Robot Learning

Philipp Wu , Alejandro Escontrela , Danijar Hafner , Ken Goldberg , Pieter Abbeel

分类：机器人 | 人工智能 | 机器学习

2022-06-28

为了解决复杂环境中的任务，机器人需要从经验中学习。深度强化学习是一种常见的机器人学习方法，但需要大量的反复试验才能学习，从而限制了其在物理世界中的部署。结果，机器人学习的许多进步都取决于模拟器。另一方面，模拟器内部的学习无法捕获现实世界的复杂性，很容易模拟器不准确，并且由此产生的行为并不适应世界上的变化。 Dreamer算法最近通过在学习的世界模型中进行计划，表现出巨大的希望，可以从少量互动中学习，从而超过了视频游戏中的纯强化学习。学习一个世界模型来预测潜在行动的结果，使计划可以在想象中进行计划，从而减少了真实环境中所需的反复试验量。但是，尚不清楚梦想家是否可以促进更快地学习物理机器人。在本文中，我们将Dreamer应用于4个机器人，以直接在网上学习，直接在现实世界中，而无需模拟器。 Dreamer训练一个四倍的机器人，从头开始，站起来，站起来，仅在1小时内就没有重置。然后，我们推动机器人，发现Dreamer在10分钟内适应以承受扰动或迅速翻身并站起来。在两个不同的机器人臂上，Dreamer学会了直接从相机图像和稀疏的奖励中挑选和放置多个物体，从而接近人类的性能。在轮式机器人上，Dreamer学会了纯粹从相机图像导航到目标位置，从而自动解决有关机器人方向的歧义。在所有实验中使用相同的超参数，我们发现Dreamer能够在现实世界中在线学习，建立强大的基线。我们释放我们的基础架构，用于世界模型在机器人学习中的未来应用。

translated by 谷歌翻译

Design and analysis of tweet-based election models for the 2021 Mexican legislative election

Alejandro Vigna-Gómez , Javier Murillo , Manelik Ramirez , Alberto Borbolla , Ian Márquez , Prasun K. Ray

分类：自然语言处理

2023-01-02

Modelling and forecasting real-life human behaviour using online social media is an active endeavour of interest in politics, government, academia, and industry. Since its creation in 2006, Twitter has been proposed as a potential laboratory that could be used to gauge and predict social behaviour. During the last decade, the user base of Twitter has been growing and becoming more representative of the general population. Here we analyse this user base in the context of the 2021 Mexican Legislative Election. To do so, we use a dataset of 15 million election-related tweets in the six months preceding election day. We explore different election models that assign political preference to either the ruling parties or the opposition. We find that models using data with geographical attributes determine the results of the election with better precision and accuracy than conventional polling methods. These results demonstrate that analysis of public online data can outperform conventional polling methods, and that political analysis and general forecasting would likely benefit from incorporating such data in the immediate future. Moreover, the same Twitter dataset with geographical attributes is positively correlated with results from official census data on population and internet usage in Mexico. These findings suggest that we have reached a period in time when online activity, appropriately curated, can provide an accurate representation of offline behaviour.

translated by 谷歌翻译

Unsupervised 4D LiDAR Moving Object Segmentation in Stationary Settings with Multivariate Occupancy Time Series

Thomas Kreutz , Max Mühlhäuser , Alejandro Sanchez Guinea

分类：计算机视觉

2022-12-30

In this work, we address the problem of unsupervised moving object segmentation (MOS) in 4D LiDAR data recorded from a stationary sensor, where no ground truth annotations are involved. Deep learning-based state-of-the-art methods for LiDAR MOS strongly depend on annotated ground truth data, which is expensive to obtain and scarce in existence. To close this gap in the stationary setting, we propose a novel 4D LiDAR representation based on multivariate time series that relaxes the problem of unsupervised MOS to a time series clustering problem. More specifically, we propose modeling the change in occupancy of a voxel by a multivariate occupancy time series (MOTS), which captures spatio-temporal occupancy changes on the voxel level and its surrounding neighborhood. To perform unsupervised MOS, we train a neural network in a self-supervised manner to encode MOTS into voxel-level feature representations, which can be partitioned by a clustering algorithm into moving or stationary. Experiments on stationary scenes from the Raw KITTI dataset show that our fully unsupervised approach achieves performance that is comparable to that of supervised state-of-the-art approaches.

translated by 谷歌翻译

Countering Malicious Content Moderation Evasion in Online Social Networks: Simulation and Detection of Word Camouflage

Álvaro Huertas-García , Alejandro Martín , Javier Huertas Tato , David Camacho

分类：自然语言处理 | 人工智能

2022-12-27

Content moderation is the process of screening and monitoring user-generated content online. It plays a crucial role in stopping content resulting from unacceptable behaviors such as hate speech, harassment, violence against specific groups, terrorism, racism, xenophobia, homophobia, or misogyny, to mention some few, in Online Social Platforms. These platforms make use of a plethora of tools to detect and manage malicious information; however, malicious actors also improve their skills, developing strategies to surpass these barriers and continuing to spread misleading information. Twisting and camouflaging keywords are among the most used techniques to evade platform content moderation systems. In response to this recent ongoing issue, this paper presents an innovative approach to address this linguistic trend in social networks through the simulation of different content evasion techniques and a multilingual Transformer model for content evasion detection. In this way, we share with the rest of the scientific community a multilingual public tool, named "pyleetspeak" to generate/simulate in a customizable way the phenomenon of content evasion through automatic word camouflage and a multilingual Named-Entity Recognition (NER) Transformer-based model tuned for its recognition and detection. The multilingual NER model is evaluated in different textual scenarios, detecting different types and mixtures of camouflage techniques, achieving an overall weighted F1 score of 0.8795. This article contributes significantly to countering malicious information by developing multilingual tools to simulate and detect new methods of evasion of content on social networks, making the fight against information disorders more effective.

translated by 谷歌翻译

Comparison and Evaluation of Methods for a Predict+Optimize Problem in Renewable Energy

Christoph Bergmeir , Frits de Nijs , Abishek Sriramulu , Mahdi Abolghasemi , Richard Bean , John Betts , Quang Bui , Nam Trong Dinh , Nils Einecke , Rasul Esmaeilbeigi

分类：人工智能

2022-12-21

Algorithms that involve both forecasting and optimization are at the core of solutions to many difficult real-world problems, such as in supply chains (inventory optimization), traffic, and in the transition towards carbon-free energy generation in battery/load/production scheduling in sustainable energy systems. Typically, in these scenarios we want to solve an optimization problem that depends on unknown future values, which therefore need to be forecast. As both forecasting and optimization are difficult problems in their own right, relatively few research has been done in this area. This paper presents the findings of the ``IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling," held in 2021. We present a comparison and evaluation of the seven highest-ranked solutions in the competition, to provide researchers with a benchmark problem and to establish the state of the art for this benchmark, with the aim to foster and facilitate research in this area. The competition used data from the Monash Microgrid, as well as weather data and energy market data. It then focused on two main challenges: forecasting renewable energy production and demand, and obtaining an optimal schedule for the activities (lectures) and on-site batteries that lead to the lowest cost of energy. The most accurate forecasts were obtained by gradient-boosted tree and random forest models, and optimization was mostly performed using mixed integer linear and quadratic programming. The winning method predicted different scenarios and optimized over all scenarios jointly using a sample average approximation method.

translated by 谷歌翻译

BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics

Liang Ma , Shuyang Cao , Robert L. Logan IV , Di Lu , Shihao Ran , Ke Zhang , Joel Tetreault , Aoife Cahill , Alejandro Jaimes

分类：自然语言处理

2022-12-20

The proliferation of automatic faithfulness metrics for summarization has produced a need for benchmarks to evaluate them. While existing benchmarks measure the correlation with human judgements of faithfulness on model-generated summaries, they are insufficient for diagnosing whether metrics are: 1) consistent, i.e., decrease as errors are introduced into a summary, 2) effective on human-written texts, and 3) sensitive to different error types (as summaries can contain multiple errors). To address these needs, we present a benchmark of unfaithful minimal pairs (BUMP), a dataset of 889 human-written, minimally different summary pairs, where a single error (from an ontology of 7 types) is introduced to a summary from the CNN/DailyMail dataset to produce an unfaithful summary. We find BUMP complements existing benchmarks in a number of ways: 1) the summaries in BUMP are harder to discriminate and less probable under SOTA summarization models, 2) BUMP enables measuring the consistency of metrics, and reveals that the most discriminative metrics tend not to be the most consistent, 3) BUMP enables the measurement of metrics' performance on individual error types and highlights areas of weakness for future work.

translated by 谷歌翻译

Two-sample test based on Self-Organizing Maps

Alejandro Álvarez-Ayllón , Manuel Palomo-Duarte , Juan-Manuel Dodero

分类：机器学习 | 神经与进化计算

2022-12-17

Machine-learning classifiers can be leveraged as a two-sample statistical test. Suppose each sample is assigned a different label and that a classifier can obtain a better-than-chance result discriminating them. In this case, we can infer that both samples originate from different populations. However, many types of models, such as neural networks, behave as a black-box for the user: they can reject that both samples originate from the same population, but they do not offer insight into how both samples differ. Self-Organizing Maps are a dimensionality reduction initially devised as a data visualization tool that displays emergent properties, being also useful for classification tasks. Since they can be used as classifiers, they can be used also as a two-sample statistical test. But since their original purpose is visualization, they can also offer insights.

translated by 谷歌翻译

Graphon Pooling for Reducing Dimensionality of Signals and Convolutional Operators on Graphs

Alejandro Parada-Mayorga , Zhiyang Wang , Alejandro Ribeiro

分类：机器学习

2022-12-15

In this paper we propose a pooling approach for convolutional information processing on graphs relying on the theory of graphons and limits of dense graph sequences. We present three methods that exploit the induced graphon representation of graphs and graph signals on partitions of [0, 1]2 in the graphon space. As a result we derive low dimensional representations of the convolutional operators, while a dimensionality reduction of the signals is achieved by simple local interpolation of functions in L2([0, 1]). We prove that those low dimensional representations constitute a convergent sequence of graphs and graph signals, respectively. The methods proposed and the theoretical guarantees that we provide show that the reduced graphs and signals inherit spectral-structural properties of the original quantities. We evaluate our approach with a set of numerical experiments performed on graph neural networks (GNNs) that rely on graphon pooling. We observe that graphon pooling performs significantly better than other approaches proposed in the literature when dimensionality reduction ratios between layers are large. We also observe that when graphon pooling is used we have, in general, less overfitting and lower computational cost.

translated by 谷歌翻译

Constraints on the design of neuromorphic circuits set by the properties of neural population codes

Stefano Panzeri , Ella Janotte , Alejandro Pequeño-Zurro , Jacopo Bonato , Chiara Bartolozzi

分类：神经与进化计算

2022-12-08

In the brain, information is encoded, transmitted and used to inform behaviour at the level of timing of action potentials distributed over population of neurons. To implement neural-like systems in silico, to emulate neural function, and to interface successfully with the brain, neuromorphic circuits need to encode information in a way compatible to that used by populations of neuron in the brain. To facilitate the cross-talk between neuromorphic engineering and neuroscience, in this Review we first critically examine and summarize emerging recent findings about how population of neurons encode and transmit information. We examine the effects on encoding and readout of information for different features of neural population activity, namely the sparseness of neural representations, the heterogeneity of neural properties, the correlations among neurons, and the time scales (from short to long) at which neurons encode information and maintain it consistently over time. Finally, we critically elaborate on how these facts constrain the design of information coding in neuromorphic circuits. We focus primarily on the implications for designing neuromorphic circuits that communicate with the brain, as in this case it is essential that artificial and biological neurons use compatible neural codes. However, we also discuss implications for the design of neuromorphic systems for implementation or emulation of neural computation.

translated by 谷歌翻译