智能论文笔记

Efficient Truncated Linear Regression with Unknown Noise Variance

Constantinos Daskalakis , Patroklos Stefanou , Rui Yao , Manolis Zampetakis

分类：机器学习

2022-08-25

截断的线性回归是统计学中的一个经典挑战，其中$ y = w^t x + \ varepsilon $及其相应的功能向量，$ x \ in \ mathbb {r}^k $，仅在当时才观察到标签属于某些子集$ s \ subseteq \ mathbb {r} $;否则，对$（x，y）$的存在被隐藏在观察中。以截断的观察结果的线性回归一直是其一般形式的挑战，因为〜\ citet {tobin1958估计，amemiya1973 reflecression}的早期作品。当误差的分布与已知方差正常时，〜\ citet {daskalakis2019 truncatedRegerse}的最新工作在线性模型$ w $上提供了计算和统计上有效的估计器。在本文中，当噪声方差未知时，我们为截断的线性回归提供了第一个计算和统计上有效的估计器，同时估计了噪声的线性模型和方差。我们的估计器基于对截短样品的负模样中的预测随机梯度下降的有效实施。重要的是，我们表明我们的估计错误是渐近正常的，我们使用它来为我们的估计提供明确的置信区域。

translated by 谷歌翻译

Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problem

Giannis Daras , Yuval Dagan , Alexandros G. Dimakis , Constantinos Daskalakis

分类：机器学习 | 人工智能

2022-06-18

我们证明了快速混合并表征了langevin算法的固定分布，用于反转随机加权DNN发电机。该结果将手和Voroninski的工作从有效的反转到有效的后部采样。实际上，为了提高表达性，我们建议在预训练的生成模型的潜在空间中进行后验采样。为了实现这一目标，我们在StyleGAN-2的潜在空间中训练基于分数的模型，并使用它来解决反问题。我们的框架，得分引导的中间层优化（SGILO），通过用中间层中的生成性先验代替稀疏正则化来扩展先前的工作。在实验上，我们对先前的最新面临，尤其是在低测量方案中获得了显着改善。

translated by 谷歌翻译

What Makes A Good Fisherman? Linear Regression under Self-Selection Bias

Yeshwanth Cherapanamjeri , Constantinos Daskalakis , Andrew Ilyas , Manolis Zampetakis

分类：机器学习 | (统计)机器学习

2022-05-06

In the classical setting of self-selection, the goal is to learn $k$ models, simultaneously from observations $(x^{(i)}, y^{(i)})$ where $y^{(i)}$ is the output of one of $k$ underlying models on input $x^{(i)}$. In contrast to mixture models, where we observe the output of a randomly selected model, here the observed model depends on the outputs themselves, and is determined by some known selection criterion. For example, we might observe the highest output, the smallest output, or the median output of the $k$ models. In known-index self-selection, the identity of the observed model output is observable; in unknown-index self-selection, it is not. Self-selection has a long history in Econometrics and applications in various theoretical and applied fields, including treatment effect estimation, imitation learning, learning from strategically reported data, and learning from markets at disequilibrium. In this work, we present the first computationally and statistically efficient estimation algorithms for the most standard setting of this problem where the models are linear. In the known-index case, we require poly$(1/\varepsilon, k, d)$ sample and time complexity to estimate all model parameters to accuracy $\varepsilon$ in $d$ dimensions, and can accommodate quite general selection criteria. In the more challenging unknown-index case, even the identifiability of the linear models (from infinitely many samples) was not known. We show three results in this case for the commonly studied $\max$ self-selection criterion: (1) we show that the linear models are indeed identifiable, (2) for general $k$ we provide an algorithm with poly$(d) \exp(\text{poly}(k))$ sample and time complexity to estimate the regression parameters up to error $1/\text{poly}(k)$, and (3) for $k = 2$ we provide an algorithm for any error $\varepsilon$ and poly$(d, 1/\varepsilon)$ sample and time complexity.

translated by 谷歌翻译

How Good are Low-Rank Approximations in Gaussian Process Regression?

Constantinos Daskalakis , Petros Dellaportas , Aristeidis Panos

分类： (统计)机器学习 | 机器学习

2021-12-13

我们提供了来自两个常见的低级内核近似产生的近似高斯过程（GP）回归的保证：基于随机傅里叶功能，并基于截断内核的Mercer扩展。特别地，我们将kullback-leibler在精确的gp和由一个上述低秩近似的一个与其内核中的一个引起的kullback-leibler发散相结合，以及它们的相应预测密度之间，并且我们还绑定了预测均值之间的误差使用近似GP使用精确的GP计算的矢量和预测协方差矩阵之间的载体。我们为模拟数据和标准基准提供了实验，以评估我们理论界的有效性。

translated by 谷歌翻译

Fast Rates for Nonparametric Online Learning: From Realizability to Learning in Games

Constantinos Daskalakis , Noah Golowich

分类：机器学习 | (统计)机器学习

2021-11-17

我们研究了非参数在线回归中的快速收敛速度，即遗憾的是关于具有有界复杂度的任意函数类来定义后悔。我们的贡献是两倍： - 在绝对损失中的非参数网上回归的可实现设置中，我们提出了一种随机适当的学习算法，该算法在假设类的顺序脂肪破碎尺寸方面获得了近乎最佳的错误。在与一类Littlestone维度$ D $的在线分类中，我们的绑定减少到$ d \ cdot {\ rm poly} \ log t $。这结果回答了一个问题，以及适当的学习者是否可以实现近乎最佳错误的界限;以前，即使在线分类，绑定的最知名错误也是$ \ tilde o（\ sqrt {dt}）$。此外，对于真实值（回归）设置，在这项工作之前，界定的最佳错误甚至没有以不正当的学习者所知。 - 使用上述结果，我们展示了Littlestone维度$ D $的一般总和二进制游戏的独立学习算法，每个玩家达到后悔$ \ tilde o（d ^ {3/4} \ cdot t ^ {1 / 4}）$。该结果概括了Syrgkanis等人的类似结果。（2015）谁表明，在有限的游戏中，最佳遗憾可以从普通的o（\ sqrt {t}）$中的$ o（\ sqrt {t}）为游戏设置中的$ o（t ^ {1/4}）$。要建立上述结果，我们介绍了几种新技术，包括：分层聚合规则，以实现对实际类别的最佳错误，Hanneke等人的适当在线可实现学习者的多尺度扩展。（2021），一种方法来表明这种非参数学习算法的输出是稳定的，并且证明Minimax定理在所有在线学习游戏中保持。

translated by 谷歌翻译

Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-Player General-Sum Games

Ioannis Anagnostides , Constantinos Daskalakis , Gabriele Farina , Maxwell Fishelson , Noah Golowich , Tuomas Sandholm

分类：机器学习

2021-11-11

最近，Daskalakis，Fisselson和Golowich（DFG）（Neurips`21）表明，如果所有代理在多人普通和正常形式游戏中采用乐观的乘法权重更新（OMWU），每个玩家的外部遗憾是$ o（\ textrm {polylog}（t））$ the游戏的$重复。我们从外部遗憾扩展到内部遗憾并交换后悔，从而建立了以$ \ tilde {o}的速率收敛到近似相关均衡的近似相关均衡（t ^ { - 1}）$。由于陈和彭（神经潜行群岛20），这实质上提高了以陈和彭（NEURIPS20）的相关均衡的相关均衡率，并且在无遗憾的框架内是最佳的 - 以$ $ $ to to polylogarithmic因素。为了获得这些结果，我们开发了用于建立涉及固定点操作的学习动态的高阶平滑的新技术。具体而言，我们确定STOLTZ和LUGOSI（Mach Learn`05）的无内部遗憾学习动态在组合空间上的无外部后悔动态等效地模拟。这使我们可以在指数大小的集合上交易多项式大型马尔可夫链的计算，用于在指数大小的集合上的（更良好的良好）的线性变换，使我们能够利用类似的技术作为DGF到接近最佳地结合内心遗憾。此外，我们建立了$ O（\ textrm {polylog}（t））$ no-swap-recreet遗憾的blum和mansour（bm）的经典算法（JMLR`07）。我们这样做是通过基于Cauchy积分的技术来介绍DFG的更有限的组合争论。除了对BM的近乎最优遗憾保证的阐明外，我们的论点还提供了进入各种方式的洞察，其中可以在分析更多涉及的学习算法中延长和利用DFG的技术。

translated by 谷歌翻译

How Good are Low-Rank Approximations in Gaussian Process Regression?

Constantinos Daskalakis , Petros Dellaportas , Aristeidis Panos

分类： (统计)机器学习 | 机器学习

2020-04-03

translated by 谷歌翻译

Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue

Daxin Tan , Nikos Kargas , David McHardy , Constantinos Papayiannis , Antonio Bonafonte , Marek Strelec , Jonas Rohnke , Agis Oikonomou Filandras , Trevor Wood

分类：自然语言处理

2022-12-07

Entrainment is the phenomenon by which an interlocutor adapts their speaking style to align with their partner in conversations. It has been found in different dimensions as acoustic, prosodic, lexical or syntactic. In this work, we explore and utilize the entrainment phenomenon to improve spoken dialogue systems for voice assistants. We first examine the existence of the entrainment phenomenon in human-to-human dialogues in respect to acoustic feature and then extend the analysis to emotion features. The analysis results show strong evidence of entrainment in terms of both acoustic and emotion features. Based on this findings, we implement two entrainment policies and assess if the integration of entrainment principle into a Text-to-Speech (TTS) system improves the synthesis performance and the user experience. It is found that the integration of the entrainment principle into a TTS system brings performance improvement when considering acoustic features, while no obvious improvement is observed when considering emotion features.

translated by 谷歌翻译

Dataset: Impact Events for Structural Health Monitoring of a Plastic Thin Plate

Ioannis Katsidimas , Thanasis Kotzakolios , Sotiris Nikoletseas , Stefanos H. Panagiotou , Konstantinos Timpilis , Constantinos Tsakonas

分类：机器学习

2022-09-20

如今，越来越多的数据集已发布针对系统和模型的研究和开发，从而直接比较，解决方案的持续改进以及研究人员参与实验，现实生活数据。但是，尤其是在结构健康监测（SHM）领域中，在许多情况下，新的研究项目具有结构设计和实施，传感器选择和技术推动因素的独特组合，这些组合不符合相关个人研究的配置文学。因此，由于我们没有找到任何相关存储库，因此我们将案例研究中的数据分享到研究界。更具体地说，在本文中，我们提出了一个新颖的时间序列数据集，用于使用陶瓷压电传感器（PZTS）连接到物联网（IOT）设备（IOT）设备的陶瓷压电传感器（PZTS），用于塑料薄板上的撞击检测和本地化，朝着结构性健康监测应用。数据集是从低速，低能冲击事件的实验过程中收集的，该过程包括每个独特的实验至少3个重复，而输入测量值来自放置在板的角落的4个PZT传感器。对于每个重复和传感器，以100 kHz的采样率存储5000个值。该系统用钢球激发，释放的高度从10厘米到20厘米不等。该数据集可在GitHub（https://github.com/smart-objects/impact-events-dataset）中获得。

translated by 谷歌翻译

Energy-Efficient Trajectory Design of a Multi-IRS Assisted Portable Access Point

Nithin Babu , Marco Virgili , Mohammad Al-jarrah , Xiaoye Jing , Emad Alsusa , Petar Popovski , Andrew Forsyth , Christos Masouros , Constantinos B. Papadias

分类：机器人

2022-09-01

在这项工作中，我们提出了一个框架，用于部署的无人驾驶汽车（UAV）的便携式接入点（PAP），以服务于一组接地节点（GNS）。除PAP和GNS外，该系统还由安装在人造结构上的一组智能反射表面（IRS）组成，以增加每焦耳的能源消耗的钻头数量，这些能量消耗被测量为全球能源效率（GEE）。 PAP的GEE轨迹是通过考虑UAV推进能量消耗和PAP电池的PEUKERT效应来设计的，PAP电池代表了精确的电池放电曲线作为无人机功耗概况的非线性功能。 GEE轨迹设计问题分为两个阶段：在第一个阶段，使用多层圆形填料方法找到了PAP的路径和可行位置，并使用替代方案计算所需的IRS相移值优化方法考虑了IRS元素的幅度和相位响应之间的相互依赖性；在第二阶段，使用新型的多轨迹设计算法计算PAP飞行速度和用户调度。数值评估表明：忽略Peukert效应高估了PAP的可用飞行时间；一定的阈值后，增加电池尺寸会减少PAP的可用飞行时间；与其他基线场景相比，IRS模块的存在改善了系统的GEE。与使用顺序凸编程和Dinkelbach算法的组合开发的单圈轨迹相比，多圈轨迹可节省更多的能量。

translated by 谷歌翻译