智能论文笔记

Location analysis of players in UEFA EURO 2020 and 2022 using generalized valuation of defense by estimating probabilities

Rikuhei Umemoto , Kazushi Tsutsui , Keisuke Fujii

分类：机器学习

2022-11-30

Analyzing defenses in team sports is generally challenging because of the limited event data. Researchers have previously proposed methods to evaluate football team defense by predicting the events of ball gain and being attacked using locations of all players and the ball. However, they did not consider the importance of the events, assumed the perfect observation of all 22 players, and did not fully investigated the influence of the diversity (e.g., nationality and sex). Here, we propose a generalized valuation method of defensive teams by score-scaling the predicted probabilities of the events. Using the open-source location data of all players in broadcast video frames in football games of men's Euro 2020 and women's Euro 2022, we investigated the effect of the number of players on the prediction and validated our approach by analyzing the games. Results show that for the predictions of being attacked, scoring, and conceding, all players' information was not necessary, while that of ball gain required information on three to four offensive and defensive players. With game analyses we explained the excellence in defense of finalist teams in Euro 2020. Our approach might be applicable to location data from broadcast video frames in football games.

translated by 谷歌翻译

Evaluation of creating scoring opportunities for teammates in soccer via trajectory prediction

Masakiyo Teranishi , Kazushi Tsutsui , Kazuya Takeda , Keisuke Fujii

分类：人工智能 | 机器学习

2022-06-04

评估足球运动员队友的个人运动对于评估队伍，侦察和粉丝的参与至关重要。据说，在90分钟的比赛中，球员平均没有大约87分钟的球。但是，在不接球的情况下评估进攻球员并揭示运动如何为队友创造得分机会的贡献一直很困难。在本文中，我们评估了通过将实际动作与通过轨迹预测产生的参考运动进行比较来评估创建球外评分机会的玩家。首先，我们使用图形差异神经网络预测玩家的轨迹，该神经网络可以准确地模拟玩家之间的关系并预测长期轨迹。接下来，基于实际运动轨迹和预测轨迹之间修改的外球评估指数的差异，我们评估实际运动与预测运动相比如何促进得分机会。为了进行验证，我们研究了专家一年中专业球队的所有比赛的年薪，目标和比赛的关系。结果表明，年薪和拟议的指标与现有指标和目标无法解释。我们的结果表明，该方法作为没有球的球员为队友创造得分机会的指标的有效性。

translated by 谷歌翻译

Complete Cross-triplet Loss in Label Space for Audio-visual Cross-modal Retrieval

Donghuo Zeng , Yanan Wang , Jianming Wu , Kazushi Ikeda

分类：人工智能

2022-11-07

The heterogeneity gap problem is the main challenge in cross-modal retrieval. Because cross-modal data (e.g. audiovisual) have different distributions and representations that cannot be directly compared. To bridge the gap between audiovisual modalities, we learn a common subspace for them by utilizing the intrinsic correlation in the natural synchronization of audio-visual data with the aid of annotated labels. TNN-CCCA is the best audio-visual cross-modal retrieval (AV-CMR) model so far, but the model training is sensitive to hard negative samples when learning common subspace by applying triplet loss to predict the relative distance between inputs. In this paper, to reduce the interference of hard negative samples in representation learning, we propose a new AV-CMR model to optimize semantic features by directly predicting labels and then measuring the intrinsic correlation between audio-visual data using complete cross-triple loss. In particular, our model projects audio-visual features into label space by minimizing the distance between predicted label features after feature projection and ground label representations. Moreover, we adopt complete cross-triplet loss to optimize the predicted label features by leveraging the relationship between all possible similarity and dissimilarity semantic information across modalities. The extensive experimental results on two audio-visual double-checked datasets have shown an improvement of approximately 2.1% in terms of average MAP over the current state-of-the-art method TNN-CCCA for the AV-CMR task, which indicates the effectiveness of our proposed model.

translated by 谷歌翻译

Action Recognition based on Cross-Situational Action-object Statistics

Satoshi Tsutsui , Xizi Wang , Guangyuan Weng , Yayun Zhang , David Crandall , Chen Yu

分类：计算机视觉

2022-08-15

通常对视觉动作识别的机器学习模型进行了对与某些对象相关联的特定情况的数据训练和测试。这是一个悬而未决的问题，训练集中的行动对象关联如何影响模型超出受过训练情况的能力。我们着手确定培训数据的属性，这些训练数据可导致具有更大泛化能力的行动识别模型。为此，我们从一种称为跨态学习的认知机制中汲取灵感，该机制指出，人类学习者通过在不同情况下观察相同概念的实例来提取概念的含义。我们对各种类型的动作对象关联进行受控实验，并在训练数据中识别动作对象共发生的关键特性，从而导致更好的分类器。鉴于数据集中缺少这些属性，这些属性通常用于培训计算机视觉文献中的动作分类器，因此我们的工作提供了有关如何最好地构建数据集以有效培训以进行更好概括的有用见解。

translated by 谷歌翻译

Generalizable and Robust Deep Learning Algorithm for Atrial Fibrillation Diagnosis Across Ethnicities, Ages and Sexes

Shany Biton , Mohsin Aldhafeeri , Erez Marcusohn , Kenta Tsutsui , Tom Szwagier , Adi Elias , Julien Oster , Jean Marc Sellal , Mahmoud Suleiman , Joachim A. Behar

分类：机器学习 | 人工智能

2022-07-20

为了推动满足所有人需求并使医疗保健民主化的健康创新，有必要评估各种分配转变的深度学习（DL）算法的概括性能，以确保这些算法具有强大的态度。据我们所知，这项回顾性研究是第一个开发和评估从跨种族，年龄和性别的长期跳动间隔的AF事件检测的深度学习模型（DL）模型的概括性能（DL）模型的概括。新的复发DL模型（表示为ARNET2）是在2,147名患者的大型回顾性数据集中开发的，总计51,386小时连续心电图（ECG）。对来自四个中心（美国，以色列，日本和中国）的手动注释测试集评估了模型的概括，总计402名患者。该模型在以色列海法的Rambam医院Holter Clinic的1,730个Consecutives Holter记录中进一步验证了该模型。该模型的表现优于最先进的模型，并且在种族，年龄和性别之间进行了广泛的良好。女性的表现高于男性和年轻人（不到60岁），并且在种族之间显示出一些差异。解释这些变化的主要发现是心房颤动患病率更高（AFL）的群体的性能受损。我们关于跨组的ARNET2相对性能的发现可能对选择相对于感兴趣群的首选AF检查方法具有临床意义。

translated by 谷歌翻译

Multi-task manifold learning for small sample size datasets

Hideaki Ishibashi , Kazushi Higa , Tetsuo Furukawa

分类：机器学习 | (统计)机器学习

2021-11-23

在这项研究中，我们开发了一种用于多任务歧管学习的方法。该方法旨在提高多项任务的歧管学习的性能，特别是当每个任务具有少量样本时。此外，除了用于现有任务的新样本之外，该方法还旨在为新任务生成新的样本。在所提出的方法中，我们使用两种不同类型的信息传输：实例传输和模型传输。例如，转移，数据集在类似的任务之间合并，而对于模型传输，歧管模型在类似的任务之间取平均值。为此目的，所提出的方法包括一组与任务相对应的一组生成歧管模型，其集成到光纤束的一般模型中。我们将所提出的方法应用于人工数据集和面部图像集，结果表明该方法能够估计歧管，即使对于微小的样品。

translated by 谷歌翻译

Multi-Level Attention Pooling for Graph Neural Networks: Unifying Graph Representations with Multiple Localities

Takeshi D. Itoh , Takatomi Kubo , Kazushi Ikeda

分类：机器学习

2021-03-02

图表神经网络（GNN）已被广泛用于学习图形结构数据的矢量表示，并实现比传统方法更好的任务性能。 GNN的基础是消息传递过程，它将节点中的信息传播到其邻居。由于该过程每层进行一个步骤，因此节点之间的信息传播的范围在下层中很小，并且它朝向更高的层扩展。因此，GNN模型必须深入地捕获图中的全局结构信息。另一方面，众所周知，深入的GNN模型遭受性能下降，因为它们丢失了节点的本地信息，这对于良好的模型性能至关重要，通过许多消息传递步骤。在本研究中，我们提出了用于图形级分类任务的多级注意汇总（MLAP），这可以适应图表中的本地和全局结构信息。对于每个消息传递步骤，它具有注意池层，通过统一层方格图表示来计算最终图表示。 MLAP架构允许模型利用具有多个级别的本地图形的结构信息，因为它在由于过度的过天气丢失时保留了层面信息。我们的实验结果表明，与基线架构相比，MLAP架构提高了图形分类性能。此外，图表表示的分析表明，来自多个级别的地方的聚合信息确实具有提高学习图表表示的可怜的潜力。

translated by 谷歌翻译

Noise-induced degeneration in online learning

Yuzuru Sato , Daiji Tsutsui , Akio Fujiwara

分类：机器学习 | (统计)机器学习

2020-08-24

为了阐明消失梯度引起的平台现象，我们在本文中分析了多层的渐变子空间附近的随机梯度下降的稳定性。在Fukumizu-Amari模型的随机梯度下降中，这是呈现非琐碎的高原现象的最小多层摄影，我们表明（1）吸引地区存在于繁殖的子空间中，（2）强大的平台现象作为噪音出现 - 在确定性梯度下降中未观察到的同步，（3）存在最佳波动，以最小化退化子空间的逃生时间。预计本文观察到的噪声引起的变性将在广泛的机器学习中找到通过神经网络。

translated by 谷歌翻译