智能论文笔记

The multi-modal universe of fast-fashion: the Visuelle 2.0 benchmark

Geri Skenderi , Christian Joppi , Matteo Denitto , Berniero Scarpa , Marco Cristani

分类：计算机视觉 | 机器学习

2022-04-14

We present Visuelle 2.0, the first dataset useful for facing diverse prediction problems that a fast-fashion company has to manage routinely. Furthermore, we demonstrate how the use of computer vision is substantial in this scenario. Visuelle 2.0 contains data for 6 seasons / 5355 clothing products of Nuna Lie, a famous Italian company with hundreds of shops located in different areas within the country. In particular, we focus on a specific prediction problem, namely short-observation new product sale forecasting (SO-fore). SO-fore assumes that the season has started and a set of new products is on the shelves of the different stores. The goal is to forecast the sales for a particular horizon, given a short, available past (few weeks), since no earlier statistics are available. To be successful, SO-fore approaches should capture this short past and exploit other modalities or exogenous data. To these aims, Visuelle 2.0 is equipped with disaggregated data at the item-shop level and multi-modal information for each clothing item, allowing computer vision approaches to come into play. The main message that we deliver is that the use of image data with deep networks boosts performances obtained when using the time series in long-term forecasting scenarios, ameliorating the WAPE and MAE by up to 5.48% and 7% respectively compared to competitive baseline methods. The dataset is available at https://humaticslab.github.io/forecasting/visuelle

translated by 谷歌翻译

在许多数值模拟中，随机梯度下降（SGD）型优化方法在深度神经网络（DNN）的训练中非常有效地执行，但直到这一天，它仍然是研究的开放问题，以提供一个严格解释SGD成功的数学融合分析键入DNN训练中的优化方法。在这项工作中，我们研究了通过整流线性单元（Relu）激活的完全连接的前馈DNN训练中的SGD型优化方法。我们首先为风险函数建立一般规律性，并出现在此类DNN的培训中出现的广义梯度函数，并且在此后，我们调查普通的Vanilla SGD优化方法在此假设所考虑的目标功能是如此常量功能。具体而言，我们证明了假设学习速率（SGD优化方法的步骤尺寸）足够小但不是$ l ^ 1 $ -sumbable并且在假设目标函数是期望的常量函数下由于SGD步骤的数量增加到无穷大，所考虑的SGD进程的风险将这些DNN的训练收敛到零。

translated by 谷歌翻译

近年来，对基于深度学习的粉丝彭化的兴趣日益增长。研究主要集中在建筑上。然而，缺乏基础事实，模型培训也是一个主要问题。一种流行的方法是使用原始数据作为地面真理训练在降低的分辨率域中的网络。然后在全分辨率数据上使用训练有素的网络，依赖于隐式缩放不变性假设。结果通常良好的分辨率，但在全分辨率下更具可疑的问题。在这里，我们向基于深度学习的泛散歌提出了一个全分辨率的培训框架。训练在高分辨率域中进行，仅依赖于原始数据，没有信息丢失。为了确保光谱和空间保真度，定义了合适的损耗，该损耗迫使泛圆柱输出与可用的全谱和多光谱输入一致。在WorldView-3，WorldView-2和Geoeye-1图像上进行的实验表明，在拟议的框架培训的方法中，在全分辨率数值指标和视觉质量方面都能保证出色的性能。该框架完全是一般的，可用于培训和微调任何基于深度学习的泛狼平网络。

translated by 谷歌翻译

在2015年和2019年之间，地平线的成员2020年资助的创新培训网络名为“Amva4newphysics”，研究了高能量物理问题的先进多变量分析方法和统计学习工具的定制和应用，并开发了完全新的。其中许多方法已成功地用于提高Cern大型Hadron撞机的地图集和CMS实验所执行的数据分析的敏感性;其他几个人，仍然在测试阶段，承诺进一步提高基本物理参数测量的精确度以及新现象的搜索范围。在本文中，在研究和开发的那些中，最相关的新工具以及对其性能的评估。

translated by 谷歌翻译