在过去几十年中,功能选择吸引了很多关注,因为它可以降低数据维度,同时保持功能的原始物理含义,这比功能提取可以更好地解释性。但是,大多数现有的功能选择方法,尤其是基于深度学习的方法,通常集中在仅具有很高分数的功能上,但忽略了那些在训练过程中得分较低的人以及重要的候选功能的顺序。这可能是有风险的,因为不幸的是,在培训过程中可能会忽略一些重要和相关的功能,从而导致次优的解决方案或误导性选择。在我们的工作中,我们通过利用较少重要性分数的功能来处理功能选择,并根据新颖的互补功能掩码提出功能选择框架。我们的方法是通用的,可以轻松地集成到现有的基于深度学习的特征选择方法中,以提高其性能。实验是在基准数据集上进行的,并表明所提出的方法可以选择比艺术状态更具代表性和信息性的特征。
translated by 谷歌翻译
在后硅验证中,调整是为了找到调整旋钮的值,这可能是过程参数和/或已知操作条件的函数。从这个意义上讲,更有效的调整需要根据测试的设备(DUT)来确定最关键的调整旋钮和过程参数。这通常是由经验丰富的专家手动进行的。但是,随着越来越复杂的芯片,对大量原始变量的手动检查变得更具挑战性。在这项工作中,我们利用神经网络有效地选择最相关的变量,并呈现相应的深学习辅助管道进行有效的调整。
translated by 谷歌翻译
智能测试需要大规模的高维数据有效分析。传统上,该分析通常由人类专家进行,但在大数据时代不可扩展。为了应对这一挑战,最近将可变选择引入了智能测试。但是,在实践中,我们遇到的方案在变量选择后必须保持某些变量(例如,测试设备的某些特定处理条件)。我们称此条件变量选择为嵌入式或深度学习的变量选择方法尚未得到很好的研究。在本文中,我们讨论了一个新颖的条件变量选择框架,该框架可以选择一组预选变量,可以选择最重要的候选变量。
translated by 谷歌翻译
近年来,由于增加了计算能力,允许在合理的时间框架中培训大型集合的培训,所应用的集合学习的使用已经显着增加。许多应用,例如恶意软件检测,面部识别或财务决策,使用有限的学习算法,并以比任何其他单独的学习算法获得更好的预测性能的方式聚合它们。在半导体器件(PSV)的硅后验证领域中,通常提供数据集,其包括各种装置,例如,例如不同的制造线的芯片。在PSV中,任务是近似于多学习算法的数据的基础功能,每个数据在设备特定的子集上训练,而不是提高整个数据集上任意分类器的性能。此外,期望是,未知数量的子集描述了显示非常不同特征的函数。相应的集合成员称为异常值,可以严重影响近似值。我们的方法旨在找到对异常值强大的合适近似,并且代表了适用于尽可能多的类型的方式最佳或最坏的情况。使用“软最大”或“软MIN”功能代替最大或最小操作员。培训神经网络(NN)以在两阶段过程中学习此“软功能”。首先,我们选择代表最佳或最坏情况的集合成员的子集。其次,我们组合这些成员并定义使用本地异常因素系数(LOF)属性的加权来增加非异常值的影响并减少异常值。加权可确保对异常值的鲁棒性,并确保近似适用于大多数类型。
translated by 谷歌翻译
越来越多的现代芯片复杂性使设计验证更加困难。现有方法不再能够应对硅后验证中稳健性能调整等任务的复杂性。因此,我们提出了一种基于学习优化和加强学习的新方法,以便以高效且稳健的方式解决复杂和混合式调整任务。
translated by 谷歌翻译
Physics-Informed Neural Networks (PINNs) have gained much attention in various fields of engineering thanks to their capability of incorporating physical laws into the models. PINNs integrate the physical constraints by minimizing the partial differential equations (PDEs) residuals on a set of collocation points. The distribution of these collocation points appears to have a huge impact on the performance of PINNs and the assessment of the sampling methods for these points is still an active topic. In this paper, we propose a Fixed-Budget Online Adaptive Mesh Learning (FBOAML) method, which decomposes the domain into sub-domains, for training collocation points based on local maxima and local minima of the PDEs residuals. The stopping criterion is based on a data set of reference, which leads to an adaptive number of iterations for each specific problem. The effectiveness of FBOAML is demonstrated in the context of non-parameterized and parameterized problems. The impact of the hyper-parameters in FBOAML is investigated in this work. The comparison with other adaptive sampling methods is also illustrated. The numerical results demonstrate important gains in terms of accuracy of PINNs with FBOAML over the classical PINNs with non-adaptive collocation points. We also apply FBOAML in a complex industrial application involving coupling between mechanical and thermal fields. We show that FBOAML is able to identify the high-gradient location and even give better prediction for some physical fields than the classical PINNs with collocation points taken on a pre-adapted finite element mesh.
translated by 谷歌翻译
Common measures of brain functional connectivity (FC) including covariance and correlation matrices are semi-positive definite (SPD) matrices residing on a cone-shape Riemannian manifold. Despite its remarkable success for Euclidean-valued data generation, use of standard generative adversarial networks (GANs) to generate manifold-valued FC data neglects its inherent SPD structure and hence the inter-relatedness of edges in real FC. We propose a novel graph-regularized manifold-aware conditional Wasserstein GAN (GR-SPD-GAN) for FC data generation on the SPD manifold that can preserve the global FC structure. Specifically, we optimize a generalized Wasserstein distance between the real and generated SPD data under an adversarial training, conditioned on the class labels. The resulting generator can synthesize new SPD-valued FC matrices associated with different classes of brain networks, e.g., brain disorder or healthy control. Furthermore, we introduce additional population graph-based regularization terms on both the SPD manifold and its tangent space to encourage the generator to respect the inter-subject similarity of FC patterns in the real data. This also helps in avoiding mode collapse and produces more stable GAN training. Evaluated on resting-state functional magnetic resonance imaging (fMRI) data of major depressive disorder (MDD), qualitative and quantitative results show that the proposed GR-SPD-GAN clearly outperforms several state-of-the-art GANs in generating more realistic fMRI-based FC samples. When applied to FC data augmentation for MDD identification, classification models trained on augmented data generated by our approach achieved the largest margin of improvement in classification accuracy among the competing GANs over baselines without data augmentation.
translated by 谷歌翻译
We aim at improving reasoning on inconsistent and uncertain data. We focus on knowledge-graph data, extended with time intervals to specify their validity, as regularly found in historical sciences. We propose principles on semantics for efficient Maximum A-Posteriori inference on the new Temporal Markov Logic Networks (TMLN) which extend the Markov Logic Networks (MLN) by uncertain temporal facts and rules. We examine total and partial temporal (in)consistency relations between sets of temporal formulae. Then we propose a new Temporal Parametric Semantics, which may combine several sub-functions, allowing to use different assessment strategies. Finally, we expose the constraints that semantics must respect to satisfy our principles.
translated by 谷歌翻译
光伏(PV)能量产生在能量转变中起着至关重要的作用。小规模的PV安装以空前的速度部署,并且它们在电网中的集成可能会具有挑战性,因为公共当局通常缺乏有关它们的质量数据。越来越多的机器学习模型能够自动映射这些安装,越来越多地用于改善住宅PV安装的知识。但是,由于图像采集的差异,这些模型不能轻易地从一个区域或数据源转移到另一个区域。为了解决此问题,称为域移动并促进了PV阵列映射管道的开发,我们提出了一个包含空中图像,注释和分割掩码的数据集。我们为28,000多个安装提供安装元数据。我们为13,000个装置提供地面真理细分面具,其中包括7,000个带有两个不同图像提供商的注释。最后,我们提供了与8,000多个安装的注释相匹配的安装元数据。数据集应用程序包括端到端的PV注册表构建,强大的PV安装映射以及众包数据集的分析。
translated by 谷歌翻译
我们介绍Audiolm,这是具有长期一致性高质量音频产生的框架。 Audiolm将输入音频映射到一系列离散令牌,并将音频生成作为此表示空间中的语言建模任务。我们展示了现有的音频令牌如何在重建质量和长期结构之间提供不同的权衡,我们提出了一个混合代币化计划来实现这两个目标。也就是说,我们利用在音频中预先训练的蒙版语言模型的离散激活来捕获长期结构和神经音频编解码器产生的离散代码,以实现高质量的合成。通过培训大型原始音频波形,Audiolm学会了在简短的提示下产生自然和连贯的连续性。当接受演讲训练时,没有任何笔录或注释,Audiolm会在句法和语义上产生可行的语音连续性,同时还为看不见的说话者保持说话者身份和韵律。此外,我们演示了我们的方法如何通过产生连贯的钢琴音乐连续性来超越语音,尽管受过训练而没有任何象征性的音乐代表。
translated by 谷歌翻译