智能论文笔记

Biblio-Analysis of Cohort Intelligence (CI) Algorithm and its allied applications from Scopus and Web of Science Perspective

Ishaan Kale , Rahul Joshi , Kalyani Kadam

分类：人工智能

2022-09-07

队列智能或CI是这种新型优化算法之一。自成立以来，在很短的范围内成功地应用于各个领域，并且观察到与同类算法相比，其结果是有效的。到目前为止，在CI及其相关应用程序上还没有进行过这种类型的文献计量分析。因此，对于那些希望将CI提升到新水平的人来说，这篇研究论文将是破冰船。在这篇研究论文中，Scopus中可用的CI出版物通过图表，有关作者，源标题，关键字的网络图进行分析，这些年来，期刊和期刊。在某种程度上，该文献计量学论文以其文献计量详细信息来展示CI，其应用和详细的系统审查。

translated by 谷歌翻译

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

Qiucheng Wu , Yujian Liu , Handong Zhao , Ajinkya Kale , Trung Bui , Tong Yu , Zhe Lin , Yang Zhang , Shiyu Chang

分类：计算机视觉

2022-12-16

Generative models have been widely studied in computer vision. Recently, diffusion models have drawn substantial attention due to the high quality of their generated images. A key desired property of image generative models is the ability to disentangle different attributes, which should enable modification towards a style without changing the semantic content, and the modification parameters should generalize to different images. Previous studies have found that generative adversarial networks (GANs) are inherently endowed with such disentanglement capability, so they can perform disentangled image editing without re-training or fine-tuning the network. In this work, we explore whether diffusion models are also inherently equipped with such a capability. Our finding is that for stable diffusion models, by partially changing the input text embedding from a neutral description (e.g., "a photo of person") to one with style (e.g., "a photo of person with smile") while fixing all the Gaussian random noises introduced during the denoising process, the generated images can be modified towards the target style without changing the semantic content. Based on this finding, we further propose a simple, light-weight image editing algorithm where the mixing weights of the two text embeddings are optimized for style matching and content preservation. This entire process only involves optimizing over around 50 parameters and does not fine-tune the diffusion model itself. Experiments show that the proposed method can modify a wide range of attributes, with the performance outperforming diffusion-model-based image-editing algorithms that require fine-tuning. The optimized weights generalize well to different images. Our code is publicly available at https://github.com/UCSB-NLP-Chang/DiffusionDisentanglement.

translated by 谷歌翻译

Adversarial Attacks and Defences for Skin Cancer Classification

Vinay Jogani , Joy Purohit , Ishaan Shivhare , Samina Attari , Shraddha Surtkar

分类：计算机视觉 | 机器学习

2022-12-13

There has been a concurrent significant improvement in the medical images used to facilitate diagnosis and the performance of machine learning techniques to perform tasks such as classification, detection, and segmentation in recent years. As a result, a rapid increase in the usage of such systems can be observed in the healthcare industry, for instance in the form of medical image classification systems, where these models have achieved diagnostic parity with human physicians. One such application where this can be observed is in computer vision tasks such as the classification of skin lesions in dermatoscopic images. However, as stakeholders in the healthcare industry, such as insurance companies, continue to invest extensively in machine learning infrastructure, it becomes increasingly important to understand the vulnerabilities in such systems. Due to the highly critical nature of the tasks being carried out by these machine learning models, it is necessary to analyze techniques that could be used to take advantage of these vulnerabilities and methods to defend against them. This paper explores common adversarial attack techniques. The Fast Sign Gradient Method and Projected Descent Gradient are used against a Convolutional Neural Network trained to classify dermatoscopic images of skin lesions. Following that, it also discusses one of the most popular adversarial defense techniques, adversarial training. The performance of the model that has been trained on adversarial examples is then tested against the previously mentioned attacks, and recommendations to improve neural networks robustness are thus provided based on the results of the experiment.

translated by 谷歌翻译

FAIR AI Models in High Energy Physics

Javier Duarte , Haoyang Li , Avik Roy , Ruike Zhu , E. A. Huerta , Daniel Diaz , Philip Harris , Raghav Kansal , Daniel S. Katz , Ishaan H. Kavoori

分类：机器学习

2022-12-09

The findable, accessible, interoperable, and reusable (FAIR) data principles have provided a framework for examining, evaluating, and improving how we share data with the aim of facilitating scientific discovery. Efforts have been made to generalize these principles to research software and other digital products. Artificial intelligence (AI) models -- algorithms that have been trained on data rather than explicitly programmed -- are an important target for this because of the ever-increasing pace with which AI is transforming scientific and engineering domains. In this paper, we propose a practical definition of FAIR principles for AI models and create a FAIR AI project template that promotes adherence to these principles. We demonstrate how to implement these principles using a concrete example from experimental high energy physics: a graph neural network for identifying Higgs bosons decaying to bottom quarks. We study the robustness of these FAIR AI models and their portability across hardware architectures and software frameworks, and report new insights on the interpretability of AI predictions by studying the interplay between FAIR datasets and AI models. Enabled by publishing FAIR AI models, these studies pave the way toward reliable and automated AI-driven scientific discovery.

translated by 谷歌翻译

Task Bias in Vision-Language Models

Sachit Menon , Ishaan Preetam Chandratreya , Carl Vondrick

分类：计算机视觉 | 机器学习

2022-12-08

Incidental supervision from language has become a popular approach for learning generic visual representations that can be prompted to perform many recognition tasks in computer vision. We conduct an in-depth exploration of the CLIP model and show that its visual representation is often strongly biased towards solving some tasks more than others. Moreover, which task the representation will be biased towards is unpredictable, with little consistency across images. To resolve this task bias, we show how to learn a visual prompt that guides the representation towards features relevant to their task of interest. Our results show that these visual prompts can be independent of the input image and still effectively provide a conditioning mechanism to steer visual representations towards the desired task.

translated by 谷歌翻译

A Hybrid Deep Learning Anomaly Detection Framework for Intrusion Detection

Rahul Kale , Zhi Lu , Kar Wai Fok , Vrizlynn L. L. Thing

分类：人工智能 | 机器学习

2022-12-02

Cyber intrusion attacks that compromise the users' critical and sensitive data are escalating in volume and intensity, especially with the growing connections between our daily life and the Internet. The large volume and high complexity of such intrusion attacks have impeded the effectiveness of most traditional defence techniques. While at the same time, the remarkable performance of the machine learning methods, especially deep learning, in computer vision, had garnered research interests from the cyber security community to further enhance and automate intrusion detections. However, the expensive data labeling and limitation of anomalous data make it challenging to train an intrusion detector in a fully supervised manner. Therefore, intrusion detection based on unsupervised anomaly detection is an important feature too. In this paper, we propose a three-stage deep learning anomaly detection based network intrusion attack detection framework. The framework comprises an integration of unsupervised (K-means clustering), semi-supervised (GANomaly) and supervised learning (CNN) algorithms. We then evaluated and showed the performance of our implemented framework on three benchmark datasets: NSL-KDD, CIC-IDS2018, and TON_IoT.

translated by 谷歌翻译

ART/ATK: A research platform for assessing and mitigating the sim-to-real gap in robotics and autonomous vehicle engineering

Asher Elmquist , Aaron Young , Thomas Hansen , Sriram Ashokkumar , Stefan Caldararu , Abhiraj Dashora , Ishaan Mahajan , Harry Zhang , Luning Fang , He Shen

分类：机器人

2022-11-09

We discuss a platform that has both software and hardware components, and whose purpose is to support research into characterizing and mitigating the sim-to-real gap in robotics and vehicle autonomy engineering. The software is operating-system independent and has three main components: a simulation engine called Chrono, which supports high-fidelity vehicle and sensor simulation; an autonomy stack for algorithm design and testing; and a development environment that supports visualization and hardware-in-the-loop experimentation. The accompanying hardware platform is a 1/6th scale vehicle augmented with reconfigurable mountings for computing, sensing, and tracking. Since this vehicle platform has a digital twin within the simulation environment, one can test the same autonomy perception, state estimation, or controls algorithms, as well as the processors they run on, in both simulation and reality. A demonstration is provided to show the utilization of this platform for autonomy research. Future work will concentrate on augmenting ART/ATK with support for a full-sized Chevy Bolt EUV, which will be made available to this group in the immediate future.

translated by 谷歌翻译

CLSE: Corpus of Linguistically Significant Entities

Aleksandr Chuklin , Justin Zhao , Mihir Kale

分类：自然语言处理

2022-11-04

One of the biggest challenges of natural language generation (NLG) is the proper handling of named entities. Named entities are a common source of grammar mistakes such as wrong prepositions, wrong article handling, or incorrect entity inflection. Without factoring linguistic representation, such errors are often underrepresented when evaluating on a small set of arbitrarily picked argument values, or when translating a dataset from a linguistically simpler language, like English, to a linguistically complex language, like Russian. However, for some applications, broadly precise grammatical correctness is critical -- native speakers may find entity-related grammar errors silly, jarring, or even offensive. To enable the creation of more linguistically diverse NLG datasets, we release a Corpus of Linguistically Significant Entities (CLSE) annotated by linguist experts. The corpus includes 34 languages and covers 74 different semantic types to support various applications from airline ticketing to video games. To demonstrate one possible use of CLSE, we produce an augmented version of the Schema-Guided Dialog Dataset, SGD-CLSE. Using the CLSE's entities and a small number of human translations, we create a linguistically representative NLG evaluation benchmark in three languages: French (high-resource), Marathi (low-resource), and Russian (highly inflected language). We establish quality baselines for neural, template-based, and hybrid NLG systems and discuss the strengths and weaknesses of each approach.

translated by 谷歌翻译

Generalized Probabilistic U-Net for medical image segementation

Ishaan Bhat , Josien P. W. Pluim , Hugo J. Kuijf

分类：计算机视觉 | 机器学习

2022-07-26

我们提出了广义的概率U-NET，该概率U-NET通过将高斯分布的更通用形式作为潜在空间分布来扩展概率的U-NET，可以更好地近似参考分段中的不确定性。我们研究了潜在空间分布的选择对使用LIDC-IDRI数据集捕获参考分割中的不确定性的效果。我们表明，分布的选择会影响预测的样本多样性及其相对于参考分割的重叠。对于LIDC-IDRI数据集，我们表明，使用高斯人的混合物会导致广义能量距离（GED）度量相对于标准概率U-NET的统计显着改善。我们已经在https://github.com/ishaanb92/generalizedprobabilisticunet上提供了实施。

translated by 谷歌翻译

Private Matrix Approximation and Geometry of Unitary Orbits

Oren Mangoubi , Yikai Wu , Satyen Kale , Abhradeep Guha Thakurta , Nisheeth K. Vishnoi

分类：机器学习 | (统计)机器学习

2022-07-06

考虑以下优化问题：给定$ n \ times n $矩阵$ a $和$ \ lambda $，最大化$ \ langle a，u \ lambda u^*\ rangle $，其中$ u $ $ u $在unital Group $ \ mathrm上变化{u}（n）$。这个问题试图通过矩阵大约$ a $，其频谱与$ \ lambda $相同，并且通过将$ \ lambda $设置为适当的对角矩阵，可以恢复矩阵近似问题，例如pca和等级$ k $近似。我们研究了在使用用户的私人数据构建矩阵$ a $的设置中，为这种优化问题设计差异化私有算法的问题。我们给出有效的私有算法，在近似误差上带有上和下限。我们的结果统一并改进了有关私人矩阵近似问题的几项先前的作品。他们依靠格拉斯曼尼亚人的包装/覆盖数量范围扩展到应该具有独立利益的单一轨道。

translated by 谷歌翻译