智能论文笔记

SS-IL: Separated Softmax for Incremental Learning

Hongjoon Ahn , Jihwan Kwak , Subin Lim , Hyeonsu Bang , Hyojun Kim , Taesup Moon

分类：计算机视觉

2020-03-31

我们考虑了类增量学习（CIL）问题，其中学习代理人通过逐步到达的培训数据批次不断学习新课程，并旨在在迄今为止所学的所有课程中很好地预测。问题的主要挑战是灾难性的遗忘，对于基于典范的示例性记忆方法，通常众所周知，遗忘通常是由于分类评分偏差引起的，该分类得分偏差是由于新类和新类之间的数据失衡而注射的旧课（在示例记忆中）。尽管已经提出了几种方法来通过一些其他后处理（例如，得分重新缩放或平衡的微调）来纠正这种分数偏见，但没有对这种偏见的根本原因进行系统分析。为此，我们分析了通过组合所有旧类和新类的输出得分来计算SoftMax概率的主要原因。然后，我们提出了一种新方法，称为分离的软磁性学习（SS-IL），该方法由分离的SoftMax（SS）输出层组成，结合了任务知识蒸馏（TKD）来解决此类偏见。在几个大规模CIL基准数据集的广泛实验结果中，我们通过在没有任何其他后处理的情况下获得更加平衡的预测分数来表明我们的SS-IL实现了强大的最新准确性。

translated by 谷歌翻译

Automated Precision Localization of Peripherally Inserted Central Catheter Tip through Model-Agnostic Multi-Stage Networks

Subin Park , Yoon Ki Cha , Soyoung Park , Kyung-Su Kim , Myung Jin Chung

分类：计算机视觉

2022-06-14

外围插入的中央导管（PICC）由于其长期的血管内渗透感具有低感染率，因此已被广泛用作代表性的中央静脉线（CVC）之一。但是，PICC的尖端错位频率很高，增加了刺穿，栓塞和心律不齐等并发症的风险。为了自动，精确地检测到它，使用最新的深度学习（DL）技术进行了各种尝试。但是，即使采用了这些方法，实际上仍然很难确定尖端位置，因为多个片段现象（MFP）发生在预测和提取PICC线之前预测尖端之前所需的PICC线的过程。这项研究旨在开发一种通常应用于现有模型的系统，并通过删除模型输出的MF来更准确地恢复PICC线路，从而精确地定位了检测其处置的实际尖端位置。为此，我们提出了一个基于多阶段DL的框架后处理，以后处理现有技术的PICC线提取结果。根据是否将MFCN应用于五个常规模型，将每个均方根误差（RMSE）和MFP发病率比较性能。在内部验证中，当将MFCN应用于现有单个模型时，MFP平均提高了45％。 RMSE从平均26.85mm（17.16至35.80mm）到9.72mm（9.37至10.98mm）的平均增长了63％以上。在外部验证中，当应用MFCN时，MFP的发病率平均下降32％，RMSE平均下降了65 \％。因此，通过应用提出的MFCN，我们观察到与现有模型相比，PICC尖端位置的显着/一致检测性能提高。

translated by 谷歌翻译

Transformer Network-based Reinforcement Learning Method for Power Distribution Network (PDN) Optimization of High Bandwidth Memory (HBM)

Hyunwook Park , Minsu Kim , Seongguk Kim , Keunwoo Kim , Haeyeon Kim , Taein Shin , Keeyoung Son , Boogyo Sim , Subin Kim , Seungtaek Jeong

分类：机器学习

2022-03-29

在本文中，我们首次提出了一种基于变压器网络的增强学习（RL）方法，用于对高带宽内存（HBM）优化功率分配网络（PDN）。所提出的方法可以提供最佳的去耦电容器（DITAP）设计，以最大程度地减少在多个端口上看到的PDN自我自我和转移阻抗。实现了基于注意力的变压器网络，以直接参数化decap优化策略。最佳性能得到了显着改善，因为注意力机制具有强大的表达，可以探索大规模的组合空间以进行脱离分配。此外，它可以捕获DETAP分配之间的顺序关系。由于可重复使用的网络在探测端口和decap分配候选者的位置上，可重复使用的网络大大减少了优化的计算时间。这是因为变压器网络具有上下文嵌入过程，可以捕获包括探测端口位置在内的元功能。此外，该网络接受了随机生成的数据集训练。因此，如果没有额外的培训，训练有素的网络可以解决新的脱邮件优化问题。由于网络的可扩展性，培训和数据成本的计算时间严重减少。由于其共同的体重属性，该网络可以在没有其他培训的情况下适应更大的问题。为了进行验证，我们将结果与常规遗传算法（GA），随机搜索（RS）和所有先前基于RL的方法进行比较。结果，所提出的方法在以下所有方面都优于：最佳性能，计算时间和数据效率。

translated by 谷歌翻译

Method to Classify Skin Lesions using Dermoscopic images

Dusa Sai Charan , Hemanth Nadipineni , Subin Sahayam , Umarani Jayaraman

分类：计算机视觉 | 机器学习

2020-08-21

Skin cancer is the most common cancer in the existing world constituting one-third of the cancer cases. Benign skin cancers are not fatal, can be cured with proper medication. But it is not the same as the malignant skin cancers. In the case of malignant melanoma, in its peak stage, the maximum life expectancy is less than or equal to 5 years. But, it can be cured if detected in early stages. Though there are numerous clinical procedures, the accuracy of diagnosis falls between 49% to 81% and is time-consuming. So, dermoscopy has been brought into the picture. It helped in increasing the accuracy of diagnosis but could not demolish the error-prone behaviour. A quick and less error-prone solution is needed to diagnose this majorly growing skin cancer. This project deals with the usage of deep learning in skin lesion classification. In this project, an automated model for skin lesion classification using dermoscopic images has been developed with CNN(Convolution Neural Networks) as a training model. Convolution neural networks are known for capturing features of an image. So, they are preferred in analyzing medical images to find the characteristics that drive the model towards success. Techniques like data augmentation for tackling class imbalance, segmentation for focusing on the region of interest and 10-fold cross-validation to make the model robust have been brought into the picture. This project also includes usage of certain preprocessing techniques like brightening the images using piece-wise linear transformation function, grayscale conversion of the image, resize the image. This project throws a set of valuable insights on how the accuracy of the model hikes with the bringing of new input strategies, preprocessing techniques. The best accuracy this model could achieve is 0.886.

translated by 谷歌翻译

X-MAS: Extremely Large-Scale Multi-Modal Sensor Dataset for Outdoor Surveillance in Real Environments

DongKi Noh , Changki Sung , Teayoung Uhm , WooJu Lee , Hyungtae Lim , Jaeseok Choi , Kyuewang Lee , Dasol Hong , Daeho Um , Inseop Chung

分类：机器人

2022-12-30

In robotics and computer vision communities, extensive studies have been widely conducted regarding surveillance tasks, including human detection, tracking, and motion recognition with a camera. Additionally, deep learning algorithms are widely utilized in the aforementioned tasks as in other computer vision tasks. Existing public datasets are insufficient to develop learning-based methods that handle various surveillance for outdoor and extreme situations such as harsh weather and low illuminance conditions. Therefore, we introduce a new large-scale outdoor surveillance dataset named eXtremely large-scale Multi-modAl Sensor dataset (X-MAS) containing more than 500,000 image pairs and the first-person view data annotated by well-trained annotators. Moreover, a single pair contains multi-modal data (e.g. an IR image, an RGB image, a thermal image, a depth image, and a LiDAR scan). This is the first large-scale first-person view outdoor multi-modal dataset focusing on surveillance tasks to the best of our knowledge. We present an overview of the proposed dataset with statistics and present methods of exploiting our dataset with deep learning-based algorithms. The latest information on the dataset and our study are available at https://github.com/lge-robot-navi, and the dataset will be available for download through a server.

translated by 谷歌翻译

Diagnosis of COVID-19 based on Chest Radiography

Mei Gah Lim , Hoi Leong Lee

分类：计算机视觉 | 机器学习

2022-12-26

The Coronavirus disease 2019 (COVID-19) was first identified in Wuhan, China, in early December 2019 and now becoming a pandemic. When COVID-19 patients undergo radiography examination, radiologists can observe the present of radiographic abnormalities from their chest X-ray (CXR) images. In this study, a deep convolutional neural network (CNN) model was proposed to aid radiologists in diagnosing COVID-19 patients. First, this work conducted a comparative study on the performance of modified VGG-16, ResNet-50 and DenseNet-121 to classify CXR images into normal, COVID-19 and viral pneumonia. Then, the impact of image augmentation on the classification results was evaluated. The publicly available COVID-19 Radiography Database was used throughout this study. After comparison, ResNet-50 achieved the highest accuracy with 95.88%. Next, after training ResNet-50 with rotation, translation, horizontal flip, intensity shift and zoom augmented dataset, the accuracy dropped to 80.95%. Furthermore, an ablation study on the effect of image augmentation on the classification results found that the combinations of rotation and intensity shift augmentation methods obtained an accuracy higher than baseline, which is 96.14%. Finally, ResNet-50 with rotation and intensity shift augmentations performed the best and was proposed as the final classification model in this work. These findings demonstrated that the proposed classification model can provide a promising result for COVID-19 diagnosis.

translated by 谷歌翻译

Feature Acquisition using Monte Carlo Tree Search

Sungsoo Lim , Diego Klabjan , Mark Shapiro

分类：机器学习

2022-12-21

Feature acquisition algorithms address the problem of acquiring informative features while balancing the costs of acquisition to improve the learning performances of ML models. Previous approaches have focused on calculating the expected utility values of features to determine the acquisition sequences. Other approaches formulated the problem as a Markov Decision Process (MDP) and applied reinforcement learning based algorithms. In comparison to previous approaches, we focus on 1) formulating the feature acquisition problem as a MDP and applying Monte Carlo Tree Search, 2) calculating the intermediary rewards for each acquisition step based on model improvements and acquisition costs and 3) simultaneously optimizing model improvement and acquisition costs with multi-objective Monte Carlo Tree Search. With Proximal Policy Optimization and Deep Q-Network algorithms as benchmark, we show the effectiveness of our proposed approach with experimental study.

translated by 谷歌翻译

Automatic Network Adaptation for Ultra-Low Uniform-Precision Quantization

Seongmin Park , Beomseok Kwon , Jieun Lim , Kyuyoung Sim , Taeho Kim , Jungwook Choi

分类：计算机视觉 | 人工智能

2022-12-21

Uniform-precision neural network quantization has gained popularity since it simplifies densely packed arithmetic unit for high computing capability. However, it ignores heterogeneous sensitivity to the impact of quantization errors across the layers, resulting in sub-optimal inference accuracy. This work proposes a novel neural architecture search called neural channel expansion that adjusts the network structure to alleviate accuracy degradation from ultra-low uniform-precision quantization. The proposed method selectively expands channels for the quantization sensitive layers while satisfying hardware constraints (e.g., FLOPs, PARAMs). Based on in-depth analysis and experiments, we demonstrate that the proposed method can adapt several popular networks channels to achieve superior 2-bit quantization accuracy on CIFAR10 and ImageNet. In particular, we achieve the best-to-date Top-1/Top-5 accuracy for 2-bit ResNet50 with smaller FLOPs and the parameter size.

translated by 谷歌翻译

Artificial Intelligence for Health Message Generation: Theory, Method, and an Empirical Study Using Prompt Engineering

Sue Lim , Ralf Schmälzle

分类：自然语言处理

2022-12-14

This study introduces and examines the potential of an AI system to generate health awareness messages. The topic of folic acid, a vitamin that is critical during pregnancy, served as a test case. Using prompt engineering, we generated messages that could be used to raise awareness and compared them to retweeted human-generated messages via computational and human evaluation methods. The system was easy to use and prolific, and computational analyses revealed that the AI-generated messages were on par with human-generated ones in terms of sentiment, reading ease, and semantic content. Also, the human evaluation study showed that AI-generated messages ranked higher in message quality and clarity. We discuss the theoretical, practical, and ethical implications of these results.

translated by 谷歌翻译

Cross-Domain Transfer via Semantic Skill Imitation

Karl Pertsch , Ruta Desai , Vikash Kumar , Franziska Meier , Joseph J. Lim , Dhruv Batra , Akshara Rai

分类：机器学习 | 机器人

2022-12-14

We propose an approach for semantic imitation, which uses demonstrations from a source domain, e.g. human videos, to accelerate reinforcement learning (RL) in a different target domain, e.g. a robotic manipulator in a simulated kitchen. Instead of imitating low-level actions like joint velocities, our approach imitates the sequence of demonstrated semantic skills like "opening the microwave" or "turning on the stove". This allows us to transfer demonstrations across environments (e.g. real-world to simulated kitchen) and agent embodiments (e.g. bimanual human demonstration to robotic arm). We evaluate on three challenging cross-domain learning problems and match the performance of demonstration-accelerated RL approaches that require in-domain demonstrations. In a simulated kitchen environment, our approach learns long-horizon robot manipulation tasks, using less than 3 minutes of human video demonstrations from a real-world kitchen. This enables scaling robot learning via the reuse of demonstrations, e.g. collected as human videos, for learning in any number of target domains.

translated by 谷歌翻译