智能论文笔记

Click Prediction Boosting via Ensemble Learning Pipelines

Çağatay Demirel , A. Aylin Tokuç , Ahmet Tezcan Tekin

分类：机器学习

2022-06-07

在线旅行社（OTA）的网站在元搜索竞标引擎上宣传。预测酒店将收到的单击数量的给定出价金额的问题是管理元搜索引擎上OTA广告活动的重要一步，因为出价时间的点击次数定义了要生成的成本。在这项工作中，各种回归器都结束了，以提高点击预测性能。按照预处理程序，将功能集分为火车和测试组，具体取决于样品的记录日期。然后，将数据收集进行基于XGBoost的缩小降低，从而大大降低了特征的维度。然后通过将贝叶斯高参数优化应用于XGBoost，LightGBM和SGD模型来找到最佳的高参数。单独测试了十种不同的机器学习模型，并将它们组合在一起以创建合奏模型。提出了三种替代合奏解决方案。相同的测试集用于测试单个和集合模型，46个模型组合的结果表明，堆栈集合模型得出所有的R2分数。总之，整体模型将预测性能提高了约10％。

translated by 谷歌翻译

ESCADA: Efficient Safety and Context Aware Dose Allocation for Precision Medicine

Ilker Demirel , Ahmet Alparslan Celik , Cem Tekin

分类：机器学习

2021-11-26

寻找最佳个性化的治疗方案被认为是最具挑战性的精确药物问题之一。各种患者特征会影响对治疗的反应，因此，没有一种尺寸适合 - 所有方案。此外，甚至在治疗过程中均不服用单一不安全剂量可能对患者的健康产生灾难性后果。因此，个性化治疗模型必须确保患者{\ EM安全} {\ EM有效}优化疗程。在这项工作中，我们研究了一种普遍的和基本的医学问题，其中治疗旨在在范围内保持生理变量，优选接近目标水平。这样的任务也与其他域中相关。我们提出ESCADA，这是一个用于这个问题结构的通用算法，在确保患者安全的同时制作个性化和背景感知最佳剂量推荐。我们在Escada的遗憾中获得了高概率的上限以及安全保证。最后，我们对1型糖尿病疾病的{\ em推注胰岛素剂量}分配问题进行了广泛的模拟，并比较ESCADA对汤普森采样，规则的剂量分配者和临床医生的表现。

translated by 谷歌翻译

Beyond the Golden Ratio for Variational Inequality Algorithms

Ahmet Alacaoglu , Axel Böhm , Yura Malitsky

分类：机器学习 | (统计)机器学习

2022-12-28

We improve the understanding of the $\textit{golden ratio algorithm}$, which solves monotone variational inequalities (VI) and convex-concave min-max problems via the distinctive feature of adapting the step sizes to the local Lipschitz constants. Adaptive step sizes not only eliminate the need to pick hyperparameters, but they also remove the necessity of global Lipschitz continuity and can increase from one iteration to the next. We first establish the equivalence of this algorithm with popular VI methods such as reflected gradient, Popov or optimistic gradient descent-ascent in the unconstrained case with constant step sizes. We then move on to the constrained setting and introduce a new analysis that allows to use larger step sizes, to complete the bridge between the golden ratio algorithm and the existing algorithms in the literature. Doing so, we actually eliminate the link between the golden ratio $\frac{1+\sqrt{5}}{2}$ and the algorithm. Moreover, we improve the adaptive version of the algorithm, first by removing the maximum step size hyperparameter (an artifact from the analysis) to improve the complexity bound, and second by adjusting it to nonmonotone problems with weak Minty solutions, with superior empirical performance.

translated by 谷歌翻译

Building Segmentation on Satellite Images and Performance of Post-Processing Methods

Metehan Yalçın , Ahmet Alp Kindiroglu , Furkan Burak Bağcı , Ufuk Uyan , Mahiye Uluyağmur Öztürk

分类：计算机视觉

2022-12-28

Researchers are doing intensive work on satellite images due to the information it contains with the development of computer vision algorithms and the ease of accessibility to satellite images. Building segmentation of satellite images can be used for many potential applications such as city, agricultural, and communication network planning. However, since no dataset exists for every region, the model trained in a region must gain generality. In this study, we trained several models in China and post-processing work was done on the best model selected among them. These models are evaluated in the Chicago region of the INRIA dataset. As can be seen from the results, although state-of-art results in this area have not been achieved, the results are promising. We aim to present our initial experimental results of a building segmentation from satellite images in this study.

translated by 谷歌翻译

Semi-Supervised Domain Adaptation for Semantic Segmentation of Roads from Satellite Images

Ahmet Alp Kindiroglu , Metehan Yalçın , Furkan Burak Bağcı , Mahiye Uluyağmur Öztürk

分类：计算机视觉

2022-12-26

This paper presents the preliminary findings of a semi-supervised segmentation method for extracting roads from sattelite images. Artificial Neural Networks and image segmentation methods are among the most successful methods for extracting road data from satellite images. However, these models require large amounts of training data from different regions to achieve high accuracy rates. In cases where this data needs to be of more quantity or quality, it is a standard method to train deep neural networks by transferring knowledge from annotated data obtained from different sources. This study proposes a method that performs path segmentation with semi-supervised learning methods. A semi-supervised field adaptation method based on pseudo-labeling and Minimum Class Confusion method has been proposed, and it has been observed to increase performance in targeted datasets.

translated by 谷歌翻译

Normalized Stochastic Gradient Descent Training of Deep Neural Networks

Salih Atici , Hongyi Pan , Ahmet Enis Cetin

分类：机器学习

2022-12-20

In this paper, we introduce a novel optimization algorithm for machine learning model training called Normalized Stochastic Gradient Descent (NSGD) inspired by Normalized Least Mean Squares (NLMS) from adaptive filtering. When we train a high-complexity model on a large dataset, the learning rate is significantly important as a poor choice of optimizer parameters can lead to divergence. The algorithm updates the new set of network weights using the stochastic gradient but with $\ell_1$ and $\ell_2$-based normalizations on the learning rate parameter similar to the NLMS algorithm. Our main difference from the existing normalization methods is that we do not include the error term in the normalization process. We normalize the update term using the input vector to the neuron. Our experiments present that the model can be trained to a better accuracy level on different initial settings using our optimization algorithm. In this paper, we demonstrate the efficiency of our training algorithm using ResNet-20 and a toy neural network on different benchmark datasets with different initializations. The NSGD improves the accuracy of the ResNet-20 from 91.96\% to 92.20\% on the CIFAR-10 dataset.

translated by 谷歌翻译

Building Height Prediction with Instance Segmentation

Furkan Burak Bagci , Ahmet Alp Kindriroglu , Metehan Yalcin , Ufuk Uyan , Mahiye Uluyagmur Ozturk

分类：计算机视觉

2022-12-19

Extracting building heights from satellite images is an active research area used in many fields such as telecommunications, city planning, etc. Many studies utilize DSM (Digital Surface Models) generated with lidars or stereo images for this purpose. Predicting the height of the buildings using only RGB images is challenging due to the insufficient amount of data, low data quality, variations of building types, different angles of light and shadow, etc. In this study, we present an instance segmentation-based building height extraction method to predict building masks with their respective heights from a single RGB satellite image. We used satellite images with building height annotations of certain cities along with an open-source satellite dataset with the transfer learning approach. We reached, the bounding box mAP 59, the mask mAP 52.6, and the average accuracy value of 70% for buildings belonging to each height class in our test set.

translated by 谷歌翻译

REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory

Ziniu Hu , Ahmet Iscen , Chen Sun , Zirui Wang , Kai-Wei Chang , Yizhou Sun , Cordelia Schmid , David A. Ross , Alireza Fathi

分类：计算机视觉 | 人工智能

2022-12-10

In this paper, we propose an end-to-end Retrieval-Augmented Visual Language Model (REVEAL) that learns to encode world knowledge into a large-scale memory, and to retrieve from it to answer knowledge-intensive queries. REVEAL consists of four key components: the memory, the encoder, the retriever and the generator. The large-scale memory encodes various sources of multimodal world knowledge (e.g. image-text pairs, question answering pairs, knowledge graph triplets, etc) via a unified encoder. The retriever finds the most relevant knowledge entries in the memory, and the generator fuses the retrieved knowledge with the input query to produce the output. A key novelty in our approach is that the memory, encoder, retriever and generator are all pre-trained end-to-end on a massive amount of data. Furthermore, our approach can use a diverse set of multimodal knowledge sources, which is shown to result in significant gains. We show that REVEAL achieves state-of-the-art results on visual question answering and image captioning.

translated by 谷歌翻译

ChromaCorrect: Prescription Correction in Virtual Reality Headsets through Perceptual Guidance

Ahmet Güzel , Jeanne Beyazian , Praneeth Chakravarthula , Kaan Akşit

分类：机器学习

2022-12-08

A large portion of today's world population suffer from vision impairments and wear prescription eyeglasses. However, eyeglasses causes additional bulk and discomfort when used with augmented and virtual reality headsets, thereby negatively impacting the viewer's visual experience. In this work, we remedy the usage of prescription eyeglasses in Virtual Reality (VR) headsets by shifting the optical complexity completely into software and propose a prescription-aware rendering approach for providing sharper and immersive VR imagery. To this end, we develop a differentiable display and visual perception model encapsulating display-specific parameters, color and visual acuity of human visual system and the user-specific refractive errors. Using this differentiable visual perception model, we optimize the rendered imagery in the display using stochastic gradient-descent solvers. This way, we provide prescription glasses-free sharper images for a person with vision impairments. We evaluate our approach on various displays, including desktops and VR headsets, and show significant quality and contrast improvements for users with vision impairments.

translated by 谷歌翻译

Real Time Incremental Image Mosaicking Without Use of Any Camera Parameter

Suleyman Melih Portakal , Ahmet Alp Kindiroglu , Mahiye Uluyagmur Ozturk

分类：计算机视觉

2022-12-05

Over the past decade, there has been a significant increase in the use of Unmanned Aerial Vehicles (UAVs) to support a wide variety of missions, such as remote surveillance, vehicle tracking, and object detection. For problems involving processing of areas larger than a single image, the mosaicking of UAV imagery is a necessary step. Real-time image mosaicking is used for missions that requires fast response like search and rescue missions. It typically requires information from additional sensors, such as Global Position System (GPS) and Inertial Measurement Unit (IMU), to facilitate direct orientation, or 3D reconstruction approaches to recover the camera poses. This paper proposes a UAV-based system for real-time creation of incremental mosaics which does not require either direct or indirect camera parameters such as orientation information. Inspired by previous approaches, in the mosaicking process, feature extraction from images, matching of similar key points between images, finding homography matrix to warp and align images, and blending images to obtain mosaics better looking, plays important roles in the achievement of the high quality result. Edge detection is used in the blending step as a novel approach. Experimental results show that real-time incremental image mosaicking process can be completed satisfactorily and without need for any additional camera parameters.

translated by 谷歌翻译