智能论文笔记

Point2Cyl: Reverse Engineering 3D Objects from Point Clouds to Extrusion Cylinders

Mikaela Angelina Uy , Yen-yu Chang , Minhyuk Sung , Purvi Goel , Joseph Lambourne , Tolga Birdal , Leonidas Guibas

分类：计算机视觉

2021-12-17

我们提出了一个Point2cyl，一个监督网络将原始3D点云变换到一组挤出缸。从原始几何到CAD模型的逆向工程是能够在形状编辑软件中操纵3D数据的重要任务，从而在许多下游应用中扩展其使用。特别地，具有挤出圆柱序列的CAD模型的形式 - 2D草图加上挤出轴和范围 - 以及它们的布尔组合不仅广泛应用于CAD社区/软件，而且相比具有很大的形状表现性具有有限类型的基元（例如，平面，球形和汽缸）。在这项工作中，我们介绍了一种神经网络，通过首先学习底层几何代理来解决挤出汽缸分解问题的挤出圆柱分解问题。精确地，我们的方法首先预测每点分割，基础/桶标签和法线，然后估计可分离和闭合形式配方中的底层挤出参数。我们的实验表明，我们的方法展示了两个最近CAD数据集，融合画廊和Deepcad上的最佳性能，我们进一步展示了逆向工程和编辑的方法。

translated by 谷歌翻译

Interactive Segmentation of Radiance Fields

Rahul Goel , Dhawal Sirikonda , Saurabh Saini , PJ Narayanan

分类：计算机视觉

2022-12-27

Radiance Fields (RF) are popular to represent casually-captured scenes for new view generation and have been used for applications beyond it. Understanding and manipulating scenes represented as RFs have to naturally follow to facilitate mixed reality on personal spaces. Semantic segmentation of objects in the 3D scene is an important step for that. Prior segmentation efforts using feature distillation show promise but don't scale to complex objects with diverse appearance. We present a framework to interactively segment objects with fine structure. Nearest neighbor feature matching identifies high-confidence regions of the objects using distilled features. Bilateral filtering in a joint spatio-semantic space grows the region to recover accurate segmentation. We show state-of-the-art results of segmenting objects from RFs and compositing them to another scene, changing appearance, etc., moving closer to rich scene manipulation and understanding. Project Page: https://rahul-goel.github.io/isrf/

translated by 谷歌翻译

Human-Guided Fair Classification for Natural Language Processing

Florian E. Dorner , Momchil Peychev , Nikola Konstantinov , Naman Goel , Elliott Ash , Martin Vechev

分类：自然语言处理 | 人工智能 | 机器学习

2022-12-20

Text classifiers have promising applications in high-stake tasks such as resume screening and content moderation. These classifiers must be fair and avoid discriminatory decisions by being invariant to perturbations of sensitive attributes such as gender or ethnicity. However, there is a gap between human intuition about these perturbations and the formal similarity specifications capturing them. While existing research has started to address this gap, current methods are based on hardcoded word replacements, resulting in specifications with limited expressivity or ones that fail to fully align with human intuition (e.g., in cases of asymmetric counterfactuals). This work proposes novel methods for bridging this gap by discovering expressive and intuitive individual fairness specifications. We show how to leverage unsupervised style transfer and GPT-3's zero-shot capabilities to automatically generate expressive candidate pairs of semantically similar sentences that differ along sensitive attributes. We then validate the generated pairs via an extensive crowdsourcing study, which confirms that a lot of these pairs align with human intuition about fairness in the context of toxicity classification. Finally, we show how limited amounts of human feedback can be leveraged to learn a similarity specification that can be used to train downstream fairness-aware models.

translated by 谷歌翻译

StyleTRF: Stylizing Tensorial Radiance Fields

Rahul Goel , Sirikonda Dhawal , Saurabh Saini , P. J. Narayanan

分类：计算机视觉

2022-12-19

Stylized view generation of scenes captured casually using a camera has received much attention recently. The geometry and appearance of the scene are typically captured as neural point sets or neural radiance fields in the previous work. An image stylization method is used to stylize the captured appearance by training its network jointly or iteratively with the structure capture network. The state-of-the-art SNeRF method trains the NeRF and stylization network in an alternating manner. These methods have high training time and require joint optimization. In this work, we present StyleTRF, a compact, quick-to-optimize strategy for stylized view generation using TensoRF. The appearance part is fine-tuned using sparse stylized priors of a few views rendered using the TensoRF representation for a few iterations. Our method thus effectively decouples style-adaption from view capture and is much faster than the previous methods. We show state-of-the-art results on several scenes used for this purpose.

translated by 谷歌翻译

Metaheuristic for Hub-Spoke Facility Location Problem: Application to Indian E-commerce Industry

Aakash Sachdeva , Bhupinder Singh , Rahul Prasad , Nakshatra Goel , Ronit Mondal , Jatin Munjal , Abhishek Bhatnagar , Manjeet Dahiya

分类：机器学习

2022-12-16

Indian e-commerce industry has evolved over the last decade and is expected to grow over the next few years. The focus has now shifted to turnaround time (TAT) due to the emergence of many third-party logistics providers and higher customer expectations. The key consideration for delivery providers is to balance their overall operating costs while meeting the promised TAT to their customers. E-commerce delivery partners operate through a network of facilities whose strategic locations help to run the operations efficiently. In this work, we identify the locations of hubs throughout the country and their corresponding mapping with the distribution centers. The objective is to minimize the total network costs with TAT adherence. We use Genetic Algorithm and leverage business constraints to reduce the solution search space and hence the solution time. The results indicate an improvement of 9.73% in TAT compliance compared with the current scenario.

translated by 谷歌翻译

Reinforcement Learning for Agile Active Target Sensing with a UAV

Harsh Goel , Laura Jarin Lipschitz , Saurav Agarwal , Sandeep Manjanna , Vijay Kumar

分类：机器人 | 人工智能

2022-12-16

Active target sensing is the task of discovering and classifying an unknown number of targets in an environment and is critical in search-and-rescue missions. This paper develops a deep reinforcement learning approach to plan informative trajectories that increase the likelihood for an uncrewed aerial vehicle (UAV) to discover missing targets. Our approach efficiently (1) explores the environment to discover new targets, (2) exploits its current belief of the target states and incorporates inaccurate sensor models for high-fidelity classification, and (3) generates dynamically feasible trajectories for an agile UAV by employing a motion primitive library. Extensive simulations on randomly generated environments show that our approach is more efficient in discovering and classifying targets than several other baselines. A unique characteristic of our approach, in contrast to heuristic informative path planning approaches, is that it is robust to varying amounts of deviations of the prior belief from the true target distribution, thereby alleviating the challenge of designing heuristics specific to the application conditions.

translated by 谷歌翻译

DAMP: Doubly Aligned Multilingual Parser for Task-Oriented Dialogue

William Held , Christopher Hidey , Fei Liu , Eric Zhu , Rahul Goel , Diyi Yang , Rushin Shah

分类：自然语言处理 | 机器学习

2022-12-15

Modern virtual assistants use internal semantic parsing engines to convert user utterances to actionable commands. However, prior work has demonstrated that semantic parsing is a difficult multilingual transfer task with low transfer efficiency compared to other tasks. In global markets such as India and Latin America, this is a critical issue as switching between languages is prevalent for bilingual users. In this work we dramatically improve the zero-shot performance of a multilingual and codeswitched semantic parsing system using two stages of multilingual alignment. First, we show that constrastive alignment pretraining improves both English performance and transfer efficiency. We then introduce a constrained optimization approach for hyperparameter-free adversarial alignment during finetuning. Our Doubly Aligned Multilingual Parser (DAMP) improves mBERT transfer performance by 3x, 6x, and 81x on the Spanglish, Hinglish and Multilingual Task Oriented Parsing benchmarks respectively and outperforms XLM-R and mT5-Large using 3.2x fewer parameters.

translated by 谷歌翻译

Deep Learning for Brain Age Estimation: A Systematic Review

M. Tanveer , M. A. Ganaie , Iman Beheshti , Tripti Goel , Nehal Ahmad , Kuan-Ting Lai , Kaizhu Huang , Yu-Dong Zhang , Javier Del Ser , Chin-Teng Lin

分类：人工智能 | 计算机视觉 | 机器学习

2022-12-07

Over the years, Machine Learning models have been successfully employed on neuroimaging data for accurately predicting brain age. Deviations from the healthy brain aging pattern are associated to the accelerated brain aging and brain abnormalities. Hence, efficient and accurate diagnosis techniques are required for eliciting accurate brain age estimations. Several contributions have been reported in the past for this purpose, resorting to different data-driven modeling methods. Recently, deep neural networks (also referred to as deep learning) have become prevalent in manifold neuroimaging studies, including brain age estimation. In this review, we offer a comprehensive analysis of the literature related to the adoption of deep learning for brain age estimation with neuroimaging data. We detail and analyze different deep learning architectures used for this application, pausing at research works published to date quantitatively exploring their application. We also examine different brain age estimation frameworks, comparatively exposing their advantages and weaknesses. Finally, the review concludes with an outlook towards future directions that should be followed by prospective studies. The ultimate goal of this paper is to establish a common and informed reference for newcomers and experienced researchers willing to approach brain age estimation by using deep learning models

translated by 谷歌翻译

A Contextual Bandit Approach for Learning to Plan in Environments with Probabilistic Goal Configurations

Sohan Rudra , Saksham Goel , Anirban Santara , Claudio Gentile , Laurent Perron , Fei Xia , Vikas Sindhwani , Carolina Parada , Gaurav Aggarwal

分类：机器人 | 机器学习

2022-11-29

Object-goal navigation (Object-nav) entails searching, recognizing and navigating to a target object. Object-nav has been extensively studied by the Embodied-AI community, but most solutions are often restricted to considering static objects (e.g., television, fridge, etc.). We propose a modular framework for object-nav that is able to efficiently search indoor environments for not just static objects but also movable objects (e.g. fruits, glasses, phones, etc.) that frequently change their positions due to human intervention. Our contextual-bandit agent efficiently explores the environment by showing optimism in the face of uncertainty and learns a model of the likelihood of spotting different objects from each navigable location. The likelihoods are used as rewards in a weighted minimum latency solver to deduce a trajectory for the robot. We evaluate our algorithms in two simulated environments and a real-world setting, to demonstrate high sample efficiency and reliability.

translated by 谷歌翻译

Quantitative Susceptibility Mapping in Cognitive Decline: A Review of Technical Aspects and Applications

Shradha Verma , Tripti Goel , M Tanveer

分类：人工智能

2022-11-09

In the human brain, essential iron molecules for proper neurological functioning exist in transferrin (tf) and ferritin (Fe3) forms. However, its unusual increment manifests iron overload, which reacts with hydrogen peroxide. This reaction will generate hydroxyl radicals, and irons higher oxidation states. Further, this reaction causes tissue damage or cognitive decline in the brain and also leads to neurodegenerative diseases. The susceptibility difference due to iron overload within the volume of interest (VOI) responsible for field perturbation of MRI and can benefit in estimating the neural disorder. The quantitative susceptibility mapping (QSM) technique can estimate susceptibility alteration and assist in quantifying the local tissue susceptibility differences. It has attracted many researchers and clinicians to diagnose and detect neural disorders such as Parkinsons, Alzheimers, Multiple Sclerosis, and aging. The paper presents a systematic review illustrating QSM fundamentals and its processing steps, including phase unwrapping, background field removal, and susceptibility inversion. Using QSM, the present work delivers novel predictive biomarkers for various neural disorders. It can strengthen new researchers fundamental knowledge and provides insight into its applicability for cognitive decline disclosure. The paper discusses the future scope of QSM processing stages and their applications in identifying new biomarkers for neural disorders.

translated by 谷歌翻译