智能论文笔记

Artificial Intelligence to Enhance Mission Science Output for In-situ Observations: Dealing with the Sparse Data Challenge

M. I. Sitnov , G. K. Stephens , V. G. Merkin , C. -P. Wang , D. Turner , K. Genestreti , M. Argall , T. Y. Chen , A. Y. Ukhorskiy , S. Wing

分类：机器学习

2022-12-26

In the Earth's magnetosphere, there are fewer than a dozen dedicated probes beyond low-Earth orbit making in-situ observations at any given time. As a result, we poorly understand its global structure and evolution, the mechanisms of its main activity processes, magnetic storms, and substorms. New Artificial Intelligence (AI) methods, including machine learning, data mining, and data assimilation, as well as new AI-enabled missions will need to be developed to meet this Sparse Data challenge.

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

Graph Neural Networks for Low-Energy Event Classification & Reconstruction in IceCube

R. Abbasi , M. Ackermann , J. Adams , N. Aggarwal , J. A. Aguilar , M. Ahlers , M. Ahrens , J. M. Alameddine , A. A. Alves Jr. , N. M. Amin

分类：机器学习

2022-09-07

ICECUBE是一种用于检测1 GEV和1 PEV之间大气和天体中微子的光学传感器的立方公斤阵列，该阵列已部署1.45 km至2.45 km的南极的冰盖表面以下1.45 km至2.45 km。来自ICE探测器的事件的分类和重建在ICeCube数据分析中起着核心作用。重建和分类事件是一个挑战，这是由于探测器的几何形状，不均匀的散射和冰中光的吸收，并且低于100 GEV的光，每个事件产生的信号光子数量相对较少。为了应对这一挑战，可以将ICECUBE事件表示为点云图形，并将图形神经网络（GNN）作为分类和重建方法。 GNN能够将中微子事件与宇宙射线背景区分开，对不同的中微子事件类型进行分类，并重建沉积的能量，方向和相互作用顶点。基于仿真，我们提供了1-100 GEV能量范围的比较与当前ICECUBE分析中使用的当前最新最大似然技术，包括已知系统不确定性的影响。对于中微子事件分类，与当前的IceCube方法相比，GNN以固定的假阳性速率（FPR）提高了信号效率的18％。另外，GNN在固定信号效率下将FPR的降低超过8（低于半百分比）。对于能源，方向和相互作用顶点的重建，与当前最大似然技术相比，分辨率平均提高了13％-20％。当在GPU上运行时，GNN能够以几乎是2.7 kHz的中位数ICECUBE触发速率的速率处理ICECUBE事件，这打开了在在线搜索瞬态事件中使用低能量中微子的可能性。

translated by 谷歌翻译

METER-ML: A Multi-sensor Earth Observation Benchmark for Automated Methane Source Mapping

Bryan Zhu , Nicholas Lui , Jeremy Irvin , Jimmy Le , Sahil Tadwalkar , Chenghao Wang , Zutao Ouyang , Frankie Y. Liu , Andrew Y. Ng , Robert B. Jackson

分类：计算机视觉

2022-07-22

减少甲烷排放对于缓解全球变暖至关重要。为了将甲烷排放归因于其来源，有必要综合的甲烷源基础设施数据集。深入学习远程感知的图像的最新进展有可能识别甲烷源的位置和特征，但是缺乏公开可用的数据，可以使机器学习研究人员和从业人员能够构建自动映射方法。为了帮助填补这一空白，我们在美国构建了一个称为Meter-ML的多传感器数据集，该数据集包含86,625个地理参考的NAIP，Sentinel-1和Sentinel-2图像，并在美国标记为有甲烷源设施，包括甲烷源设施，包括集中动物喂养操作，，，，，，，包括浓缩动物喂养操作，煤矿，垃圾填埋场，天然气加工厂，炼油厂和石油末端以及废水处理厂。我们尝试各种模型，以利用不同的空间分辨率，空间足迹，图像产品和光谱带。我们发现，我们的最佳模型在确定浓缩动物喂养操作的精确召回曲线下达到了一个面积，在专家标签的测试集上，用于识别浓缩动物饲养操作，用于油炼油厂和石油末端0.821，这表明有可能进行大规模映射。我们在https://stanfordmlgroup.github.io/projects/meter-ml/上免费提供仪表-ML，以支持自动化甲烷源映射的未来工作。

translated by 谷歌翻译

Varying Joint Patterns and Compensatory Strategies Can Lead to the Same Functional Gait Outcomes: A Case Study

T. Bacek , M. Sun , H. Liu , Z. Chen , D. Kulic , D. Oetomo , Y. Tan

分类：机器人

2022-06-27

本文分析了交付功能步态结果的联合空间步行机制和冗余。分析了两名参加多因素研究并在三个课程中行走的健康男性成年人的生物力学措施。两位参与者都采用不同的人体内部和人际补偿策略（例如，拱顶，髋关节远足）跨步行条件，并表现出显着的步态模式改变，同时保持任务空间（功能）步态参数不变。他们还更喜欢各种不对称的步长，但在自由步行过程中保持了对称步长的一致性和Cadence-Invariant。结果表明，个性化方法的重要性以及需要从功能（任务空间）到关节空间步态分析的范式转变，以便在（a）典型步态和提供以人为中心的人类机器人相互作用。

translated by 谷歌翻译

A Hybrid Science-Guided Machine Learning Approach for Modeling and Optimizing Chemical Processes

Niket Sharma , Y. A. Liu

分类：机器学习

2021-12-02

本研究介绍了混合过程建模和优化的广阔视角，将科学知识和数据分析在生物处理和化学工程中与科学引导机学习（SGML）方法相结合。我们将这种方法分为两大类。首先是指基于数据的ML模型的恭维的情况并使基于第一原理的科学的模型在预测中更准确，并且第二个对应于科学知识有助于使ML模型更加科学地保持的情况。我们对科学和工程文献进行了详细审查，与混合SGML方法有关，并提出了混合动力SGML模型的系统分类。为了应用ML改善基于科学的模型，我们呈现了直串行和并行混合建模的子类别及其组合，反向建模，阶阶建模，量化过程中的不确定性，甚至发现该过程的管理方程式的博览会模型。为了应用科学原则来改善ML模型，我们讨论科学导游的设计，学习和改进的子类别。对于每个子类别，我们确定其要求，优势和局限性以及其在生物处理和化学工程中的出版和潜在的应用领域。

translated by 谷歌翻译

A deep real options policy for sequential service region design and timing

Srushti Rath , Joseph Y. J. Chow

分类：机器学习 | 人工智能

2022-12-30

As various city agencies and mobility operators navigate toward innovative mobility solutions, there is a need for strategic flexibility in well-timed investment decisions in the design and timing of mobility service regions, i.e. cast as "real options" (RO). This problem becomes increasingly challenging with multiple interacting RO in such investments. We propose a scalable machine learning based RO framework for multi-period sequential service region design & timing problem for mobility-on-demand services, framed as a Markov decision process with non-stationary stochastic variables. A value function approximation policy from literature uses multi-option least squares Monte Carlo simulation to get a policy value for a set of interdependent investment decisions as deferral options (CR policy). The goal is to determine the optimal selection and timing of a set of zones to include in a service region. However, prior work required explicit enumeration of all possible sequences of investments. To address the combinatorial complexity of such enumeration, we propose a new variant "deep" RO policy using an efficient recurrent neural network (RNN) based ML method (CR-RNN policy) to sample sequences to forego the need for enumeration, making network design & timing policy tractable for large scale implementation. Experiments on multiple service region scenarios in New York City (NYC) shows the proposed policy substantially reduces the overall computational cost (time reduction for RO evaluation of > 90% of total investment sequences is achieved), with zero to near-zero gap compared to the benchmark. A case study of sequential service region design for expansion of MoD services in Brooklyn, NYC show that using the CR-RNN policy to determine optimal RO investment strategy yields a similar performance (0.5% within CR policy value) with significantly reduced computation time (about 5.4 times faster).

translated by 谷歌翻译

Autothrottle: A Practical Framework for Harvesting CPUs from SLO-Targeted Microservices

Zibo Wang , Pinghe Li , Chieh-Jan Mike Liang , Feng Wu , Francis Y. Yan

分类：机器学习

2022-12-23

As the number of distributed services (or microservices) of cloud-native applications grows, resource management becomes a challenging task. These applications tend to be user-facing and latency-sensitive, and our goal is to continuously minimize the amount of CPU resources allocated while still satisfying the application latency SLO. Although previous efforts have proposed simple heuristics and sophisticated ML-based techniques, we believe that a practical resource manager should accurately scale CPU resources for diverse applications, with minimum human efforts and operation overheads. To this end, we ask: can we systematically break resource management down to subproblems solvable by practical policies? Based on the notion of CPU-throttle-based performance target, we decouple the mechanisms of SLO feedback and resource control, and implement a two-level framework -- Autothrottle. It combines a lightweight learned controller at the global level, and agile per-microservice controllers at the local level. We evaluate Autothrottle on three microservice applications, with both short-term and 21-day production workload traces. Empirical results show Autothrottle's superior CPU core savings up to 26.21% over the best-performing baselines across applications, while maintaining the latency SLO.

translated by 谷歌翻译

A Study of Left Before Treatment Complete Emergency Department Patients: An Optimized Explanatory Machine Learning Framework

Abdulaziz Ahmed , Khalid Y. Aram , Salih Tutun

分类：人工智能

2022-12-22

The issue of left before treatment complete (LBTC) patients is common in emergency departments (EDs). This issue represents a medico-legal risk and may cause a revenue loss. Thus, understanding the factors that cause patients to leave before treatment is complete is vital to mitigate and potentially eliminate these adverse effects. This paper proposes a framework for studying the factors that affect LBTC outcomes in EDs. The framework integrates machine learning, metaheuristic optimization, and model interpretation techniques. Metaheuristic optimization is used for hyperparameter optimization--one of the main challenges of machine learning model development. Three metaheuristic optimization algorithms are employed for optimizing the parameters of extreme gradient boosting (XGB), which are simulated annealing (SA), adaptive simulated annealing (ASA), and adaptive tabu simulated annealing (ATSA). The optimized XGB models are used to predict the LBTC outcomes for the patients under treatment in ED. The designed algorithms are trained and tested using four data groups resulting from the feature selection phase. The model with the best predictive performance is interpreted using SHaply Additive exPlanations (SHAP) method. The findings show that ATSA-XGB outperformed other mode configurations with an accuracy, area under the curve (AUC), sensitivity, specificity, and F1-score of 86.61%, 87.50%, 85.71%, 87.51%, and 86.60%, respectively. The degree and the direction of effects of each feature were determined and explained using the SHAP method.

translated by 谷歌翻译

Elixir: A system to enhance data quality for multiple analytics on a video stream

Sibendu Paul , Kunal Rao , Giuseppe Coviello , Murugan Sankaradas , Oliver Po , Y. Charlie Hu , Srimat T. Chakradhar

分类：计算机视觉

2022-12-08

IoT sensors, especially video cameras, are ubiquitously deployed around the world to perform a variety of computer vision tasks in several verticals including retail, healthcare, safety and security, transportation, manufacturing, etc. To amortize their high deployment effort and cost, it is desirable to perform multiple video analytics tasks, which we refer to as Analytical Units (AUs), off the video feed coming out of every camera. In this paper, we first show that in a multi-AU setting, changing the camera setting has disproportionate impact on different AUs performance. In particular, the optimal setting for one AU may severely degrade the performance for another AU, and further the impact on different AUs varies as the environmental condition changes. We then present Elixir, a system to enhance the video stream quality for multiple analytics on a video stream. Elixir leverages Multi-Objective Reinforcement Learning (MORL), where the RL agent caters to the objectives from different AUs and adjusts the camera setting to simultaneously enhance the performance of all AUs. To define the multiple objectives in MORL, we develop new AU-specific quality estimator values for each individual AU. We evaluate Elixir through real-world experiments on a testbed with three cameras deployed next to each other (overlooking a large enterprise parking lot) running Elixir and two baseline approaches, respectively. Elixir correctly detects 7.1% (22,068) and 5.0% (15,731) more cars, 94% (551) and 72% (478) more faces, and 670.4% (4975) and 158.6% (3507) more persons than the default-setting and time-sharing approaches, respectively. It also detects 115 license plates, far more than the time-sharing approach (7) and the default setting (0).

translated by 谷歌翻译