智能论文笔记

CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

Hengrui Cai , Wenbin Lu , Rachel Marceau West , Devan V. Mehrotra , Lingkang Huang

分类： (统计)机器学习 | 机器学习

2021-10-11

个性化医学是针对患者特征量身定制的医学范式，是医疗保健中越来越有吸引力的领域。个性化医学的一个重要目标是根据基线协变量鉴定患者的亚组，而与其他比较治疗相比，从目标治疗中受益更多。当前的大多数亚组识别方法仅着重于获得具有增强治疗效果的亚组，而无需注意亚组大小。但是，临床上有意义的亚组学习方法应确定可以从更好的治疗中受益的患者数量的最大数量。在本文中，我们提出了一项最佳的亚组选择规则（SSR），该规则最大化选定的患者的数量，同时，达到了预先指定的临床意义上有意义的平均结果，例如平均治疗效果。我们基于描述结果中的处理 - 果膜相互作用的对比函数，得出了最佳SSR的两种等效理论形式。我们进一步提出了一个受约束的策略树搜索算法（资本），以在可解释的决策树类中找到最佳SSR。所提出的方法是灵活的，可以处理多种限制因素，以惩罚具有负面治疗效果的患者，并使用受限的平均生存时间作为临床上有趣的平均结果来解决事件数据的时间。进行了广泛的模拟，比较研究和实际数据应用，以证明我们方法的有效性和实用性。

translated by 谷歌翻译

CODA: Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

Hengrui Cai , Wenbin Lu , Rui Song

分类： (统计)机器学习

2021-04-21

我们考虑在具有多个可用的多个辅助来源的主要兴趣样本中最佳决策问题。感兴趣的结果是有限的，因为它仅在主要样本中观察到。实际上，这种多个数据源可能属于异质研究，因此不能直接组合。本文提出了一种新的框架来处理异构研究，并通过新的校准最佳决策（CODA）方法同时解决有限的结果，通过利用多种数据来源的常见中间结果来解决。具体地，CODA允许跨不同样品的基线协变量具有均匀或异质的分布。在温和和可测试的假设下，不同样本中的中间结果的条件方法等于基线协变量和治疗信息，我们表明，条件平均结果的提议CODA估计是渐近正常的和更有效的，而不是使用主要样品。此外，由于速率双重稳健性，可以使用简单的插件方法轻松获得CODA估计器的方差。对模拟数据集的广泛实验显示了使用CoDa的经验有效性和提高效率，然后是与来自Eicu的辅助数据的主要样本是MIMIC-III数据集的真实应用程序。

translated by 谷歌翻译

Estimating heterogeneous treatment effects with right-censored data via causal survival forests

Yifan Cui , Michael R. Kosorok , Erik Sverdrup , Stefan Wager , Ruoqing Zhu

分类：机器学习 | (统计)机器学习

2020-01-27

基于森林的方法最近在非参数治疗效应估计中获得了普及。在这一工作方面，我们引入了因果生存森林，可用于在可能右估计结果的生存和观察环境中估计异质治疗效果。我们的方法依赖于正交估计方程来在不满意的情况下对审查和选择效果进行鲁棒性调整。在我们的实验中，我们发现相对于许多基线的表现良好的方法。

translated by 谷歌翻译

Jump Interval-Learning for Individualized Decision Making

Hengrui Cai , Chengchun Shi , Rui Song , Wenbin Lu

分类：机器学习 | (统计)机器学习

2021-11-17

个性化决定规则（IDR）是一个决定函数，可根据他/她观察到的特征分配给定的治疗。文献中的大多数现有工作考虑使用二进制或有限的许多治疗方案的设置。在本文中，我们专注于连续治疗设定，并提出跳跃间隔 - 学习，开发一个最大化预期结果的个性化间隔值决定规则（I2DR）。与推荐单一治疗的IDRS不同，所提出的I2DR为每个人产生了一系列治疗方案，使其在实践中实施更加灵活。为了获得最佳I2DR，我们的跳跃间隔学习方法估计通过跳转惩罚回归给予治疗和协变量的结果的条件平均值，并基于估计的结果回归函数来衍生相应的最佳I2DR。允许回归线是用于清晰的解释或深神经网络的线性，以模拟复杂的处理 - 协调会相互作用。为了实现跳跃间隔学习，我们开发了一种基于动态编程的搜索算法，其有效计算结果回归函数。当结果回归函数是处理空间的分段或连续功能时，建立所得I2DR的统计特性。我们进一步制定了一个程序，以推断（估计）最佳政策下的平均结果。进行广泛的模拟和对华法林研究的真实数据应用，以证明所提出的I2DR的经验有效性。

translated by 谷歌翻译

Evaluating Treatment Prioritization Rules via Rank-Weighted Average Treatment Effects

Steve Yadlowsky , Scott Fleming , Nigam Shah , Emma Brunskill , Stefan Wager

分类： (统计)机器学习

2021-11-15

有许多可用于选择优先考虑治疗的可用方法，包括基于治疗效果估计，风险评分和手工制作规则的遵循申请。我们将秩加权平均治疗效应（RATY）指标作为一种简单常见的指标系列，用于比较水平竞争范围的治疗优先级规则。对于如何获得优先级规则，率是不可知的，并且仅根据他们在识别受益于治疗中受益的单位的方式进行评估。我们定义了一系列速率估算器，并证明了一个中央限位定理，可以在各种随机和观测研究环境中实现渐近精确的推断。我们为使用自主置信区间的使用提供了理由，以及用于测试关于治疗效果中的异质性的假设的框架，与优先级规则相关。我们对速率的定义嵌套了许多现有度量，包括QINI系数，以及我们的分析直接产生了这些指标的推论方法。我们展示了我们从个性化医学和营销的示例中的方法。在医疗环境中，使用来自Sprint和Accor-BP随机对照试验的数据，我们发现没有明显的证据证明异质治疗效果。另一方面，在大量的营销审判中，我们在一些数字广告活动的治疗效果中发现了具有的强大证据，并证明了如何使用率如何比较优先考虑估计风险的目标规则与估计治疗效益优先考虑的目标规则。

translated by 谷歌翻译

Hybrid Censored Quantile Regression Forest to Assess the Heterogeneous Effects

Huichen Zhu , Yifei Sun , Ying Wei

分类： (统计)机器学习

2022-12-12

In many applications, heterogeneous treatment effects on a censored response variable are of primary interest, and it is natural to evaluate the effects at different quantiles (e.g., median). The large number of potential effect modifiers, the unknown structure of the treatment effects, and the presence of right censoring pose significant challenges. In this paper, we develop a hybrid forest approach called Hybrid Censored Quantile Regression Forest (HCQRF) to assess the heterogeneous effects varying with high-dimensional variables. The hybrid estimation approach takes advantage of the random forests and the censored quantile regression. We propose a doubly-weighted estimation procedure that consists of a redistribution-of-mass weight to handle censoring and an adaptive nearest neighbor weight derived from the forest to handle high-dimensional effect functions. We propose a variable importance decomposition to measure the impact of a variable on the treatment effect function. Extensive simulation studies demonstrate the efficacy and stability of HCQRF. The result of the simulation study also convinces us of the effectiveness of the variable importance decomposition. We apply HCQRF to a clinical trial of colorectal cancer. We achieve insightful estimations of the treatment effect and meaningful variable importance results. The result of the variable importance also confirms the necessity of the decomposition.

translated by 谷歌翻译

Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding

Zhengling Qi , Rui Miao , Xiaoke Zhang

分类：机器学习 | (统计)机器学习

2021-05-03

数据驱动的个性化决策最近收到了增加的研究兴趣。大多数现有的方法都取决于没有无法衡量的混杂的假设，不幸的是，在实践中，尤其是在观察性研究中无法确保这种混杂。在最近提出的近端因果推理的推动下，我们开发了几种近端学习方法，以估算未衡量的混杂的最佳个性化治疗方案（ITR）。特别是，我们为不同类别的ITR建立了几个识别结果，这表现出了做出不可测试的假设的风险与决策的价值函数改善之间的权衡。基于这些结果，我们提出了几种基于分类的方法来找到各种限制的课堂最佳ITR并发展其理论属性。通过广泛的仿真研究和一项真实的数据应用，我们提出的方法的数值性能具有吸引力。

translated by 谷歌翻译

Bayesian Semiparametric Model for Sequential Treatment Decisions with Informative Timing

Arman Oganisian , Kelly D. Getz , Todd A. Alonzo , Richard Aplenc , Jason A. Roy

分类：机器学习 | (统计)机器学习

2022-11-29

We develop a Bayesian semi-parametric model for the estimating the impact of dynamic treatment rules on survival among patients diagnosed with pediatric acute myeloid leukemia (AML). The data consist of a subset of patients enrolled in the phase III AAML1031 clinical trial in which patients move through a sequence of four treatment courses. At each course, they undergo treatment that may or may not include anthracyclines (ACT). While ACT is known to be effective at treating AML, it is also cardiotoxic and can lead to early death for some patients. Our task is to estimate the potential survival probability under hypothetical dynamic ACT treatment strategies, but there are several impediments. First, since ACT was not randomized in the trial, its effect on survival is confounded over time. Second, subjects initiate the next course depending on when they recover from the previous course, making timing potentially informative of subsequent treatment and survival. Third, patients may die or drop out before ever completing the full treatment sequence. We develop a generative Bayesian semi-parametric model based on Gamma Process priors to address these complexities. At each treatment course, the model captures subjects' transition to subsequent treatment or death in continuous time under a given rule. A g-computation procedure is used to compute a posterior over potential survival probability that is adjusted for time-varying confounding. Using this approach, we conduct posterior inference for the efficacy of hypothetical treatment rules that dynamically modify ACT based on evolving cardiac function.

translated by 谷歌翻译

Tree-based Subgroup Discovery In Electronic Health Records: Heterogeneity of Treatment Effects for DTG-containing Therapies

Jiabei Yang , Ann W. Mwangi , Rami Kantor , Issa J. Dahabreh , Monicah Nyambura , Allison Delong , Joseph W. Hogan , Jon A. Steingrimsson

分类： (统计)机器学习

2022-08-30

电子健康记录（EHR）可获得的丰富纵向个体水平数据可用于检查治疗效果异质性。但是，使用EHR数据估算治疗效果提出了几个挑战，包括时变的混杂，重复和时间不一致的协变量测量，治疗分配和结果以及由于辍学导致的损失。在这里，我们开发了纵向数据（SDLD）算法的亚组发现，该算法是一种基于树的算法，用于使用纵向相互作用树算法结合使用纵向相互作用的一般数据驱动的方法，与纵向驱动的方法与纵向驱动的方法结合使用纵向相互作用，以发现具有异质治疗效果的亚组，并进行纵向研究。目标最大似然估计。我们将算法应用于EHR数据，以发现患有人免疫缺陷病毒（HIV）的人群的亚组，他们在接受非Dolutegravir抗逆转录病毒疗法（ART）接受非Dolutegravir抗逆转录病毒疗法（艺术）时的体重增加风险较高。

translated by 谷歌翻译

HTML版本

A Tree-based Model Averaging Approach for Personalized Treatment Effect Estimation from Heterogeneous Data Sources

Xiaoqing Tan , Chung-Chou H. Chang , Ling Zhou , Lu Tang

分类： (统计)机器学习 | 机器学习

2021-03-10

由于样本量有限，可以准确估计研究地点（例如医院）中的个性化治疗效果。此外，隐私考虑和缺乏资源阻止站点利用其他站点的主题级数据。我们提出了一种基于树的模型平均方法，以通过利用从其他潜在异质部位得出的模型来提高目标部位条件平均治疗效果（CATE）的估计精度，而无需共享主题级数据。据我们的最佳知识，没有建立的模型平均分布式数据的方法，重点是改善治疗效果的估计。具体而言，在分布式数据网络下，我们的框架提供了一个基于CATE估算器的基于可解释的树的合奏，该集合可以跨研究站点加入模型，同时通过站点分区积极地对数据源中的异质性进行建模。通过对氧疗法对医院存活率的因果影响的现实研究证明了这种方法的表现，并得到了全面的模拟结果的支持。

translated by 谷歌翻译

Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings

Hengrui Cai , Chengchun Shi , Rui Song , Wenbin Lu

分类： (统计)机器学习 | 机器学习

2020-10-29

我们认为离政策在连续处理设置，如个性化的剂量调查评价（OPE）。在OPE，一个目标来估算下使用不同的决策规则产生的历史数据的新的治疗决策规则中的平均结果。离散处理设置上OPE焦点大多数现有的作品。为了应对持续的治疗，我们开发使用OPE深跳学习一种新的估计方法。我们的方法在于在使用深离散化，通过利用深度学习和多尺度变化点检测自适应离散化治疗领域的主要成分。这使我们能够应用在离散处理现有OPE方法来处理连续治疗。我们的方法是通过理论计算结果，模拟和实际应用程序，以华法林给药进一步合理的。

translated by 谷歌翻译

Statistically Efficient Advantage Learning for Offline Reinforcement Learning in Infinite Horizons

Chengchun Shi , Shikai Luo , Yuan Le , Hongtu Zhu , Rui Song

分类： (统计)机器学习 | 机器学习

2022-02-26

我们考虑在离线域中的强化学习（RL）方法，没有其他在线数据收集，例如移动健康应用程序。计算机科学文献中的大多数现有策略优化算法都是在易于收集或模拟的在线设置中开发的。通过预采用的离线数据集，它们对移动健康应用程序的概括尚不清楚。本文的目的是开发一个新颖的优势学习框架，以便有效地使用预采用的数据进行策略优化。所提出的方法采用由任何现有的最新RL算法计算的最佳Q-估计器作为输入，并输出一项新策略，其价值比基于初始Q-得出的策略更快地收敛速度。估计器。进行广泛的数值实验以支持我们的理论发现。我们提出的方法的Python实现可在https://github.com/leyuanheart/seal上获得。

translated by 谷歌翻译

Policy learning with asymmetric utilities

Eli Ben-Michael , Kosuke Imai , Zhichao Jiang

分类： (统计)机器学习 | 机器学习

2022-06-21

即使在医学和公共政策等高风险设置中，数据驱动的决策也起着重要作用。从观察到的数据中学习最佳策略需要仔细制定效用函数，该功能的期望值在人群中最大化。尽管研究人员通常使用仅依赖于观察到的结果的实用程序，但在许多情况下，决策者的效用函数的特征是所有动作下的潜在结果集合。例如，``不伤害''的希波克拉底原则意味着，在未经治疗的情况下生存的患者导致死亡的成本大于预防救生治疗的成本。我们考虑使用此形式的不对称效用功能的最佳政策学习。我们表明，不对称公用事业会导致无法识别的社会福利功能，因此我们首先部分识别它。利用统计决策理论，我们通过最大程度地减少相对于替代政策的最大遗憾来得出最小的决策规则。我们表明，可以通过解决中间分类问题从观察到的数据中学习最小值决策规则。我们还确定，此过程的有限样本遗憾是由这些中间分类器的错误分类率界定的。我们将此概念框架和方法应用于有关是否使用肺高血压患者是否使用正确的心脏导管插入术的决定。

translated by 谷歌翻译

Continuous Treatment Recommendation with Deep Survival Dose Response Function

Jie Zhu , Blanca Gallego

分类： (统计)机器学习 | 人工智能 | 机器学习

2021-08-24

我们提出了一种通用公式，用于具有临床生存数据的设置中连续治疗建议问题，我们称之为深层生存剂量反应函数（DEEPSDRF）。也就是说，我们认为学习条件平均剂量反应（CADR）的问题仅来自历史数据，在历史数据中，观察到的因素（混杂因素）都会影响观察到的治疗和事件时间结果。DEEPSDRF的估计治疗效果使我们能够开发出针对选择偏差的校正的推荐算法。我们比较了基于随机搜索和强化学习的两种推荐方法，并在患者结局方面发现了相似的表现。我们在广泛的仿真研究和EICU研究所（ERI）数据库中测试了DeepSDRF和相应的推荐剂。据我们所知，这是首次使用因果模型来解决医疗环境中观察数据的连续治疗效果。

translated by 谷歌翻译

Quantile Off-Policy Evaluation via Deep Conditional Generative Learning

Yang Xu , Chengchun Shi , Shikai Luo , Lan Wang , Rui Song

分类： (统计)机器学习 | 机器学习

2022-12-29

Off-Policy evaluation (OPE) is concerned with evaluating a new target policy using offline data generated by a potentially different behavior policy. It is critical in a number of sequential decision making problems ranging from healthcare to technology industries. Most of the work in existing literature is focused on evaluating the mean outcome of a given policy, and ignores the variability of the outcome. However, in a variety of applications, criteria other than the mean may be more sensible. For example, when the reward distribution is skewed and asymmetric, quantile-based metrics are often preferred for their robustness. In this paper, we propose a doubly-robust inference procedure for quantile OPE in sequential decision making and study its asymptotic properties. In particular, we propose utilizing state-of-the-art deep conditional generative learning methods to handle parameter-dependent nuisance function estimation. We demonstrate the advantages of this proposed estimator through both simulations and a real-world dataset from a short-video platform. In particular, we find that our proposed estimator outperforms classical OPE estimators for the mean in settings with heavy-tailed reward distributions.

translated by 谷歌翻译

Relative Sparsity for Medical Decision Problems

Samuel J. Weisenthal , Sally W. Thurston , Ashkan Ertefaie

分类：机器学习

2022-11-29

Existing statistical methods can be used to estimate a policy, or a mapping from covariates to decisions, which can then instruct decision makers. There is great interest in using such data-driven policies in healthcare. In healthcare, however, it is often important to explain to the healthcare provider, and to the patient, how a new policy differs from the current standard of care. This end is facilitated if one can pinpoint the aspects (i.e., parameters) of the policy that change most when moving from the standard of care to the new, suggested policy. To this end, we adapt ideas from Trust Region Policy Optimization. In our work, however, unlike in Trust Region Policy Optimization, the difference between the suggested policy and standard of care is required to be sparse, aiding with interpretability. In particular, we trade off between maximizing expected reward and minimizing the $L_1$ norm divergence between the parameters of the two policies. This yields "relative sparsity," where, as a function of a tuning parameter, $\lambda$, we can approximately control the number of parameters in our suggested policy that differ from their counterparts in the standard of care. We develop our methodology for the observational data setting. We propose a problem-specific criterion for selecting $\lambda$, perform simulations, and illustrate our method with a real, observational healthcare dataset, deriving a policy that is easy to explain in the context of the current standard of care. Our work promotes the adoption of data-driven decision aids, which have great potential to improve health outcomes.

translated by 谷歌翻译

Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment

Eli Ben-Michael , D. James Greiner , Kosuke Imai , Zhichao Jiang

分类： (统计)机器学习 | 机器学习

2021-09-22

在当今的社会中，算法建议和决策已经变得普遍存在。其中许多和其他数据驱动的政策，特别是在公共政策领域，基于已知的确定性规则，以确保其透明度和可解释性。例如，算法预审风险评估，即作为我们的激励申请，提供相对简单，确定性的分类分数和建议，以帮助法官发出释放决策。我们如何根据现有的确定性政策使用数据，并学习新的和更好的策略？不幸的是，策略学习的先前方法不适用，因为它们需要现有的政策是随机而非确定性的。我们开发了一种强大的优化方法，部分地识别策略的预期效用，然后通过最小化最坏情况后悔找到最佳策略。由此产生的政策是保守的，但具有统计安全保障，允许政策制定者限制产生比现有政策更糟糕的结果的可能性。我们将这种方法扩展到人类借助算法建议作出决策的共同和重要的环境。最后，我们将提议的方法应用于预审风险评估工具的独特现场实验。我们推出了新的分类和推荐规则，以保留现有仪器的透明度和可解释性，同时可能以较低的成本导致更好的整体结果。

translated by 谷歌翻译

Modelling hetegeneous treatment effects by quantitle local polynomial decision tree and forest

Lai Xinglin

分类： (统计)机器学习

2021-11-30

为了进一步开发异构治疗效果的统计推理问题，本文在Breiman（2001）随机林树（RFT）和Wager等人的情况下建立了使用古典的优秀统计属性来参数化非参数问题的（2018）因果树。oLs和基于协变量分数的局部线性间隔的划分，同时保留随机林树木，具有可构造的置信区间和渐近常数特性的优势[athey和Imbens（2016），efron（2014），赌第等（2014年）\ citep {wagert2014Asymptotic}，我们根据固定规则提出了一个决策树，根据固定规则与本地样本的多项式估计相结合，我们称之为临时局部线性因果树（QLPRT）和林（QLPRF）。

translated by 谷歌翻译

Increasing the efficiency of randomized trial estimates via linear adjustment for a prognostic score

Alejandro Schuler , David Walsh , Diana Hall , Jon Walsh , Charles Fisher

分类： (统计)机器学习 | 机器学习

2020-12-17

估算随机实验的因果效应是临床研究的核心。降低这些分析中的统计不确定性是统计学家的重要目标。注册管理机构，事先审判和健康记录构成了对患者的历史数据汇编，其在可能是可利用至此的患者下的历史数据。但是，大多数历史借贷方法通过牺牲严格的I型错误率控制来达到方差的减少。在这里，我们建议使用利用线性协变调整的历史数据来提高试验分析的效率而不会产生偏见。具体而言，我们在历史数据上培训预后模型，然后使用线性回归估计治疗效果，同时调整试验受试者预测结果（其预后分数）。我们证明，在某些条件下，这种预后调整程序在大类估算仪中获得了最低差异。当不符合这些条件时，预后的协变量调整仍然比原始协变量调整更有效，并且效率的增益与上述预后模型的预测准确性的衡量标准成正比，与原始协变量的线性关系的预测准确性。我们展示了使用模拟的方法和阿尔茨海默病的临床试验的再分析，并观察平均平均误差的有意义减少和估计方差。最后，我们提供了一种简化的渐近方差公式，使得能够计算这些收益的功率计算。在使用预后模型的预后模型中，可以实现10％和30％的样品尺寸减少。

translated by 谷歌翻译

Meta-analysis of individualized treatment rules via sign-coherency

Jay Jojo Cheng , Jared D. Huling , Guanhua Chen

分类： (统计)机器学习 | 机器学习

2022-11-28

Medical treatments tailored to a patient's baseline characteristics hold the potential of improving patient outcomes while reducing negative side effects. Learning individualized treatment rules (ITRs) often requires aggregation of multiple datasets(sites); however, current ITR methodology does not take between-site heterogeneity into account, which can hurt model generalizability when deploying back to each site. To address this problem, we develop a method for individual-level meta-analysis of ITRs, which jointly learns site-specific ITRs while borrowing information about feature sign-coherency via a scientifically-motivated directionality principle. We also develop an adaptive procedure for model tuning, using information criteria tailored to the ITR learning problem. We study the proposed methods through numerical experiments to understand their performance under different levels of between-site heterogeneity and apply the methodology to estimate ITRs in a large multi-center database of electronic health records. This work extends several popular methodologies for estimating ITRs (A-learning, weighted learning) to the multiple-sites setting.

translated by 谷歌翻译