智能论文笔记

Heterogeneous Synthetic Learner for Panel Data

Ye Shen , Runzhe Wan , Hengrui Cai , Rui Song

分类： (统计)机器学习 | 机器学习

2022-12-30

In the new era of personalization, learning the heterogeneous treatment effect (HTE) becomes an inevitable trend with numerous applications. Yet, most existing HTE estimation methods focus on independently and identically distributed observations and cannot handle the non-stationarity and temporal dependency in the common panel data setting. The treatment evaluators developed for panel data, on the other hand, typically ignore the individualized information. To fill the gap, in this paper, we initialize the study of HTE estimation in panel data. Under different assumptions for HTE identifiability, we propose the corresponding heterogeneous one-side and two-side synthetic learner, namely H1SL and H2SL, by leveraging the state-of-the-art HTE estimator for non-panel data and generalizing the synthetic control method that allows flexible data generating process. We establish the convergence rates of the proposed estimators. The superior performance of the proposed methods over existing ones is demonstrated by extensive numerical studies.

translated by 谷歌翻译

Synthetic learner: model-free inference on treatments over time

Davide Viviano , Jelena Bradic

分类：机器学习 | (统计)机器学习

2019-04-02

了解特定待遇或政策与许多感兴趣领域有关的影响，从政治经济学，营销到医疗保健。在本文中，我们开发了一种非参数算法，用于在合成控制的背景下检测随着时间的流逝的治疗作用。该方法基于许多算法的反事实预测，而不必假设该算法正确捕获模型。我们介绍了一种推论程序来检测治疗效果，并表明测试程序对于固定，β混合过程渐近有效，而无需对所考虑的一组基础算法施加任何限制。我们讨论了平均治疗效果估计的一致性保证，并为提出的方法提供了遗憾的界限。算法类别可能包括随机森林，套索或任何其他机器学习估计器。数值研究和应用说明了该方法的优势。

translated by 谷歌翻译

Predictor Selection for Synthetic Controls

Jaume Vives-i-Bastida

分类： (统计)机器学习

2022-03-22

Synthetic control methods often rely on matching pre-treatment characteristics (called predictors) of the treated unit. The choice of predictors and how they are weighted plays a key role in the performance and interpretability of synthetic control estimators. This paper proposes the use of a sparse synthetic control procedure that penalizes the number of predictors used in generating the counterfactual to select the most important predictors. We derive, in a linear factor model framework, a new model selection consistency result and show that the penalized procedure has a faster mean squared error convergence rate. Through a simulation study, we then show that the sparse synthetic control achieves lower bias and has better post-treatment performance than the un-penalized synthetic control. Finally, we apply the method to revisit the study of the passage of Proposition 99 in California in an augmented setting with a large number of predictors available.

translated by 谷歌翻译

Do We Exploit all Information for Counterfactual Analysis? Benefits of Factor Models and Idiosyncratic Correction

Jianqing Fan , Ricardo P. Masini , Marcelo C. Medeiros

分类：机器学习 | (统计)机器学习

2020-11-08

最佳定价，即确定最大限度地提高给定产品的利润或收入的价格水平，是零售业的重要任务。要选择这样的数量，请先估计产品需求的价格弹性。由于混淆效果和价格内限性，回归方法通常无法恢复这些弹性。因此，通常需要随机实验。然而，例如，弹性可以是高度异构的，这取决于商店的位置。随着随机化经常发生在市级，标准差异差异方法也可能失败。可能的解决方案是基于根据从人工对照构成的治疗方法测量处理对单个（或仅几个）处理单元的影响的方法。例如，对于治疗组中的每个城市，可以从未处理的位置构成反事实。在本文中，我们应用了一种新的高维统计方法，以衡量价格变化对巴西主要零售商的日常销售的影响。所提出的方法结合了主成分（因子）和稀疏回归，导致一种称为因子调整的正规化方法的方法（\ TextTt {FarmTraTeat}）。数据包括每日五种不同产品的日常销售和价格，超过400多名市。审议的产品属于\ emph {甜蜜和糖果}类别和实验已经在2016年和2017年进行。我们的结果证实了高度异质性的假设，从而产生了与独特的市政当局的不同定价策略。

translated by 谷歌翻译

Optimal Experimental Design for Staggered Rollouts

Ruoxuan Xiong , Susan Athey , Mohsen Bayati , Guido Imbens

分类： (统计)机器学习

2019-11-09

在本文中，我们研究了在一组单位上进行的设计实验的问题，例如在线市场中的用户或用户组，以多个时间段，例如数周或数月。这些实验特别有助于研究对当前和未来结果具有因果影响的治疗（瞬时和滞后的影响）。设计问题涉及在实验之前或期间选择每个单元的治疗时间，以便最精确地估计瞬间和滞后的效果，实验后。这种治疗决策的优化可以通过降低其样本尺寸要求，直接最小化实验的机会成本。优化是我们提供近最优解的NP-Hard整数程序，当时在开始时进行设计决策（固定样本大小设计）。接下来，我们研究允许在实验期间进行适应性决策的顺序实验，并且还可能早期停止实验，进一步降低其成本。然而，这些实验的顺序性质使设计阶段和估计阶段复杂化。我们提出了一种新的算法，PGAE，通过自适应地制造治疗决策，估算治疗效果和绘制有效的实验后推理来解决这些挑战。 PGAE将来自贝叶斯统计，动态编程和样品分裂的思想结合起来。使用来自多个域的真实数据集的合成实验，我们证明了与基准相比，我们的固定样本尺寸和顺序实验的提出解决方案将实验的机会成本降低了50％和70％。

translated by 谷歌翻译

Treatment Effect Estimation from Observational Network Data using Augmented Inverse Probability Weighting and Machine Learning

Corinne Emmenegger , Meta-Lina Spohn , Peter Bühlmann

分类： (统计)机器学习

2022-06-29

治疗效应估计的因果推理方法通常假设独立的实验单位。但是，由于实验单元可能会相互作用，因此这种假设通常值得怀疑。我们开发了增强的反可能性加权（AIPW），以估计和推断因果治疗对依赖观察数据的影响。我们的框架涵盖了网络中相互作用的单位引起的溢出效应的非常普遍的案例。我们使用插件机学习来估计无限维的滋扰成分，导致一致的治疗效应估计器以参数速率收敛，渐近地遵循高斯分布。

translated by 谷歌翻译

Estimating heterogeneous treatment effects with right-censored data via causal survival forests

Yifan Cui , Michael R. Kosorok , Erik Sverdrup , Stefan Wager , Ruoqing Zhu

分类：机器学习 | (统计)机器学习

2020-01-27

基于森林的方法最近在非参数治疗效应估计中获得了普及。在这一工作方面，我们引入了因果生存森林，可用于在可能右估计结果的生存和观察环境中估计异质治疗效果。我们的方法依赖于正交估计方程来在不满意的情况下对审查和选择效果进行鲁棒性调整。在我们的实验中，我们发现相对于许多基线的表现良好的方法。

translated by 谷歌翻译

Counterfactual Risk Assessments under Unmeasured Confounding

Ashesh Rambachan , Amanda Coston , Edward Kennedy

分类：机器学习

2022-12-19

Statistical risk assessments inform consequential decisions such as pretrial release in criminal justice, and loan approvals in consumer finance. Such risk assessments make counterfactual predictions, predicting the likelihood of an outcome under a proposed decision (e.g., what would happen if we approved this loan?). A central challenge, however, is that there may have been unmeasured confounders that jointly affected past decisions and outcomes in the historical data. This paper proposes a tractable mean outcome sensitivity model that bounds the extent to which unmeasured confounders could affect outcomes on average. The mean outcome sensitivity model partially identifies the conditional likelihood of the outcome under the proposed decision, popular predictive performance metrics (e.g., accuracy, calibration, TPR, FPR), and commonly-used predictive disparities. We derive their sharp identified sets, and we then solve three tasks that are essential to deploying statistical risk assessments in high-stakes settings. First, we propose a doubly-robust learning procedure for the bounds on the conditional likelihood of the outcome under the proposed decision. Second, we translate our estimated bounds on the conditional likelihood of the outcome under the proposed decision into a robust, plug-in decision-making policy. Third, we develop doubly-robust estimators of the bounds on the predictive performance of an existing risk assessment.

translated by 谷歌翻译

Estimating Individual Treatment Effects using Non-Parametric Regression Models: a Review

Alberto Caron , Gianluca Baio , Ioanna Manolopoulou

分类：机器学习 | (统计)机器学习

2020-09-14

大型观察数据越来越多地提供健康，经济和社会科学等学科，研究人员对因果问题而不是预测感兴趣。在本文中，从旨在调查参与学校膳食计划对健康指标的实证研究，研究了使用非参数回归的方法估算异质治疗效果的问题。首先，我们介绍了与观察或非完全随机数据进行因果推断相关的设置和相关的问题，以及如何在统计学习工具的帮助下解决这些问题。然后，我们审查并制定现有最先进的框架的统一分类，允许通过非参数回归模型来估算单个治疗效果。在介绍模型选择问题的简要概述后，我们说明了一些关于三种不同模拟研究的方法的性能。我们通过展示一些关于学校膳食计划数据的实证分析的一些方法的使用来结束。

translated by 谷歌翻译

On Model Identification and Out-of-Sample Prediction of Principal Component Regression: Applications to Synthetic Controls

Anish Agarwal , Devavrat Shah , Dennis Shen

分类：机器学习 | (统计)机器学习

2020-10-27

我们在具有固定设计的高维错误设置中分析主组件回归（PCR）。在适当的条件下，我们表明PCR始终以最小$ \ ell_2 $ -norm识别唯一模型，并且是最小的最佳模型。这些结果使我们能够建立非质子化的样本外预测，以确保提高最著名的速率。在我们的分析中，我们在样本外协变量之间引入了天然的线性代数条件，这使我们能够避免分布假设。我们的模拟说明了即使在协变量转移的情况下，这种条件对于概括的重要性。作为副产品，我们的结果还导致了合成控制文献的新结果，这是政策评估的主要方法。特别是，我们的minimax结果表明，在众多变体中，基于PCR的方法具有吸引力。据我们所知，我们对固定设计设置的预测保证在高维错误和合成控制文献中都是难以捉摸的。

translated by 谷歌翻译

Evaluating Treatment Prioritization Rules via Rank-Weighted Average Treatment Effects

Steve Yadlowsky , Scott Fleming , Nigam Shah , Emma Brunskill , Stefan Wager

分类： (统计)机器学习

2021-11-15

有许多可用于选择优先考虑治疗的可用方法，包括基于治疗效果估计，风险评分和手工制作规则的遵循申请。我们将秩加权平均治疗效应（RATY）指标作为一种简单常见的指标系列，用于比较水平竞争范围的治疗优先级规则。对于如何获得优先级规则，率是不可知的，并且仅根据他们在识别受益于治疗中受益的单位的方式进行评估。我们定义了一系列速率估算器，并证明了一个中央限位定理，可以在各种随机和观测研究环境中实现渐近精确的推断。我们为使用自主置信区间的使用提供了理由，以及用于测试关于治疗效果中的异质性的假设的框架，与优先级规则相关。我们对速率的定义嵌套了许多现有度量，包括QINI系数，以及我们的分析直接产生了这些指标的推论方法。我们展示了我们从个性化医学和营销的示例中的方法。在医疗环境中，使用来自Sprint和Accor-BP随机对照试验的数据，我们发现没有明显的证据证明异质治疗效果。另一方面，在大量的营销审判中，我们在一些数字广告活动的治疗效果中发现了具有的强大证据，并证明了如何使用率如何比较优先考虑估计风险的目标规则与估计治疗效益优先考虑的目标规则。

translated by 谷歌翻译

CODA: Calibrated Optimal Decision Making with Multiple Data Sources and Limited Outcome

Hengrui Cai , Wenbin Lu , Rui Song

分类： (统计)机器学习

2021-04-21

我们考虑在具有多个可用的多个辅助来源的主要兴趣样本中最佳决策问题。感兴趣的结果是有限的，因为它仅在主要样本中观察到。实际上，这种多个数据源可能属于异质研究，因此不能直接组合。本文提出了一种新的框架来处理异构研究，并通过新的校准最佳决策（CODA）方法同时解决有限的结果，通过利用多种数据来源的常见中间结果来解决。具体地，CODA允许跨不同样品的基线协变量具有均匀或异质的分布。在温和和可测试的假设下，不同样本中的中间结果的条件方法等于基线协变量和治疗信息，我们表明，条件平均结果的提议CODA估计是渐近正常的和更有效的，而不是使用主要样品。此外，由于速率双重稳健性，可以使用简单的插件方法轻松获得CODA估计器的方差。对模拟数据集的广泛实验显示了使用CoDa的经验有效性和提高效率，然后是与来自Eicu的辅助数据的主要样本是MIMIC-III数据集的真实应用程序。

translated by 谷歌翻译

Forecasting Algorithms for Causal Inference with Panel Data

Jacob Goldin , Julian Nyarko , Justin Young

分类： (统计)机器学习

2022-08-06

使用面板数据进行因果推断是社会科学研究的核心挑战。预测方法的进步可以通过更准确地预测未发生治疗的治疗单元的反事实演变来促进这项任务。在本文中，我们借鉴了新开发的时间序列预测（N-Beats算法）的深度神经体系结构。我们通过合并控制单元的领先值来预测处理后的处理单元的“合成”未经处理的版本，从传统的时间序列应用程序中调整了此方法。我们将从此方法得出的估计量称为合成器，发现它在一系列设置中的传统双向固定效果和合成控制方法显着优于传统的双向固定效果和合成控制方法。我们还发现，相对于最新的面板估计方法，例如矩阵完成和差异中的合成差异，合成器具有可比性或更准确的性能。我们的结果强调了如何利用预测文献的进步来改善面板设置的因果推断。

translated by 谷歌翻译

Estimating Heterogeneous Bounds for Treatment Effects under Sample Selection and Non-response

Phillip Heiler

分类： (统计)机器学习

2022-09-09

在本文中，我们提出了一种非参数估计的方法，并推断了一般样本选择模型中因果效应参数的异质界限，初始治疗可能会影响干预后结果是否观察到。可观察到的协变量可能会混淆治疗选择，而观察结果和不可观察的结果可能会混淆。该方法提供条件效应界限作为策略相关的预处理变量的功能。它允许对身份不明的条件效应曲线进行有效的统计推断。我们使用灵活的半参数脱偏机学习方法，该方法可以适应柔性功能形式和治疗，选择和结果过程之间的高维混杂变量。还提供了易于验证的高级条件，以进行估计和错误指定的鲁棒推理保证。

translated by 谷歌翻译

Synthetic Design: An Optimization Approach to Experimental Design with Synthetic Controls

Nick Doudchenko , Khashayar Khosravi , Jean Pouget-Abadie , Sebastien Lahaie , Miles Lubin , Vahab Mirrokni , Jann Spiess , Guido Imbens

分类： (统计)机器学习

2021-12-01

我们研究了具有预处理结果数据的实验研究的最佳设计。估计平均处理效果是治疗和控制单元的加权平均结果之间的差异。许多常用的方法符合该配方，包括差分估计器和各种合成控制技术。我们提出了几种方法，用于结合重量选择一组处理的单位。观察问题的NP硬度，我们介绍了混合整数编程配方，可选择处理和控制集和单位权重。我们证明，这些提出的方法导致定性不同的实验单元进行治疗。我们根据美国劳动统计局的公开数据使用模拟，这些数据在与随机试验等简单和常用的替代品相比时，表现出平均平方误差和统计功率的改进。

translated by 谷歌翻译

Estimating individual treatment effects under unobserved confounding using binary instruments

Dennis Frauen , Stefan Feuerriegel

分类：机器学习 | (统计)机器学习

2022-08-17

观察数据中估算单个治疗效果（ITE）在许多领域，例如个性化医学等领域。但是，实际上，治疗分配通常被未观察到的变量混淆，因此引入了偏见。消除偏见的一种补救措施是使用仪器变量（IVS）。此类环境在医学中广泛存在（例如，将合规性用作二进制IV的试验）。在本文中，我们提出了一个新颖的，可靠的机器学习框架，称为MRIV，用于使用二进制IV估算ITES，从而产生无偏见的ITE估计器。与以前的二进制IV的工作不同，我们的框架通过伪结果回归直接估算了ITE。（1）我们提供了一个理论分析，我们表明我们的框架产生了多重稳定的收敛速率：即使几个滋扰估计器的收敛缓慢，我们的ITE估计器也会达到快速收敛。（2）我们进一步表明，我们的框架渐近地优于最先进的插件IV方法，以进行ITE估计。（3）我们以理论结果为基础，并提出了一种使用二进制IVS的ITE估算的定制的，称为MRIV-NET的深度神经网络结构。在各种计算实验中，我们从经验上证明了我们的MRIV-NET实现最先进的性能。据我们所知，我们的MRIV是第一个机器学习框架，用于估算显示出倍增功能的二进制IV设置。

translated by 谷歌翻译

High-dimensional Inference for Dynamic Treatment Effects

Jelena Bradic , Weijie Ji , Yuqian Zhang

分类：机器学习 | (统计)机器学习

2021-10-10

本文提出了在多阶段实验的背景下的异质治疗效应的置信区间结构，以$ N $样品和高维，$ D $，混淆。我们的重点是$ d \ gg n $的情况，但获得的结果也适用于低维病例。我们展示了正则化估计的偏差，在高维变焦空间中不可避免，具有简单的双重稳固分数。通过这种方式，不需要额外的偏差，并且我们获得root $ N $推理结果，同时允许治疗和协变量的多级相互依赖性。记忆财产也没有假设;治疗可能取决于所有先前的治疗作业以及以前的所有多阶段混淆。我们的结果依赖于潜在依赖的某些稀疏假设。我们发现具有动态处理的强大推理所需的新产品率条件。

translated by 谷歌翻译

Estimation and Inference on Heterogeneous Treatment Effects in High-Dimensional Dynamic Panels under Weak Dependence

Vira Semenova , Matt Goldman , Victor Chernozhukov , Matt Taddy

分类： (统计)机器学习

2017-12-28

This paper provides estimation and inference methods for a conditional average treatment effects (CATE) characterized by a high-dimensional parameter in both homogeneous cross-sectional and unit-heterogeneous dynamic panel data settings. In our leading example, we model CATE by interacting the base treatment variable with explanatory variables. The first step of our procedure is orthogonalization, where we partial out the controls and unit effects from the outcome and the base treatment and take the cross-fitted residuals. This step uses a novel generic cross-fitting method we design for weakly dependent time series and panel data. This method "leaves out the neighbors" when fitting nuisance components, and we theoretically power it by using Strassen's coupling. As a result, we can rely on any modern machine learning method in the first step, provided it learns the residuals well enough. Second, we construct an orthogonal (or residual) learner of CATE -- the Lasso CATE -- that regresses the outcome residual on the vector of interactions of the residualized treatment with explanatory variables. If the complexity of CATE function is simpler than that of the first-stage regression, the orthogonal learner converges faster than the single-stage regression-based learner. Third, we perform simultaneous inference on parameters of the CATE function using debiasing. We also can use ordinary least squares in the last two steps when CATE is low-dimensional. In heterogeneous panel data settings, we model the unobserved unit heterogeneity as a weakly sparse deviation from Mundlak (1978)'s model of correlated unit effects as a linear function of time-invariant covariates and make use of L1-penalization to estimate these models. We demonstrate our methods by estimating price elasticities of groceries based on scanner data. We note that our results are new even for the cross-sectional (i.i.d) case.

translated by 谷歌翻译

Policy design in experiments with unknown interference

Davide Viviano

分类：机器学习

2020-11-16

本文提出了一种估计溢出效应存在福利最大化政策的实验设计。我考虑一个设置在其中组织成一个有限数量的大型群集，并在每个群集中以不观察到的方式交互。作为第一种贡献，我介绍了一个单波实验，以估计治疗概率的变化的边际效应，以考虑到溢出率，并测试政策最优性。该设计在群集中独立地随机化处理，并诱导局部扰动到对簇成对的治疗概率。使用估计的边际效应，我构建了对定期治疗分配规则最大化福利的实际测试，并且我表征了其渐近性质。该想法是，研究人员应报告对福利最大化政策的边际效应和测试的估计：边际效应表明福利改善的方向，并提供了关于是否值得进行额外实验以估计估计福利改善的证据治疗分配。作为第二种贡献，我设计了多波实验来估计治疗分配规则并最大化福利。我获得了小型样本保证，最大可获得的福利和估计政策（遗憾）评估的福利之间的差异。这种保证的必要性是，遗憾在迭代和集群的数量中线性会聚到零。校准在信息扩散和现金转移方案上校准的模拟表明，该方法导致了显着的福利改进。

translated by 谷歌翻译

Orthogonal Series Estimation for the Ratio of Conditional Expectation Functions

Kazuhiko Shinoda , Takahiro Hoshino

分类： (统计)机器学习

2022-12-26

In various fields of data science, researchers are often interested in estimating the ratio of conditional expectation functions (CEFR). Specifically in causal inference problems, it is sometimes natural to consider ratio-based treatment effects, such as odds ratios and hazard ratios, and even difference-based treatment effects are identified as CEFR in some empirically relevant settings. This chapter develops the general framework for estimation and inference on CEFR, which allows the use of flexible machine learning for infinite-dimensional nuisance parameters. In the first stage of the framework, the orthogonal signals are constructed using debiased machine learning techniques to mitigate the negative impacts of the regularization bias in the nuisance estimates on the target estimates. The signals are then combined with a novel series estimator tailored for CEFR. We derive the pointwise and uniform asymptotic results for estimation and inference on CEFR, including the validity of the Gaussian bootstrap, and provide low-level sufficient conditions to apply the proposed framework to some specific examples. We demonstrate the finite-sample performance of the series estimator constructed under the proposed framework by numerical simulations. Finally, we apply the proposed method to estimate the causal effect of the 401(k) program on household assets.

translated by 谷歌翻译