智能论文笔记

Covariate-Balancing-Aware Interpretable Deep Learning models for Treatment Effect Estimation

Kan Chen , Qishuo Yin , Qi Long

分类： (统计)机器学习 | 机器学习

2022-03-07

对于许多具有观察数据的生物医学应用，估计治疗效果至关重要。特别是，对于许多生物医学研究人员来说，可解释性可解释性。在本文中，我们首先提供理论分析，并在强大的无知性假设下获得平均治疗效果（ATE）估计的偏差的上限。通过利用加权能量距离的吸引力性能得出，我们的上限比文献中报道的更紧密。在理论分析的激励下，我们提出了一个新的目标函数，用于估计使用能量距离平衡评分的ATE，因此不需要正确规范倾向得分模型。我们还利用最近开发的神经添加剂模型来改善用于潜在结果预测的深度学习模型的可解释性。我们通过能量距离平衡评分加权正则化进一步增强了我们提出的模型。在半合成实验中，使用两个基准数据集（即IHDP和ACIC）证明了我们提出的模型比当前最新方法的优势。

translated by 谷歌翻译

Estimating individual treatment effect: generalization bounds and algorithms

Uri Shalit , Fredrik D. Johansson , David Sontag

分类：

2016-06-13

There is intense interest in applying machine learning to problems of causal inference in fields such as healthcare, economics and education. In particular, individual-level causal inference has important applications such as precision medicine. We give a new theoretical analysis and family of algorithms for predicting individual treatment effect (ITE) from observational data, under the assumption known as strong ignorability. The algorithms learn a "balanced" representation such that the induced treated and control distributions look similar. We give a novel, simple and intuitive generalization-error bound showing that the expected ITE estimation error of a representation is bounded by a sum of the standard generalization-error of that representation and the distance between the treated and control distributions induced by the representation. We use Integral Probability Metrics to measure distances between distributions, deriving explicit bounds for the Wasserstein and Maximum Mean Discrepancy (MMD) distances. Experiments on real and simulated data show the new algorithms match or outperform the state-of-the-art.

translated by 谷歌翻译

Learning Representations for Counterfactual Inference

Fredrik D. Johansson , Uri Shalit , David Sontag

分类：

2016-05-12

Observational studies are rising in importance due to the widespread accumulation of data in fields such as healthcare, education, employment and ecology. We consider the task of answering counterfactual questions such as, "Would this patient have lower blood sugar had she received a different medication?". We propose a new algorithmic framework for counterfactual inference which brings together ideas from domain adaptation and representation learning. In addition to a theoretical justification, we perform an empirical comparison with previous approaches to causal inference from observational data. Our deep learning algorithm significantly outperforms the previous state-of-the-art.

translated by 谷歌翻译

A Survey of Deep Causal Model

Zongyu Li , Zhenfeng Zhu

分类： (统计)机器学习 | 机器学习

2022-09-19

因果关系的概念在人类认知中起着重要作用。在过去的几十年中，在许多领域（例如计算机科学，医学，经济学和教育）中，因果推论已经得到很好的发展。随着深度学习技术的发展，它越来越多地用于针对反事实数据的因果推断。通常，深层因果模型将协变量的特征映射到表示空间，然后设计各种客观优化函数，以根据不同的优化方法公正地估算反事实数据。本文重点介绍了深层因果模型的调查，其核心贡献如下：1）我们在多种疗法和连续剂量治疗下提供相关指标； 2）我们从时间开发和方法分类的角度综合了深层因果模型的全面概述； 3）我们协助有关相关数据集和源代码的详细且全面的分类和分析。

translated by 谷歌翻译

Estimating Treatment Effects using Neurosymbolic Program Synthesis

Abbavaram Gowtham Reddy , Vineeth N Balasubramanian

分类：人工智能 | 机器学习

2022-11-08

Estimating treatment effects from observational data is a central problem in causal inference. Methods to solve this problem exploit inductive biases and heuristics from causal inference to design multi-head neural network architectures and regularizers. In this work, we propose to use neurosymbolic program synthesis, a data-efficient, and interpretable technique, to solve the treatment effect estimation problem. We theoretically show that neurosymbolic programming can solve the treatment effect estimation problem. By designing a Domain Specific Language (DSL) for treatment effect estimation problem based on the inductive biases used in literature, we argue that neurosymbolic programming is a better alternative to treatment effect estimation than traditional methods. Our empirical study reveals that our method, which implicitly encodes inductive biases in a DSL, achieves better performance on benchmark datasets than the state-of-the-art methods.

translated by 谷歌翻译

Estimating individual treatment effects under unobserved confounding using binary instruments

Dennis Frauen , Stefan Feuerriegel

分类：机器学习 | (统计)机器学习

2022-08-17

观察数据中估算单个治疗效果（ITE）在许多领域，例如个性化医学等领域。但是，实际上，治疗分配通常被未观察到的变量混淆，因此引入了偏见。消除偏见的一种补救措施是使用仪器变量（IVS）。此类环境在医学中广泛存在（例如，将合规性用作二进制IV的试验）。在本文中，我们提出了一个新颖的，可靠的机器学习框架，称为MRIV，用于使用二进制IV估算ITES，从而产生无偏见的ITE估计器。与以前的二进制IV的工作不同，我们的框架通过伪结果回归直接估算了ITE。（1）我们提供了一个理论分析，我们表明我们的框架产生了多重稳定的收敛速率：即使几个滋扰估计器的收敛缓慢，我们的ITE估计器也会达到快速收敛。（2）我们进一步表明，我们的框架渐近地优于最先进的插件IV方法，以进行ITE估计。（3）我们以理论结果为基础，并提出了一种使用二进制IVS的ITE估算的定制的，称为MRIV-NET的深度神经网络结构。在各种计算实验中，我们从经验上证明了我们的MRIV-NET实现最先进的性能。据我们所知，我们的MRIV是第一个机器学习框架，用于估算显示出倍增功能的二进制IV设置。

translated by 谷歌翻译

Deep Treatment-Adaptive Network for Causal Inference

Qian Li , Zhichao Wang , Shaowu Liu , Gang Li , Guandong Xu

分类：机器学习

2021-12-27

因果推断能够估计治疗效果（即，治疗结果的因果效果），使各个领域的决策受益。本研究中的一个基本挑战是观察数据的治疗偏见。为了提高对因果推断的观察研究的有效性，基于代表的方法作为最先进的方法表明了治疗效果估计的卓越性能。基于大多数基于表示的方法假设所有观察到的协变量都是预处理的（即，不受治疗影响的影响），并学习这些观察到的协变量的平衡表示，以估算治疗效果。不幸的是，这种假设往往在实践中往往是太严格的要求，因为一些协调因子是通过对治疗的干预进行改变（即，后治疗）来改变。相比之下，从不变的协变量中学到的平衡表示因此偏置治疗效果估计。

translated by 谷歌翻译

Estimating Individual Treatment Effects using Non-Parametric Regression Models: a Review

Alberto Caron , Gianluca Baio , Ioanna Manolopoulou

分类：机器学习 | (统计)机器学习

2020-09-14

大型观察数据越来越多地提供健康，经济和社会科学等学科，研究人员对因果问题而不是预测感兴趣。在本文中，从旨在调查参与学校膳食计划对健康指标的实证研究，研究了使用非参数回归的方法估算异质治疗效果的问题。首先，我们介绍了与观察或非完全随机数据进行因果推断相关的设置和相关的问题，以及如何在统计学习工具的帮助下解决这些问题。然后，我们审查并制定现有最先进的框架的统一分类，允许通过非参数回归模型来估算单个治疗效果。在介绍模型选择问题的简要概述后，我们说明了一些关于三种不同模拟研究的方法的性能。我们通过展示一些关于学校膳食计划数据的实证分析的一些方法的使用来结束。

translated by 谷歌翻译

Causal Effect Estimation using Variational Information Bottleneck

Zhenyu Lu , Yurong Cheng , Mingjun Zhong , George Stoian , Ye Yuan , Guoren Wang

分类：机器学习

2021-10-26

因果推断是在采用干预时估计因果关系中的因果效应。确切地说，在具有二进制干预措施的因果模型中，即控制和治疗，因果效应仅仅是事实和反事实之间的差异。困难是必须估算反事实，因此因果效应只能是估计。估计反事实的主要挑战是确定影响结果和治疗的混杂因素。一种典型的方法是将因果推论作为监督学习问题，因此可以预测反事实。包括线性回归和深度学习模型，最近的机器学习方法已适应因果推断。在本文中，我们提出了一种通过使用变分信息瓶颈（CEVIB）来估计因果效应的方法。有希望的点是，VIB能够自然地将变量从数据中蒸馏出来，从而可以通过使用观察数据来估计因果效应。我们通过将CEVIB应用于三个数据集，表明我们的方法实现了最佳性能，将其应用于其他方法。我们还实验表明了我们方法的鲁棒性。

translated by 谷歌翻译

Optimal transport weights for causal inference

Eric Dunipace

分类：机器学习 | (统计)机器学习

2021-09-05

加权方法是偏离因果效应的估计的常见工具。虽然越来越多的看似不同的方法，但其中许多可以折叠成一个统一的制度：因果最佳运输。这种新方法通过最小化治疗和对照组之间的最佳运输距离，或者更一般地，在源和目标群体之间直接针对分布平衡。我们的方法是半富集的有效和无模型，但也可以包含研究人员希望平衡的协变量的时刻或任何其他重要的功能。我们发现因果最佳运输优于竞争对手的方法，当错过倾向分数和结果模型时，表明它是一种稳健的替代普通加权方法。最后，我们证明了我们在外部对照研究中的效用检查米索前列醇与催产素治疗后骨髓出血的影响。

translated by 谷歌翻译

Estimating Potential Outcome Distributions with Collaborating Causal Networks

Tianhui Zhou , William E Carson IV , David Carlson

分类： (统计)机器学习 | 机器学习

2021-10-04

传统的因果推理方法利用观察性研究数据来估计潜在治疗的观察到的差异和未观察到的结果，称为条件平均治疗效果（CATE）。然而，凯特就对应于仅第一刻的比较，因此可能不足以反映治疗效果的全部情况。作为替代方案，估计全部潜在结果分布可以提供更多的见解。但是，估计治疗效果的现有方法潜在的结果分布通常对这些分布施加限制性或简单的假设。在这里，我们提出了合作因果网络（CCN），这是一种新颖的方法，它通过学习全部潜在结果分布而超出了CATE的估计。通过CCN框架估算结果分布不需要对基础数据生成过程的限制性假设。此外，CCN促进了每种可能处理的效用的估计，并允许通过效用函数进行特定的特定变异。 CCN不仅将结果估计扩展到传统的风险差异之外，而且还可以通过定义灵活的比较来实现更全面的决策过程。根据因果文献中通常做出的假设，我们表明CCN学习了渐近捕获真正潜在结果分布的分布。此外，我们提出了一种调整方法，该方法在经验上可以有效地减轻观察数据中治疗组之间的样本失衡。最后，我们评估了CCN在多个合成和半合成实验中的性能。我们证明，与现有的贝叶斯和深层生成方法相比，CCN学会了改进的分布估计值，以及对各种效用功能的改进决策。

translated by 谷歌翻译

Instrumental Variables in Causal Inference and Machine Learning: A Survey

Anpeng Wu , Kun Kuang , Ruoxuan Xiong , Fei Wu

分类：机器学习 | 人工智能

2022-12-12

Causal inference is the process of using assumptions, study designs, and estimation strategies to draw conclusions about the causal relationships between variables based on data. This allows researchers to better understand the underlying mechanisms at work in complex systems and make more informed decisions. In many settings, we may not fully observe all the confounders that affect both the treatment and outcome variables, complicating the estimation of causal effects. To address this problem, a growing literature in both causal inference and machine learning proposes to use Instrumental Variables (IV). This paper serves as the first effort to systematically and comprehensively introduce and discuss the IV methods and their applications in both causal inference and machine learning. First, we provide the formal definition of IVs and discuss the identification problem of IV regression methods under different assumptions. Second, we categorize the existing work on IV methods into three streams according to the focus on the proposed methods, including two-stage least squares with IVs, control function with IVs, and evaluation of IVs. For each stream, we present both the classical causal inference methods, and recent developments in the machine learning literature. Then, we introduce a variety of applications of IV methods in real-world scenarios and provide a summary of the available datasets and algorithms. Finally, we summarize the literature, discuss the open problems and suggest promising future research directions for IV methods and their applications. We also develop a toolkit of IVs methods reviewed in this survey at https://github.com/causal-machine-learning-lab/mliv.

translated by 谷歌翻译

CausalEGM: a general causal inference framework by encoding generative modeling

Qiao Liu , Zhongren Chen , Wing Hung Wong

分类： (统计)机器学习 | 机器学习

2022-12-08

Although understanding and characterizing causal effects have become essential in observational studies, it is challenging when the confounders are high-dimensional. In this article, we develop a general framework $\textit{CausalEGM}$ for estimating causal effects by encoding generative modeling, which can be applied in both binary and continuous treatment settings. Under the potential outcome framework with unconfoundedness, we establish a bidirectional transformation between the high-dimensional confounders space and a low-dimensional latent space where the density is known (e.g., multivariate normal distribution). Through this, CausalEGM simultaneously decouples the dependencies of confounders on both treatment and outcome and maps the confounders to the low-dimensional latent space. By conditioning on the low-dimensional latent features, CausalEGM can estimate the causal effect for each individual or the average causal effect within a population. Our theoretical analysis shows that the excess risk for CausalEGM can be bounded through empirical process theory. Under an assumption on encoder-decoder networks, the consistency of the estimate can be guaranteed. In a series of experiments, CausalEGM demonstrates superior performance over existing methods for both binary and continuous treatments. Specifically, we find CausalEGM to be substantially more powerful than competing methods in the presence of large sample sizes and high dimensional confounders. The software of CausalEGM is freely available at https://github.com/SUwonglab/CausalEGM.

translated by 谷歌翻译

MALTS: Matching After Learning to Stretch

Harsh Parikh , Cynthia Rudin , Alexander Volfovsky

分类：机器学习

2018-11-18

我们引入了一个灵活的框架，该框架可为因果推理产生高质量的几乎享用的匹配。匹配中的大多数先前工作都使用临时距离指标，通常会导致质量差，尤其是在有无关的协变量时。在这项工作中，我们学习了一个可解释的距离度量，以实现更高质量的匹配。学到的距离度量标准根据每个协变量对结果预测的贡献延伸协变量空间：这种拉伸意味着，对重要协变量的不匹配比对无关协变量的不匹配的惩罚更大。我们学习柔性距离指标的能力会导致匹配，这些匹配对于估计有条件的平均治疗效果有用。

translated by 谷歌翻译

Causal effect inference with deep latent-variable models

分类：

Learning individual-level causal effects from observational data, such as inferring the most effective medication for a specific patient, is a problem of growing importance for policy makers. The most important aspect of inferring causal effects from observational data is the handling of confounders, factors that affect both an intervention and its outcome. A carefully designed observational study attempts to measure all important confounders. However, even if one does not have direct access to all confounders, there may exist noisy and uncertain measurement of proxies for confounders. We build on recent advances in latent variable modeling to simultaneously estimate the unknown latent space summarizing the confounders and the causal effect. Our method is based on Variational Autoencoders (VAE) which follow the causal structure of inference with proxies. We show our method is significantly more robust than existing methods, and matches the state-of-the-art on previous benchmarks focused on individual treatment effects.

translated by 谷歌翻译

Heterogeneous Treatment Effect Estimation using machine learning for Healthcare application: tutorial and benchmark

Yaobin Ling , Pulakesh Upadhyaya , Luyao Chen , Xiaoqian Jiang , Yejin Kim

分类：机器学习

2021-09-27

为目标疾病开发新药物是一项耗时且昂贵的任务，药物重新利用已成为药物开发领域的流行话题。随着许多健康索赔数据可用，已经对数据进行了许多研究。现实世界的数据嘈杂，稀疏，并且具有许多混杂因素。此外，许多研究表明，药物的作用在人群中是异质的。近年来已经出现了许多有关估计异构治疗效果（HTE）（HTE）的高级机器学习模型，并已应用于计量经济学和机器学习社区。这些研究将医学和药物开发视为主要应用领域，但是从HTE方法论到药物开发的转化研究有限。我们旨在将HTE方法介绍到医疗保健领域，并在通过基准实验进行医疗保健行政索赔数据进行基准实验时提供可行性考虑。另外，我们希望使用基准实验来展示如何将模型应用于医疗保健研究时如何解释和评估模型。通过将最近的HTE技术引入生物医学信息学社区的广泛读者，我们希望通过机器学习促进广泛采用因果推断。我们还希望提供HTE具有个性化药物有效性的可行性。

translated by 谷歌翻译

Robust Orthogonal Machine Learning of Treatment Effects

Yiyan Huang , Cheuk Hang Leung , Qi Wu , Xing Yan

分类： (统计)机器学习 | 机器学习

2021-03-22

Causal learning is the key to obtaining stable predictions and answering \textit{what if} problems in decision-makings. In causal learning, it is central to seek methods to estimate the average treatment effect (ATE) from observational data. The Double/Debiased Machine Learning (DML) is one of the prevalent methods to estimate ATE. However, the DML estimators can suffer from an \textit{error-compounding issue} and even give extreme estimates when the propensity scores are close to 0 or 1. Previous studies have overcome this issue through some empirical tricks such as propensity score trimming, yet none of the existing works solves it from a theoretical standpoint. In this paper, we propose a \textit{Robust Causal Learning (RCL)} method to offset the deficiencies of DML estimators. Theoretically, the RCL estimators i) satisfy the (higher-order) orthogonal condition and are as \textit{consistent and doubly robust} as the DML estimators, and ii) get rid of the error-compounding issue. Empirically, the comprehensive experiments show that: i) the RCL estimators give more stable estimations of the causal parameters than DML; ii) the RCL estimators outperform traditional estimators and their variants when applying different machine learning models on both simulation and benchmark datasets, and a mimic consumer credit dataset generated by WGAN.

translated by 谷歌翻译

DESCN: Deep Entire Space Cross Networks for Individual Treatment Effect Estimation

Kailiang Zhong , Fengtong Xiao , Yan Ren , Yaorong Liang , Wenqing Yao , Xiaofeng Yang , Ling Cen

分类：机器学习 | 人工智能

2022-07-19

因果推论在电子商务和精确医学等各个领域都有广泛的应用，其性能在很大程度上取决于对个体治疗效果（ITE）的准确估计。通常，通过在其各个样品空间中分别对处理和控制响应函数进行建模来预测ITE。但是，这种方法通常会在实践中遇到两个问题，即治疗偏见引起的治疗组和对照组之间的分布分布以及其人口规模的显着样本失衡。本文提出了深层的整个空间跨网络（DESCN），以从端到端的角度进行建模治疗效果。 DESCN通过多任务学习方式捕获了治疗倾向，反应和隐藏治疗效果的综合信息。我们的方法共同学习了整个样品空间中的治疗和反应功能，以避免治疗偏见，并采用中间伪治疗效应预测网络来减轻样品失衡。从电子商务凭证分销业务的合成数据集和大规模生产数据集进行了广泛的实验。结果表明，DESCN可以成功提高ITE估计的准确性并提高提升排名的性能。发布生产数据集和源代码的样本是为了促进社区的未来研究，据我们所知，这是首个大型公共偏见的因果推理数据集。

translated by 谷歌翻译

Moderately-Balanced Representation Learning for Treatment Effects with Orthogonality Information

Yiyan Huang , Cheuk Hang Leung , Shumin Ma , Qi Wu , Dongdong Wang , Zhixiang Huang

分类：机器学习 | 人工智能

2022-09-05

由于选择偏差，观察数据估算平均治疗效果（ATE）是有挑战性的。现有作品主要以两种方式应对这一挑战。一些研究人员建议构建满足正交条件的分数函数，该函数确保已建立的估计量“正交”更加健壮。其他人探索表示模型，以实现治疗组和受控群体之间的平衡表示。但是，现有研究未能进行1）在表示空间中歧视受控单元以避免过度平衡的问题； 2）充分利用“正交信息”。在本文中，我们提出了一个基于最新协变量平衡表示方法和正交机器学习理论的中等平衡的表示学习（MBRL）框架。该框架可保护表示形式免于通过多任务学习过度平衡。同时，MBRL将噪声正交性信息纳入培训和验证阶段，以实现更好的ATE估计。与现有的最新方法相比，基准和模拟数据集的全面实验表明，我们方法对治疗效应估计的优越性和鲁棒性。

translated by 谷歌翻译

End-to-End Balancing for Causal Continuous Treatment-Effect Estimation

Mohammad Taha Bahadori , Eric Tchetgen Tchetgen , David E. Heckerman

分类：机器学习 | (统计)机器学习

2021-07-27

我们研究了在反倾向得分加权的框架内使用连续处理的观察性因果推断的问题。为了获得稳定的权重，我们设计了一种基于熵平衡的新算法，该算法可以学习权重，以直接使用端到端优化最大化因果推理精度。在优化过程中，这些权重自动调整为使用的特定数据集和正在使用的因果推理算法。我们提供了证明我们方法一致性的理论分析。使用合成和现实世界数据，我们表明我们的算法估计因果效应比基线熵平衡更准确。

translated by 谷歌翻译