智能论文笔记

Factorization of the Partial Covariance in Singly-Connected Path Diagrams

Jose M. Peña

分类：机器学习 | (统计)机器学习

2020-02-12

我们通过表示单独连接的路径图来扩展路径分析，两个随机变量的部分协方差根据节点和变量之间的路径中的节点和边缘进行分解。此结果允许我们显示SIMPSON的悖论不能在单个连接的路径图中发生。

translated by 谷歌翻译

Clustering and Structural Robustness in Causal Diagrams

Santtu Tikka , Jouni Helske , Juha Karvanen

分类： (统计)机器学习 | 机器学习

2021-11-08

常用图是表示和可视化因果关系的。对于少量变量，这种方法提供了简洁和清晰的方案的视图。随着下属的变量数量增加，图形方法可能变得不切实际，并且表示的清晰度丢失。变量的聚类是减少因果图大小的自然方式，但如果任意实施，可能会错误地改变因果关系的基本属性。我们定义了一种特定类型的群集，称为Transit Cluster，保证在某些条件下保留因果效应的可识别性属性。我们提供了一种用于在给定图中查找所有传输群集的声音和完整的算法，并演示集群如何简化因果效应的识别。我们还研究了逆问题，其中一个人以群集的图形开始，寻找扩展图，其中因果效应的可识别性属性保持不变。我们表明这种结构稳健性与过境集群密切相关。

translated by 谷歌翻译

On the Representation of Causal Background Knowledge and its Applications in Causal Inference

Zhuangyan Fang , Ruiqi Zhao , Yue Liu , Yangbo He

分类：人工智能 | 机器学习 | (统计)机器学习

2022-07-10

在观察性研究中，经常遇到有关存在或缺乏因果边缘和路径的因果背景知识。由于背景知识而导致的马尔可夫等效dag的子类共享的指向边缘和链接可以由因果关系最大部分定向的无循环图（MPDAG）表示。在本文中，我们首先提供了因果MPDAG的声音和完整的图形表征，并提供了因果MPDAG的最小表示。然后，我们介绍了一种名为Direct Causal子句（DCC）的新颖表示，以统一形式表示所有类型的因果背景知识。使用DCC，我们研究因果背景知识的一致性和等效性，并表明任何因果背景知识集都可以等效地分解为因果MPDAG，以及最小的残留DCC。还提供了多项式时间算法，以检查一致性，等效性并找到分解的MPDAG和残留DCC。最后，有了因果背景知识，我们证明了一个足够且必要的条件来识别因果关系，并且出人意料地发现因果效应的可识别性仅取决于分解的MPDAG。我们还开发了局部IDA型算法，以估计无法识别效应的可能值。模拟表明因果背景知识可以显着提高因果影响的识别性。

translated by 谷歌翻译

Causal inference in statistics: An overview

分类：

This review presents empirical researchers with recent advances in causal inference, and stresses the paradigmatic shifts that must be undertaken in moving from traditional statistical analysis to causal analysis of multivariate data. Special emphasis is placed on the assumptions that underly all causal inferences, the languages used in formulating those assumptions, the conditional nature of all causal and counterfactual claims, and the methods that have been developed for the assessment of such claims. These advances are illustrated using a general theory of causation based on the Structural Causal Model (SCM) described in Pearl (2000a), which subsumes and unifies other approaches to causation, and provides a coherent mathematical foundation for the analysis of causes and counterfactuals. In particular, the paper surveys the development of mathematical tools for inferring (from a combination of data and assumptions) answers to three types of causal queries: (1) queries about the effects of potential interventions, (also called "causal effects" or "policy evaluation") (2) queries about probabilities of counterfactuals, (including assessment of "regret," "attribution" or "causes of effects") and (3) queries about direct and indirect effects (also known as "mediation"). Finally, the paper defines the formal and conceptual relationships between the structural and potential-outcome frameworks and presents tools for a symbiotic analysis that uses the strong features of both.

translated by 谷歌翻译

Foundations of Structural Causal Models with Cycles and Latent Variables

Stephan Bongers , Patrick Forré , Jonas Peters , Joris M. Mooij

分类：人工智能 | 机器学习

2016-11-18

也称为（非参数）结构方程模型（SEMS）的结构因果模型（SCM）被广泛用于因果建模目的。特别是，也称为递归SEM的无循环SCMS，形成了一个研究的SCM的良好的子类，概括了因果贝叶斯网络来允许潜在混淆。在本文中，我们调查了更多普通环境中的SCM，允许存在潜在混杂器和周期。我们展示在存在周期中，无循环SCM的许多方便的性质通常不会持有：它们并不总是有解决方案;它们并不总是诱导独特的观察，介入和反事实分布;边缘化并不总是存在，如果存在边缘模型并不总是尊重潜在的投影;他们并不总是满足马尔可夫财产;他们的图表并不总是与他们的因果语义一致。我们证明，对于SCM一般，这些属性中的每一个都在某些可加工条件下保持。我们的工作概括了SCM的结果，迄今为止仅针对某些特殊情况所知的周期。我们介绍了将循环循环设置扩展到循环设置的简单SCM的类，同时保留了许多方便的无环SCM的性能。用本文，我们的目标是为SCM提供统计因果建模的一般理论的基础。

translated by 谷歌翻译

Feature selection in stratification estimators of causal effects: lessons from potential outcomes, causal diagrams, and structural equations

P. Richard Hahn , Andrew Herren

分类： (统计)机器学习

2022-09-23

估计平均因果效应的理想回归（如果有）是什么？我们在离散协变量的设置中研究了这个问题，从而得出了各种分层估计器的有限样本方差的表达式。这种方法阐明了许多广泛引用的结果的基本统计现象。我们的博览会结合了研究因果效应估计的三种不同的方法论传统的见解：潜在结果，因果图和具有加性误差的结构模型。

translated by 谷歌翻译

The d-separation criterion in Categorical Probability

Tobias Fritz , Andreas Klingler

分类： (统计)机器学习

2022-07-12

D分隔标准通过某些条件独立性检测到关节概率分布与定向无环图的兼容性。在这项工作中，我们通过引入因果模型的分类定义，D分隔的分类概念，并证明了D-Exaration Criterion的抽象版本，从而在分类概率理论的背景下研究了这个问题。这种方法有两个主要好处。首先，分类D分隔是基于拓扑连接的非常直观的标准。其次，我们的结果适用于度量理论概率（具有标准的鲍尔空间），因此提供了与局部和全球马尔可夫属性等效性具有因果关系兼容性的简洁证明。

translated by 谷歌翻译

Optimal structure identification with greedy search

分类：

In this paper we prove the so-called "Meek Conjecture". In particular, we show that if a DAG H is an independence map of another DAG G, then there exists a finite sequence of edge additions and covered edge reversals in G such that (1) after each edge modification H remains an independence map of G and ( 2) after all modifications G = H. As shown by Meek (1997), this result has an important consequence for Bayesian approaches to learning Bayesian networks from data: in the limit of large sample size, there exists a twophase greedy search algorithm that-when applied to a particular sparsely-connected search space-provably identifies a perfect map of the generative distribution if that perfect map is a DAG. We provide a new implementation of the search space, using equivalence classes as states, for which all operators used in the greedy search can be scored efficiently using local functions of the nodes in the domain. Finally, using both synthetic and real-world datasets, we demonstrate that the two-phase greedy approach leads to good solutions when learning with finite sample sizes.

translated by 谷歌翻译

Necessary and sufficient graphical conditions for optimal adjustment sets in causal graphical models with hidden variables

Jakob Runge

分类：机器学习

2021-02-20

解决了选择最佳后门调整集的问题，以解决隐藏和条件变量的图形模型中的因果效应。以前的工作已经定义了实现最小的渐近估计方差，并且在没有隐藏变量的情况下派生的最佳集。对于隐藏变量的情况，可以有设置在没有最佳集合的情况下，并且目前仅导出有限适用性的足够的图形最优标准。在本工作中，最优性的特征在于最大化某个调整信息，该信息允许导出用于存在最佳调整集的必要和足够的图形标准和构造它的定义和算法。此外，如果仅存在有效调整集并且具有比Perkovi {\'C}等所提出的调整集更高（或等于）调整信息，则最佳集是有效的。 [机器学习研究学报，18：1--62,2018]任何图表。结果转化为一类估计的渐近估计差异，其渐近方差遵循某种信息理论关系。数值实验表明，渐近结果也适用于相对较小的样本尺寸，并且最佳调整集或其最小化变体通常也会产生更好的方差，也超出该估计类。令人惊讶的是，在随机创建的设置中，超过90 \％满足最优性条件，指示在许多现实世界场景中也可以保持。代码可用作Python Package \ URL {https://github.com/jakobrunge/tigramite}的一部分。

translated by 谷歌翻译

Learning Invariant Representations under General Interventions on the Response

Kang Du , Yu Xiang

分类：机器学习

2022-08-22

如今，收集来自不同环境的特征和响应对的观察已经变得越来越普遍。结果，由于分布变化，必须将学习的预测变量应用于具有不同分布的数据。一种原则性的方法是采用结构性因果模型来描述培训和测试模型，遵循不变性原则，该原理说响应的条件分布鉴于其预测因素在整个环境中保持不变。但是，当响应干预时，在实际情况下可能会违反该原则。一个自然的问题是，是否仍然可以识别其他形式的不变性来促进在看不见的环境中的预测。为了阐明这种具有挑战性的情况，我们引入了不变的匹配属性（IMP），这是通过附加功能捕获干预措施的明确关系。这导致了一种替代形式的不变性形式，该形式能够对响应进行统一的一般干预措施。我们在离散环境设置和连续环境设置下分析了我们方法的渐近概括误差，在该设置中，通过将其与半磁头变化的系数模型相关联来处理连续情况。我们提出的算法与各种实验环境中的现有方法相比表现出竞争性能。

translated by 谷歌翻译

Semiparametric Inference For Causal Effects In Graphical Models With Hidden Variables

Rohit Bhattacharya , Razieh Nabi , Ilya Shpitser

分类： (统计)机器学习 | 机器学习

2020-03-27

研究了与隐藏变量有关的非循环图（DAG）相关的因果模型中因果效应的识别理论。然而，由于估计它们输出的识别功能的复杂性，因此未耗尽相应的算法。在这项工作中，我们弥合了识别和估算涉及单一治疗和单一结果的人口水平因果效应之间的差距。我们派生了基于功能的估计，在大类隐藏变量DAG中表现出对所识别的效果的双重稳健性，其中治疗满足简单的图形标准;该类包括模型，产生调整和前门功能作为特殊情况。我们还提供必要的和充分条件，其中隐藏变量DAG的统计模型是非分子饱和的，并且意味着对观察到的数据分布没有平等约束。此外，我们推导了一类重要的隐藏变量DAG，这意味着观察到观察到的数据分布等同于完全观察到的DAG等同于（最高的相等约束）。在这些DAG类中，我们推出了实现兴趣目标的半导体效率界限的估计估计值，该估计是治疗满足我们的图形标准的感兴趣的目标。最后，我们提供了一种完整的识别算法，可直接产生基于权重的估计策略，以了解隐藏可变因果模型中的任何可识别效果。

translated by 谷歌翻译

Causal Fairness Analysis

Drago Plecko , Elias Bareinboim

分类：人工智能 | 机器学习 | (统计)机器学习

2022-07-23

基于AI和机器学习的决策系统已在各种现实世界中都使用，包括医疗保健，执法，教育和金融。不再是牵强的，即设想一个未来，自治系统将推动整个业务决策，并且更广泛地支持大规模决策基础设施以解决社会最具挑战性的问题。当人类做出决定时，不公平和歧视的问题普遍存在，并且当使用几乎没有透明度，问责制和公平性的机器做出决定时（或可能会放大）。在本文中，我们介绍了\ textit {Causal公平分析}的框架，目的是填补此差距，即理解，建模，并可能解决决策设置中的公平性问题。我们方法的主要见解是将观察到数据中存在的差异的量化与基本且通常是未观察到的因果机制收集的因果机制的收集，这些机制首先会产生差异，挑战我们称之为因果公平的基本问题分析（FPCFA）。为了解决FPCFA，我们研究了分解差异和公平性的经验度量的问题，将这种变化归因于结构机制和人群的不同单位。我们的努力最终达到了公平地图，这是组织和解释文献中不同标准之间关系的首次系统尝试。最后，我们研究了进行因果公平分析并提出一本公平食谱的最低因果假设，该假设使数据科学家能够评估不同影响和不同治疗的存在。

translated by 谷歌翻译

Causal Discovery in Linear Structural Causal Models with Deterministic Relations

Yuqin Yang , Mohamed Nafea , AmirEmad Ghassami , Negar Kiyavash

分类：机器学习 | 人工智能 | (统计)机器学习

2021-10-30

Linear structural causal models (SCMs)-- in which each observed variable is generated by a subset of the other observed variables as well as a subset of the exogenous sources-- are pervasive in causal inference and casual discovery. However, for the task of causal discovery, existing work almost exclusively focus on the submodel where each observed variable is associated with a distinct source with non-zero variance. This results in the restriction that no observed variable can deterministically depend on other observed variables or latent confounders. In this paper, we extend the results on structure learning by focusing on a subclass of linear SCMs which do not have this property, i.e., models in which observed variables can be causally affected by any subset of the sources, and are allowed to be a deterministic function of other observed variables or latent confounders. This allows for a more realistic modeling of influence or information propagation in systems. We focus on the task of causal discovery form observational data generated from a member of this subclass. We derive a set of necessary and sufficient conditions for unique identifiability of the causal structure. To the best of our knowledge, this is the first work that gives identifiability results for causal discovery under both latent confounding and deterministic relationships. Further, we propose an algorithm for recovering the underlying causal structure when the aforementioned conditions are satisfied. We validate our theoretical results both on synthetic and real datasets.

translated by 谷歌翻译

Efficient Bayesian network structure learning via local Markov boundary search

Ming Gao , Bryon Aragam

分类：人工智能 | 机器学习 | (统计)机器学习

2021-10-12

我们分析了在没有特定分布假设的常规设置中从观察数据的学习中学循环图形模型的复杂性。我们的方法是信息定理，并使用本地马尔可夫边界搜索程序，以便在基础图形模型中递归地构建祖先集。也许令人惊讶的是，我们表明，对于某些图形集合，一个简单的前向贪婪搜索算法（即没有向后修剪阶段）足以学习每个节点的马尔可夫边界。这显着提高了我们在节点的数量中显示的样本复杂性。然后应用这一点以在从文献中概括存在现有条件的新型标识性条件下学习整个图。作为独立利益的问题，我们建立了有限样本的保障，以解决从数据中恢复马尔可夫边界的问题。此外，我们将我们的结果应用于特殊情况的Polytrees，其中假设简化，并提供了多项识别的明确条件，并且在多项式时间中可以识别和可知。我们进一步说明了算法在仿真研究中易于实现的算法的性能。我们的方法是普遍的，用于无需分布假设的离散或连续分布，并且由于这种棚灯对有效地学习来自数据的定向图形模型结构所需的最小假设。

translated by 谷歌翻译

Learning Linear Non-Gaussian Polytree Models

Daniele Tramontano , Anthea Monod , Mathias Drton

分类： (统计)机器学习 | 机器学习

2022-08-13

在图形因果发现的背景下，我们适应了线性非高斯无环模型（Lingams）的多功能框架，以提出新算法以有效地学习polytrees的图形。我们的方法结合了Chow- Liu算法，该算法首先学习了无向树结构，并与新的方案定向边缘。方向方案评估数据生成分布的矩之间的代数关系，并且计算便宜。我们为我们的方法建立了高维的一致性结果，并比较了数值实验中的不同算法版本。

translated by 谷歌翻译

Chow-Liu++: Optimal Prediction-Centric Learning of Tree Ising Models

Enric Boix-Adsera , Guy Bresler , Frederic Koehler

分类：机器学习

2021-06-07

我们考虑从数据学习树结构ising模型的问题，使得使用模型计算的后续预测是准确的。具体而言，我们的目标是学习一个模型，使得小组变量$ S $的后海报$ p（x_i | x_s）$。自推出超过50年以来，有效计算最大似然树的Chow-Liu算法一直是学习树结构图形模型的基准算法。 [BK19]示出了关于以预测的局部总变化损耗的CHOW-LIU算法的样本复杂性的界限。虽然这些结果表明，即使在恢复真正的基础图中也可以学习有用的模型是不可能的，它们的绑定取决于相互作用的最大强度，因此不会达到信息理论的最佳选择。在本文中，我们介绍了一种新的算法，仔细结合了Chow-Liu算法的元素，以便在预测的损失下有效地和最佳地学习树ising模型。我们的算法对模型拼写和对抗损坏具有鲁棒性。相比之下，我们表明庆祝的Chow-Liu算法可以任意次优。

translated by 谷歌翻译

Detecting hidden confounding in observational data using multiple environments

Rickard K. A. Karlsson , Jesse H. Krijthe

分类：机器学习 | (统计)机器学习

2022-05-27

A common assumption in causal inference from observational data is that there is no hidden confounding. Yet it is, in general, impossible to verify the presence of hidden confounding factors from a single dataset. Under the assumption of independent causal mechanisms underlying the data generating process, we demonstrate a way to detect unobserved confounders when having multiple observational datasets coming from different environments. We present a theory for testable conditional independencies that are only absent during hidden confounding and examine cases where we violate its assumptions: degenerate & dependent mechanisms, and faithfulness violations. Additionally, we propose a procedure to test these independencies and study its empirical finite-sample behavior using simulation studies and semi-synthetic data based on a real-world dataset. In most cases, our theory correctly predicts the presence of hidden confounding, particularly when the confounding bias is~large.

translated by 谷歌翻译

Representation of Context-Specific Causal Models with Observational and Interventional Data

Eliana Duarte , Liam Solus

分类： (统计)机器学习

2021-01-22

我们考虑代表代理模型的问题，该模型使用我们称之为CSTREES的阶段树模型的适当子类对离散数据编码离散数据的原因模型。我们表明，可以通过集合表达CSTREE编码的上下文专用信息。由于并非所有阶段树模型都承认此属性，CSTREES是一个子类，可提供特定于上下文的因果信息的透明，直观和紧凑的表示。我们证明了CSTREEES承认全球性马尔可夫属性，它产生了模型等价的图形标准，概括了Verma和珍珠的DAG模型。这些结果延伸到一般介入模型设置，使CSTREES第一族的上下文专用模型允许介入模型等价的特征。我们还为CSTREE的最大似然估计器提供了一种封闭式公式，并使用它来表示贝叶斯信息标准是该模型类的本地一致的分数函数。在模拟和实际数据上分析了CSTHEELE的性能，在那里我们看到与CSTREELE而不是一般上演树的建模不会导致预测精度的显着损失，同时提供了特定于上下文的因果信息的DAG表示。

translated by 谷歌翻译

Causal Inference in medicine and in health policy, a summary

Wenhao Zhang , Ramin Ramezani , Arash Naeim

分类：机器学习

2021-05-10

数据科学任务可以被视为了解数据的感觉或测试关于它的假设。从数据推断的结论可以极大地指导我们做出信息做出决定。大数据使我们能够与机器学习结合执行无数的预测任务，例如鉴定患有某种疾病的高风险患者并采取可预防措施。然而，医疗保健从业者不仅仅是仅仅预测的内容 - 它们也对输入特征和临床结果之间的原因关系感兴趣。了解这些关系将有助于医生治疗患者并有效降低风险。通常通过随机对照试验鉴定因果关系。当科学家和研究人员转向观察研究并试图吸引推论时，这种试验通常是不可行的。然而，观察性研究也可能受到选择和/或混淆偏差的影响，这可能导致错误的因果结论。在本章中，我们将尝试突出传统机器学习和统计方法中可能出现的一些缺点，以分析观察数据，特别是在医疗保健数据分析域中。我们将讨论因果化推理和方法，以发现医疗领域的观测研究原因。此外，我们将展示因果推断在解决某些普通机器学习问题等中的应用，例如缺少数据和模型可运输性。最后，我们将讨论将加强学习与因果关系相结合的可能性，作为反击偏见的一种方式。

translated by 谷歌翻译

Correlation detection in trees for planted graph alignment

Luca Ganassali , Laurent Massoulié , Marc Lelarge

分类：机器学习 | (统计)机器学习

2021-07-15

Motivated by alignment of correlated sparse random graphs, we introduce a hypothesis testing problem of deciding whether or not two random trees are correlated. We obtain sufficient conditions under which this testing is impossible or feasible. We propose MPAlign, a message-passing algorithm for graph alignment inspired by the tree correlation detection problem. We prove MPAlign to succeed in polynomial time at partial alignment whenever tree detection is feasible. As a result our analysis of tree detection reveals new ranges of parameters for which partial alignment of sparse random graphs is feasible in polynomial time. We then conjecture that graph alignment is not feasible in polynomial time when the associated tree detection problem is impossible. If true, this conjecture together with our sufficient conditions on tree detection impossibility would imply the existence of a hard phase for graph alignment, i.e. a parameter range where alignment cannot be done in polynomial time even though it is known to be feasible in non-polynomial time.

translated by 谷歌翻译