2011
DOI: 10.1016/j.neunet.2011.05.017
|View full text |Cite
|
Sign up to set email alerts
|

Estimating exogenous variables in data with more variables than observations

Abstract: Many statistical methods have been proposed to estimate causal models in classical situations with fewer variables than observations. However, modern datasets including gene expression data increase the needs of high-dimensional causal modeling in challenging situations with orders of magnitude more variables than observations. In this paper, we propose a method to find exogenous variables in a linear non-Gaussian causal model, which requires much smaller sample sizes than conventional methods and works even u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2012
2012
2023
2023

Publication Types

Select...
4
3
1

Relationship

4
4

Authors

Journals

citations
Cited by 12 publications
(4 citation statements)
references
References 17 publications
(20 reference statements)
0
4
0
Order By: Relevance
“…Though the Gaussian approximation has been a common approach [19], real-world data could be considered more or less non-Gaussian. In fact, non-Gaussian data appear in many applications including bioinformatics [20,21].…”
Section: Modelmentioning
confidence: 99%
“…Though the Gaussian approximation has been a common approach [19], real-world data could be considered more or less non-Gaussian. In fact, non-Gaussian data appear in many applications including bioinformatics [20,21].…”
Section: Modelmentioning
confidence: 99%
“…The singular Wishart distribution has been studied for instance by Uhlig (1994), Janik & Nowak (2003), Srivastava (2003), Yu & Zhang (2002) and Yu, Ryu & Park (2014). As pointed out by Dudoit, Fridlyand & Speed (2002) and Sogawa, Shimizu & Shimamura (2011), cases where the number of variables, p$$ p $$, exceeds the number of observations frequently occur in gene expression datasets, for instance. Ratnarajah (2005) made use of the complex singular Wishart distribution to address a problem involving multiple‐antenna systems.…”
Section: Introductionmentioning
confidence: 99%
“…A major reason for this disadvantage is that this method explicitly or implicitly assumes the Gaussianity of data, and typically utilizes only the covariance structures of data for estimating causal relations. However, in many applications, it is common for non-Gaussian data to be obtained (Micceri, 1989;Hyvärinen et al, 2001;Smith et al, 2011;Sogawa et al, 2011;Moneta et al, 2013), which means that more information can be contained in the data distribution than in the covariance matrix. Bentler (1983) proposed making use of non-Gaussianity of data for estimating structural equation models, although this had not been extensively studied until recently.…”
Section: Introductionmentioning
confidence: 99%