2017
DOI: 10.1214/17-ejs1229
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale mode identification and data-driven sciences

Abstract: Bump-hunting or mode identification is a fundamental problem that arises in almost every scientific field of data-driven discovery. Surprisingly, very few data modeling tools are available for automatic (not requiring manual case-by-case investigation), objective (not subjective), and nonparametric (not based on restrictive parametric model assumptions) mode discovery, which can scale to large data sets. This article introduces LPMode-an algorithm based on a new theory for detecting multimodality of a probabil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
30
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 23 publications
(30 citation statements)
references
References 43 publications
0
30
0
Order By: Relevance
“…This article provides a pragmatic and comprehensive framework for nonlinear time series modeling that is easier to use, more versatile and has a strong theoretical foundation based on recently developed theory on unified algorithms of data science via LP modeling (Parzen and Mukhopadhyay, 2012, 2013a,b, Mukhopadhyay and Parzen, 2014, Mukhopadhyay, 2016, 2017. Summary and broader implications of the proposed research:…”
Section: Resultsmentioning
confidence: 98%
See 1 more Smart Citation
“…This article provides a pragmatic and comprehensive framework for nonlinear time series modeling that is easier to use, more versatile and has a strong theoretical foundation based on recently developed theory on unified algorithms of data science via LP modeling (Parzen and Mukhopadhyay, 2012, 2013a,b, Mukhopadhyay and Parzen, 2014, Mukhopadhyay, 2016, 2017. Summary and broader implications of the proposed research:…”
Section: Resultsmentioning
confidence: 98%
“…Does the Normal probability distribution provide a good fit to the S&P 500 return data? insight into this question can be gained by looking at the distribution of the random variable U = G(Y ), called comparison density (Parzen, 1997, Mukhopadhyay, 2017 given by:…”
Section: Non-normality Diagnosismentioning
confidence: 99%
“…In the context of signal identification for instance, this implies that the researcher can specify the density function of the events associated to the signal (e.g, a Gaussian bump). In situations where this cannot be done, one possibility is to refer to nonparametric inferential methods (e.g., Chen et al, 2016;Mukhopadhyay, 2017;Algeri, 2019). More work is needed to extend TOHM to discrete regions Θ and provide a formal justification of its validity in non-regular setting, such as the one in Example 3.…”
Section: Discussionmentioning
confidence: 99%
“…In order to understand if G is a good candidate for F, it is convenient to express the relationship among the two in a concise manner. The skew-G density model [9] is a universal representation scheme defined by…”
Section: Lp Approach To Statistical Modellingmentioning
confidence: 99%
“…with mean E[ LP j ] = LP j , variance V ( LP j ) = σ 2 j /n, where σ 2 j = 1 0 (Leg j (u) − LP j ) 2 d(u; G, F)∂ u (see [17]) and covariance Cov( LP j , LP k ) = σ jk /n, with σ 2 jk = 1 0 (Leg j (u) − LP j )(Leg k (u) − LP k )d(u; G, F)∂ u. Further, it can be shown [9] that, when f ≡ g,…”
Section: B Lp Density Estimatementioning
confidence: 99%