2018
DOI: 10.1016/j.jmva.2018.05.006
|View full text |Cite
|
Sign up to set email alerts
|

A model selection approach for multiple sequence segmentation and dimensionality reduction

Abstract: In this paper we consider the problem of segmenting n aligned random sequences of equal length m into a finite number of independent blocks. We propose to use a penalized maximum likelihood criterion to infer simultaneously the number of points of independence as well as the position of each one of these points. We show how to compute the estimator efficiently by means of a dynamic programming algorithm with time complexity O(m 2 n). We also propose another algorithm, called hierarchical algorithm, that provid… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2

Relationship

3
2

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 27 publications
0
4
0
Order By: Relevance
“…Classically, change-point detection refers to the problem of determining the times at which sequential observed data undergoes an abrupt change. In that type of setting, a change-point may refer to changes in mean ( Page 1954 , Tsay 1988 , Keshavarz et al 2018 ), variance ( Chen and Gupta 1997 , Hawkins and Zamba 2005 ), regression slope ( Chow 1960 , Qu and Perron 2007 ), general distributions forms ( Matteson and James 2014 ), or other types of change ( Castro et al 2018 , Leonardi et al 2021 ). Many of these methods have been applied to a wide range of problems such as stream anomaly detection in industry ( Li et al 2018 ), monitoring of sleep stages using EEG/EMG ( Agudelo-España et al 2020 ), identification of cyberattacks on networks ( Tartakovsky et al 2006 ), between many other interesting applications.…”
Section: Introductionmentioning
confidence: 99%
“…Classically, change-point detection refers to the problem of determining the times at which sequential observed data undergoes an abrupt change. In that type of setting, a change-point may refer to changes in mean ( Page 1954 , Tsay 1988 , Keshavarz et al 2018 ), variance ( Chen and Gupta 1997 , Hawkins and Zamba 2005 ), regression slope ( Chow 1960 , Qu and Perron 2007 ), general distributions forms ( Matteson and James 2014 ), or other types of change ( Castro et al 2018 , Leonardi et al 2021 ). Many of these methods have been applied to a wide range of problems such as stream anomaly detection in industry ( Li et al 2018 ), monitoring of sleep stages using EEG/EMG ( Agudelo-España et al 2020 ), identification of cyberattacks on networks ( Tartakovsky et al 2006 ), between many other interesting applications.…”
Section: Introductionmentioning
confidence: 99%
“…These are some of the many existing references that use hypothesis testing to discover or study independence. However, to the best of our knowledge, the estimation of points of independence, as proposed in this work, has not received much attention, aside from the work presented in Castro et al (2018). In the later, the authors consider this problem to detect recombination hotspots in single nucleotide polymorphisms data, assuming that the random vector takes values in A d , where A is a finite alphabet and the observations are independent.…”
Section: Introductionmentioning
confidence: 99%
“…We allow for multiple change points without assuming an a priori fixed, known number. The penalized maximum likelihood approach has also been considered recently in Castro et al (2018); Leonardi et al (2021), but on a different type of change-point problem. There, the approach was introduced for non-parametric discrete distributions in order to detect points of independence on a multidimensional random vector, under independent or non-independent sampling.…”
mentioning
confidence: 99%