volume 13, issue 1, P18-33 2020
DOI: 10.1007/s12561-020-09278-z
View full text
Ning Hao, Yue Selena Niu, Feifei Xiao, Heping Zhang

Abstract: In many applications such as copy number variant (CNV) detection, the goal is to identify short segments on which the observations have different means or medians from the background. Those segments are usually short and hidden in a long sequence, and hence are very challenging to find. We study a super scalable short segment (4S) detection algorithm in this paper. This nonparametric method clusters the locations where the observations exceed a threshold for segment detection. It is computationally efficient …

expand abstract