Abstract. In applications ranging from DNA sequencing through archeological dating to sparse matrix reordering, a recurrent problem is the sequencing of elements in such a way that highly correlated pairs of elements are near each other. That is, given a correlation function f reflecting the desire for each pair of elements to be near each other, find all permutations π with the property that ifThis seriation problem is a generalization of the well-studied consecutive ones problem. We present a spectral algorithm for this problem that has a number of interesting features. Whereas most previous applications of spectral techniques provide only bounds or heuristics, our result is an algorithm that correctly solves a nontrivial combinatorial problem. In addition, spectral methods are being successfully applied as heuristics to a variety of sequencing problems, and our result helps explain and justify these applications.Key words. seriation, consecutive ones property, eigenvector, Fiedler vector, analysis of algorithms AMS subject classifications. 05C15, 15A18, 15A48, 68E15PII. S00975397952857711. Introduction. Many applied computational problems involve ordering a set so that closely coupled elements are placed near each other. This is the underlying problem in such diverse applications as genomic sequencing, sparse matrix envelope reduction, and graph linear arrangement as well as less familiar settings such as archeological dating. In this paper we present a spectral algorithm for this class of problems. Unlike traditional combinatorial methods, our approach uses an eigenvector of a matrix to order the elements. Our main result is that this approach correctly solves an important ordering problem we call the seriation problem which includes the wellknown consecutive ones problem (C1P) [5] as a special case.More formally, we are given a set of n elements to sequence; that is, we wish to bijectively map the elements to the integers 1, . . . , n. We also have a symmetric, real valued correlation function (sometimes called a similarity function) that reflects the desire for elements i and j to be near each other in the sequence. We now wish to find all ways to sequence the elements so that the correlations are consistent; that is, if π is our permutation of elements andAlthough there may be an exponential number of such orderings, they can all be described in a compact data structure known as a PQ-tree [5], which we review in the next section. Not all correlation functions allow for a consistent sequencing. If a consistent ordering is possible we will say the problem is well posed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.