Alejandro Gustavo Vigo Pacheco scite author profile

Computing the matching statistics of patterns with respect to a text is a fundamental task in bioinformatics, but a formidable one when the text is a highly compressed genomic database. Bannai et al. gave an efficient solution for this case, which Rossi et al. recently implemented, but it uses two passes over the patterns and buffers a pointer for each character during the first pass. In this paper, we simplify their solution and make it streaming, at the cost of slowing it down slightly. This means that, first, we can compute the matching statistics of several long patterns (such as whole human chromosomes) in parallel while still using a reasonable amount of RAM; second, we can compute matching statistics online with low latency and thus quickly recognize when a pattern becomes incompressible relative to the database. Our code is available at https://github.com/koeppl/phoni .

show abstract

Grammar-Compressed Indexes with Logarithmic Search Time

Claude¹,

Navarro²,

Pacheco³

2020

Preprint

View full text Add to dashboard Cite

Let a text T [1..n] be the only string generated by a context-free grammar with g (terminal and nonterminal) symbols, and of size G (measured as the sum of the lengths of the right-hand sides of the rules). Such a grammar, called a grammar-compressed representation of T , can be encoded using essentially G lg g bits. We introduce the first grammar-compressed index that uses O(G lg n) bits and can find the occ occurrences of patterns P [1..m] in time O((m 2 + occ) lg G). We implement the index and demonstrate its practicality in comparison with the state of the art, on highly repetitive text collections.

show abstract

Esfuerzos residuales inducidos por abolladuras en tuberías

Ortíz¹,

Riaño²,

Pacheco³

1999

Revista de Ingeniería

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.