2023
DOI: 10.1101/2023.03.09.531845
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

General encoding of canonicalk-mers

Abstract: To index or compare sequences efficiently, often k-mers, i.e., substrings of fixed length k, are used. In order to store them in a table, or to assign them to different tables or threads, k-mers are encoded as integers. One way to ensure an even distribution is to use minimal perfect hashing, i.e., a bijective mapping between all possible sigmak k-mers and the interval [0, sigma k-1], where sigma is the alphabet size. In many applications, e.g., when the reading direction of a DNA-sequence is ambiguous, \emph{… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Publication Types

Select...

Relationship

0
0

Authors

Journals

citations
Cited by 0 publications
references
References 16 publications
(18 reference statements)
0
0
0
Order By: Relevance

No citations

Set email alert for when this publication receives citations?