Fast and efficient short read mapping based on a succinct hash index

Zhang, Haowen; Chan, Yuandong; Fan, Kaichao; Schmidt, Bertil; Liu, Weiguo

doi:10.1186/s12859-018-2094-5

Cited by 20 publications

(22 citation statements)

References 25 publications

(44 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hash indexes are efficient with precise values searching queries, but the returned values are not sorted. Hash indexes are optimized for queries that use the equality operator and they support full index scans [39].…”

Section: Figurementioning

confidence: 99%

Simulation and Modeling of Telocytes Behavior in Signaling and Intercellular Communication Processes

Crețoiu

Roateşi

Bica

et al. 2020

IJMS

View full text Add to dashboard Cite

Background: Telocytes (TCs) are unique interstitial or stromal cells of mesodermal origin, defined by long cellular extensions called telopodes (Tps) which form a network, connecting them to surrounding cells. TCs were previously found around stem and progenitor cells, and were thought to be most likely involved in local tissue metabolic equilibrium and regeneration. The roles of telocytes are still under scientific scrutiny, with existing studies suggesting they possess various functions depending on their location. Methods: Human myometrium biopsies were collected from pregnant and non-pregnant women, telocytes were then investigated in myometrial interstitial cell cultures based on morphological criteria and later prepared for time-lapse microscopy. Semi-analytical and numerical solutions were developed to highlight the geometric characteristics and the behavior of telocytes. Results: Results were gathered in a database which would further allow efficient telocyte tracking and indexing in a content-based image retrieval (CBIR) of digital medical images. Mathematical analysis revealed pivotal information regarding the homogeneity, hardness and resistance of telocytes’ structure. Cellular activity models were monitored in vitro, therefore supporting the creation of databases of telocyte images. Conclusions: The obtained images were analyzed, using segmentation techniques and mathematical models in conjunction with computer simulation, in order to depict TCs behavior in relation to surrounding cells. This paper brings an important contribution to the development of bioinformatics systems by creating software-based telocyte models that could be used both for diagnostic and educational purposes.

show abstract

Section: Figurementioning

confidence: 99%

Simulation and Modeling of Telocytes Behavior in Signaling and Intercellular Communication Processes

Crețoiu

Roateşi

Bica

et al. 2020

IJMS

View full text Add to dashboard Cite

show abstract

“…We choose an error rate of 10 (which is two errors at most for a read or size 20), and discarded reads with more that 2 erros. Other all mappers, such as FEM (Zhang et al, 2018), Hobbes (Ahmadi et al, 2011), and BitMapper2 (Cheng et al, 2019), could not be used, because the reads were too short for an edit distance of 2.…”

Section: Resultsmentioning

confidence: 99%

srnaMapper: an optimal mapping tool for sRNA-Seq reads

Zytnicki

Gaspin

2021

Preprint

View full text Add to dashboard Cite

MotivationSequencing is the key method to study the impact of short RNAs, which include micro RNAs, tRNA-derived RNAs, and piwi-interacting RNA, among other. The first step to make use of these reads is to map them to a genome. Existing mapping tools have been developed for the long RNAs in mind, and, so far, no tool has been conceived for short RNAs. However, short RNAs have several distinctive features which make them different from messenger RNAs: they are shorter (not greater than 200bp), they often redundant, they can be produced by duplicated loci, and they may be edited at their ends.ResultsIn this work, we present a new tool, srnaMapper, that maps these reads with all these objectives in mind. We show on two data sets that srnaMapper is more efficient considering computation time and edition error handling: it quickly retrieves all the hits, with arbitrary number of errors.AvailabilitysrnaMapper source code is available at https://github.com/mzytnicki/srnaMapper.Contactmatthias.zytnicki@inrae.fr

show abstract

“…Briefly explained, FM-index alignment tools are derived from the Burrows-Wheeler Transform [ 68 ]—a method to sufficiently compress large amount of data and finding approximate matches of sequences in the reference genome [ 69 ]. Hash table-based aligners uses the seed-and-extend method in combination with additional alignment algorithms [ 68 , 70 , 71 ].…”

Section: Precautions Of Data Output From Sequencingmentioning

confidence: 99%

Next Generation Sequencing Technology in the Clinic and Its Challenges

Vestergaard

Oliveira

Høgdall

et al. 2021

Cancers

View full text Add to dashboard Cite

Data analysis has become a crucial aspect in clinical oncology to interpret output from next-generation sequencing-based testing. NGS being able to resolve billions of sequencing reactions in a few days has consequently increased the demand for tools to handle and analyze such large data sets. Many tools have been developed since the advent of NGS, featuring their own peculiarities. Increased awareness when interpreting alterations in the genome is therefore of utmost importance, as the same data using different tools can provide diverse outcomes. Hence, it is crucial to evaluate and validate bioinformatic pipelines in clinical settings. Moreover, personalized medicine implies treatment targeting efficacy of biological drugs for specific genomic alterations. Here, we focused on different sequencing technologies, features underlying the genome complexity, and bioinformatic tools that can impact the final annotation. Additionally, we discuss the clinical demand and design for implementing NGS.

show abstract

Fast and efficient short read mapping based on a succinct hash index

Cited by 20 publications

References 25 publications

Simulation and Modeling of Telocytes Behavior in Signaling and Intercellular Communication Processes

Simulation and Modeling of Telocytes Behavior in Signaling and Intercellular Communication Processes

srnaMapper: an optimal mapping tool for sRNA-Seq reads

Next Generation Sequencing Technology in the Clinic and Its Challenges

Contact Info

Product

Resources

About