2014
DOI: 10.1186/1471-2105-15-311
|View full text |Cite
|
Sign up to set email alerts
|

PLEK: a tool for predicting long non-coding RNAs and messenger RNAs based on an improved k-mer scheme

Abstract: BackgroundHigh-throughput transcriptome sequencing (RNA-seq) technology promises to discover novel protein-coding and non-coding transcripts, particularly the identification of long non-coding RNAs (lncRNAs) from de novo sequencing data. This requires tools that are not restricted by prior gene annotations, genomic sequences and high-quality sequencing.ResultsWe present an alignment-free tool called PLEK (predictor of long non-coding RNAs and messenger RNAs based on an improved k-mer scheme), which uses a comp… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
538
0
5

Year Published

2016
2016
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 630 publications
(569 citation statements)
references
References 58 publications
0
538
0
5
Order By: Relevance
“…SNP detection was conducted with the SAMtools pileup program [30] [32], CNCI [33], CPC [34] and Pfam [35] were applied to assess the protein-coding potential of the above-obtained transcripts and the intersection of the results from these software tools was taken as the final result. In order to identify known lncRNAs, BLASTN tool was used to align lncRNA candidates to ALDB (A DomesticAnimal Long Noncoding RNA Database) [36], a database with a focus on the domestic-animal lncRNAs, with the settings of identity=100%, mismatch=0, E-value<1e-10 and gap_opening=0.…”
Section: Snp Detectionmentioning
confidence: 99%
“…SNP detection was conducted with the SAMtools pileup program [30] [32], CNCI [33], CPC [34] and Pfam [35] were applied to assess the protein-coding potential of the above-obtained transcripts and the intersection of the results from these software tools was taken as the final result. In order to identify known lncRNAs, BLASTN tool was used to align lncRNA candidates to ALDB (A DomesticAnimal Long Noncoding RNA Database) [36], a database with a focus on the domestic-animal lncRNAs, with the settings of identity=100%, mismatch=0, E-value<1e-10 and gap_opening=0.…”
Section: Snp Detectionmentioning
confidence: 99%
“…Kyoto Encyclopedia of Genes and Genomes (KEGG) mapping was used for pathway analysis and KEGG Automatic Annotation Server (KAAS) to assign I. scapularis KEGG orthology (KO) identifiers to the transcripts (Moriya et al, 2007). The protein-coding potential of the transcripts were determined by the Coding Potential Calculator (CPC) package (Kong et al, 2007), Coding-Potential Assessment Tool (CPAT v1.2.2) package (Wang et al, 2013) and Predictor of lncRNAs and mRNAs based on k-mer scheme (PLEK v1.2) package (Li et al, 2014).…”
Section: Transcriptome Annotationmentioning
confidence: 99%
“…K-mer features were found to be somewhat robust to indel sequencing errors (Li et al, 2014); however, the proposed K-mer wildcards must be more robust than K-mer since it introduces wildcards, allowing for additional non-indel nuclear change sequencing errors. This indicates that the proposed features are robust.…”
Section: Robustness To Sequencing Errormentioning
confidence: 99%