Characterizing ncRNAs in human pathogenic protists using high-throughput sequencing technology

Collins, Lesley J.

doi:10.3389/fgene.2011.00096

Cited by 6 publications

(5 citation statements)

References 72 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…We set out to test an intriguing and revolutionary hypothesis, that miRNAs evolved once in the last common ancestor (LCA) of crown eukaryotes and that this ancestor shared miRNAs present today in plants and animals. Although this hypothesis is becoming widely accepted 77, 78, 88–91 our data clearly refute it. Instead, our results show that of the 73 plant and animal miRNAs identified in protists, all fail to meet the criteria required for the identification of miRNAs.…”

Section: Discussioncontrasting

confidence: 46%

“…Some of the studies 27–30 describing miRNAs in protists did not follow the established conventions for naming miRNAs 16, 62, 63. Consequently, authors annotated novel miRNAs using names of already existing animal and plant miRNAs, implying homology, which can readily be misinterpreted 77, 78. For example, Saraiya et al 28 described miR‐2 regulating the expression of 22 variant surface protein genes in Giardia .…”

Section: Discussionmentioning

confidence: 99%

See 1 more Smart Citation

Do miRNAs have a deep evolutionary history?

2012

View full text Add to dashboard Cite

The recent discovery of microRNAs (miRNAs) in unicellular eukaryotes, including miRNAs known previously only from animals or plants, implies that miRNAs have a deep evolutionary history among eukaryotes. This contrasts with the prevailing view that miRNAs evolved convergently in animals and plants. We re-evaluate the evidence and find that none of the 73 plant and animal miRNAs described from protists meet the required criteria for miRNA annotation and, by implication, animals and plants did not acquire any of their respective miRNA genes from the crown ancestor of eukaryotes. Furthermore, of the 159 novel miRNAs previously identified among the seven species of unicellular protists examined, only 28 from the algae Ectocarpus and Chlamydomonas, meet the criteria for miRNA annotation. Therefore, at present only five groups of eukaryotes are known to possess miRNAs, indicating that miRNAs have evolved independently within eukaryotes through exaptation of their shared inherited RNAi machinery.

show abstract

Section: Discussioncontrasting

confidence: 46%

Section: Discussionmentioning

confidence: 99%

Do miRNAs have a deep evolutionary history?

2012

View full text Add to dashboard Cite

show abstract

“…Long non-coding RNAs (lncRNAs) are often defined as transcribed but not translated RNA segments larger than sRNAs (>200 nucleotides) [ 29 ]. lncRNAs affect chromosomal dynamics, the telomeres and structural organization [ 20 , 21 , 23 ].…”

Section: Introductionmentioning

confidence: 99%

The parasite Trichomonas vaginalis expresses thousands of pseudogenes and long non-coding RNAs independently from functional neighbouring genes

et al. 2014

View full text Add to dashboard Cite

BackgroundThe human pathogen Trichomonas vaginalis is a parabasalian flagellate that is estimated to infect 3% of the world’s population annually. With a 160 megabase genome and up to 60,000 genes residing in six chromosomes, the parasite has the largest genome among sequenced protists. Although it is thought that the genome size and unusual large coding capacity is owed to genome duplication events, the exact reason and its consequences are less well studied.ResultsAmong transcriptome data we found thousands of instances, in which reads mapped onto genomic loci not annotated as genes, some reaching up to several kilobases in length. At first sight these appear to represent long non-coding RNAs (lncRNAs), however, about half of these lncRNAs have significant sequence similarities to genomic loci annotated as protein-coding genes. This provides evidence for the transcription of hundreds of pseudogenes in the parasite. Conventional lncRNAs and pseudogenes are expressed in Trichomonas through their own transcription start sites and independently from flanking genes in Trichomonas. Expression of several representative lncRNAs was verified through reverse-transcriptase PCR in different T. vaginalis strains and case studies exclude the use of alternative start codons or stop codon suppression for the genes analysed.ConclusionOur results demonstrate that T. vaginalis expresses thousands of intergenic loci, including numerous transcribed pseudogenes. In contrast to yeast these are expressed independently from neighbouring genes. Our results furthermore illustrate the effect genome duplication events can have on the transcriptome of a protist. The parasite’s genome is in a steady state of changing and we hypothesize that the numerous lncRNAs could offer a large pool for potential innovation from which novel proteins or regulatory RNA units could evolve.Electronic supplementary materialThe online version of this article (doi:10.1186/1471-2164-15-906) contains supplementary material, which is available to authorized users.

show abstract

“…Métodos computacionais para a identificação e classificação de ncRNAs vêm sendo propostos e aprimorados nos últimos anos [63,34,5,95,13]. A tarefa de identificar e classificar ncRNAs é bastante desafiadora, devido a dificuldade de confirmar experimentalmente a função de um ncRNA, pois essa está associada à sua estrutura espacial (estruturas secundária e terciária), o que impede o uso de métodos de predição de genes codificadores de proteínas que usam apenas a informação de sua estrutura primária (sequência de nucleotídeos).…”

Section: Introductionunclassified

“…These computational methods focus on predicting candidates that have to be experimentally confirmed. Identification of ncRNAs have been developed for a variety of organisms [31,13,96,53], with the objective of constructing sets of different classes of ncRNAs. In particular, snoR-NAs [25] are 60 to 300 nt ncRNAs, classified based on their characteristic sequence elements, called boxes, in two main classes: H/ACA box snoRNAs and C/D box snoRNAs.…”

Section: Introductionmentioning

confidence: 99%

Identificação de snoRNAs usando aprendizagem de máquina

Oliveira¹

View full text Add to dashboard Cite

Dedico este trabalho a todos que me ajudaram e me apoiaram nestes 7 anos de UnB. Primeiramente a Deus, pois nunca me deixou na mão, principalmente nestes tempos de mestrado, onde sempre senti sua companhia em todos os momentos. Também dedico a minha família, por todo incentivo a continuar estudando e, por fim, a minha namorada, amigos e professores, especialmente a professora Maria Emília, por sempre acreditar em meu potencial. "Comece fazendo o que é necessário, depois o que é possível, e em breve estarás fazendo o impossível." São Francisco de Assis iv Agradecimentos Agradeço as instituições de ensino que me deram a chance de fazer esta dissertação de mestrado: as universidades de Brasília, de Leipzig e de Freiburg, das quais me deram a chance de ver o mundo com outros olhos, melhorando o meu eu cientista e o meu eu humano. Também agradeço ao grupo de bioinformática da UnB e da Universidade de Freiburg, pela amizade e apoio nestes tempos de pesquisa e muito estudo. Por fim agradeço as pessoas que com pequenos gestos e atitudes me ajudaram a seguir em frente, em especial o Dr. Christian Schulz-Huotari e os professores Fabrizio Costa, Rolf Backofen e Jana Hertel. Obrigado!! v Resumo Métodos de aprendizagem de máquina vêm sendo amplamente usados na identificação e classificação de diferentes famílias de RNAs não-codificadores (ncRNAs). Muitos desses métodos são baseados na aprendizagem supervisionada, onde atributos anteriormente conhecidos, chamados features, são extraídos de uma sequência e usados em um classificador. Nesta dissertação, apresentamos dois métodos para a identificação das duas classes principais de snoRNAs, C/D box e H/ACA box snoRNAs: snoReport 2.0, uma melhoria significativa da primeira versão do snoReport; e o snoRNA-EDeN, um novo método baseado no EDeN, que é um kernel decomposicional de grafos. O snoReport 2.0 é um método que, usando features extraídas de sequências candidatas em genomas, combina predição de estrutura secundária de ncRNAs com Máquina de Vetores de Suporte (Support Vector Machine-SVM), para identificar C/D box e H/ACA box snoRNAs. Seu classificador de H/ACA box snoRNA mostrou um F-score de 93% (uma melhoria de 10% em relação à primeira versão do snoReport), enquanto o classificador de C/D box snoRNA obteve F-score de 94% (melhoria de 14%). Alem disso, ambos os classificadores tiveram todas as medidas de performances acima de 90%. Na fase de validação, o snoReport 2.0 identificou 67,43% dos snoRNAs de vertebrados de ambas as classes. Em Nematóides, o snoReport 2.0 identificou 29,6% dos C/D box snoRNAs e 69% dos H/ACA box snoR-NAs. Para as Drosofilídeas, foram identificados 3,2% dos C/D box snoRNAs e 76,7% dos H/ACA box snoRNAs. Esses resultados mostram que o snoReport 2.0 é eficiente na identificação de snoRNAs em organismos vertebrados, e também para H/ACA box snoRNAs de organismos invertebrados. Por outro lado, em vez de usar features de uma sequência (em geral, difíceis de identificar), uma abordagem recente de aprendizagem de máquina é descrita a seguir. Dada uma região de ...

show abstract

Characterizing ncRNAs in human pathogenic protists using high-throughput sequencing technology

Cited by 6 publications

References 72 publications

Do miRNAs have a deep evolutionary history?

Do miRNAs have a deep evolutionary history?

The parasite Trichomonas vaginalis expresses thousands of pseudogenes and long non-coding RNAs independently from functional neighbouring genes

Identificação de snoRNAs usando aprendizagem de máquina

Contact Info

Product

Resources

About