2018
DOI: 10.1007/s12539-018-0312-5
|View full text |Cite
|
Sign up to set email alerts
|

Bioinformatics Protocols for Quickly Obtaining Large-Scale Data Sets for Phylogenetic Inferences

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
6
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
6
2
1

Relationship

3
6

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…To find and extract protein encoding segments larger than 100 bp, we have used getorf, using the emboss Docker image available at pegi3s Bioinformatics Docker images project (htttps:pegi3s.github.io/dockerfiles). Then, we selected the protein encoding segments that show similarity with reference S-RNase sequences, using tblastx (Expect value (e) < 0.05), as implemented in SEDA 58 , 59 , using as query Prunus S3-RNase (AJ298312), Malus Sh-RNase (AB032247), and Fragaria putative S-RNase (gi561957436, gi561674690 and gi561985884) 24 . Based on this information, we manually annotated the corresponding genome region to identify the exons of each gene.…”
Section: Methodsmentioning
confidence: 99%
“…To find and extract protein encoding segments larger than 100 bp, we have used getorf, using the emboss Docker image available at pegi3s Bioinformatics Docker images project (htttps:pegi3s.github.io/dockerfiles). Then, we selected the protein encoding segments that show similarity with reference S-RNase sequences, using tblastx (Expect value (e) < 0.05), as implemented in SEDA 58 , 59 , using as query Prunus S3-RNase (AJ298312), Malus Sh-RNase (AB032247), and Fragaria putative S-RNase (gi561957436, gi561674690 and gi561985884) 24 . Based on this information, we manually annotated the corresponding genome region to identify the exons of each gene.…”
Section: Methodsmentioning
confidence: 99%
“…This approach will retrieve nucleotide sequences showing similarity at the amino acid level with the SLF reference sequences beyond the F-box motif. To the obtained dataset we added the predicted CDSs from the annotated genomes, and removed identical sequences, after merging the headers, using SEDA 4 (López-Fernández et al, 2019). As before, sequences were aligned using Muscle and a Bayesian phylogenetic tree was obtained using ADOPS (Reboiro-Jato et al, 2012).…”
Section: Methodsmentioning
confidence: 99%
“…To obtain phylogenies representative of the virilis phylad, we started by downloading all the D. virilis coding sequences (CDS) available at FlyBase ( ftp://ftp.flybase.net/genomes/Drosophila_virilis/current/fasta/ ). We further used SEDA 71 to retrieve only those CDS of genes located on scaffolds anchored to chromosomes (Muller E: scaffolds 12,822; 13,047; 12,855 and 12,954; Muller B: scaffolds 13,246; 12,963 and 12,723 and Muller C: scaffolds 12,823; 10,324; 12,875 and 13,324). When more than one isoform was available for the same gene only the longest one was retrieved.…”
Section: Methodsmentioning
confidence: 99%