Multigene and genomic data sets have become commonplace in the field of phylogenetics, but many existing tools are not designed for such data sets, which often makes the analysis time‐consuming and tedious. Here, we present PhyloSuite, a (cross‐platform, open‐source, stand‐alone Python graphical user interface) user‐friendly workflow desktop platform dedicated to streamlining molecular sequence data management and evolutionary phylogenetics studies. It uses a plugin‐based system that integrates several phylogenetic and bioinformatic tools, thereby streamlining the entire procedure, from data acquisition to phylogenetic tree annotation (in combination with iTOL). It has the following features: (a) point‐and‐click and drag‐and‐drop graphical user interface; (b) a workplace to manage and organize molecular sequence data and results of analyses; (c) GenBank entry extraction and comparative statistics; and (d) a phylogenetic workflow with batch processing capability, comprising sequence alignment (mafft and macse), alignment optimization (trimAl, HmmCleaner and Gblocks), data set concatenation, best partitioning scheme and best evolutionary model selection (PartitionFinder and modelfinder), and phylogenetic inference (MrBayes and iq‐tree). PhyloSuite is designed for both beginners and experienced researchers, allowing the former to quick‐start their way into phylogenetic analysis, and the latter to conduct, store and manage their work in a streamlined way, and spend more time investigating scientific questions instead of wasting it on transferring files from one software program to another.
18Multi-gene and genomic datasets have become commonplace in the field of 19 phylogenetics, but many of the existing tools are not designed for such datasets, 20 which makes the analysis time-consuming and tedious. We therefore present 21 PhyloSuite, a user-friendly workflow desktop platform dedicated to streamlining 22 molecular sequence data management and evolutionary phylogenetics studies. It 23 employs a plugin-based system that integrates a number of useful phylogenetic and 24 bioinformatic tools, thereby streamlining the entire procedure, from data acquisition 25 to phylogenetic tree annotation, with the following features: (i) point-and-click and 26 drag-and-drop graphical user interface, (ii) a workspace to manage and organize 27 molecular sequence data and results of analyses, (iii) GenBank entries extraction and 28 comparative statistics, (iv) a phylogenetic workflow with batch processing capability, 29(v) elaborate bioinformatic analysis for mitochondrial genomes. The aim of 30 PhyloSuite is to enable researchers to spend more time playing with scientific 31 questions, instead of wasting it on conducting standard analyses. The compiled binary 32 of PhyloSuite is available under the GPL license at 33 https://github.com/dongzhang0725/PhyloSuite/releases, implemented in Python and 34 runs on Windows, Mac OSX and Linux. 35 36 37 Advancements in next-generation sequencing technologies (Metzker, 2009) have 38 resulted in a huge increase in the amount of genetic data available through public 39 databases. While this opens a multitude of research possibilities, retrieving and 40 managing such large amounts of data may be difficult and time-consuming for 41 researchers who are not computer-savvy. A standard analytical procedure for 42 phylogenetic analysis is: selecting and downloading GenBank entries, extracting 43 target genes (for multi-gene datasets, such as organelle genomes) and/or mining other 44 data, sequence alignment, alignment optimization, concatenation of alignments (for 45 multi-gene datasets), selection of best-fit partitioning schemes and evolutionary 46 models, phylogeny reconstruction, and finally visualization and annotation of the 47 phylogram. This can be very time-consuming if different programs have to be 48 employed for different steps, especially as they often have different input file format 49 requirements, and sometimes even require manual file tweaking. Therefore, 50 multifunctional, workflow-type software packages are becoming increasingly needed 51 by a broad range of evolutionary biologists (Smith, 2015). Specifically, as single-gene 52 datasets are rapidly being replaced by multi-gene or genomic datasets as a tool of 53 choice for phylogenetic reconstruction (Degnan and Rosenberg, 2009; Rivera-Rivera 54 and Montoya-Burgos, 2016), automated gene extraction from genomic data and batch 55 manipulation in some of the above steps, like alignment, are becoming a necessity. 56 Although there are several tools in existence, designed to streamline this process 57 by incorporat...
BackgroundExternal segmentation and internal proglottization are important evolutionary characters of the Eucestoda. The monozoic caryophyllideans are considered the earliest diverging eucestodes based on partial mitochondrial genes and nuclear rDNA sequences, yet, there are currently no complete mitogenomes available. We have therefore sequenced the complete mitogenomes of three caryophyllideans, as well as the polyzoic Schyzocotyle acheilognathi, explored the phylogenetic relationships of eucestodes and compared the gene arrangements between unsegmented and segmented cestodes.ResultsThe circular mitogenome of Atractolytocestus huronensis was 15,130 bp, the longest sequence of all the available cestodes, 14,620 bp for Khawia sinensis, 14,011 bp for Breviscolex orientalis and 14,046 bp for Schyzocotyle acheilognathi. The A-T content of the three caryophyllideans was found to be lower than any other published mitogenome. Highly repetitive regions were detected among the non-coding regions (NCRs) of the four cestode species. The evolutionary relationship determined between the five orders (Caryophyllidea, Diphyllobothriidea, Bothriocephalidea, Proteocephalidea and Cyclophyllidea) is consistent with that expected from morphology and the large fragments of mtDNA when reconstructed using all 36 genes. Examination of the 54 mitogenomes from these five orders, revealed a unique arrangement for each order except for the Cyclophyllidea which had two types that were identical to that of the Diphyllobothriidea and the Proteocephalidea. When comparing gene order between the unsegmented and segmented cestodes, the segmented cestodes were found to have the lower similarities due to a long distance transposition event. All rearrangement events between the four arrangement categories took place at the junction of rrnS-tRNA Arg (P1) where NCRs are common.ConclusionsHighly repetitive regions are detected among NCRs of the four cestode species. A long distance transposition event is inferred between the unsegmented and segmented cestodes. Gene arrangements of Taeniidae and the rest of the families in the Cyclophyllidea are found be identical to those of the sister order Proteocephalidea and the relatively basal order Diphyllobothriidea, respectively.Electronic supplementary materialThe online version of this article (doi:10.1186/s13071-017-2245-y) contains supplementary material, which is available to authorized users.
BackgroundAs the topologies produced by previous molecular and morphological studies were contradictory and unstable (polytomy), evolutionary relationships within the Diplozoidae family and the Monogenea class (controversial relationships among the Discocotylinea, Microcotylinea and Gastrocotylinea suborders) remain unresolved. Complete mitogenomes carry a relatively large amount of information, sufficient to provide a much higher phylogenetic resolution than traditionally used morphological traits and/or single molecular markers. However, their implementation is hampered by the scarcity of available monogenean mitogenomes. Therefore, we sequenced and characterized mitogenomes belonging to three Diplozoidae family species, and conducted comparative genomic and phylogenomic analyses for the entire Monogenea class.ResultsTaxonomic identification was inconclusive, so two of the species were identified merely to the genus level. The complete mitogenomes of Sindiplozoon sp. and Eudiplozoon sp. are 14,334 bp and 15,239 bp in size, respectively. Paradiplozoon opsariichthydis (15,385 bp) is incomplete: an approximately 2000 bp-long gap within a non-coding region could not be sequenced. Each genome contains the standard 36 genes (atp8 is missing). G + T content and the degree of GC- and AT-skews of these three mitogenome (and their individual elements) were higher than in other monogeneans. nad2, atp6 and nad6 were the most variable PCGs, whereas cox1, nad1 and cytb were the most conserved. Mitochondrial phylogenomics analysis, conducted using concatenated amino acid sequences of all PCGs, indicates that evolutionary relationships of the three genera are: (Eudiplozoon, (Paradiplozoon, Sindiplozoon)); and of the three suborders: (Discocotylinea, (Microcotylinea, Gastrocotylinea)). These intergeneric relationships were also supported by the skewness and principal component analyses.ConclusionsOur results show that nad2, atp6 and nad6 (fast-evolving) would be better candidates than cox1 (slow-evolving) for species identification and population genetics studies in Diplozoidae. Nucleotide bias and codon and amino acid usage patterns of the three diplozoid mitogenomes are more similar to cestodes and trematodes than to other monogenean flatworms. This unusual mutational bias was reflected in disproportionately long branches in the phylogram. Our study offsets the scarcity of molecular data for the subclass Polyopisthocotylea to some extent, and might provide important new insights into the evolutionary history of the three genera and three suborders.Electronic supplementary materialThe online version of this article (10.1186/s12862-018-1249-3) contains supplementary material, which is available to authorized users.
Background Paratetraonchoides inermis (Monogenea: Tetraonchoididae) is a flatworm parasitising the gills of uranoscopid fishes. Its morphological characteristics are ambiguous, and molecular data have never been used to study its phylogenetic relationships, which makes its taxonomic classification controversial. Also, several decades of unsuccessful attempts to resolve the relationships within the Monogenea present a strong indication that morphological datasets may not be robust enough to be used to infer evolutionary histories. As the use of molecular data is currently severely limited by their scarcity, we have sequenced and characterized the complete mitochondrial (mt) genome of P. inermis. To investigate its phylogenetic position, we performed phylogenetic analyses using Bayesian inference and maximum likelihood approaches using concatenated amino acid sequences of all 12 protein-coding genes on a dataset containing all available monogenean mt genomes.ResultsThe circular mt genome of P. inermis (14,654 bp) contains the standard 36 genes: 22 tRNAs, two rRNAs, 12 protein-encoding genes (PCGs; Atp8 is missing) and a major non-coding region (mNCR). All genes are transcribed from the same strand. The A + T content of the whole genome (82.6%), as well as its elements, is the highest reported among the monogeneans thus far. Three tRNA-like cloverleaf structures were found in mNCR. Several results of the phylogenomic analysis are in disagreement with previously proposed relationships: instead of being closely related to the Gyrodactylidea, Tetraonchidea exhibit a phylogenetic affinity with the Dactylogyridea + Capsalidea clade; and the order Capsalidea is neither basal within the subclass Monopisthocotylea, nor groups with the Gyrodactylidea, but instead forms a sister clade with the Dactylogyridea. The mt genome of P. inermis exhibits a unique gene order, with an extensive reorganization of tRNAs. Monogenea exhibit exceptional gene order plasticity within the Neodermata.ConclusionsThis study shows that gene order within monopisthocotylid mt genomes is evolving at uneven rates, which creates misleading evolutionary signals. Furthermore, our results indicate that all previous attempts to resolve the evolutionary history of the Monogenea may have produced at least partially erroneous relationships. This further corroborates the necessity to generate more molecular data for this group of parasitic animals.Electronic supplementary materialThe online version of this article (10.1186/s13071-017-2404-1) contains supplementary material, which is available to authorized users.
BackgroundRecent mitochondrial phylogenomics studies have reported a sister-group relationship of the orders Capsalidea and Dactylogyridea, which is inconsistent with previous morphology- and molecular-based phylogenies. As Dactylogyridea mitochondrial genomes (mitogenomes) are currently represented by only one family, to improve the phylogenetic resolution, we sequenced and characterized two dactylogyridean parasites, Lamellodiscus spari and Lepidotrema longipenis, belonging to a non-represented family Diplectanidae.ResultsThe L. longipenis mitogenome (15,433 bp) contains the standard 36 flatworm mitochondrial genes (atp8 is absent), whereas we failed to detect trnS1, trnC and trnG in L. spari (14,614 bp). Both mitogenomes exhibit unique gene orders (among the Monogenea), with a number of tRNA rearrangements. Both long non-coding regions contain a number of different (partially overlapping) repeat sequences. Intriguingly, these include putative tRNA pseudogenes in a tandem array (17 trnV pseudogenes in L. longipenis, 13 trnY pseudogenes in L. spari). Combined nucleotide diversity, non-synonymous/synonymous substitutions ratio and average sequence identity analyses consistently showed that nad2, nad5 and nad4 were the most variable PCGs, whereas cox1, cox2 and cytb were the most conserved. Phylogenomic analysis showed that the newly sequenced species of the family Diplectanidae formed a sister-group with the Dactylogyridae + Capsalidae clade. Thus Dactylogyridea (represented by the Diplectanidae and Dactylogyridae) was rendered paraphyletic (with high statistical support) by the nested Capsalidea (represented by the Capsalidae) clade.ConclusionsOur results show that nad2, nad5 and nad4 (fast-evolving) would be better candidates than cox1 (slow-evolving) for species identification and population genetics studies in the Diplectanidae. The unique gene order pattern further suggests discontinuous evolution of mitogenomic gene order arrangement in the Class Monogenea. This first report of paraphyly of the Dactylogyridea highlights the need to generate more molecular data for monogenean parasites, in order to be able to clarify their relationships using large datasets, as single-gene markers appear to provide a phylogenetic resolution which is too low for the task.Electronic supplementary materialThe online version of this article (10.1186/s13071-018-3144-6) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.