BackgroundMolecular taxonomy studies and barcoding projects can provide rapid means of detecting cryptic diversity. Nevertheless, the use of molecular data for species delimitation should be undertaken with caution. Especially the single-gene approaches are linked with certain pitfalls for taxonomical inference. In the present study, recent and historical species descriptions based upon morphology were used as primary species hypotheses, which were then evaluated with molecular data (including in type and historical museum material) to form secondary species hypotheses. As an example of cryptic diversity and taxonomic controversy, the European Phoxinus phoxinus species complex was used.ResultsThe results of the revision showed that of the fourteen primary species hypotheses, three were rejected, namely P. ketmaieri, P. likai, and P. apollonicus. For three species (P. strandjae, P. strymonicus, P. morella), further investigation with increased data sampling was suggested, while two primary hypotheses, P. bigerri and P. colchicus, were supported as secondary species hypotheses. Finally, six of the primary species hypotheses (P. phoxinus, P. lumaireul, P. karsticus, P. septimanae, P. marsilii and P. csikii) were well supported by mitochondrial but only limitedly corroborated by nuclear data analysis.ConclusionThe approach has proven useful for revision of species complexes, and the study can serve as an overview of the Phoxinus genus in Europe, as well as a solid basis for further work.Electronic supplementary materialThe online version of this article (doi:10.1186/s12862-017-1032-x) contains supplementary material, which is available to authorized users.
BackgroundThere is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance.ResultsA total of 30 international groups were engaged. The entries reveal a general convergence of practices on most elements of the analysis and interpretation process. However, even given this commonality of approach, only two groups identified the consensus candidate variants in all disease cases, demonstrating a need for consistent fine-tuning of the generally accepted methods. There was greater diversity of the final clinical report content and in the patient consenting process, demonstrating that these areas require additional exploration and standardization.ConclusionsThe CLARITY Challenge provides a comprehensive assessment of current practices for using genome sequencing to diagnose and report genetic diseases. There is remarkable convergence in bioinformatic techniques, but medical interpretation and reporting are areas that require further development by many groups.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.