The mustard family (Brassicaceae) is a scientifically and economically important family, containing the model plant Arabidopsis thaliana and numerous crop species that feed billions worldwide. Despite its relevance, most published family phylogenies are incompletely sampled, generally contain massive polytomies, and/or show incongruent topologies between datasets. Here, we present the most complete Brassicaceae genus-level family phylogenies to date (Brassicaceae Tree of Life, or BrassiToL) based on nuclear (>1,000 genes, almost all 349 genera and 53 tribes) and plastome (60 genes, 79% of the genera, all tribes) data. We found cytonuclear discordance between nuclear and plastome-derived phylogenies, which is likely a result of rampant hybridisation among closely and more distantly related species, and highlight rogue taxa. To evaluate the impact of this rampant hybridisation on the nuclear phylogeny reconstruction, we performed four different sampling routines that increasingly removed variable data and likely paralogs. Our resulting cleaned subset of 297 nuclear genes revealed high support for the tribes, while support for the main lineages remained relatively low. Calibration based on the 20 most clock-like nuclear genes suggests a late Eocene to late Oligocene icehouse origin of the family. Finally, we propose five new or re-established tribes, including the recognition of Arabidopsideae, a monotypic tribe to accommodate Arabidopsis. With a worldwide community of thousands of researchers working on this family, our new, densely sampled family phylogeny will be an indispensable tool to further highlight Brassicaceae as an excellent model family for studies on biodiversity and plant biology.
We provide a quantitative description of the French national herbarium vascular plants collection dataset. Held at the Muséum national d'histoire naturelle, Paris, it currently comprises records for 5,400,000 specimens, representing 90% of the estimated total of specimens. Ninety nine percent of the specimen entries are linked to one or more images and 16% have field-collecting information available. This major botanical collection represents the results of over three centuries of exploration and study. The sources of the collection are global, with a strong representation for France, including overseas territories, and former French colonies. The compilation of this dataset was made possible through numerous national and international projects, the most important of which was linked to the renovation of the herbarium building. The vascular plant collection is actively expanding today, hence the continuous growth exhibited by the dataset, which can be fully accessed through the GBIF portal or the MNHN database portal (available at: https://science.mnhn.fr/institution/mnhn/collection/p/item/search/form). This dataset is a major source of data for systematics, global plants macroecological studies or conservation assessments.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.