embarcadero: Species distribution modelling with Bayesian additive regression trees in R

Carlson, Colin J.

doi:10.1101/774604

Cited by 10 publications

(17 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…There exists a large suite of algorithms for modelling the distribution of species, but because there is no single 'best' algorithm some authors have reasonably concluded that niche or distribution modelling studies should begin by testing a suite of algorithms for predictive ability under the particular circumstances of the study and choose an algorithm for a particular challenge based on the results of those tests (Qiao et al, 2015). Accordingly, we assessed the relative performance of various categories of SDM algorithms: BIOCLIM (Busby, 1991;Booth et al, 2014), Generalized Linear Models (GLMs, Guisan et al, 2002), MaxLike (Royle, et al, 2012), Random forests (Breiman, 2001), Boosted Regression Trees (Elith et al, 2008), Support Vector Machines (SVMs; Vapnik, 1998), and Bayesian additive regression trees (BART, Carlson, 2020).…”

Section: Modelling Methodsmentioning

confidence: 99%

“…In computer science, BARTs are used for everything from medical diagnostics to self-driving car algorithms, however they have yet to fi nd widespread application in ecology and in predicting species distributions. Running SDMs with BARTs has recently been greatly facilitated by the development of an R package, 'embarcadero' (Carlson, 2020), including an automated variable selection procedure being highly eff ective at identifying informative subsets of predictors. Also the package includes methods for generating and plotting partial dependence curves.…”

Section: Analysis Of the Environmental Niche Using Bartsmentioning

confidence: 99%

See 1 more Smart Citation

Associations Between Habitat Quality and Body Size in the Carpathian-Podolian Land Snail Vestia turgida (Gastropoda, Clausiliidae): Species Distribution Model Selection and Assessment of Performance

Tytar¹

2021

View full text Add to dashboard Cite

Species distribution models (SDMs) are generally thought to be good indicators of habitat suitability, and thus of species’ performance. Consequently SDMs can be validated by checking whether the areas projected to have the greatest habitat quality are occupied by individuals or populations with higher than average fi tness. We hypothesized a positive and statistically signifi cant relationship between observed in the fi eld body size of the snail V. turgida (Rossmässler, 1836) and modelled habitat suitability, tested this relationship with linear mixed models, and found that indeed, larger individuals tend to occupy high-quality areas, as predicted by the SDMs. However, by testing several SDM algorithms, we found varied levels of performance in terms of expounding this relationship. Marginal R2 expressing the variance explained by the fi xed terms in the regression models, was adopted as a measure of functional accuracy, and used to rank the SDMs accordingly. In this respect, the Bayesian additive regression trees (BART) algorithm gave the best result, despite the low AUC and TSS. By restricting our analysis to the BART algorithm only, a variety of sets of environmental variables commonly or less used in the construction of SDMs were explored and tested according to their functional accuracy. In this respect, the SDM produced using the ENVIREM data set gave the best result.

show abstract

Section: Modelling Methodsmentioning

confidence: 99%

Section: Analysis Of the Environmental Niche Using Bartsmentioning

confidence: 99%

Associations Between Habitat Quality and Body Size in the Carpathian-Podolian Land Snail Vestia turgida (Gastropoda, Clausiliidae): Species Distribution Model Selection and Assessment of Performance

Tytar¹

2021

View full text Add to dashboard Cite

show abstract

“…SDMs were generated by employing Bayesian additive regression trees (BART), a powerful machine learning approach. Running SDMs with BARTs has recently been greatly facilitated by the development of an R package, 'embarcadero' [13], including an automated variable selection procedure being highly effective at identifying informative subsets of predictors. Also the package includes methods for generating and plotting partial dependence curves and visualization called spatial partial dependence plots, which reclassifies predictor rasters based on their partial dependence plots, and show the relative suitability of different regions for an individual covariate.…”

Section: Methodsmentioning

confidence: 99%

Identifying Environmental Refuges (“Coldspots”) from Infection by Batrachochytrium Dendrobatidis of Amphibians in Eastern Europe

Marushchak

Tytar

Nekrasova

et al. 2021

The 1st International Electronic Conference on Biological Diversity, Ecology and Evolution

View full text Add to dashboard Cite

Amphibians are the most threatened group of vertebrates. While habitat loss poses the greatest threat to amphibians, a spreading fungal disease caused by Batrachochytrium dendrobatidis (Bd) is seriously affecting an increasing number of species. Although Bd is widely prevalent, there are identifiable heterogeneities in the pathogen's distribution that are linked to environmental parameters. Our objective was to identify conditions that affect the geographic distribution of this pathogen using species distribution models (SDMs), with a special focus on Eastern Europe. SDMs can help identify hotspots for future outbreaks of Bd, but perhaps more importantly identify locations that may be environmental refuges ("coldspots") from infection. In general, climate is considered a major factor driving amphibian disease dynamics, but in particular temperature has received increased attention. Here, 42 environmental raster layers containing data on climate, soil and human impact were used. Mean annual temperature range (or 'continentality') was found to have the strongest constrain on the geographic distribution of this pathogen. Using the partial dependence visualization module in the R package 'embarcadero', a number of corresponding coldspots were identified.

show abstract

“…In particular, posterior width directly measures model uncertainty (rather than approximating it by permuting training data), and a single model can be run (instead of an ensemble trained on smaller subsets of training data), allowing the model to use the full training dataset all at once. 105…”

Section: Methodsmentioning

confidence: 99%

“…This often produces a much more reduced model without going through a stepwise variable selection process, which can be slow and very subject to stochasticity. 105…”

Section: Methodsmentioning

confidence: 99%

Optimizing predictive models to prioritize viral discovery in zoonotic reservoirs

Becker

Albery

Sjodin

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

31 51 52 Coronaviruses are a diverse family of positive-sense, single-stranded RNA viruses, found widely 53 in mammals and birds 1 . They have a broad host range, a high mutation rate, and the largest 54 genomes of any RNA viruses, but they have also evolved mechanisms for RNA proofreading and 55repair, which help to mitigate the deleterious effects of a high recombination rate acting over a 56 large genome 2 . Consequently, coronaviruses fit the profile of viruses with high zoonotic potential. 57There are seven human coronaviruses (two in the genus Alphacoronavirus and five in 58Betacoronavirus), of which three are highly pathogenic in humans: SARS-CoV, SARS-CoV-2, and 59MERS-CoV. These three are zoonotic and widely agreed to have evolutionary origins in bats 3-6 . 60 61Our collective experience with both SARS-CoV and MERS-CoV illustrate the difficulty of tracing 62 specific animal hosts of emerging coronaviruses. During the 2002-2003 SARS epidemic, SARS-63 CoV was traced to the masked palm civet (Paguma larvata) 7 , but the ultimate origin remained 64 unknown for several years. Horseshoe bats (family Rhinolophidae: Rhinolophus) were implicated 65 as reservoir hosts in 2005, but their SARS-like viruses were not identical to circulating human 66 strains 4 . Stronger evidence from 2017 placed the most likely evolutionary origin of SARS-CoV in 67 Rhinolophus ferrumequinum or potentially R. sinicus 8 . Presently, there is even less certainty in the 68 origins of MERS-CoV, although spillover to humans occurs relatively often through contact with 69 dromedary camels (Camelus dromedarius). A virus with 100% nucleotide identity in a ~200 base 70 pair region of the polymerase gene was detected in Taphozous bats (family Emballonuridae) in 71 Saudi Arabia 9 ; however, based on spike gene similarity, other sources treat HKU4 virus from 72 Tylonycteris bats (family Vespertilionidae) in China as the closest-related bat virus 10,11 . Several 73 bat coronaviruses have shown close relation to MERS-CoV, with a surprisingly broad geographic 74 distribution from Mexico to China 12,13,14,15 . 75 76 Coronavirus disease 2019 (COVID-19) is caused by severe acute respiratory syndrome 77 coronavirus-2 (SARS-CoV-2), a novel virus with presumed evolutionary origins in bats. Although 78 the earliest cases were linked to a wildlife market, contact tracing was limited, and there has been 79 no definitive identification of the wildlife contact that resulted in spillover nor a true "index case." 80 Two bat viruses are closely related to SARS-CoV-2: RaTG13 bat CoV from Rhinolophus affinis 81 (96% identical overall), and RmYN02 bat CoV from Rhinolophus malayanus (97% identical in one 82 gene but only 61% in the receptor-binding domain and with less overall similarity) 6,16 . The 83 divergence time between these bat viruses and human SARS-CoV-2 has been estimated as 40-50 84 years 17 , suggesting that the main host(s) involved in spillover remain unknown. Evidence of viral 85 recombination in pangolins has been proposed but is unresolved 17 . S...

show abstract

embarcadero: Species distribution modelling with Bayesian additive regression trees in R

Cited by 10 publications

References 29 publications

Associations Between Habitat Quality and Body Size in the Carpathian-Podolian Land Snail Vestia turgida (Gastropoda, Clausiliidae): Species Distribution Model Selection and Assessment of Performance

Associations Between Habitat Quality and Body Size in the Carpathian-Podolian Land Snail Vestia turgida (Gastropoda, Clausiliidae): Species Distribution Model Selection and Assessment of Performance

Identifying Environmental Refuges (“Coldspots”) from Infection by Batrachochytrium Dendrobatidis of Amphibians in Eastern Europe

Optimizing predictive models to prioritize viral discovery in zoonotic reservoirs

Contact Info

Product

Resources

About