The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2018
DOI: 10.1029/2018jf004640
|View full text |Cite
|
Sign up to set email alerts
|

Using Machine Learning to Predict Geomorphic Disturbance: The Effects of Sample Size, Sample Prevalence, and Sampling Strategy

Abstract: Advances in data acquisition and statistical methodology have led to growing use of machine‐learning methods to predict geomorphic disturbance events. However, capturing the data required to parameterize these models is challenging because of expense or, more fundamentally, because the phenomenon of interest occurs infrequently. Thus, it is important to understand how the nature of the data used to train predictive models influences their performance. Using a database of cliff failure prediction and associated… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 26 publications
(16 citation statements)
references
References 58 publications
0
15
0
1
Order By: Relevance
“…However, the random sampling method has some drawbacks. First, this method does not pay attention to the distribution pattern of absence samples, and therefore the absence samples generated are sometimes significantly clustered and do not provide overall information on the entire study area [34,38]. Second, absence samples may be very close to presence locations, resulting in confusion in the model and also increasing an error in the final output [39].…”
Section: Introductionmentioning
confidence: 99%
“…However, the random sampling method has some drawbacks. First, this method does not pay attention to the distribution pattern of absence samples, and therefore the absence samples generated are sometimes significantly clustered and do not provide overall information on the entire study area [34,38]. Second, absence samples may be very close to presence locations, resulting in confusion in the model and also increasing an error in the final output [39].…”
Section: Introductionmentioning
confidence: 99%
“…In this study, a thorough investigation of ANN structures has been carried out for predicting the 28 days CS of FC, which is one of the most important mechanical properties of FC. In addition, it is well-known that the accuracy of the given machine learning algorithm greatly depends on the sampling strategy to construct the model [85,86]. results in [25,47].…”
Section: Discussionmentioning
confidence: 99%
“…In this study, a thorough investigation of ANN structures has been carried out for predicting the 28 days CS of FC, which is one of the most important mechanical properties of FC. In addition, it is well-known that the accuracy of the given machine learning algorithm greatly depends on the sampling strategy to construct the model [85,86]. Therefore, MCS was used in this study to fully analyze the capability of all the C-ANN structures, taking into account such variability of the input space in the training phase of the model.…”
Section: Discussionmentioning
confidence: 99%
“…Current studies have clarified patterns of spatial sensitivity, however temporal forecasts have remained largely empirical [49], [50]. Most ML techniques achieve overall success rates of 75 − 95% [51]. While this may seem very promising, there are issues which remain with data input quality, potential over fitting and inadequate choice of prediction models, introducing unintentional inclusion of redundant or noise variables, and technical limits to predicting only certain types and sizes of the flare event [52], [53], [54].…”
Section: Motivationmentioning
confidence: 99%