Cation exchange capacity (CEC) have diverse applications from soil classification and management to agricultural/environmental simulations. Direct measurement of CEC is difficult. The objective of this study is to investigate the effect of calibration dataset size on accuracy and validity of soil CEC pedotransfer functions (PTFs) derived for selected arid–semi arid soils. Furthermore, the effect of the type of inputs on performance of the PTFs is evaluated. The soil organic carbon content along with either soil sand, silt, and clay fractions or dg are used as input variables. A large dataset (n = 1141) and another independent dataset (n = 205) are applied to develop and evaluate the validity of PTFs, respectively. Multiple linear regression analysis is used to correlate soil CEC with the inputs. To investigate the database‐size effect, random subsets with different sizes (n = 50, 100, 200, 300, 500, 700, 900) from the whole dataset are used for the development of PTFs. The accuracy of the PTFs decreased to some extent and then remained relatively constant, but their reliability increased with increasing the dataset size. Clay has the strongest correlation with CEC (r = 0.41 to 0.59, depending on the dataset size), so that using clay content resulted in more accurate PTFs than dg. The critical value of CEC is found to be about 15–16 cmolc kg−1, below which the PTFs overestimate CEC. The results support the hypothesis that the number of samples have to be large enough (≥500) to develop both accurate and reliable PTFs.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.