Along with the rapid development of science and technology information in contemporary society, massive data has become a common phenomenon in information processing in various industries, and various data quality problems have also followed. Among them, data loss is a common problem. In the process of oilfield production, the dynamic data of production wells is increasing continuously every day, and data missing problems often occur. Aiming at the missing data, this paper proposes an improved data filling algorithm SKNN based on KNN. Based on the KNN algorithm, the algorithm uses the SMOTE algorithm to randomly generate the fill data range, and uses the multi-filling idea to find the weighted average as the fill data. The experiment with the production data of a well area in Daqing oilfield as the sample set verified that the SKNN algorithm could not only fill the data, but also improve the accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.