2021
DOI: 10.1088/1757-899x/1090/1/012053
|View full text |Cite
|
Sign up to set email alerts
|

Data Mining preparation: Process, Techniques and Major Issues in Data Analysis

Abstract: Data preparation is an essential stage in data analysis. Many institutions or companies are interested in converting data into pure forms that can be used for scientific and profit purposes. It helps you set goals regarding system capabilities and features or the benefits your company expects from its investment. This purpose creates an immediate need to review and prepare the data to clean the raw data. In this paper, we highlight the importance of data preparation in data analysis and data extraction techniq… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0
3

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 22 publications
(17 citation statements)
references
References 8 publications
0
14
0
3
Order By: Relevance
“…These techniques, still widely used today are often referred to as the traditional approaches to data integration and serve as core components of important data management projects such as building data warehouses and synchronizing data between applications [40]. The various scenarios of data integration across these numerous projects enable them to be mapped into three fundamental approaches namely consolidation, propagation and federation [41]. Data consolidation involves a wholesale transfer of data from one or more systems to another and is usually applied in business intelligence for centralizing data from multiple systems into a single data warehouse.…”
Section: Multidata Integration Techniquesmentioning
confidence: 99%
“…These techniques, still widely used today are often referred to as the traditional approaches to data integration and serve as core components of important data management projects such as building data warehouses and synchronizing data between applications [40]. The various scenarios of data integration across these numerous projects enable them to be mapped into three fundamental approaches namely consolidation, propagation and federation [41]. Data consolidation involves a wholesale transfer of data from one or more systems to another and is usually applied in business intelligence for centralizing data from multiple systems into a single data warehouse.…”
Section: Multidata Integration Techniquesmentioning
confidence: 99%
“…Sebagai bentuk teknik ekstraksi data, data mining memiliki kemampuan untuk menemukan pola (pattern) yang tersembunyi guna menghasilkan pengetahuan baru dalam suatu kumpulan data yang bahkan abstrak sebelumnya [16], [17]. Secara khusus pendekatan data mining memiliki teknik-teknik tertentu berdasarkan tujuan pemanfaatan ekstraksi data, baik untuk kebutuhan estimasi, prediksi, klasifikasi, pengelompokan, maupun asosiasi [18]. Teknik data mining tidak Langkah pertama yang dilakukan adalah menentukan berapa banyak cluster yang akan dibuat, selanjutnya disebut sebagai nilai k. Tahap selanjutnya adalah menentukan nilai centroid (pusat cluster awal) yang diperoleh dari data set secara acak sejumlah nilai k. Selanjutnya menghitung jarak setiap dataset dengan masing-masing centroid.…”
Section: Data Miningunclassified
“…• Data Mining: To extract useful patterns, techniques and tools must be applied at this point in the process. Classification, clustering, regression, and other techniques are all part of data mining algorithms [47].…”
Section: Figurementioning
confidence: 99%