2014
DOI: 10.1155/2014/646497
|View full text |Cite
|
Sign up to set email alerts
|

Meteorological Data Analysis Using MapReduce

Abstract: In the atmospheric science, the scale of meteorological data is massive and growing rapidly. K-means is a fast and available cluster algorithm which has been used in many fields. However, for the large-scale meteorological data, the traditional K-means algorithm is not capable enough to satisfy the actual application needs efficiently. This paper proposes an improved MK-means algorithm (MK-means) based on MapReduce according to characteristics of large meteorological datasets. The experimental results show tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 11 publications
0
8
0
Order By: Relevance
“…Wei Fang, V.S. Sheng, Xue Zhi Wen and Wubin Pan [14] designed and implemented SVM classification technique using Hadoop and MapReduce framework to predict the rainfall from large amount data. They utilized feature selection and reduction algorithm associated with the dataset.…”
Section: Related Workmentioning
confidence: 99%
“…Wei Fang, V.S. Sheng, Xue Zhi Wen and Wubin Pan [14] designed and implemented SVM classification technique using Hadoop and MapReduce framework to predict the rainfall from large amount data. They utilized feature selection and reduction algorithm associated with the dataset.…”
Section: Related Workmentioning
confidence: 99%
“…This is because there will be one entry per block which is 1000x3 (number of blocks x replication factor) and one entry per file which is 1000. Each block or file entry will take about 150bytes [6][7] [8].…”
Section: Small File Problemmentioning
confidence: 99%
“…Fang et al [36] used large-scale meteorological datasets and an MK-means algorithm applied through MapReduce to manage meteorological information stored in high-cost servers over several years. Optimal K-centroids were calculated using the standard distance measure of similarity and the Euclidean distance.…”
Section: Related Workmentioning
confidence: 99%