2018
DOI: 10.14419/ijet.v7i2.21.12456
|View full text |Cite
|
Sign up to set email alerts
|

Effective processing of unstructured data using python in Hadoop map reduce

Abstract: In present scenario, the growing data are naturally unstructured. In this case to handle the wide range of data, is difficult. The proposed paper is to process the unstructured text data effectively in Hadoop map reduce using Python. Apache Hadoop is an open source platform and it widely uses Map Reduce framework. Map Reduce is popular and effective for processing the unstructured data in parallel manner. There are two stages in map reduce, namely transform and repository. Here the input splits into small bloc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 2 publications
0
1
0
Order By: Relevance
“…Research related to the use of map-reduce for preprocessing has been carried out to review the algorithmic aspects of parallel processing [25], Scalable Distributed Data Processing [26]- [28], to Effective processing for unstructured data using python [29]. The proposed research uses the python programming language and parallel processing; however, it uses a different kind of pre-processing and algorithm.…”
Section: Introductionmentioning
confidence: 99%
“…Research related to the use of map-reduce for preprocessing has been carried out to review the algorithmic aspects of parallel processing [25], Scalable Distributed Data Processing [26]- [28], to Effective processing for unstructured data using python [29]. The proposed research uses the python programming language and parallel processing; however, it uses a different kind of pre-processing and algorithm.…”
Section: Introductionmentioning
confidence: 99%