2014 Fifth International Conference on Computing for Geospatial Research and Application 2014
DOI: 10.1109/com.geo.2014.8
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Technique for Searching Very Large Files with Fuzzy Criteria Using the Pigeonhole Principle

Abstract: Big Data is the new term of the exponential growth of data in the Internet. The importance of Big Data is not about how large it is, but about what information you can get from analyzing these data. Such analysis would help many businesses on making smarter decisions, and provide time and cost reduction. Therefore, to make such analysis, you will definitely need to search the large files on Big Data. Big Data is such a construction where sequential search is prohibitively inefficient, in terms of time and ener… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
17
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
5
2
1

Relationship

5
3

Authors

Journals

citations
Cited by 11 publications
(17 citation statements)
references
References 11 publications
0
17
0
Order By: Relevance
“…Thus, the whole gene can be clustered using our efficient clustering algorithm (Fig.1). Nevertheless, when performing the exact or best match search, the system would offer multiple answers following fuzzy search algorithm that is discussed in [10]. Therefore, combining the result of clustering schemes and assigning the relevance in terms of Hamming metric yields high accuracy and efficiency that are needed to discover mutations, alignment and perform other analysis functionalities.…”
Section: Ensemble Methods and Best Match Searchmentioning
confidence: 99%
“…Thus, the whole gene can be clustered using our efficient clustering algorithm (Fig.1). Nevertheless, when performing the exact or best match search, the system would offer multiple answers following fuzzy search algorithm that is discussed in [10]. Therefore, combining the result of clustering schemes and assigning the relevance in terms of Hamming metric yields high accuracy and efficiency that are needed to discover mutations, alignment and perform other analysis functionalities.…”
Section: Ensemble Methods and Best Match Searchmentioning
confidence: 99%
“…Clusters are essential components in our classification and prediction methodology due to its ability to discover the connected components of patients [12]. Because fuzziness is one of the most salient features of the "Big Data" concept, underlying relationships can be detected by using Golay code clustering technique.…”
Section: The Structure Of Clustersmentioning
confidence: 99%
“…Text classification is a major challenge in many domains and fields for researchers. Information retrieval systems [33] and search engine [34,35] applications commonly make use of text classification methods. Extending from these applications, text classification could also be used for applications such as information filtering (e.g., email and text message spam filtering) [36].…”
mentioning
confidence: 99%