Computer Science &Amp; Information Technology ( CS &Amp; IT ) 2014
DOI: 10.5121/csit.2014.4910
|View full text |Cite
|
Sign up to set email alerts
|

An Effective Tokenization Algorithm for Information Retrieval Systems

Abstract: ABSTRACT

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
19
0
2

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(21 citation statements)
references
References 10 publications
0
19
0
2
Order By: Relevance
“…[1] Ontologies are metadata schemas, providing a controlled vocabulary of concepts, each with an explicitly defined and machine process able semantics [1] [7], by defining shared and common domain theories, ontology helps both people and machines to communicate precisely to support the exchange of semantics. Ontology language editors help to build semantic web [3][4] [5]. Hence, the contemptible and efficient construction of domain specific ontology is crucial for the success of many data processing systems.…”
Section: Introductionmentioning
confidence: 99%
“…[1] Ontologies are metadata schemas, providing a controlled vocabulary of concepts, each with an explicitly defined and machine process able semantics [1] [7], by defining shared and common domain theories, ontology helps both people and machines to communicate precisely to support the exchange of semantics. Ontology language editors help to build semantic web [3][4] [5]. Hence, the contemptible and efficient construction of domain specific ontology is crucial for the success of many data processing systems.…”
Section: Introductionmentioning
confidence: 99%
“…The integration results into creation of merged node and best position for newly created will be based on highest values of frequency among participating concept. Precision is a value in the range [0, 1]; the higher the value, the fewer wrong merging computed [5] [11]. Precision is determines as the ration of number of correct found alignment/ matching with total number of alignment found.…”
Section: Proposed Procedures and Examplementioning
confidence: 99%
“…[1] Ontologies are metadata schemas, providing a controlled vocabulary of concepts, each with an explicitly defined and machine process able semantics [1][7], by defining shared and common domain theories, ontology helps both people and machines to communicate precisely to support the exchange of semantics. Ontology language editors help to build semantic web [3][4] [5]. Hence, the contemptible and efficient construction of domain specific ontology is crucial for the success of many data processing systems.…”
Section: Introductionmentioning
confidence: 99%
“…These words, numbers, symbols and other characters distinguished by tokenization called tokens. [4].…”
Section: Introductionmentioning
confidence: 99%
“…Stop word removing is substantial in the preprocessing, it has some advantages like reducing the size of stored data set and it improves the overall efficiency and effectiveness of the analysis system. [4]. The proposed system uses a list of stop words obtained from Onix Text Retrieval Toolkit website [5].…”
Section: Introductionmentioning
confidence: 99%