2011
DOI: 10.1007/s10579-011-9165-9
|View full text |Cite
|
Sign up to set email alerts
|

A survey of methods to ease the development of highly multilingual text mining applications

Abstract: Multilingual text processing is useful because the information content found in different languages is complementary, both regarding facts and opinions. While Information Extraction and other text mining software can, in principle, be developed for many languages, most text analysis tools have only been applied to small sets of languages because the development effort per language is large. Self-training tools obviously alleviate the problem, but even the effort of providing training data and of manually tunin… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(33 citation statements)
references
References 14 publications
0
33
0
Order By: Relevance
“…The art of sharing meaningful information with the help of uncommon and meaningless data is truly a good thing. Text mining technique as described by [45] examines the content for extracting the meaningful data which can be used for particular purposes. It looks like text mining that is going to include the overall NLP scheme [46] in its system in order to effectively examine the human language and to structure the unstructured data patterns accordingly.…”
Section: Text Mining Efforts In Resolving Various Nlp Issuesmentioning
confidence: 99%
“…The art of sharing meaningful information with the help of uncommon and meaningless data is truly a good thing. Text mining technique as described by [45] examines the content for extracting the meaningful data which can be used for particular purposes. It looks like text mining that is going to include the overall NLP scheme [46] in its system in order to effectively examine the human language and to structure the unstructured data patterns accordingly.…”
Section: Text Mining Efforts In Resolving Various Nlp Issuesmentioning
confidence: 99%
“…Precision and recall process is used to check and evaluate the relevance of results on the extracted data. In-depth and complete information about the relevant field is required to perform information extraction process to attain more relevant results [19].…”
Section: A Information Extractionmentioning
confidence: 99%
“…There is a close relationship in text mining and information retrieval for textual data. In IR systems, different algorithms are used to track the user's behavior and search relevant data accordingly [19]. Google and Yahoo search engines are using information retrieval system more frequently to extract relevant documents according to a phrase on Web.…”
Section: B Information Retrievalmentioning
confidence: 99%
“…A major disadvantage arises, however, for the end-user wishing to process a genuine multilingual corpus such as news feed. For most languages, ecient components will be lacking [13]. In recent years, machine learning was successfully used to ll gaps when one can nd sucient training data in a language which has enough common properties with the new one [11].…”
Section: Related Workmentioning
confidence: 99%