2018
DOI: 10.1007/s12046-018-0828-8
|View full text |Cite
|
Sign up to set email alerts
|

Machine transliteration and transliterated text retrieval: a survey

Abstract: Users of the WWW across the globe are increasing rapidly. According to Internet live stats there are more than 3 billion Internet users worldwide today and the number of non-English native speakers is quite high there. A large proportion of these non-English speakers access the Internet in their native languages but use the Roman script to express themselves through various communication channels like messages and posts. With the advent of Web 2.0, user-generated content is increasing on the Web at a very rapi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(2 citation statements)
references
References 89 publications
0
2
0
Order By: Relevance
“…By design, the silver data provides only a single translation for each English NE. However, multiple translations are often correct, due to the variability of morphology, transliteration, naming conventions and dialects (Prabhakar and Pal, 2018). For example, the English NE "Paul" can be aligned to "Pavel" and "Pavla".…”
Section: Silver Evaluationmentioning
confidence: 99%
“…By design, the silver data provides only a single translation for each English NE. However, multiple translations are often correct, due to the variability of morphology, transliteration, naming conventions and dialects (Prabhakar and Pal, 2018). For example, the English NE "Paul" can be aligned to "Pavel" and "Pavla".…”
Section: Silver Evaluationmentioning
confidence: 99%
“…The content written on Facebook , more than 33% comments are written using phonetic text and more than 38% comments are written using code mixed phonetic text (bilingual) [4]. These text do not follow any standard spelling rules, but are based on the pronunciation of the words [5]. So, the development of phonetic dataset(s) is required for the text mining, opinion mining, information retrieval, feedback analysis, business intelligence, data analytics, etc.…”
Section: Introductionmentioning
confidence: 99%