2016 International Conference on Advances in Computing and Communication Engineering (ICACCE) 2016
DOI: 10.1109/icacce.2016.8073765
|View full text |Cite
|
Sign up to set email alerts
|

Text-based language identification for some of the under-resourced languages of South Africa

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(3 citation statements)
references
References 13 publications
0
3
0
Order By: Relevance
“…Combining signals form other system in addition to acoustics is a good way to further boost the performance of LangID accuracy, such as text-based features, language model features [14,15]. In this work, we have tried both lattice based methods and neural network method to combine text-based semantic features and acoustic features to improve the accuracy of language identification.…”
Section: Related Workmentioning
confidence: 99%
“…Combining signals form other system in addition to acoustics is a good way to further boost the performance of LangID accuracy, such as text-based features, language model features [14,15]. In this work, we have tried both lattice based methods and neural network method to combine text-based semantic features and acoustic features to improve the accuracy of language identification.…”
Section: Related Workmentioning
confidence: 99%
“…Multiple papers have proposed hierarchical stacked classifiers (including lexicons) that would for example first classify a piece of text by language group and then by exact language [19,20,9,1]. Some work has also been done on classifying surnames between Tshivenda, Xitsonga and Sepedi [21]. Additionally, data augmentation [22] and adversarial training [23] approaches are potentially very useful to reduce the requirement for data.…”
Section: Introductionmentioning
confidence: 99%
“…L ANGUAGE. Identification (LID)plays a critical role in the expansive field of computational linguistics, essential for interpreting multilingual texts [5], [6]. This task assumes greater significance in today's interconnected global environment, particularly in linguistically diverse regions such as India, with its array of languages, scripts, and dialects [2], [3].…”
mentioning
confidence: 99%