2018
DOI: 10.1007/978-3-319-77113-7_42
|View full text |Cite
|
Sign up to set email alerts
|

Language Technology for Digital Linguistics: Turning the Linguistic Survey of India into a Rich Source of Linguistic Information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 5 publications
0
3
0
Order By: Relevance
“…A recent initiative to digitise old linguistic data is the digitisation of the Linguistic Survey of India (Grierson, 1903(Grierson, -1928 under the project South Asia as a linguistic area? Exploring big-data methods in areal and genetic linguistics (Borin et al, 2020(Borin et al, , 2018(Borin et al, , 2014. Using OCR and subsequent information extraction from the text, Borin et al have shown that "old" data still has much to tell for the computational study of typology and comparative linguistics.…”
Section: Text Digitisation and Ocrmentioning
confidence: 99%
“…A recent initiative to digitise old linguistic data is the digitisation of the Linguistic Survey of India (Grierson, 1903(Grierson, -1928 under the project South Asia as a linguistic area? Exploring big-data methods in areal and genetic linguistics (Borin et al, 2020(Borin et al, , 2018(Borin et al, , 2014. Using OCR and subsequent information extraction from the text, Borin et al have shown that "old" data still has much to tell for the computational study of typology and comparative linguistics.…”
Section: Text Digitisation and Ocrmentioning
confidence: 99%
“…Previously, a few experimental techniques and associated systems have been reported for automatic extraction of typological information. In (Borin et al, 2018;Virk et al, 2017), the authors have reported on simple pattern matching and syntactic parsing based systems. The systems have modest accuracy and recall and are very restricted with respect to the number of features they can target.…”
Section: Related Workmentioning
confidence: 99%
“…A small corpus consisting of descriptive grammars of the natural languages spoken in South Asia was reported in (Borin et al, 2018), and a set of documents from that corpus annotated with LingFN frames was reported in (Virk et al, 2019). Annotation of a descriptive grammars with LingFN frames involve identification of lexical units and selection of appropriate linguistic semantic frames and their frame elements.…”
Section: Datamentioning
confidence: 99%