Proceedings of the Forum for Information Retrieval Evaluation on - FIRE '14 2015
DOI: 10.1145/2824864.2824873
|View full text |Cite
|
Sign up to set email alerts
|

Mixed-script query labelling using supervised learning and ad hoc retrieval using sub word indexing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 2 publications
0
6
0
Order By: Relevance
“…Gambäck (2013, 2014) used the language tags of the previous three words with an SVM. Mukherjee et al (2014) used language labels of surrounding words with NB. King et al (2015) used the language probabilities of the previous word to determining weights for languages.…”
Section: Statistics Of Words Van Der Lee and Boschmentioning
confidence: 99%
“…Gambäck (2013, 2014) used the language tags of the previous three words with an SVM. Mukherjee et al (2014) used language labels of surrounding words with NB. King et al (2015) used the language probabilities of the previous word to determining weights for languages.…”
Section: Statistics Of Words Van Der Lee and Boschmentioning
confidence: 99%
“…For the reasons explained in Section 5.2, we are unable to directly compare to the systems that participated in the FIRE 2014 shared task. The best reported F-score results on the deromanization of transliterated search subtask were 7.3% for Bengali (Gupta et al, 2014a) and 30.4% for Hindi (Mukherjee et al, 2014). We attribute the superior results of our system to its ability to handle spelling variations found in romanized codemixed texts.…”
Section: Sequence Predictionmentioning
confidence: 78%
“…They fine-tuned their system for those languages and performed very well in the respective language tracks. Two teams (Asterish [33] and BITS-Lipyantaran [27]) used Google transliteration API for Hindi, and they achieved the highest TF scores. The teams which used machine learning on token-based and n-gram features have higher labeling accuracy than the teams which only relied on dictionaries and rules.…”
Section: Submissionsmentioning
confidence: 99%
“…In Ad hoc@MSIR'14, we received seven runs and we observed that the two runs from BITS-Lipyantran [27] performs best across all the metrics. Table 9 presents the results of the seven runs received.…”
Section: Submissionsmentioning
confidence: 99%