<scp>ParsiNLU</scp>: A Suite of Language Understanding Challenges for Persian

Khashabi, Daniel; Cohan, Arman; Shakeri, Siamak; Hosseini, Pedram; Pezeshkpour, Pouya; Alikhani, Malihe; Aminnaseri, Moin; Bitaab, Marzieh; Brahman, Faeze; Ghazarian, Sarik; Gheini, Mozhdeh; Kabiri, Arman; Mahabagdi, Rabeeh Karimi; Memarrast, Omid; Mosallanezhad, Ahmadreza; Noury, Erfan; Raji, Shahab; Rasooli, Mohammad Sadegh; Sadeghi, Sepideh; Azer, Erfan Sadeqi; Samghabadi, Niloofar Safi; Shafaei, Mahsa; Sheybani, Saber; Tazarv, Ali; Yaghoobzadeh, Yadollah

doi:10.1162/tacl_a_00419

Cited by 8 publications

(6 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In order to study if ARMAN works well as a language model, we tested our models in Natural Language Understanding (NLU) tasks. According to Khashabi et al (2020), we selected multiple-choice question-answering, textual entailment, sentiment analysis, and question paraphrasing tasks to examine our models' performance on them. For more information about these tasks and datasets, see Appendix A and Khashabi et al (2020).…”

Section: Nlu Resultsmentioning

confidence: 99%

See 1 more Smart Citation

ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

Salemi¹,

Kebriaei²,

Minaei³

et al. 2021

Preprint

View full text Add to dashboard Cite

ive text summarization is one of the areas influenced by the emergence of pre-trained language models. Current pre-training works in abstractive summarization give more points to the summaries with more words in common with the main text and pay less attention to the semantic similarity between generated sentences and the original document. We propose ARMAN, a Transformer-based encoderdecoder model pre-trained with three novel objectives to address this issue. In ARMAN, salient sentences from a document are selected according to a modified semantic score to be masked and form a pseudo summary. To summarize more accurately and similar to human writing patterns, we applied modified sentence reordering. We evaluated our proposed models on six downstream Persian summarization tasks. Experimental results show that our proposed model achieves state-of-the-art performance on all six summarization tasks measured by ROUGE and BERTScore. Our models also outperform prior works in textual entailment, question paraphrasing, and multiple choice question answering. Finally, we established a human evaluation and show that using the semantic score significantly improves summarization results.

show abstract

Section: Nlu Resultsmentioning

confidence: 99%

“…ParsiNLU (Khashabi et al, 2020) is a collection of NLU tasks for the Persian language including Textual Entailment, Sentiment Analysis, Question Paraphrasing, Multiple Choice Question Answering, and Reading Comprehension tasks. We have fine-tuned our models on most of them to test their performances on NLU tasks.…”

Section: Downstream Datasetsmentioning

confidence: 99%

ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

Salemi¹,

Kebriaei²,

Minaei³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…Hence, in this paper, we create a native QA dataset for the Persian language. Khashabi et al [34] created a Persian QA dataset containing 1300 instances and trained a QA system using this dataset. To the best of our knowledge, currently, there is no native larg-scale QA dataset for answering the Persian questions, neither as a monolingual nor as a cross-lingual dataset.…”

Section: B Other Languagesmentioning

confidence: 99%

“…[7] 2018 English Native 150K+ Wikiqa: A challenge dataset for open-domain question answering [8] 2015 English Native 3K+ MS MARCO: A human generated machine reading comprehension dataset [9] 2016 English Native 100K+ Natural questions: a benchmark for question answering research [10] 2019 English Native 300K+ Quac: Question answering in context [11] 2018 English Native 100K+ Coqa: A conversational question answering challenge [12] 2019 English Native 127K+ Newsqa: A machine comprehension dataset [13] 2016 English Native 100K+ Constructing datasets for multi-hop reading comprehension across documents [15] 2018 English Native, Multi-hop 50K+ Hotpotqa: A dataset for diverse, explainable multi-hop question answering [16] 2018 English Native, Multi-hop 113K+ Repartitioning of the complexwebquestions dataset [17] 2018 English Native, Multi-hop 63K+ R4C: A benchmark for evaluating RC systems to get the right answer for the right reason [18] 2019 English Native, Multi-hop 4K+ Automatic spanish translation of the squad dataset for multilingual question answering [19] 2019 Spanish Translation 100K+ Neural arabic question answering [20] 2019 Arabic Translation 48K+ Semi-supervised training data generation for multilingual question answering [21] 2018 Korean Translation 81K+ Neural learning for question answering in italian [22] 2018 Italian Translation 60K+ SberQuAD-Russian reading comprehension dataset: Description and analysis [24] 2020 Russian Native 50K+ Drcd: a chinese machine reading comprehension dataset [25] 2018 Chinese Native 30K+ Korquad1. 0: Korean qa dataset for machine reading comprehension [26] 2018 Korean Native 70K+ Project PIAF: Building a Native French Question-Answering Dataset [27] 2020 French Native 3K+ Parsinlu: a suite of language understanding challenges for persian [34] 2021 Persian Native 1K+ ParSQuAD: Persian Question Answering Dataset based on Machine Translation of SQuAD 2.0 [33] 2021 Persian Translation 25K, 70K…”

Section: B Other Languagesmentioning

confidence: 99%

“…Content may change prior to final publication. ParSQuAD (manual) [33] Persian ALBERT-FA 48.11% 51.66% ParSQuAD (manual) [33] Persian ParsBERT 46.32% 50.06% ParSQuAD (manual) [33] Persian MBERT 52.86% 56.66% ParSQuAD (automatic) [33] Persian ALBERT-FA 64.71% 67.59% ParSQuAD (automatic) [33] Persian ParsBERT 62.42% 65.26% ParSQuAD (automatic) [33] Persian MBERT 67.73% 70.84% ParsiNLU [34] Persian ParsBERT -40.70% ParsiNLU [34] Persian MBERT -49.70% SQuAD [2] English Bert-base 80.8% 88.5% SQuAD [2] English Bert-large 84.1% 90.9% SQuAD-es [19] Spanish MBERT 48.3% 68.1% SberQuAD [24] Russian BERT 66.6% 84.8% KorQuAD [26] Korean BERT 71.68% 89.76% PIAF [27] French CamemBert -79.64%…”

mentioning

confidence: 99%

See 1 more Smart Citation

PersianQuAD: The Native Question Answering Dataset for the Persian Language

2022

View full text Add to dashboard Cite

Developing Question Answering systems (QA) is one of the main goals in Artificial Intelligence. With the advent of Deep Learning (DL) techniques, QA systems have witnessed significant advances. Although DL performs very well on QA, it requires a considerable amount of annotated data for training. Many annotated datasets have been built for the QA task; most of them are exclusively in English. In order to address the need for a high-quality QA dataset in the Persian language, we present PersianQuAD, the native QA dataset for the Persian language. We create PersianQuAD in four steps: (1) Wikipedia article selection, ( 2) question-answer collection, (3) three-candidates test set preparation, and (4) Data Quality Monitoring. PersianQuAD consists of approximately 20,000 questions and answers made by native annotators on a set of Persian Wikipedia articles. The answer to each question is a segment of the corresponding article text. To better understand PersianQuAD and ensure its representativeness, we analyze PersianQuAD and show it contains questions of varying types and difficulties. We also present three versions of a deep learning-based QA system trained with PersianQuAD. Our best system achieves an F1 score of 82.97% which is comparable to that of QA systems on English SQuAD, made by the Stanford University. This shows that PersianQuAD performs well for training deep-learning-based QA systems. Human performance on PersianQuAD is significantly better (96.49%), demonstrating that PersianQuAD is challenging enough and there is still plenty of room for future improvement. PersianQuAD is freely available and can be downloaded from here. All the QA systems implemented in this paper are also available here.

show abstract

A corpus of Persian literary text

Raji,

Alikhani,

de Melo

et al. 2023

Lang Resources & Evaluation

View full text Add to dashboard Cite

Persian poetry has profoundly affected all periods of Persian literature and the literature of other countries as well. It is a fundamental vehicle for expressing Persian culture and political opinion. This paper presents a corpus of Persian literary text mainly focusing on poetry, covering the ninth to twenty-first century annotated for century and style, with additional partial annotation of rhetorical figures. Our resource is the largest and the most diverse corpus available in Persian literary text, with a particularly broad temporal scope. This allows us to conduct several computational experiments to analyze poetic styles, authors and time periods, as well as context shifts over time, for which we rely both on supervised models and on Persian poetry-specific heuristics. The corpus, the tools, and experiments described in this paper can be used not only for digital humanities studies of Persian literature but also for processing Persian texts in general, as well as in other broader cross-linguistic applications.

show abstract

ParsiNLU: A Suite of Language Understanding Challenges for Persian

Cited by 8 publications

References 53 publications

ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization

PersianQuAD: The Native Question Answering Dataset for the Persian Language

A corpus of Persian literary text

Contact Info

Product

Resources

About