Mayank Kulkarni scite author profile

In this paper, we formulate keyphrase extraction from scholarly articles as a sequence labeling task solved using a BiLSTM-CRF, where the words in the input text are represented using deep contextualized embeddings. We evaluate the proposed architecture using both contextualized and fixed word embedding models on three different benchmark datasets (Inspec, SemEval 2010, SemEval 2017), and compare with existing popular unsupervised and supervised techniques. Our results quantify the benefits of: (a) using contextualized embeddings (e.g. BERT) over fixed word embeddings (e.g. Glove); (b) using a BiLSTM-CRF architecture with contextualized word embeddings over fine-tuning the contextualized word embedding model directly; and (c) using genre-specific contextualized embeddings (SciBERT). Through error analysis, we also provide some insights into why particular models work better than the others. Lastly, we present a case study where we analyze different self-attention layers of the two best models (BERT and SciBERT) to better understand the predictions made by each for the task of keyphrase extraction.

show abstract

Multi-Domain Named Entity Recognition with Genre-Aware and Agnostic Inference

Wang¹,

Kulkarni²,

Preoţiuc-Pietro³

2020

View full text Add to dashboard Cite

Named entity recognition is a key component of many text processing pipelines and it is thus essential for this component to be robust to different types of input. However, domain transfer of NER models with data from multiple genres has not been widely studied. To this end, we conduct NER experiments in three predictive setups on data from: a) multiple domains; b) multiple domains where the genre label is unknown at inference time; c) domains not encountered in training. We introduce a new architecture tailored to this task by using shared and private domain parameters and multi-task learning. This consistently outperforms all other baseline and competitive methods on all three experimental setups, with differences ranging between +1.95 to +3.11 average F1 across multiple genres when compared to standard approaches. These results illustrate the challenges that need to be taken into account when building real-world NLP applications that are robust to various types of text and the methods that can help, at least partially, alleviate these issues.

show abstract

Learning Rich Representation of Keyphrases from Text

Kulkarni¹,

Mahata²,

Arora³

et al. 2022

View full text Add to dashboard Cite

In this work, we explore how to train taskspecific language models aimed towards learning rich representation of keyphrases from text documents. We experiment with different masking strategies for pre-training transformer language models (LMs) in discriminative as well as generative settings. In the discriminative setting, we introduce a new pre-training objective -Keyphrase Boundary Infilling with Replacement (KBIR), showing large gains in performance (upto 8.16 points in F1) over SOTA, when the LM pre-trained using KBIR is fine-tuned for the task of keyphrase extraction. In the generative setting, we introduce a new pre-training setup for BART -Key-BART, that reproduces the keyphrases related to the input text in the CatSeq format, instead of the denoised original input. This also led to gains in performance (upto 4.33 points in F1@M) over SOTA for keyphrase generation. Additionally, we also fine-tune the pre-trained language models on named entity recognition (NER), question answering (QA), relation extraction (RE), abstractive summarization and achieve comparable performance with that of the SOTA, showing that learning rich representation of keyphrases is indeed beneficial for many other fundamental NLP tasks.

show abstract

Learning Rich Representation of Keyphrases from Text

Kulkarni¹,

Mahata²,

Arora³

et al. 2021

Preprint

View full text Add to dashboard Cite

Affect-Based Early Prediction of Player Mental Demand and Engagement for Educational Games

Wiggins

Kulkarni

Min

et al. 2018

AIIDE

View full text Add to dashboard Cite

Player affect is a central consideration in the design of game-based learning environments. Affective indicators such as facial expressions exhibited during gameplay may support building more robust player models and adaptation modules. In game-based learning, predicting player mental demand and engagement from player affect is a particularly promising approach to helping create more effective gameplay. This paper reports on a predictive player-modeling approach that observes player affect during early interactions with a game-based learning environment and predicts selfreports of mental demand and engagement at the conclusion of gameplay sessions. The findings show that automatically detected facial expressions such as those associated with joy, disgust, sadness, and surprise are significant predictors of players’ self-reported engagement and mental demand at the end of gameplay interactions. The results suggest that it is possible to create affect-based predictive player models that can enable proactively tailored gameplay by anticipating player mental demand and engagement.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mayank Kulkarni

Keyphrase Extraction as Sequence Labeling Using Contextualized Embeddings

Multi-Domain Named Entity Recognition with Genre-Aware and Agnostic Inference

Learning Rich Representation of Keyphrases from Text

Learning Rich Representation of Keyphrases from Text

Affect-Based Early Prediction of Player Mental Demand and Engagement for Educational Games

Contact Info

Product

Resources

About