Vinitra Swamy scite author profile

Vinitra Swamy

5Publications

9Citation Statements Received

80Citation Statements Given

How they've been cited

How they cite others

Affiliations

Computer Science Laboratory of Lille, SASTRA University

Publications

Order By: Most citations

Interpreting Language Models Through Knowledge Graph Extraction

Swamy¹,

Romanou²,

Jäggi³

2021

Preprint

View full text Add to dashboard Cite

Transformer-based language models trained on large text corpora have enjoyed immense popularity in the natural language processing community and are commonly used as a starting point for downstream tasks. While these models are undeniably useful, it is a challenge to quantify their performance beyond traditional accuracy metrics. In this paper, we compare BERT-based language models through snapshots of acquired knowledge at sequential stages of the training process. Structured relationships from training corpora may be uncovered through querying a masked language model with probing tasks. We present a methodology to unveil a knowledge acquisition timeline by generating knowledge graph extracts from cloze "fill-in-the-blank" statements at various stages of RoBERTa's early training. We extend this analysis to a comparison of pretrained variations of BERT models (DistilBERT, BERT-base, RoBERTa). This work proposes a quantitative framework to compare language models through knowledge graph extraction (GED, Graph2Vec) and showcases a part-of-speech analysis (POSOR) to identify the linguistic strengths of each model variant. Using these metrics, machine learning practitioners can compare models, diagnose their models' behavioral strengths and weaknesses, and identify new targeted datasets to improve model performance.

show abstract

Survey on clustering techniques in Twitter data

Devika

Revathy

et al. 2018

View full text Add to dashboard Cite

Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs

Swamy¹,

Krco²,

Marras³

et al. 2022

Preprint

View full text Add to dashboard Cite

Neural networks are ubiquitous in applied machine learning for education. Their pervasive success in predictive performance comes alongside a severe weakness, the lack of explainability of their decisions, especially relevant in humancentric fields. We implement five state-of-the-art methodologies for explaining black-box machine learning models (LIME, PermutationSHAP, KernelSHAP, DiCE, CEM) and examine the strengths of each approach on the downstream task of student performance prediction for five massive open online courses. Our experiments demonstrate that the families of explainers do not agree with each other on feature importance for the same Bidirectional LSTM models with the same representative set of students. We use Principal Component Analysis, Jensen-Shannon distance, and Spearman's rank-order correlation to quantitatively cross-examine explanations across methods and courses. Furthermore, we validate explainer performance across curriculum-based prerequisite relationships. Our results come to the concerning conclusion that the choice of explainer is an important decision and is in fact paramount to the interpretation of the predictive results, even more so than the course the model is trained on. Source code and models are released at http://github.com/epfl-ml4ed/evaluating-explainers.

show abstract

Deep Knowledge Tracing for Free-Form Student Code Progression

Swamy¹,

Guo²,

Lau³

et al. 2018

View full text Add to dashboard Cite

Trusting the Explainers: Teacher Validation of Explainable Artificial Intelligence for Course Design

Swamy¹,

Du²,

Marras³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Vinitra Swamy

Interpreting Language Models Through Knowledge Graph Extraction

Survey on clustering techniques in Twitter data

Evaluating the Explainers: Black-Box Explainable Machine Learning for Student Success Prediction in MOOCs

Deep Knowledge Tracing for Free-Form Student Code Progression

Trusting the Explainers: Teacher Validation of Explainable Artificial Intelligence for Course Design

Contact Info

Product

Resources

About