G. Pankaj Kumar scite author profile

Most of the empirical evaluations of active learning approaches in the literature have focused on a single classifier and a single performance measure. We present an extensive empirical evaluation of common active learning baselines using two probabilistic classifiers and several performance measures on a number of large datasets. In addition to providing important practical advice, our findings highlight the importance of overlooked choices in active learning experiments in the literature. For example, one of our findings shows that model selection is as important as devising an active learning approach, and choosing one classifier and one performance measure can often lead to unexpected and unwarranted conclusions. Active learning should generally improve the model's capability to distinguish between instances of different classes, but our findings show that the improvements provided by active learning for one performance measure often came at the expense of another measure. We present several such results, raise questions, guide users and researchers to better alternatives, caution against unforeseen side effects of active learning, and suggest future research directions.

show abstract

Albatross: An efficient cloud-enabled task scheduling and execution framework using distributed message queues

Sadooghi

Kumar

Wang

et al. 2016

View full text Add to dashboard Cite

Profile-based authorship analysis

Dunn

Argamon

Rasooli

et al. 2015

Digital Scholarship Humanities

View full text Add to dashboard Cite

This article presents a profile-based authorship analysis method which first categorizes texts according to social and conceptual characteristics of their author (e.g. Sex and Political Ideology) and then combines these profiles for two authorship analysis tasks: (1) determining shared authorship of pairs of texts without a set of candidate authors and (2) clustering texts according to characteristics of their authors in order to provide an analysis of the types of individuals represented in the data set. The first task outperforms Burrows' Delta by a wide margin on short texts and a small margin on long texts. The second task has no such benchmark with existing methods. The data set for evaluating the method consists of speeches from the US House and Senate from 1995 to 2013. This data set contains both a large number of texts (42,000 in the test sets) and a large number of speakers (over 800). The article shows that this approach to authorship analysis is more accurate than existing approaches given a data set with hundreds of authors. Further, this profile-based method makes new types of analysis possible by looking at types of individuals as well as at specific individuals.

show abstract

SRL Video Recommender for Syllabus Driven E-Learning Platforms

Laiju¹,

Saurav²,

Rishad³

et al. 2021

View full text Add to dashboard Cite

Application of fuzzy logic in video recommendation system for syllabus driven E-learning platform

Rishad¹,

Saurav²,

Laiju³

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

G. Pankaj Kumar

Active learning: an empirical study of common baselines

Albatross: An efficient cloud-enabled task scheduling and execution framework using distributed message queues

Profile-based authorship analysis

SRL Video Recommender for Syllabus Driven E-Learning Platforms

Application of fuzzy logic in video recommendation system for syllabus driven E-learning platform

Contact Info

Product

Resources

About