Gantavya Bhatt scite author profile

Gantavya Bhatt

4Publications

11Citation Statements Received

148Citation Statements Given

How they've been cited

How they cite others

144

Affiliations

University of Washington, Seattle University

Publications

Order By: Most citations

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Bhatt¹,

Bansal²,

Singh³

et al. 2020

View full text Add to dashboard Cite

Long short-term memory (LSTM) networks and their variants are capable of encapsulating long-range dependencies, which is evident from their performance on a variety of linguistic tasks. On the other hand, simple recurrent networks (SRNs), which appear more biologically grounded in terms of synaptic connections, have generally been less successful at capturing long-range dependencies as well as the loci of grammatical errors in an unsupervised setting. In this paper, we seek to develop models that bridge the gap between biological plausibility and linguistic competence. We propose a new architecture, the Decay RNN, which incorporates the decaying nature of neuronal activations and models the excitatory and inhibitory connections in a population of neurons. Besides its biological inspiration, our model also shows competitive performance relative to LSTMs on subject-verb agreement, sentence grammaticality, and language modeling tasks. These results provide some pointers towards probing the nature of the inductive biases required for RNN architectures to model linguistic phenomena successfully.

show abstract

Systematic Generalization in Neural Networks-based Multivariate Time Series Forecasting Models

Bansal

Bhatt

Malhotra³

et al. 2021

View full text Add to dashboard Cite

Matryoshka Representations for Adaptive Deployment

Kusupati¹,

Bhatt²,

Wallingford³

et al. 2022

Preprint

View full text Add to dashboard Cite

Learned representations are a central component in modern ML systems, serving a multitude of downstream tasks. When training such representations, it is often the case that computational and statistical constraints for each downstream task are unknown. In this context, rigid fixed-capacity representations can be either over or under-accommodating to the task at hand. This leads us to ask: can we design a flexible representation that can adapt to multiple downstream tasks with varying computational resources? Our main contribution is Matryoshka Representation Learning (MRL) which encodes information at different granularities and allows a single embedding to adapt to the computational constraints of downstream tasks. MRL minimally modifies existing representation learning pipelines and imposes no additional cost during inference and deployment. MRL learns coarse-to-fine representations that are at least as accurate and rich as independently trained low-dimensional representations. The flexibility within the learned Matryoshka Representations offer: (a) up to 14× smaller embedding size for ImageNet-1K classification at the same level of accuracy; (b) up to 14× real-world speed-ups for large-scale retrieval on ImageNet-1K and 4K; and (c) up to 2% accuracy improvements for long-tail few-shot classification, all while being as robust as the original representations. Finally, we show that MRL extends seamlessly to web-scale datasets (ImageNet, JFT) across various modalities -vision (ViT, ResNet), vision + language (ALIGN) and language (BERT). MRL code and pretrained models are open-sourced at https://github.com/RAIVNLab/MRL. * Equal contribution -AK led the project with extensive support from GB and AR for experimentation.Preprint. Under review.

show abstract

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Bhatt¹,

Bansal²,

Singh³

et al. 2020

Preprint

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Gantavya Bhatt

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Systematic Generalization in Neural Networks-based Multivariate Time Series Forecasting Models

Matryoshka Representations for Adaptive Deployment

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Contact Info

Product

Resources

About