Kowshik Bhowmik scite author profile

Kowshik Bhowmik

3Publications

0Citation Statements Received

0Citation Statements Given

How they've been cited

How they cite others

Affiliations

Computational Sciences (United States), University of Cincinnati, College of Wooster

Publications

Order By: Most citations

Leveraging Vector Space Similarity for Learning Cross-Lingual Word Embeddings: A Systematic Review

Bhowmik

Ralescu

2021

Digital

View full text Add to dashboard Cite

This article presents a systematic literature review on quantifying the proximity between independently trained monolingual word embedding spaces. A search was carried out in the broader context of inducing bilingual lexicons from cross-lingual word embeddings, especially for low-resource languages. The returned articles were then classified. Cross-lingual word embeddings have drawn the attention of researchers in the field of natural language processing (NLP). Although existing methods have yielded satisfactory results for resource-rich languages and languages related to them, some researchers have pointed out that the same is not true for low-resource and distant languages. In this paper, we report the research on methods proposed to provide better representation for low-resource and distant languages in the cross-lingual word embedding space.

show abstract

Clustering of Monolingual Embedding Spaces

Bhowmik

Ralescu

2023

Digital

View full text Add to dashboard Cite

Suboptimal performance of cross-lingual word embeddings for distant and low-resource languages calls into question the isomorphic assumption integral to the mapping-based methods of obtaining such embeddings. This paper investigates the comparative impact of typological relationship and corpus size on the isomorphism between monolingual embedding spaces. To that end, two clustering algorithms were applied to three sets of pairwise degrees of isomorphisms. It is also the goal of the paper to determine the combination of the isomorphism measure and clustering algorithm that best captures the typological relationship among the chosen set of languages. Of the three measures investigated, Relational Similarity seemed to capture best the typological information of the languages encoded in their respective embedding spaces. These language clusters can help us identify, without any pre-existing knowledge about the real-world linguistic relationships shared among a group of languages, the related higher-resource languages of low-resource languages. The presence of such languages in the cross-lingual embedding space can help improve the performance of low-resource languages in a cross-lingual embedding space.

show abstract

Bridging the Resource Gap in Cross-Lingual Embedding Space

Bhowmik

Ralescu

2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Kowshik Bhowmik

Leveraging Vector Space Similarity for Learning Cross-Lingual Word Embeddings: A Systematic Review

Clustering of Monolingual Embedding Spaces

Bridging the Resource Gap in Cross-Lingual Embedding Space

Contact Info

Product

Resources

About