Nhu Khoa Nguyen scite author profile

Nhu Khoa Nguyen

5Publications

4Citation Statements Received

115Citation Statements Given

How they've been cited

How they cite others

115

Affiliations

University of La Rochelle

Publications

Order By: Most citations

Assessing the impact of OCR noise on multilingual event detection over digitised documents

et al. 2022

View full text Add to dashboard Cite

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L'archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau recherche, publiés ou non, émanant des établissements d'enseignement et de recherche français ou étrangers, des laboratoires publics ou privés.

show abstract

L3i_LBPAM at the FinSim-2 task: Learning Financial Semantic Similarities with Siamese Transformers

Nguyen

Boroş

Lejeune

et al. 2021

View full text Add to dashboard Cite

In this paper, we present the different methods proposed for the FinSIM-2 Shared Task 2021 on Learning Semantic Similarities for the Financial domain. The main focus of this task is to evaluate the classification of financial terms into corresponding top-level concepts (also known as hypernyms) that were extracted from an external ontology. We approached the task as a semantic textual similarity problem. By relying on a siamese network with pre-trained language model encoders, we derived semantically meaningful term embeddings and computed similarity scores between them in a ranked manner. Additionally, we exhibit the results of different baselines in which the task is tackled as a multi-class classification problem. The proposed methods outperformed our baselines and proved the robustness of the models based on textual similarity siamese network. CCS CONCEPTS• Computing methodologies → Lexical semantics; Neural networks.

show abstract

Utilizing Keywords Evolution in Context for Emerging Trend Detection in Scientific Publications

Nguyen

Boroş

Lejeune³

et al. 2022

View full text Add to dashboard Cite

Contextualizing Emerging Trends in Financial News Articles

Nguyen¹,

Delahaut²,

Boroş³

et al. 2023

Preprint

View full text Add to dashboard Cite

Identifying and exploring emerging trends in news is becoming more essential than ever with many changes occurring around the world due to the global health crises. However, most of the recent research has focused mainly on detecting trends in social media, thus, benefiting from social features (e.g. likes and retweets on Twitter) which helped the task as they can be used to measure the engagement and diffusion rate of content. Yet, formal text data, unlike short social media posts, comes with a longer, less restricted writing format, and thus, more challenging. In this paper, we focus our study on emerging trends detection in financial news articles about Microsoft, collected before and during the start of the COVID-19 pandemic (July 2019 to July 2020). We make the dataset accessible and we also propose a strong baseline (Contextual Leap2Trend) for exploring the dynamics of similarities between pairs of keywords based on topic modeling and term frequency. Finally, we evaluate against a gold standard (Google Trends) and present noteworthy real-world scenarios regarding the influence of the pandemic on Microsoft.

show abstract

Transformer-based Methods with #Entities for Detecting Emergency Events on Social Media

Boroş

Nguyen

Lejeune

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Nhu Khoa Nguyen

Assessing the impact of OCR noise on multilingual event detection over digitised documents

L3i_LBPAM at the FinSim-2 task: Learning Financial Semantic Similarities with Siamese Transformers

Utilizing Keywords Evolution in Context for Emerging Trend Detection in Scientific Publications

Contextualizing Emerging Trends in Financial News Articles

Transformer-based Methods with #Entities for Detecting Emergency Events on Social Media

Contact Info

Product

Resources

About