Yusuf Arslan scite author profile

Yusuf Arslan

5Publications

50Citation Statements Received

52Citation Statements Given

How they've been cited

101

How they cite others

Affiliations

University of Luxembourg, Middle East Technical University

Publications

Order By: Most citations

Semi-automatic construction of a domain ontology for wind energy using Wikipedia articles

Küçük

Arslan

2014

Renewable Energy

View full text Add to dashboard Cite

Domain ontologies are important information sources for knowledge-based systems. Yet, building domain ontologies from scratch is known to be a very labor-intensive process. In this study, we present our semi-automatic approach to building an ontology for the domain of wind energy which is an important type of renewable energy with a growing share in electricity generation all over the world. Related Wikipedia articles are first processed in an automated manner to determine the basic concepts of the domain together with their properties and next the concepts, properties, and relationships are organized to arrive at the ultimate ontology. We also provide pointers to other engineering ontologies which could be utilized together with the proposed wind energy ontology in addition to its prospective application areas. The current study is significant as, to the best of our knowledge, it proposes the first considerably wide-coverage ontology for the wind energy domain and the ontology is built through a semi-automatic process which makes use of the related Web resources, thereby reducing the overall cost of the ontology building process.

show abstract

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain

Arslan

Allix

Veiber

et al. 2021

View full text Add to dashboard Cite

Neural networks for language modeling have been proven effective on several sub-tasks of natural language processing. Training deep language models, however, is time-consuming and computationally intensive. Pre-trained language models such as BERT are thus appealing since (1) they yielded state-of-the-art performance, and (2) they offload practitioners from the burden of preparing the adequate resources (time, hardware, and data) to train models. Nevertheless, because pre-trained models are generic, they may underperform on specific domains. In this study, we investigate the case of multi-class text classification, a task that is relatively less studied in the literature evaluating pre-trained language models. Our work is further placed under the industrial settings of the financial domain. We thus leverage generic benchmark datasets from the literature and two proprietary datasets from our partners in the financial technological industry. After highlighting a challenge for generic pre-trained models (BERT, DistilBERT, RoBERTa, XLNet, XLM) to classify a portion of the financial document dataset, we investigate the intuition that a specialized pre-trained model for financial documents, such as FinBERT, should be leveraged. Nevertheless, our experiments show that the FinBERT model, even with an adapted vocabulary, does not lead to improvements compared to the generic BERT models. CCS CONCEPTS• Applied computing → Text processing.

show abstract

Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales

Arslan

Küçük

Birtürk

2018

View full text Add to dashboard Cite

Real-time Lexicon-based sentiment analysis experiments on Twitter with a mild (more information, less data) approach

Arslan

Birtürk

Djumabaev

et al. 2017

View full text Add to dashboard Cite

On the Suitability of SHAP Explanations for Refining Classifications

Arslan

Lebichot

Allix

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yusuf Arslan

Semi-automatic construction of a domain ontology for wind energy using Wikipedia articles

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain

Twitter Sentiment Analysis Experiments Using Word Embeddings on Datasets of Various Scales

Real-time Lexicon-based sentiment analysis experiments on Twitter with a mild (more information, less data) approach

On the Suitability of SHAP Explanations for Refining Classifications

Contact Info

Product

Resources

About