2023
DOI: 10.3390/app13042272
|View full text |Cite
|
Sign up to set email alerts
|

A Powerful Predicting Model for Financial Statement Fraud Based on Optimized XGBoost Ensemble Learning Technique

Abstract: This study aims to develop a better Financial Statement Fraud (FSF) detection model by utilizing data from publicly available financial statements of firms in the MENA region. We develop an FSF model using a powerful ensemble technique, the XGBoost (eXtreme Gradient Boosting) algorithm, that helps to identify fraud in a set of sample companies drawn from the Middle East and North Africa (MENA) region. The issue of class imbalance in the dataset is addressed by applying the Synthetic Minority Oversampling Techn… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(6 citation statements)
references
References 53 publications
0
3
0
Order By: Relevance
“…ML and DL models can be used parallelly to build a model with improved capabilities, as in Hybrid Learning (HL). They can also be combined, and the output of one model is the input for the second, and that is called Ensembled Learning (EL) as in [49] [29]. Accordingly, the training process of CADM uses an HL approach for enhanced performance.…”
Section: Big Data In Accounting and Auditingmentioning
confidence: 99%
“…ML and DL models can be used parallelly to build a model with improved capabilities, as in Hybrid Learning (HL). They can also be combined, and the output of one model is the input for the second, and that is called Ensembled Learning (EL) as in [49] [29]. Accordingly, the training process of CADM uses an HL approach for enhanced performance.…”
Section: Big Data In Accounting and Auditingmentioning
confidence: 99%
“…These papers collectively contribute to the field of data-driven applications and predictive modeling across various domains, like a Decision Support System for the Marketing domain [11]. They offer an advanced model for detecting financial statement fraud using XGBoost [12], introduce innovative neural network models for stock price prediction [13], analyze factors influencing tourist offer prices [14], develop predictive models for healthcare patient influx [15], and propose intelligent decision forest models for customer churn prediction in the telecom industry [16]. They also address customer churn prediction in noncontractual B2B settings [17], improve legal judgment prediction through graph neural networks [18], enhance car sales forecasts using online sentiment data and deep learning [19], introduce a reinforcement learning framework for options trading [20], and predict the charge of a legal case using a novel graph convolutional network [21].…”
Section: Category 2: Marketing and Business Decision Supportmentioning
confidence: 99%
“…The models with the Gini and Entr suffixes indicate the measures used to determine how a decision tree node splits. These measures are referred to as the Gini index and Entropy, which is a measure of the purity of the split [42]. XGBoost is an extreme gradient-boosted tree with an ensemble algorithm, with each tree boosting misclassified attributes of the previous tree [43].…”
Section: Details Of the Selected Automl Toolsmentioning
confidence: 99%