2023
DOI: 10.3390/sym15122185
|View full text |Cite
|
Sign up to set email alerts
|

A Novel Approach for Data Feature Weighting Using Correlation Coefficients and Min–Max Normalization

Mohammed Shantal,
Zalinda Othman,
Azuraliza Abu Bakar

Abstract: In the realm of data analysis and machine learning, achieving an optimal balance of feature importance, known as feature weighting, plays a pivotal role, especially when considering the nuanced interplay between the symmetry of data distribution and the need to assign differential weights to individual features. Also, avoiding the dominance of large-scale traits is essential in data preparation. This step makes choosing an effective normalization approach one of the most challenging aspects of machine learning… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(3 citation statements)
references
References 53 publications
0
1
0
Order By: Relevance
“…The samples are normalized by [0, 1] to remove the impact of various data outlines and avoid masking the data that significantly affects the dependent variable. Equation ( 4) provides the data normalization formula [39].…”
Section: Methods and Investigated Vehiclesmentioning
confidence: 99%
“…The samples are normalized by [0, 1] to remove the impact of various data outlines and avoid masking the data that significantly affects the dependent variable. Equation ( 4) provides the data normalization formula [39].…”
Section: Methods and Investigated Vehiclesmentioning
confidence: 99%
“…Since these parameters are different in the ranges, the training process of the ML-based prediction model may be deviated. In previous studies [35,36], it has been noted that parameter normalization can significantly increase the performance of the training process of the ML estimation model. Although different ranges can be possible for normalizing the input data, [0.1 0.9] has been served in several studies which maximize the efficiency of the training process [37].…”
Section: Data Processmentioning
confidence: 99%
“…In the formula, Z denotes the calculated value of the Min-Max normalization of the current environmental variable, X i denotes the current environmental variable at the time of operation, X min denotes the minimum value of the current environmental variable at the time of operation, and X max denotes the maximum value of the current environmental variable at the time of operation [11].…”
Section: Deviation Normalizationmentioning
confidence: 99%