Artificial speech detection using image-based features and random forest classifier

Tan, Choon Beng; Hijazi, Mohd Hanafi Ahmad; Kok, Frazier; Mohamad, Mohd Saberi; Nohuddin, Puteri N. E.

doi:10.11591/ijai.v11.i1.pp161-172

Cited by 3 publications

(3 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Artificial speech detection techniques have witnessed significant evolution, falling into three primary categories: classic machine learning, end-to-end learning, and hybrid methodologies. Classic machine learning entails the manual crafting and extraction of predetermined features from data samples, which are then subject to separate classification modules [4,5]. In contrast, end-to-end learning orchestrates the automatic and joint identification and learning of all data sample features to determine their class labels.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

A Comparative Evaluation on Data Transformation Approach for Artificial Speech Detection

Tan,

Ahmad Hijazi

2024

ITM Web Conf.

Self Cite

View full text Add to dashboard Cite

The rise of voice biometrics has transformed user authentication and offered enhanced security and convenience while phasing out less secure methods. Despite these advancements, Automatic Speaker Verification (ASV) systems remain vulnerable to spoofing, particularly with artificial speech generated swiftly using advanced speech synthesis and voice conversion algorithms. A recent data transformation technique achieved an impressive Equal Error Rate (EER) of 1.42% on the ASVspoof 2019 Logical Access Dataset. While this approach predominantly relies on Support Vector Machine (SVM) as the backend classifier for artificial speech detection, it is vital to explore a broader range of classifiers to enhance resilience. This paper addresses this research gap by systematically assessing classifier efficacy in artificial speech detection. The objectives are twofold: first, to evaluate various classifiers, not limited to SVM, and identify those best suited for artificial speech detection; second, to compare this approach's performance with existing methods. The evaluation demonstrated SVM-Polynomial as the top-performing classifier, surpassing the end-to-end learning approach. This work contributes to a deeper understanding of classifier efficacy and equips researchers and practitioners with a diversified toolkit for building robust ASV spoofing detection systems.

show abstract

Section: Related Workmentioning

confidence: 99%

“…RF was considered in this evaluation due to its demonstrated strong performance in the ASVspoof 2015 challenge [4]. RF is an ensemble learning model that employs decision trees for classification and regression.…”

Section: Classifiersmentioning

confidence: 99%

A Comparative Evaluation on Data Transformation Approach for Artificial Speech Detection

Tan,

Ahmad Hijazi

2024

ITM Web Conf.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Tan et al [19], showed a method for engineer image-based features when used with a RF classifier to identify artificial speech by use of data transformation techniques. The two goals are as follows: i) from the mel-frequency cepstral coefficients representation of the speech signal, extract image-based features; and ii) compare the effectiveness of using RF and the extracted features with the existing approaches to determine the authenticity of voices.…”

Section: Literature Surveymentioning

confidence: 99%

Alzheimer image registration using hybrid random forest and deep regression network algorithm

Siddabathuni,

Palanivel,

Lakshmi Narasimha Murthy

2024

IJEECS

View full text Add to dashboard Cite

<span>Image registration involves superimposing images (two or more) of similar background obtained at various periods of time, at different angles, and/or with various detectors. Geometrical alignment of two scans, reference image as well as capture image. The current dissimilarity between images is because of distinct image conditions. Image registration is difficult step in image analysis works on change detection, image fusion as well as <br /> multi-channel images recovery to obtain concluded data from integration of different sources. In this analysis image registration using hybrid random forest (RF) and deep regression network algorithm for magnetic resonance imaging (MRI) applications is implemented. The Alzheimer’s disease neuroimaging initiative (ADNI) database provided by the dataset utilised in this implementation. From results it can observe that compared with individual random of forest, Hybrid RF and deep regression network algorithm improves the accuracy, precision and F1-score in effective way.</span>

show abstract

A Hybrid Classification Approach For Artificial Speech Detection

Tan,

Ahmad Hijazi,

Ellyza Nohuddin

2023

2023 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)

View full text Add to dashboard Cite

Artificial speech detection using image-based features and random forest classifier

Cited by 3 publications

References 28 publications

A Comparative Evaluation on Data Transformation Approach for Artificial Speech Detection

A Comparative Evaluation on Data Transformation Approach for Artificial Speech Detection

Alzheimer image registration using hybrid random forest and deep regression network algorithm

A Hybrid Classification Approach For Artificial Speech Detection

Contact Info

Product

Resources

About