2023
DOI: 10.32604/cmc.2023.039528
|View full text |Cite
|
Sign up to set email alerts
|

Text Extraction with Optimal Bi-LSTM

Bahera H. Nayef,
Siti Norul Huda Sheikh Abdullah,
Rossilawati Sulaiman
et al.

Abstract: Text extraction from images using the traditional techniques of image collecting, and pattern recognition using machine learning consume time due to the amount of extracted features from the images. Deep Neural Networks introduce effective solutions to extract text features from images using a few techniques and the ability to train large datasets of images with significant results. This study proposes using Dual Maxpooling and concatenating convolution Neural Networks (CNN) layers with the activation function… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 28 publications
(33 reference statements)
0
0
0
Order By: Relevance
“…In this study, we first compute the audio's Short-Time Fourier Transform (STFT) and then extract various features such as spectral centroid, spectrogram, mel-frequency cepstral coefficients (MFCC), chromagram, and pitch-related features. Then a 312-dimensional feature vector will be utilized for further audio processing and analysis [17][18][19]. BiLSTM model architecture diagram are depicted in Figure 2.…”
Section: Speech Feature Extraction Methods Based On Bilstmmentioning
confidence: 99%
“…In this study, we first compute the audio's Short-Time Fourier Transform (STFT) and then extract various features such as spectral centroid, spectrogram, mel-frequency cepstral coefficients (MFCC), chromagram, and pitch-related features. Then a 312-dimensional feature vector will be utilized for further audio processing and analysis [17][18][19]. BiLSTM model architecture diagram are depicted in Figure 2.…”
Section: Speech Feature Extraction Methods Based On Bilstmmentioning
confidence: 99%