Deep Learning-Based Classification of Spoken English Digits
Algorithm 3
The DFNN Classification Model.
(1)
procedure Deep Feedforward Neural Network Classifier () (⊳) contains the STFT features of each audio sample, while contains the target audio class label
(2)
Reading the dataset using the library “Librosa”
(3)
Extract STFT features from the audio
(4)
One hot encode the audio data to produce the class label.
(5)
Split the dataset into training and testing set with STFT features as the input, and audio class as the target label.