Research Article

Design of Political Online Teaching Based on Artificial Speech Recognition and Deep Learning

Table 1

CTC-CNN baseline acoustic model parameters.

Network layerParameter

InputLyer300 dimensional harmony
conv2d_Lyer133 convolution channels, convolution kernel 4 × 4, step size 3 × 3, activation function
conv2d_Lyer233 convolution channels, convolution kernel 4 × 4, step size 3 × 3, activation function
max-poolng2dMaximum pooling 3 × 3
conv2d_Lyer365 convolution channels, convolution kernel 4 × 4, step size 3 × 3, activation function
conv2d_Lyer465 convolution channels, convolution kernel 4 × 4, step size 3 × 3, activation function
max-poolng2dMaximum pooling 3 × 3
conv2d_Lyer5129 convolution channels, convolution kernel 4 × 4, step size 3 × 3, activation function
conv2d_Lyer6129 convolution channels, convolution kernel 4 × 4, step size 3 × 3, activation function
max-poolng2dMaximum pooling 3 × 3
ReshapFeature map transformation output 300 × 3300
FC_Lyer1Number of neurons 129, activation function
FC_Lyer2The number of neurons is 1425, the activation function
SoftmxActivation output matrix, the dimension is 300 × 1425
CTCProbability matrix, the length of the sonogram, the length of the label sequence