[Retracted] Interrogative Sentences Recognition Based on the GRU Multiattentive Layer Model

Liu, Liu

doi:https://doi.org/10.1155/2022/1354337

Mathematical Problems in Engineering

On this page

Abstract Introduction Related Work Methods Experimental Results and Analysis Conclusions Data Availability Disclosure Conflicts of Interest References Copyright Related Articles

Research Article Retraction

!

This article has been Retracted. To view the article details, please click the ‘Retraction’ tab above.

Special Issue

Recent Advances in Industrial Mathematics and Applications 2021

View this Special Issue

Research Article | Open Access

Volume 2022 | Article ID 1354337 | https://doi.org/10.1155/2022/1354337

[Retracted] Interrogative Sentences Recognition Based on the GRU Multiattentive Layer Model

Liu Liu^1,2

Academic Editor: Gengxin Sun

Received14 Jan 2022

Revised15 Feb 2022

Accepted08 Mar 2022

Published29 Mar 2022

Abstract

Understanding the question is the key point in the question answering system. Therefore, this paper designs a multi attention layer model, extracts some missing features from the storage module, and uses Gru unit to organize them reasonably. By using non-linear operations to combine the results of different attention layers, we can avoid extracting only the linear combination of memory modules. Effectively combine the process of problem understanding, including the classification of problem words, the vocabulary annotation of problem sentences, the identification of problem sentence patterns, the classification of problems and the identification of problem centers, the grammatical analysis of question sentences and the identification of question sentence patterns, so as to achieve problem understanding. Experimental results show that this method improves the accuracy of problem understanding.

1. Introduction

Question answer system (QA) can better meet the user’s retrieval need and can find out the answers the user needs faster, and is currently receiving a lot of attention [1]. QA systems generally consist of three main components: question understanding, information retrieval and answer extraction [2]. Question understanding, also known as sentence analysis, is a key step in a question and answer system, and its accuracy has a significant impact on how questions are answered. Question comprehension analyses the question information, including: the type of question, the centre of the question, the focus of the question, the constraints and the level of detail of the answer, and whether the answer is single [3].

At present, questions are generally categorised by question phrases, such as a person, place, time, quantity, etc. [4], and then keywords are extracted and expanded. Although the method of classifying questions by question word phrases is easy, the understanding of the questions is relatively shallow, which reduces the accuracy of understanding [5]. Therefore, there have been a number of studies combining syntactic analysis to classify interrogative sentences. In [6], the classification of questions was extracted from syntactic structures based on Chinese dependency grammar, and in [7], syntactic trees based on phrase syntax were analysed using syntactic tree truncation to form syntactic fragments, and these studies have achieved good results.

In this paper, based on the characteristics of context-independent phrase grammar, the syntactic tree of phrase syntax is analysed to obtain the syntactic structure of question sentences about the syntactic pattern, and then the similarity between the syntactic structure and the syntactic pattern is calculated, and finally the syntactic pattern of question sentences is identified according to the similarity, and then the question is understood [8]. The grammatical analysis of interrogative sentences is combined with the recognition of the sentence patterns of interrogative sentences to achieve problem understanding.

Utterance recognition refers to the identification of specific entities with specific meanings from a natural text and the annotation of the location and type of the entities [9]. With the advancement of related research, the research object of utterance recognition has begun to shift from general knowledge text to certain special fields in industry, such as the recognition of proprietary organisation names in the field of telecommunications [10] and its application scope is [11] gradually expanding. Early approaches to recognize statements included rule-based and machine-learning approaches. Reference [12] chose multi-layer perceptron neural networks and LSTM models to extract utterances in medical records; [13] used LSTM models to recognize utterances in the medical field; [14] based on CNN was used to identify utterances in combat documents. Reference [15] constructed a CRFs-based product alias recognition model for manually annotated ancient materials of Fangzhi; [16] recognized place names in Zuo Shi Zhuan of the Spring and Autumn Period based on CRFs and MEMM, and used The State Language [17]as a validation corpus, and found that CRFs outperformed MEMM [18] used a Bi LSTM-CNN-CRF model for entity extraction as the basis for constructing a Chinese historical knowledge graph.

3. Methods and Steps for Understanding Interrogative Sentences

In automated question and answer systems, only ad hock questions are generally studied. The distinctive feature of ad hock questions is the presence of question words, which are also closely related to the type of question asked and the answer to the question, and because of this, most question understanding methods currently use template matching to classify questions based on question word phrases. (1) In China, the same question can be asked in several different ways, and the position of the question word is very arbitrary. (2) Some interrogative phrases in Chinese do not clearly represent the type of question, and some sentences with the same interrogative phrase do not have the same type of question, e.g., “What is the reason for …?” This sentence is classified as “what” in terms of the question word, i.e., asking about something, but in fact the sentence is about the cause of something. (3) Due to the multiple meanings of the Chinese language itself, some sentences contain question words that are used for other purposes, and such sentences do not ask about anything at all, e.g., “He does not know everything.”

In Chinese special interrogative sentences, the structure of interrogative sentences is characterised by interrogative phrases and some special words, as well as the word order in which the words appear in the sentence and other constituent features. The characteristic words of the interrogative sentence include the interrogative phrase in the interrogative sentence and the special words related to the type of interrogative sentence. When a person understands a question, he or she first identifies the characteristic words in the question, then classifies the question according to the characteristic words, the sentence type and the possible response pattern of the question, and in turn identifies the subject of the question. Therefore, in order to understand a question accurately, it is necessary to analyse the syntactic and even semantic aspects of the sentence, starting with the characteristic words and sentence patterns in the question. By identifying the syntactic patterns of interrogative sentences, it is possible to derive a classification of the question and to identify the centre of the question, etc. [19].

Therefore, the analysis of interrogative sentences in this paper consists of three steps: classification of interrogative words, subordination of interrogative sentences and lexical annotation, identification of interrogative sentence patterns and classification of questions.

3.1. Classification of Question Words

A group of interrogative phrases with the same meaning and usage forms a type of interrogative.

3.2. Classification of Issues

There is a correspondence between question patterns and question types. An interrogative pattern may correspond to one or more question types, and a question type may correspond to multiple interrogative patterns. Question classification requires the identification of the question type to which an interrogative belongs as showed in Table 1.

4. Combined Neural Network Based

This paper uses a combinatorial neural network model with four components: an input module, a memory module, an extraction module and an output module. For the input interrogative structure , which . Contains the words in position i and their lexical properties the aim of this paper is to identify whether the interrogative structure around is omitted. In the first 3 modules, the same operation is performed on Figure 1 the words and lexical properties of the interrogative structure, and the results of both are combined in the output module. The first 3 modules are presented from a word perspective.

4.1. Input Modules

By unsupervised learning methods, such as Skip-gram [20], a word vector lookup table is obtained where d is the dimension of the word vector and |V| is the length of the word vocabulary. In the input module, L is queried against the input sequence to obtain the corresponding word embedding representation sequence , where . If the current word does not exist in the word vector lookup table, the vector is represented by the special symbol “UNK,” which is randomly initialized.

4.2. Memory Modules

Bi-LSTM was chosen to obtain deep semantic-grammar information of the question structure, with a two-layer Bi-LSTM for lexical inputs and a single-layer Bi-LSTM for word inputs.

The word embedding representation sequence is used as the initial input to the Bi-LSTM, which is encoded by the model to obtain an abstract representation of the question structure. For the word vector , the vector representation is obtained in the first layer of the Bi-LSTM by the forward LSTM calculation and the vector representation is obtained in the backward LSTM calculation . If a total of L layers of the Bi-LSTM are used, this results in a memory module and a memory segment . [; ] means that the vectors are connected to form a vector, i.e. the output of the Lth layer of the forward LSTM and the backward LSTM is connected.

4.3. Extraction Module

The extraction module consists of two steps: the first step is to extract the omitted features of the question structure from the memory model; the second step is to organize the omitted features a rational way. Specifically, two extraction modes are adopted: GRU-based multi-attentive mechanism and Max-pooling.

Multiple omitted features are extracted from the memory module by the multi-attentive layer and organised in a rational way of using GRU units. It is generally believed that as the number of layers in the network increases, the model is more capable of carving out more abstract feature information [21], which is more useful for recognising the structure of omitted types of question sentences. Multiple attention layers allow different positions of the input to be noticed in different layers [22]. The use of nonlinear operations to combine the results of different attention layers prevents the extraction from being a linear combination of memory modules only [23]. The structure of this extraction module is shown in Figure 2.

The input to the attention layer t consists of memory fragment , the previous hidden state of the GRU , the word vector of the question sentence and the distance of word j from the relative position of the question sentence . The word vector of the question and the previous hidden state of the GRU guide the calculation of the score of each memory fragment in the current attention layer, which helps to extract information related to the omission of the structure of the question. The relative position distance helps to obtain the syntactic and semantic association of word j with “of” [24].

First calculate the attention score for each memory segment and the result of that attention layer :where, is the weight parameter, shared by all attention layers, and is the bias term. Based on the experience of [25], the number of attention layers is set to 3.

The GRU then applies a GRU unit between the attention layers, the last output of which organises all omitted features in a rational way, and uses this feature as the output of the extraction module.

The most important omitted features are extracted using Max-pooling, the extraction structure of which is shown in Figure 3. Max-pooling is an element-level operation [26] and is calculated as follows:

Since semantic omitted question structure recognition is a binary classification task, and the vector dimension of Max-pooling extraction is large, we add another layer of a feedforward neural network (FNN) and use its result as the output of Max-pooling extraction module.

4.4. Output Modules

In the output module, we connect the omitted features of the question structure based on words and lexemes and feed them into a soft max layer to obtain the final classification probability. The calculation formula is as follows:where , and are the parameters of the classification layer. The model uses a cross-entropy loss function, which is calculated as follows:

C is the type of interrogative structure, D is the training data set, is the probability that sample d belongs to type i obtained from the model in this paper, and y(d) is the label of sample d. We use back propagation to calculate the gradient of the parameters, Adam’s algorithm for neural network model optimization, and dropout for regularization [27].

5. Experimental Results and Analysis

In the experiment, the authors used BAIDU to search a total of 8,000 search results with various question phrases, and then selected 1,000 of these sentences as the test set by manually processing them to eliminate some repetitive and advertising sentences. The results were analysed using the methods described in this paper and are shown in Table 2. Test 1 is the result of a full set test of these test questions. The results of Test 2 were obtained after eliminating the incorrectly analysed questions because the syntactic analysis procedure used in the test was not very accurate and had a large impact on the test results. The main reasons for the incorrect analysis in Test 2 are still related to the syntactic analysis.

The accuracy of the test is not particularly high as the set of questions used is from real sentences with nonquestioning uses of question words, but a comparison with a question classification method based on question words shows that it is an improvement over question classification methods based on question words. Table 3 The main reason for incorrect question understanding is still syntactic analysis errors, but the results obtained are satisfactory. In addition, analysis of the misconceived sentences revealed that the number of questions with incorrect comprehension was 79, or 7.9% of the total test sentences, and 75.96% of the misconceived test sentences.

Table 4 shows the experimental results compared with the baseline method, where ACC is the correct rate of all question structures, nonelided-F1 is the F1 value of nonelided question structures, and elided-F1 is the F1 value of elided question structures. It can be seen that the recognition performance of CRF is proportional to the size of the corpus, but GRU improves the recognition of elliptical question structures while seriously affecting the recognition of noneliminated types, while the performance of the proposed model is better than both CRF and GRU [28].

In the following, the role of each module in the model is analysed through comparative experiments. The network structure is detailed in Table 5, where M1, M2 and M3 are the three settings of the memory module; L1 and L2 denote the use of single-layer and double-layer bidirectional LSTM neural networks in the memory module, respectively; None denotes that no extraction module is used, and the forward and backward memory fragments at the question position are directly connected as omitted features of the question structure and fed into the output layer; Att denotes that the result of the attention layer is essentially a linear combination of memory modules. If there is no nonlinear operation between multiple attention layers, the final result is still a linear combination of memory modules, so the multiple attention layers are replaced by a single attention layer.

Figure 4 shows that all models were able to classify question structures effectively, with a correct rate of over 97.5%. Figure 5 shows that all models were able to identify nonelided question structures effectively, with F1 values exceeding 98.5%. Figure 6 shows that as the corpus expands, the amount of data for omitted and nonoligible question structures gradually tends to balance out, and all models are getting better at recognising omitted question structures, while the performance of recognising nonoligible question structures is not significantly affected.

In this paper, we focus on the results of recognising the structure of omitted types of questions. The next experiments are based on Data3, as Figure 6 shows that all models perform best on Data3. Table 5 shows the results of the comparison experiments using different memory module settings, and it can be seen that M3 + Max-pooling outperform M2 + Max-pooling. That is, from a world perspective, words contain both syntactic and semantic information and may not require complex models that are more inclined to learn semantic information, such as the two-layer Bi-LSTM.

6. Conclusions

In this paper, the question comprehension method is combined with the syntactic analysis of interrogative sentences, and the accuracy of question comprehension is improved. The analysis of semantics has not been addressed, and further analysis of the nonquestioning use of question words has not been done. Therefore, further work should be done to improve the accuracy of questionable understanding by combining the nonquestion use of question words, and to improve the analysis and understanding of question sentences at a higher and deeper level by combining the semantic analysis of question sentences.

Data Availability

The data underlying the results presented in the study are available within the manuscript.

Disclosure

The author confirms that the content of the manuscript has not been published or submitted for publication elsewhere.

Conflicts of Interest

The author declares no conflicts of interest.

References

K. Ohira, M. Fujimoto, Y. Fukumoto, and H. Akazawa, “A study on the historical change of landscape recognition in naruto strait based on paintings,” Journal of the Japanese Institute of Landscape Architecture, vol. 82, no. 5, pp. 457–462, 2019.
View at: Publisher Site | Google Scholar
Y. B. Cao, “Study on recognition method of ferrographic debris based on neural network,” Advanced Materials Research, vol. 774–776, pp. 1508–1511, 2013.
View at: Publisher Site | Google Scholar
Z. W. Zhou, L. Xu, C. Y. Wang et al., “An image captioning model based on bidirectional depth residuals and its application,” IEEE Access, vol. 9, pp. 25360–25370, 2021.
View at: Publisher Site | Google Scholar
D. Lee, M. Choi, and J. Lee, “Prediction of head movement in 360-degree videos using attention model,” Sensors, vol. 21, no. 11, p. 3678, 2021.
View at: Publisher Site | Google Scholar
T. Perova, “Design, fabrication and optical characterization of multi-component photonic crystals for integrated Si microphotonics,” Proceedings of SPIE, vol. 76, no. 7943, pp. 79430F1–79430F11, 2011.
View at: Google Scholar
A. Bollimunta, J. Mo, C. E. Schroeder, and M. Ding, “Neuronal mechanisms and attentional modulation of corticothalamic alpha oscillations,” Journal of Neuroscience, vol. 31, no. 13, pp. 4935–4943, 2011.
View at: Publisher Site | Google Scholar
S. A. Burke, J. M. Topple, and P. Grütter, “Molecular dewetting on insulators,” Journal of Physics: Condensed Matter, vol. 21, no. 42, Article ID 423101, 2009.
View at: Publisher Site | Google Scholar
P. Aghion, N. Bloom, and J. V. Reenen, “Incomplete contracts and the internal organization of firms,” CEP Occasional Papers, vol. 30, no. 1, pp. 37–63, 2013.
View at: Publisher Site | Google Scholar
R. Messina, “Electrostatics in soft matter,” Journal of Physics: Condensed Matter, vol. 21, no. 11, pp. 113102–113119, 2009.
View at: Publisher Site | Google Scholar
L. M. R. Arndt and A. R. Davis, “Warrior spirit: soul wound and coping among American Indians in law enforcement,” The Counseling Psychologist, vol. 39, no. 4, pp. 527–569, 2011.
View at: Publisher Site | Google Scholar
D. Saxena, K. S. Verma, and S. N. Singh, “Power quality event classification: an overview and key issues,” International Journal of Engineering Science and Technology, vol. 2, no. 3, pp. 186–199, 2010.
View at: Publisher Site | Google Scholar
W. Li, Y. Xu, and G. Wang, “Stance detection of microblog text based on two-channel CNN-GRU fusion network,” IEEE Access, vol. 7, pp. 145944–145952, 2019.
View at: Publisher Site | Google Scholar
H. Xia, Y. Luo, and Y. Liu, “Attention neural collaboration filtering based on GRU for recommender systems,” Complex & Intelligent Systems, vol. 7, no. 3, pp. 1367–1379, 2021.
View at: Publisher Site | Google Scholar
A. Amarasingham, M. T. Harrison, N. G. Hatsopoulos, and S. Geman, “Conditional modeling and the jitter method of spike resampling,” Journal of Neurophysiology, vol. 107, no. 2, pp. 517–531, 2012.
View at: Publisher Site | Google Scholar
C. H. Cao, Y. N. Tang, D. Y. Huang, G. WeiMin, and Z. Chunjiong, “IIBE: An Improved Identity-Based Encryption Algorithm for Wsn Security,” Security and Communication Networks, vol. 2021, Article ID 8527068, 8 pages, 2021.
View at: Publisher Site | Google Scholar
Y. Cheng, L. Yao, G. Xiang, G. Zhang, T. Tang, and L. Zhong, “Text sentiment orientation analysis based on multi-channel CNN and bidirectional GRU with attention mechanism,” IEEE Access, vol. 8, pp. 134964–134975, 2020.
View at: Publisher Site | Google Scholar
G. U. Kulkarni, S. Kiruthika, R. Gupta, and K. Rao, “Towards low cost materials and methods for transparent electrodes,” Current Opinion in Chemical Engineering, vol. 8, pp. 60–68, 2015.
View at: Publisher Site | Google Scholar
L. Xiong, L. Zhang, X. Huang et al., “DCAST: a spatiotemporal model with DenseNet and GRU based on attention mechanism,” Mathematical Problems in Engineering, vol. 2021, Article ID 8867776, 12 pages, 2021.
View at: Publisher Site | Google Scholar
H. Xia, J. J. Li, and Y. Liu, “Collaborative filtering recommendation algorithm based on attention GRU and adversarial learning,” IEEE Access, vol. 8, pp. 208149–208157, 2020.
View at: Publisher Site | Google Scholar
F. Zhang, Q. Liu, Y. Liu, N. Tong, S. Chen, and C. Zhang, “Novel fault location method for power systems based on attention mechanism and double structure GRU neural network,” IEEE Access, vol. 8, pp. 75237–75248, 2020.
View at: Publisher Site | Google Scholar
J. Li, A. P. Hitchcock, and H. D. H. Stöver, “Pickering emulsion templated interfacial atom transfer radical polymerization for microencapsulation,” Langmuir, vol. 26, no. 23, pp. 17926–17935, 2010.
View at: Publisher Site | Google Scholar
H. Li, H. Duan, Y. Zheng, Q. Wang, and Y. Wang, “A CTR prediction model based on user interest via attention mechanism,” Applied Intelligence, vol. 50, no. 4, pp. 1192–1203, 2020.
View at: Publisher Site | Google Scholar
E. S. Kim and S. J. Pak, “Improvement in university freshmen’s questioning by explicit practice of experts’ physics problem solving strategies,” Journal of the Korean Association for Research in Science Education, vol. 22, no. 3, pp. 299–317, 2002.
View at: Google Scholar
M. C. Yang, D. G. Lee, S. Y. Park, and H. C. Rim, “Knowledge-based question answering using the semantic embedding space,” Expert Systems with Applications, vol. 42, no. 23, pp. 9086–9104, 2015.
View at: Publisher Site | Google Scholar
T. Nariai and K. Tanaka, “A study on pitch patterns of Japanese speakers of English in comparison with native speakers of English,” Acoustical Science and Technology, vol. 33, no. 4, pp. 247–254, 2012.
View at: Publisher Site | Google Scholar
H. Lin, Z. Lin, and J. Yi, “The implementation of Chinese words segmentation technology in interrogative sentences in intelligent Q/A systems,” Education Technology & Training & International Workshop on Geoscience & Remote Se, vol. 1, pp. 63–68, 2008.
View at: Google Scholar
Z. Azizi, “The acoustic survey of intonation in autism spectrum disorder,” Journal of the Acoustical Society of America, vol. 137, no. 4, p. 2207, 2015.
View at: Publisher Site | Google Scholar
H. Li, D. Zeng, L. Chen, Q. Chen, M. Wang, and C. Zhang, “Immune multipath reliable transmission with fault tolerance in wireless sensor network,” in Proceedings of the International Conference on Bio-Inspired Computing: Theories and Applications, pp. 513–517, Springer, Xi’an, China, October 2016.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2022 Liu Liu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Mathematical Problems in Engineering

Recent Advances in Industrial Mathematics and Applications 2021

[Retracted] Interrogative Sentences Recognition Based on the GRU Multiattentive Layer Model

Abstract

1. Introduction

2. Related Work

3. Methods and Steps for Understanding Interrogative Sentences

3.1. Classification of Question Words

3.2. Classification of Issues

4. Combined Neural Network Based

4.1. Input Modules

4.2. Memory Modules

4.3. Extraction Module

4.4. Output Modules

5. Experimental Results and Analysis

6. Conclusions

Data Availability

Disclosure

Conflicts of Interest

References

Copyright