Abstract
With the rapid development of computer technology, people have begun to combine virtual reality and other technologies to achieve scientific sports auxiliary training, to get rid of the state of traditional sports training purely relying on experience. The article proposes a deep learning BP neural network human body posture recognition algorithm and briefly introduces the human body motion posture. The purpose of this research was to use the powerful data processing, mining, and analysis functions of deep learning to train the massive data generated in competitive sports training and apply it to competitive sports training. It is committed to promoting the accuracy and analysis of competitive sports training. Refinement provides technical guidance for athletes’ training, promotes the scientific and informatized development of competitive sports training in China, and provides some reference methods for the research and application of deep learning in competitive sports training. The article’s research results show that (1) taking a rural area as an example, we recorded the exercise postures of rural athletes in five different states: static, upstairs, downstairs, walking, and running. Comparing the recognition rate and training time of The BP neural network algorithm, ABC-BP algorithm, AFS-BP algorithm, and ABC-AFS-BP algorithm, it can be found that in terms of recognition rate, ABC-AFS-BP algorithm, AFS-BP algorithm, and ABC-BP algorithm are better than traditional BP algorithm. Among them, the recognition rate of the ABC-AFS-BP algorithm is higher than that of the ABC-BP algorithm, but it takes slightly more time than the ABC-BP algorithm. In terms of training time, the ABC-BP algorithm takes less time, but the accuracy is lower than the ABC-AFS-BP algorithm; the ABC-AFS-BP algorithm has a greater improvement in time consumption than the AFS-BP model and can guarantee the recognition rate and accuracy, and the error rate curves of the four algorithms show that after 500 iterations of the training part, the iteration error value of the ABC-AFS-BP algorithm is the smallest. (2) We evaluated sports postures of athletes from a certain rural team and concluded that bad postures will have a certain impact on the body. Among them, more than 85% of athletes in football and basketball have pelvic rotation. The problem is that football players have reached 90% of the test sample. 60% of football players and basketball players have the problem of collapsed foot. The main problem of aerobic athletes is flat back and collapsed foot. More than 90% of badminton players have high and low shoulder problems, and more than 80% of them have neck problems, which is a very serious body posture problem. (3) Detecting the flexibility experiment of the BP posture detection algorithm, compared with the traditional motion posture recording method, we tested from the three aspects of recording motion accuracy, missed detection rate, and recording time. The result shows BP posture detection. The missing detection rate of the algorithm is low, basically maintained at about 2.0, the accuracy of recording actions is relatively high, generally maintained above 98%, and the highest is 99.15%, and the recording time is short, maintained at 3–4 minutes; comparing traditional posture detection with the BP attitude detection algorithm, the missed detection rate of the algorithm is relatively high, kept at 4–6, the action accuracy is lower than that of the BP attitude detection algorithm, generally kept at about 95%, and the recording time is kept at 5–6 minutes. The posture detection algorithm is more efficient.
1. Introduction
Moving target recognition and analysis is an important research direction in the field of computer vision, which is widely used in our lives, such as intelligent robots, video surveillance, medical education, sports competitions, national defense security, and other fields. Sports applications are also very smart. With the continuous development of computer technology, based on deep learning methods, a sports video key gesture extraction system can be designed to extract the key gestures of athletes in competitive sports training videos to assist athletes and coaches in training reasonably and efficiently and avoid complexity. The interference of the environment on the extraction improves the inaccurate situation of traditional coaches in judging and extracting key gestures based on experience. Target detection and tracking based on deep learning, first, is the problem of target detection, that is, determining the position of the target object in the image or scene, which is generally determined by the bounding box of the object. In response to such problems, the literature [1] proposed that RCNN adopts the method of selecting the domain to obtain the local candidate regions where the detection target may exist in the image, and then, these candidate regions are input into the convolutional neural network to obtain their characteristics, and the classifier is connected to feature map, whether the corresponding area belongs to the target to be detected is judged, and finally, regression is performed on the calibration frame to correct the position of the prediction frame, but RCNN has the problem of repeated calculation. Literature [2] introduced the spatial pyramid pooling layer into CNN and proposed SPP-Net, which reduces the limitation of the CNN network on the size of the input image and improves the accuracy. Literature [3] proposed to use inertial sensors to collect information about human motion and apply the collected information to analyze and recognize human motion. Literature [4] discussed the theoretical framework for studying the coordination strategy of standing posture. The framework consists of a musculoskeletal model of the lower limbs of the human body in the sagittal plane and a technology that geometrically visualizes how the constraints inside and outside the body affect movement. Literature [5] designed an action recognition method based on depth images. The algorithm projects the depth image in three projection planes, extracts the Gabor features from the three projection images, and uses these features to train the extreme learning machine classifier. The calculation efficiency of the algorithm is high, but the performance of recognition of small amplitude actions is not ideal. Literature [6] proposed a time-series deep belief network that can complete online human action recognition. This model solves the problem that the current deep belief network model can only recognize static images, but the training process of this model takes a long time to process, which affects the application performance of the algorithm for large-scale data sets, in addition to the time efficiency issue for large-scale data sets. Literature [7] used asymmetrical system deviation to model human movement information, and the algorithm introduces a posture labeling mechanism to further improve the recognition performance of small movements. To meet the recognition of large-scale data sets and small-scale actions at the same time, a human action recognition algorithm based on multifeature fusion and motion information is designed. Literature [8] learned manual features and deep learning features, and the manual features adopted an improved dense trajectory. Literature [9] used a convolutional neural network based on motion information. High recognition accuracy is achieved for small-range actions, but the amount of features that need to be analyzed is large, and it is difficult to apply to large-scale data sets. Literature [10] used an inverted double pendulum to simulate the human body in an upright posture. The human body posture is a more complicated problem in biomechanics. The model proposed in this article can effectively distinguish the different parameters of the body and effectively control the body’s movement posture. Literature [11] analyzed the experimental data, established a human body model, and conducted a simulation experiment on the OSG platform. When a person is exercising, there will be an angle between the limbs and the body. The accelerator is fixed on the arm, and the arm and torso can be calculated in real time. In recent years, athlete performance analysis has received widespread attention as a new anti-doping auxiliary judgment strategy, but many researchers have not had a clear understanding of this strategy in theory and practice. Literature [12] conducted in-depth research on performance analysis through literature review and inductive research methods. The literature takes some well-known athletes as examples and analyzes the results. The results show that whether there are athletes using doping can be quickly distinguished from the performance of the athletes, which can improve the efficiency of sports excitement detection. The purpose of the literature [13] is to determine which morphological indicators and how to affect postural stability and control through special tests. The results of the literature [14] can be used to enhance the digital human modeling motion generated for human motion simulation. Literature [15] analyzed the EMG response and joint movement of the legs in subjects who stood on a sinusoidal treadmill with their eyes closed.
2. Theory Introduction
2.1. The Status Quo of Rural Sports Development
At present, China’s rural sports venues are few and of low quality, and some areas even lack suitable venues for villagers’ activities. Rural sports venues and infrastructure are generally in poor condition; some villages have stadiums, but they have been idle for too long and no one is used. Even crops have already been planted, and the overall situation is not optimistic. This also causes farmers to watch TV as their main activity during leisure time, so conditions are not allowed to a certain extent, which limits the enthusiasm of villagers to participate in sports. Therefore, the relevant departments must take measures to solve these problems. The villagers do not have a deep understanding of the problem of sports and cultural exercises, and it is generally young people who participate in sports. Relevant governments have taken measures to solve these problems, but despite this, rural sports activities are still a weak link in China’s sports industry. The increasing sports health requirements in rural areas and the serious shortage of public sports venues in rural areas have restricted rural sports.
2.2. Classification of Human Motion Posture
The posture of the human body movement is the form displayed by the human body in the process of daily life activities. Whether a person’s body is healthy or not is closely related to the posture of the human body movement, so the posture of the human body movement has been widely concerned by the majority of researchers. The increase in user needs and people’s requirements for the quality-of-life force researchers to speed up the research on human body movement postures. These studies are of great significance to the timely discovery of the human body’s health [16]. The basic forms displayed by the human body in the process of daily life activities include lying, standing upright, sitting, walking, and running. In some complicated situations, it may be due to some irresistible external factors, or some injury or illness, etc., which may cause people to fall. Under normal circumstances, people call the most basic postures displayed by the human body in the process of daily life activities, including lying, standing upright, sitting, walking, and running, as ADL. In the process of life activities, some basic activities have to be carried out to take care of themselves and their families, protect the family and social environment, or participate in some social activities. These activities of daily life are basic life postures that human beings must always repeat to survive and live in this society. Each posture can be subdivided into many types, such as standing posture, which can be subdivided into standing upright, standing tilted forward, and standing tilted backward; lying flat includes lying on your back, lying on the left, lying on the right, and lying on the left and right sides. Sitting posture can be subdivided into sitting upright, leaning forward when sitting, leaning backward when sitting, and so on. Like walking, running is also affected by the surrounding environment, road conditions, and the physical condition of the runner. Running can also be subdivided into even running, slow running, and fast running. In some special circumstances, due to some complicated reasons, it may be due to some irresistible external factors, or some injuries or illnesses, etc., which may cause people to fall. The fall mentioned here refers to the uncontrolled fall of the body to the ground or to another plane that is not controlled by humans, occurs suddenly without preparation, or falls to another plane [17]. The World Health Organization has classified “falls” (including falls), as shown in Table 1.
2.3. Human Body Motion Posture Detection Algorithm
To carry out a three-dimensional simulation of the athlete’s movement posture to adjust the training method and improve the training level, this study studies the application of unlabeled 3D pose estimation method in training assistance system [18, 19]. While athletes interact with the computer through limb movement, it is difficult to establish a human body posture database because there is no corresponding marking technology support to provide a robust observation method for the computer. Therefore, this article adopts a partial estimation method, which divides the athlete’s overall posture into multiple partial postures, and estimates the posture of each part separately. The algorithm block diagram is shown in Figure 1.

3. Method
3.1. Deep Learning Features Based on Motion Information
Deep learning has been widely used in recent years [20, 21]. The time template can extract all the motion sequences of an image frame, and the difference between the video frames is used to calculate the motion information between the frames. The calculation formula of the time template is as follows [19]:
Among them, represents the number of video frames, represents the motion information of the th frame, represents the weight value of the th frame [22], and the value range of the weight value is [0, 255]. Formula (1) is transformed to get the following:
A fuzzy membership function is replaced to get the following:
The appropriate is selected, to enhance the saliency of the temporal motion information in the time template [23]; the model diagram of is shown in Figure 2.

Four are set to , defined by the following formulas:
3.2. Human Motion Gesture Recognition
The motion detection of the human body is carried out by detecting the confidence map of the joint points of the human body and the affinity domain of the human joint points of a two-dimensional vector [24], and the calculation formula is as follows:
Each loss function is weighted, and the loss function is as follows:
Among them, and are the actual labeled values, and the final objective function is as follows:
3.2.1. Confidence Graph Detection of Nodes
First, the confidence map of the th person is , and is the true value of the th joint point for the th person, and the predicted value at the point is expressed as a Gaussian function [25]:
The maximum operation is performed:
As shown in Figure 3, represent the real position of the th person’s joint point and joint point on the limb , and the vector field [26] at is as follows:

Here,
The unit vector representing the direction of the limbs satisfies the formula:
Limb length is calculated as follows:
Mean processing is calculated as follows:
The link reliability of the affinity domain along the line segment between two points is calculated as follows:
Limb prediction results are as follows:
3.2.2. Human Motion Gesture Recognition
For the training data set , where each sample includes -dimensional feature, namely , the class label set has categories, namely [27]; according to Bayes’ theorem:
According to the total probability formula:
can be converted to:
The naive Bayes classification model is as follows:
It is converted to:
4. Simulation Experiment and Data Analysis
4.1. Data Preprocessing
The article proposes a human body gesture recognition algorithm based on BP neural network, as well as the modified AEC-BP, ABC-BP, and ABC-AEC-BP algorithms. AEC-BP regards continuous video frames as a box and uses a three-dimensional convolution kernel for convolution. Through this structure, action information can be captured; ABC-BP uses single frame data and optical flow data, thus capturing motion information; ABC-AEC-BP is a convolutional neural network using two data streams for video behavior recognition. The video is divided into a static frame data stream and an inter-frame dynamic data stream. The static frame data stream can use single frame data, the inter-frame dynamic data stream uses optical flow data, and each data use a deep convolutional neural network for feature extraction. The BP algorithm is used to record the body’s action decomposition during walking and running, as shown in Figures 4 and 5.


Taking a certain rural area as an example, we recorded the sports postures of rural athletes in five different states: stationary, going upstairs, going downstairs, walking, and running. The recognition results are as follows:(a)AFC-BP algorithm in Table 2(b)ABC-BP algorithm in Table 3(c)ABC-AFC-BP algorithm in Table 4
Comparing the recognition rate and training time of the BP neural network algorithm, ABC-BP algorithm, AFS-BP algorithm, and ABC-AFS-BP algorithm, it can be seen from Table 5 that in terms of recognition rate, ABC-AFS-BP algorithm, AFS-BP algorithm, and ABC-BP algorithm are better than the traditional BP algorithm. Among them, the recognition rate of ABC-AFS-BP algorithm is higher than that of ABC-BP algorithm, but it takes slightly more time than the ABC-BP algorithm. In terms of training time, the ABC-BP algorithm takes less time, but the accuracy is lower than the ABC-AFS-BP algorithm. The error curves of the four algorithms are shown in Figure 7.


Through the comparison of Figure 7, it can be found that after 500 iterations of the training part, the iteration error value of the ABC-AFS-BP algorithm is the smallest, reaching the set accuracy, and the local searchability is strong, followed by BP neural network. The algorithm has the largest iterative error value, which is much different from the set accuracy, and the searchability is poor, which proves that the error is smaller when the ABC-AFS-BP algorithm neural network is used for data training and the training effect is better. At the same time, the ABC-AFS-BP algorithm and the ABC-BP algorithm curve decline faster, and a smaller error value is reached after fewer iterations. The simulation results confirm that the ABC-AFS-BP algorithm has better stability. The flexibility, fast convergence speed, and small error can largely avoid the problem of the BP neural network falling into local extreme value.
4.2. Simulation Experiment
We conducted a posture assessment of sports athletes from a certain team in rural areas. Because different sports have different power points, we selected football and basketball that consume more energy, as well as aerobics and badminton that consume relatively less energy. These 4 sports are taken as examples to study the effects of bad posture on the body in sports. We selected a total of 200 athletes. The experiment selected athletes whose gender, age, height, and weight were roughly the same, ignoring the influence of factors such as age, gender, and years of exercise on the athletes. The specific situation is shown in Table 6.
From the statistical data in Figure 8, we can conclude that bad postures in sports will have a certain impact on the athletes’ bodies. Among them, more than 85% of athletes in football and basketball have the problem of pelvic rotation. The problem is more with football players; in 90% of the test sample, 60% of football players and basketball players have collapsed arches. Football players even reach 90% of the test sample, 77% of football players and 37% of basketball players have O-legs.

From the data in Figure 9, we can conclude that the main problems of aerobic athletes are flat backs and collapsed arches. More than 90% of badminton athletes have high and low shoulder problems, and more than 80% have neck problems. Serious body posture problems and wrong exercise posture can cause damage to the body, so in the daily training process, you should pay attention to these problems.

4.3. Comparative Experiment
To test the flexibility of the BP posture detection algorithm proposed in the article, we have compared it with the traditional motion posture recording method. We tested the accuracy of recorded motion, missed detection rate, and recording time. The experimental results are shown in Table 7 and Table 8.
From the data in Figures 10 and 11, we can conclude that the missed detection rate of the BP attitude detection algorithm is low, basically maintained at about 2.0, and the accuracy of recording actions is relatively high, generally maintained above 98%, with a maximum of 99.15%. The recording time is shorter, keeping it at 3-4 minutes; the missed detection rate of the traditional attitude detection algorithm is relatively high compared with the BP attitude detection algorithm, which is kept at 4–6, and the accuracy of action is lower than that of the BP attitude detection algorithm. It is generally maintained at about 95%, and the recording time is maintained at 5-6 minutes. The BP detection algorithm is faster and more efficient.


5. Conclusion
The integrated development of urban and rural sports is the basic basis for the development of sports in China. Urban sports should promote the citizenization of migrant workers with its inclusiveness, and rural sports should attract urban residents to the countryside with its ecological nature, to meet the different needs of urban and rural residents for a better life. At this stage, focusing on creating rural ecological sports, promoting the two-way flow of urban and rural sports resources, then promoting the integrated development of urban and rural sports, and contributing to rural revitalization and the construction of new urbanization are the historical responsibility given to us by the new era. The model proposed in the article uses the powerful data processing, mining, and analysis functions of deep learning to train the massive data generated in competitive sports training and apply it to competitive sports training. It is committed to promoting the accuracy and analysis of competitive sports training. Refinement provides technical guidance for athletes’ training, promotes the scientific and information development of competitive sports training in China, and provides some reference methods for the research and application of deep learning in competitive sports training. The human body poses that estimation has a good application prospect in sports analysis, but the truly appropriate matching method still needs to be adjusted according to specific data sets and specific applications. In addition, how to effectively use these data after getting the key points of the human body is also a question that requires in-depth thinking. The article only applies basic mathematical operations to get some simple metrics, which is relatively rough. There can be two directions for further research: one is to continue to improve the accuracy of key point extraction, the other is to fully tap the value of the obtained data, make full use of information to explain the actual needs of sports, and help athletes better improve their training.
Data Availability
The experimental data used to support the findings of this study are available from the corresponding author upon request.
Conflicts of Interest
The authors declare that they have no conflicts of interest regarding this work.
Acknowledgments
This work was sponsored in part by the Research Project of Humanities and Social Sciences of Chongqing Municipal Education Commission (20SKGH273) and the Research Project of Higher Education’s Teaching Reform of Chongqing College of Humanities, Science and Technology (18CRKXJJG25).