Abstract

Background and Objective. Adenovirus pneumonia is a severe disease in children. Constructing a prognostic model contributes to individualized treatment of children with adenovirus pneumonia. Thus, a machine learning model was constructed in this study to explore the clinical and baseline characteristics of pneumonia and predict the type of pneumonia. Methods. Children with bacterial pneumonia (41 cases) and adenovirus pneumonia (179 cases) hospitalized in Tianjin Children’s Hospital from January to October 2020 were selected. The differences in baseline and clinical characteristics between children with two pneumonias, respectively, were compared via the chi-square test and Wilcox test. The Least Absolute Shrinkage and Selection Operator (LASSO) model was applied to screen the pneumonia type-related characteristics. Patients were randomly divided into the training set (n = 154) and test set (n = 66). The logistic model was constructed using the screened characteristics in the training set to predict whether the cases are bacterial pneumonia or adenovirus pneumonia. Finally, the model was validated by receiver operating characteristic (ROC) curve and area under curve (AUC) in the test set. Results. The age (), hospital stay (), tonsil condition (), interleukin-6 (IL-6; ), and lactate dehydrogenase (LDH; ) between children with bacterial pneumonia and adenovirus pneumonia were significantly different. Sex, tonsil condition, age, hospital stay, r-glutamyltransferase (r-GT), and LDH levels were the factors associated with the types of pneumonia. Compared with bacterial pneumonia, children with adenovirus pneumonia were younger (OR = 0.207, 95% CI: 0.041–0.475), with longer hospital stay (OR = 7.974, 95% CI: 2.626–74.354) and higher LDH expression level (OR = 1.025, 95% CI: 1.006–1.060). 92.4% types of pneumonia were correctly predicted, and the AUC value of the model was 0.981. Conclusion. The LDH level was the associated factor to predict the types of pneumonia. Adenovirus pneumonia was associated with earlier age and longer hospital stay than bacterial pneumonia. The established model can well predict the types of pneumonia in children and provide clinical basis for guiding the individualized treatment of children.

1. Introduction

As a severe viral pneumonia, adenovirus pneumonia is mostly seen in children aged from 6 months to 2 years [1]. At the beginning of the disease, children often have persistent fever. After 3–5 days, children patients are likely to present dyspnea and systemic poison, some of whom suffered from diarrhea, vomiting, and even severe abdominal distension symptoms, while a small number of them suffered from conjunctival congestion and tonsil secretions [2]. Children at young age, with underlying disease or recurrent respiratory tract infection, are prone to severe case [3]. In addition, human adenovirus load and fever time are risk factors for pneumonia severity [4].

Lactate dehydrogenase (LDH) is an important enzyme involved in the anaerobic metabolism and ubiquitously present in almost all cells in the body, which responds to tissue damage in a nonspecific manner [5]. Serum LDH level may elevate due to hemolysis, cancer, and human immunodeficiency virus infection [6]. According to its characteristics, many studies reported the role of LDH in assessing the clinical severity of coronavirus disease 2019 (COVID-19) [7]. Moreover, the inhibition of LDH can reduce the conversion of glucose to lactate, thus reversing the Warburg effect to deprive cancer cells of their ability to survive in the hypoxic tumor microenvironment [8].

Least absolute shrinkage and selection operator (LASSO) is an effective embedded method of feature selection, which minimizes the sum of squares of residuals and adds regularized penalty term (L1) to estimate coefficients and screen variables simultaneously [9]. The model is a ridge regression model when the penalty term is L2, and elastic net integrates the penalty terms L1 and L2. K. Aheto et al. [10] applied three models to predict malaria prevalence, and these models screen 11, 15, and 13 characteristics out of 15 characteristics, respectively, with LASSO showing the smallest prediction error.

We constructed the prognostic model based on machine learning to evaluate the risk factors for adenovirus pneumonia and then validated the accuracy of this model.

2. Materials and Methods

2.1. Objects

This study selected 41 children with bacterial pneumonia and 179 children with adenovirus pneumonia who were hospitalized in Tianjin Children’s Hospital from January to October 2020. The exclusion criteria were as follows: children were less than 14 years old, and children were diagnosed with adenovirus pneumonia according to Guidelines for Diagnosis and Treatment of Adenovirus Pneumonia in Children (2019 Version) [2]. The study was approved by the Ethics Committee of Tianjin Children’s Hospital (L2021-19). As the study was a retrospective study and no clear information of patients appeared in the study, the informed consent was waived.

2.2. Data Collection

The baseline characteristics of children were collected, including age, sex, hospital stay, tonsil enlargement, conjunctival congestion, alanine aminotransferase (ALT), aspartate aminotransferase (AST), r-glutamyltransferase (r-GT), interleukin-6 (IL-6), LDH, CD4 T cell proportion, and CD8 T cell proportion.

2.3. Statistical Methods

The chi-square test was utilized to analyze the differences of enumeration data. The Shapiro–Wilk test was applied to detect the normality of measurement data. The t-test was chosen if the data satisfied the normal distribution, and the Wilcox test was employed or otherwise. The differences in LDH levels between children with adenovirus and bacterial pneumonia were compared via boxplots. The LASSO model and logistic model were utilized by the R software “glmnet” package. ROC curve drawing and AUC value calculation were based on the “plotROC” package and “pROC” package. value < 0.05 was considered statistically significant (Figure 1).

3. Results

3.1. Analysis of Differences

The differences of enumeration data between two groups, including sex, tonsil condition, and conjunctiva congestion, were analyzed by the chi-square test. No significant differences were found in the age and conjunctiva congestion (, ), while the differences in tonsil condition were statistically significant (). Children with adenovirus pneumonia were susceptible to tonsil enlargement.

Shapiro–Wilk confirmed that all measurement data did not fit the normal distribution. Thus, the Wilcox test was applied to compare the differences between the groups, as given in Table 1. The age (), hospital stay (), tonsil condition (), IL-6 (), and LDH () between two groups were significantly different. Children with adenovirus pneumonia were younger and had longer hospital stay and higher IL-6 and LDH levels. The results of boxplot (Figure 2) indicated that LDH level in children with adenovirus pneumonia was higher.

3.2. Construction of the LASSO Model

The LASSO model (Figure 3(a)) was constructed to analyze the characteristics that affected pneumonia types. 10-fold cross-validation was applied to select the corresponding model parameters as the classification error was minimum. In this model, the classification effect of the model was optimal when 6 characteristics were reserved (Figure 3(b), Table 2), that is, the types of pneumonia were related to sex, tonsil condition, age, hospital stay, r-GT, and LDH levels.

3.3. Analysis of the Logistic Regression Model

Children were randomized to the training set (n = 154) and test set (n = 66). The binary logistic model was constructed based on the 6 screened characteristics in the training set to predict the classification effect (Table 3). Age (), hospital stay (), and LDH () were statistically significant. According to odds ratio (OR), children with adenovirus pneumonia were younger (OR = 0.207, 95% CI: 0.041–0.475), with longer hospital stay (OR = 7.974, 95% CI: 2.626–74.354) and higher LDH level (OR = 1.025, 95% CI: 1.006–1.060), compared to bacterial pneumonia.

3.4. Model Assessment

The model was validated in the test set. According to the results of logistic regression, the type of pneumonia was predicted with 0.5 as the boundary. It was predicted as adenovirus pneumonia when the probability value of adenovirus pneumonia was bigger than 0.5; otherwise, it was bacterial pneumonia. The probability of correct classification of the model was 92.4%. ROC of the model was drawn (Figure 4), and AUC was 0.981, indicating that the model had high accuracy.

4. Discussion

The relevant characteristics of pneumonia types were analyzed in this study. Differential analysis confirmed that the age, hospital stay, tonsil condition, IL-6, and LDH levels were significantly different between children with bacterial and adenovirus pneumonias. LASSO was applied to screen the characteristics associated with pneumonia types, including sex, tonsil condition, age, hospital stay, r-GT, and LDH levels. The constructed logistic regression model showed that age, hospital stay, and LDH levels were statistically significant. Children with adenovirus pneumonia were younger and had longer hospital stay compared with children with bacterial pneumonia. AUC value of the model was 0.981, and the model had high accuracy. Yang et al. screened predictors for adenovirus pneumonia applying LASSO regression and developed nomogram, followed by the evaluation of nomogram discrimination via ROC curves, whose AUC value is 0.79 [11]. Whereas, the prediction model constructed in our study had higher accuracy, and we subsequently explored the correlation between LDH level and adenovirus pneumonia in children.

LDH level can reflect lung conditions [12], and serum LDH elevates in patients with severe adenovirus tract infection [13]. In this study, the significance of LDH level was confirmed in the process of statistical analysis, characteristics screening, and regression model construction. The results indicated that LDH was the relevant factor of adenovirus pneumonia, and children with adenovirus pneumonia had higher LDH expression, which was consistent with the results of Liu et al. [14]. In addition to indicating the existence of viral pneumonia, LDH level can also reflect the severity of pneumonia [15]. Liu et al. [14] and Wu et al. [16] demonstrated that LDH level was higher in patients with severe adenovirus tract infection.

Age and hospital stay were different in this study, and children with adenovirus pneumonia were younger and had longer hospital stay. The characteristics of young age of adenovirus patients were reflected in many studies. The Guideline of Diagnosis and Treatment of Adenovirus in Children pointed out that adenovirus pneumonia is mostly seen in children aged from 6 months to 5 years [2]. Infants under 6 months have immune ability due to maternal antibodies and develop their own antibodies as they grow, reducing the risk of adenovirus infection [17]. The longer hospital stay in children with adenovirus pneumonia may result from its severity. Adenovirus pneumonia has high mortality [18] and leads to multisystem complications [19]. Kuo et al. [20] found no difference in age and hospital stay between children with adenovirus and nonadenovirus, and the different results may be explained by the small number of samples (n = 48) included in their study.

The CD4/CD8 ratio varies with the immune dysregulation caused by viral infection [21]. But no significant difference was found between children with adenovirus and bacterial pneumonia in CD4, CD8, and CD4/CD8 levels. Mei et al. [22] studied children with viral pneumonia and found that children with viral pneumonia have higher CD4 and CD4/CD8 levels and lower CD8 levels compared to normal children.

In conclusion, it was found that LDH level might be a predictor of adenovirus pneumonia in children, which provides an idea to distinguish adenovirus pneumonia from bacterial pneumonia. Meanwhile, the analyses of age, hospital stay, tonsil condition, and IL-6 reveal the clinical characteristics of adenovirus pneumonia. However, only one model was used to screen the relevant factors of adenovirus pneumonia, and it will be more reliable to apply multiple models and compare the results to screen characteristics. The inclusion of more pneumonia types in future studies may enable LDH data to be more clinically useful.

Data Availability

No data were used to support this study.

Ethical Approval

This study was conducted in accordance with the Helsinki Declaration II and was approved by the IEC of Tianjin Children’s Hospital (L2021-19).

Written informed consent was obtained from individual or guardian participants.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

Yuqiao Liu conceptualized, involved in data curation, and wrote the original draft of the manuscript. Yang Shen developed methodology, visualized, and investigated the study. Botao Wei reviewed and edited the article.