Abstract

Introduction. Lung cancer ranks first among malignant tumors worldwide and is a leading cause of cancer-related mortality in both men and women. Combining tumor marker testing is a strategy to screen individuals at high risk of pulmonary cancer and minimize pulmonary cancer mortality. Therefore, tumor marker screening is crucial. In this study, we analyzed combinations of tumor markers for lung cancer screening using receiver operating characteristic (ROC) curve analysis. Methods. A retrospective descriptive study was conducted on patients diagnosed with lung cancer, as well as healthy and benign lung diseases, using data from the China Huludao Central Hospital database between January 2016 and July 2022. The t-test and ROC curve were utilized to assess the effectiveness of individual tumor marker and the combination of multiple tumor markers. Tumor markers are molecular products metabolized and secreted by tumor tissues, characterized by cells or body fluids. They serve as indicators of tumor stage and grading, monitor treatment response, and predict recurrence. Results. In this study, 267 healthy participants, 385 patients with benign lesions, and 296 patients with lung cancer underwent tumor marker screening. The sensitivity of five tumor markers—CEA, CYFRA21-1, NSE, pro-GRP, and CA125—was found to be <55%. This study revealed that a single tumor marker had limited value in lung cancer screening. However, combining two or more markers yielded varying area under the curves (AUC), with no significant impact on screening accuracy. The combination of CEA + CA125 demonstrated the highest accuracy for lung cancer screening in healthy participants. At a cutoff of 0.447 for CEA + CA125, the combination showed a sensitivity of 0.676 and specificity of 0.846 for lung cancer screening. Conversely, for patients with benign lung lesions, the optimal combination was CEA + NSE, with a cutoff of 0.393, yielding a sensitivity of 0.645 and specificity of 0.766 for lung cancer screening. Conclusion. The five tumor markers—CEA, CA125, CY211, NSE, GRP—show promising results in screening healthy individuals and patients with lung cancer. However, only CEA, NSE, and GRP effectively differentiate patients with benign lung lesions from those with lung cancer. A single tumor marker has limited utility in detecting and screening for lung cancer and should be combined with other tumor markers. CEA + CA125 emerges as a superior tumor marker for distinguishing healthy individuals from those with lung cancer, whereas the CEA + NSE combination is more effective in identifying tumor markers in patients with benign lung lesions and lung cancer.

1. Background

Lung cancer ranks as among the most prevalent malignant tumors worldwide, with an annual incidence of approximately 1.82 million cases and 386,000 deaths, predominantly affecting men. The causes of lung cancer are multifactorial, including environmental factors such as air pollution, smoking, genetic predisposition, chronic lung diseases, and occupational exposures [1]. Due to its high incidence and poor prognosis, pulmonary carcinoma is a significant contributor to cancer-related mortality globally, posing a growing disease burden in many countries [2]. In China alone, lung cancer accounts for about 787,000 annually [3], making it the most prevalent cancer and a leading cause of cancer-related deaths, with a 12.16% increase in incidence over the past decade [4]. Efforts to address the escalating burden of cancer in China face challenges due to the rapid aging of the population and cumulative exposure to risk factors [5]. If detected at an early stage, surgical resection of lung cancer offers a favorable prognosis, with a 5-year survival rate of 70%–90% for small localized tumors. However, most patients (approximately 75%) are diagnosed with advanced disease, and despite recent advancements in oncological treatment, survival rates remain low [6]. Early detection through screening is crucial for improving the survival rate of patients with lung cancer. Although various screening methods exist, early detection remains challenging [7].

Therefore, the timely identification of pulmonary cancer is imperative. Screening tests for lung cancer play a crucial role in identifying patients at an early stage, facilitating potentially curative local resection. Tumor markers present in clinical specimens hold promise for reducing lung cancer mortality. Furthermore, in cases where cancer is diagnosed at an advanced stage, sensitive molecular markers may can aid in clarifying the diagnosis without necessitating additional invasive diagnostic procedures. Early diagnosis and confirmation of tumor biomarkers can improve the outcomes of lung cancer treatment. The precise early detection of tumor biomarkers can enhance the effectiveness of pulmonary cancer diagnosis and lung cancer detection. Simultaneously, early screening can reduce the potential damage to individuals who are less likely to develop lung cancer [8]. The measurement or identification of tumor markers, which are molecules that occur in the blood and tissues and are related to cancer, or are the result of a host response to cancer, is highly valuable for clinical diagnosis and patient management [9]. Neurocyte-specific enolase (NSE), carcinoembryonic antigen (CEA), cytokeratin fragment (CYFRA21-1), pro-gastrin-releasing peptide (pro-GRP), and carbohydrate antigen 125 (CA125) are routine markers recommended by the National Academy of Clinical Biochemistry (NACB) guidelines for laboratory medicine in lung cancer. However, these five indicators were not systematically assessed in the same cohort [10].

CEA is a serum glycoprotein commonly used as a marker for lung, breast, and colorectal cancers. Normally absent in healthy adults, CEA exists as a ligand for L-selectin and E-selectin. Increased CEA concentrations are observed in smokers and individuals with nonneoplastic illnesses [11]. CA125 is a mucus glycoprotein primarily associated with ovarian cancer [12], but it is also utilized in monitoring cancer progression and has emerged as a primary biomarker screening candidate [13]. Serum CA 125 concentration correlates with TNM stage, histological type, and survival rate in patients with lung cancer [14]. CYFRA21-1, a cytokeratin 19 soluble fragment, is a newly discovered monoclonal antibody with comparative diagnostic value in detecting nonsmall cell lung cancer (NSCLC), particularly in patients with squamous cell carcinoma (SCC) [15]. Serum CYFRA21-1 shows promise as a biomarker for differentiating small cell lung cancer (SCLC) from NSCLC [16]. NSE, also known as enolase 2 (ENO2), is a member of the enolase family expressed by mammalian neurons and neuroendocrine cells [17]. NSE is already utilized as a biomarker in the differential diagnosis of SCLC. Additionally, NSE holds particular prognostic value for predicting outcomes in patients with severe craniocerebral trauma [18]. NSE is a well-established marker, and its serum levels are utilized to support the initial diagnosis of SCLC. Numerous studies have demonstrated that NSE exhibits high diagnostic power in patients with SCLC [19]. Gastrin-releasing peptide (GRP) was first isolated from the stomach of pigs in 1978 and has since been found to be distributed in various organs and tissues, including the fetal lung and lung cancer, with good sensitivity and specificity [20]. In healthy adult participants, GRP levels increase with age, and smokers typically exhibit higher concentrations compared with nonsmokers. Additionally, GRP levels rise rapidly in patients with SCLC during advanced stages of the disease [21]. Biomarkers of lung pose challenges in terms of sensitivity and specificity, hindering their effectiveness in detection and diagnosis [22]. Due to the sensitivity and specificity of individuals biomarkers, various biomarker methods have been developed for cancer detection instead of relying solely on a single biomarker approach [23].

Most studies have primarily focused the test results of individual tumor markers in lung cancer screening, with limited investigation into the combined detection of tumor markers [24]. Conversely, this study examined the efficacy of combined detection using various tumor markers, which is critical for selecting optimal markers for lung cancer screening [25]. By using receiver operating characteristic (ROC) curves, the cutoff values for marker combinations can be adjusted to achieve a balance between sensitivity and specificity. However, in other studies, fixed cutoff values were utilized, resulting in sensitivity and specificity levels that may not always meet the requirements for effective screening [26].

Nevertheless, screening for various tumor markers may pose potential risks, including a higher rate of false positives, decreased specificity, and other hazards [27]. Therefore, although cotesting with tumor markers has been acknowledged as an effective diagnostic aid for tumors, it cannot be assumed that utilizing more markers leads to greater diagnostic efficiency [28]. To date, no organization has thoroughly examined which tumor markers should be used in conjunction with screening, both in healthy individuals and those with benign lung disease. In this study, we utilized a combination of ROC curves and binary logistic regression analysis to identify the optimal combination of markers suitable for detecting pulmonary carcinoma.

2. Materials and Methods

2.1. Patients

This case–control study was conducted at Huludao Central Hospital, Huludao city, Liaoning province, between January 2016 and July 2022. The study included all patients who underwent physical examinations for lung cancer at the hospital during the 6-year period. We also included 296 patients with lung cancer, comprising 149 males (50.33%) and 147 females (49.66%), with a mean age of 62.93 ± 11.12 years. Additionally, 384 patients with benign pulmonary lesions were included, with a mean patient age of 67.09 ± 14.28 years, including 188 males (48.9%) and 196 females (51.0%). Benign lung lesions encompassed conditions such as pneumonia, lung infection, chronic obstructive pulmonary disease, and bronchial asthma. Pulmonary samples were obtained from 267 healthy controls, of which 131 were male (49.0%) and 136 were female (51.0%), with a mean age of 58.82 ± 12.79 years. All selected participants provided informed consent, and the study received approval from the Institutional Review Board of the hospital.

2.2. Methods

This retrospective descriptive research utilized available clinical data to summarize and analyze retrospectively [29]. The advantages of this approach include requiring fewer human resources, minimal funding requirements, noninterference with normal clinical work, and a short research period. In this retrospective cross-sectional study, data were collected from individuals who had undergone lung cancer screening or treatment at our hospital over the past 6 years, including 267 healthy individuals, 384 patients with benign lung lesions, and 296 patients with lung cancer. These data were collated and analyzed retrospectively, and the selection of tumor markers for lung cancer screening was analyzed using ROC curves and binary logistic regression to identify for the most effective tumor marker combinations. All participants were administered 4 mL of fasting venous blood, centrifuged at 3,000 r/min for 10 min. The levels of CEA, CYFRA21-1, SCC-Ag, pro-GRP, and CA125 were measured using a chemiluminescence instrument, strictly following the manufacturer’s instructions. The following reference ranges were applied: CEA > 5 ng/mL, CA125 > 35 U/mL, CYFRA21-1 > 3.3 ng/mL, pro-GRP > 65.7 pg/ml, and NSE > 16.3 ng/mL.

SPSS.21 software (IBM Corp. Released 2012. IBM SPSS Statistics for Windows, Version 21.0. Armonk, NY: IBM Corp.) and MedCalc® Statistical Software version 19.6.4 (MedCalc Software Ltd., Ostend, Belgium; https://www.medcalc.org; 2021) were employed for statistical analysis of the data in this study. Continuous variables were expressed as mean ± standard deviation (x ± s), and comparisons of changes in tumor marker concentrations between groups were performed using t-test or chi-square analysis. Significant differences were considered when . The predictive probability of lung cancer diagnosis by combining tumor markers was assessed using binary logistic regression, and ROC curves were generated with the predictive probabilities as variables. The area under curve (AUC) was calculated as the area enclosed by the coordinate axis under the ROC curve and was utilized to evaluate the statistically significance of each marker [30]. The ROC curve is a graphical representation of true positive rate versus false positive rate (FPR) for various threshold settings. The ROC curve is a function of sensitivity and FPR, playing a central role in assessing the diagnostic power of a test to differentiate between the true state of a participant, determine the optimal cutoff value, and compare two alternative diagnostic tasks when each task is performed on the same participant [31]. ROC curves are extensively employed in clinical epidemiology to assess the diagnostic efficacy of serum markers and differentiate between diseased and healthy participants. ROC curves were employed to assess the diagnostic efficacy of serum markers and classify the tests between diseased and healthy participants, a method widely used in clinical epidemiology. When the AUC is >0.7 it indicates that the marker has good diagnostic value. An AUC ranging from 0.5 to 0.7 indicated an average diagnosis. However, when the area was <0.5, the diagnosis was not statistically significant [32].

3. Results

Implications of single tumor markers for lung cancer screening. A t-test analysis of the data from all tests in this study (Table 1) revealed that the concentrations of all five tumor markers, including CEA (), CA125 (), NSE (), GRP (), and CY211 (), were dramatically different when lung cancer tumor marker testing was performed in healthy populations versus lung cancer patients. Conversely, when comparing participants with benign lung disease with those with pulmonary carcinoma, no statistically significant variation was noted in the concentrations of the two tumor markers, CA125 () and CY211 (). In contrast, the concentrations of the tumor markers CEA (), NSE (), and GRP () were significantly higher in patients with pulmonary cancer than in those with benign lung disease. This study also compared the ages of healthy individuals, patients with benign lung lesions, and patients with lung cancer and found that healthy individuals tended to be younger than patients with lung cancer (), whereas patients with benign lung lesions tended to be older than patients with lung cancer (). As shown in Figure 1, ROC curves were constructed with the five tumor markers as test variables for the healthy group and patients with benign lung cancer, and the AUC was calculated. The diagnostic efficacies of various tumor markers calculated using the ROC curve are displayed in Table 2. In a comparison between the healthy population and patients with lung cancer, CEA, CA125, and CY211 had good diagnostic effects in lung cancer screening (AUC > 0.7), whereas NSE and GRP had limited diagnostic value for pulmonary cancer (AUC < 0.7). For patients with benign lung lesions, CA125 and CY211 had no application in lung cancer screening (AUC < 0.5), and the diagnostic importance of the remaining three markers remained weak (AUC < 0.7).

Comparison of the sensitivity, specificity, Jorden index, and AUC of combined tumor markers in lung cancer screening in a healthy population. ROC curves were established by analyzing tumor marker pulmonary cancer screening in healthy individuals using multivariate binary logistic regression. Table 3 illustrates the lung cancer screening performed using the biomarkers in healthy individuals, whereas the corresponding ROC curves are displayed in Figure 2. The results revealed that 10 permutations (CEA + CA125, CEA + CA125 + NSE, CEA + CA125 + CY211, CEA + CA125 + GRP, CEA + NSE + GRP, CEA + CA125 + CY211 + NSE, CEA + CA125 + CF21 + GRP, CA125 + CY211 + NSE + GRP, CEA + CA125 + NSE + GRP, and CEA + CA125 + CFA21 + NSE + GRP) exhibited significantly higher AUCs than the other combinations, with negligible differences observed among their AUCs. No obvious discrepancy was found between the minimal AUC (CEA + CA125; marked with ▴) and the maximal AUC (CEA + CA125 + CY 211 + NSE + GRP; marked with ★) among these 10 combinations (Z = 1.684, ). The sensitivity () and negative predictive value () of CEA + CA125 were lower than those of CEA + CA125 + CY211 + NSE + GRP, whereas the specificity () and positive predictive value () were significantly higher. The Jorden index for CEA + CA125 was also lower than that for CEA + CA125 + CY211 + NSE + GRP.

The ROC curves of the combined pulmonary carcinoma screening markers in patients with benign lung disease are depicted in Figure 3. The combinations of multiple markers employed for the evaluation of lung cancer detection are shown in Table 4. Because CA125 and CY211 did not contribute significantly to differentiating benign lung lesions from lung cancer, these two tumor markers were not included in our analysis. As the results indicated, two combinations (CEA + NSE and CEA + NSE + GRP) exhibited an AUC > 0.7, and no obvious difference was noted between the comparison of AUC of CEA + NSE (marked with ▴) and CEA + NSE + GRP (marked with ★; Z = 1.019, ). The sensitivity of CEA + NSE was lower than that of CEA + NSE + GRP (), but its specificity was higher than that of CEA + NSE + GRP (). Moreover, the differences in the PPV and NPV were not significant ( and 0.163), with the latter having a higher Jorden index than the former.

The study examined the impact of age, sex, and various other factors on the selection of tumor markers for lung cancer. According to the World Health Organization (WHO), individuals aged >65 years are considered older adults [33]. In this investigation, participants were categorized based on their age, and the cutoff value was 65 years. Each group was further subdivided into by sex. Finally, patients with pulmonary cancer or benign lung disease and healthy participants were classified into 12 panels. Individual subgroup sizes are displayed in Table 5. To investigate the effects of seven factors, including NSE, GRP, CA125, CY211, CEA, age, and sex, on lung cancer screening status, we performed a multivariate binary logistic regression analysis with these seven factors as independent variables and lung cancer screening status as the dependent variable. For this study, age = 1 for individuals aged <65 years and age = 2 for those aged ≥65 years.

The logistic regression model included sex, age, CEA, CYFRA21-1, NSE, pro-GRP, and CA125 as independent variables, with the presence or absence of the disease as the dependent variable coded as 0 for no and 1 for yes. Binary logistic regression revealed that in healthy participants, CEA (), CA125 (), GRP (), and age () were significant factors affecting lung cancer screening. However, sex (), CY211 (), and NSE () levels did not significantly influence lung cancer screening results in healthy participants. Conversely, for patients with benign lung lesions, CEA (), NSE (), GRP (), and age () were influential factors in lung cancer screening, whereas sex (), CA125 (), and CY211 () did not significantly impact the screening of patients with benign lung lesions. Further details are presented in Tables 6 and 7.

To analyze the screening of tumor markers for lung cancer diagnosis, we divided healthy participants and patients with benign lung lesions into age-based cohorts and analyzed their ROC curves. The AUCs of various combinations of tumor markers for lung cancer diagnosis in different populations are shown in Table 8. The calculated ROC curve is presented in Figure 4. We calculated the AUC values based on each group, identified the top three combinations (marked in bold), and compared their AUC values to determine the optimal tumor marker combination for lung cancer screening in different groups (a1, a2, a3, d1, d2, and d3). In healthy participants, the CEA + CA125 combination displayed the largest AUC value among all combinations, regardless of age, and CEA + CA125 is evidently the optimal tumor marker combination for lung cancer screening among healthy participants. Conversely, CEA + NSE is the optimal combination for screening for lung cancer in patients with benign lung diseases. Age is crucial for both the differentiation of healthy individuals from those with lung cancer, and the differentiation of patients with benign lung disease from those with lung cancer.

The cutoff values for the optimal marker combinations for lung cancer screening and their corresponding sensitivity and specificity, along with the results of pulmonary cancer screening utilizing the combinations of CEA + CA125 and CEA + NSE in healthy participants and patients with benign lung diseases are outlined in Table 9. The sensitivity of each combination (SEN) and its corresponding specificity (SPE) were measured using the maximum Jorden index. In healthy individuals, CEA + CA125 displayed a sensitivity of 0.676 and specificity of 0.846 for lung cancer at a cutoff value of 0.447. Conversely, for patients with benign lung disease, CEA + NSE demonstrated a sensitivity of 0.645 and a specificity of 0.766 for lung cancer at a cutoff value of 0.393.

Comparison with other lung cancer tumor marker selection studies reveals that this study employed multivariate binary logistic regression analysis and ROC curves to select tumor marker combinations for lung cancer screening, a method informed by previous research. Unlike other studies where threshold values for tumor markers were derived from reagent company recommendations without ROC curve analysis, this study rigorously determined cutoff values using ROC curves. This study identified CEA + CA125 as the most effective combination for lung cancer screening in healthy individuals, with a sensitivity of 0.676 and specificity of 0.846. Compared with studies without ROC curve analysis, this approach resulted in significantly higher sensitivity (), whereas specificity remained relatively unchanged (). Similarly, for patients with benign lung lesions, CEA + NSE emerged as the optimal tumor marker combination, with a sensitivity of 0.6453 and specificity of 0.7662. Compared with nonROC curve methods, this approach led to significantly higher sensitivity () but lower specificity (). All findings are shown in Table 10.

4. Discussion

Lung cancer stands as the most common malignant tumor in China and carries the highest mortality rate among all malignant tumors [34]. Therefore, conducting lung cancer screening is crucial. To the best of our knowledge, this is the first screening study to explore tumor marker binding between healthy patients and those with pulmonary cancer, as well as between patients with benign lung lesions and those with lung cancer.

Although the selection of tumor markers for lung cancer has remained stable over the past 10 years, some changes have occurred. The 2011 guidelines recommended the use of CEA, NSE, CYFRA21-1, and SCC [35]. The primary lung cancer markers recommended in the 2015 lung cancer guidelines include CEA, NSE, CYFRA21-1, pro-GRP, and SCC [36]. In the 2018 lung cancer guidelines, the recommended tumor markers did not change [37].

In this study, we aimed to assess the efficacy of tumor markers in detecting lung cancer. CEA, CY211, CA125, NSE, and GRP are relevant tumor markers for lung cancer [38]. These markers are automated and suitable for large-scale lung cancer detection [39]. To identify the optimal combination for pulmonary cancer screening, ROC curves were constructed, and the diagnostic value was evaluated using the AUC. Logistic regression was employed to consolidate various markers into a single variable for ROC curve analysis, and ROC curves were subsequently generated based on these probabilities.

Five tumor markers, namely CEA, CY211, CA125, NSE, and GRP, exhibited significance in distinguishing healthy patients from those with lung cancer. Conversely, CEA, NSE, and GRP demonstrated significance in discriminating benign lung lesions from lung cancer, whereas CA125 and CY211 did not. CEA, CA125, and CY211 were effective in differentiating healthy participants from those with lung cancer (AUC > 0.7), whereas NSE and GRP were not (AUC < 0.7). However, none of the tumor markers were effective in differentiating benign lung disease from lung cancer (AUC < 0.7). These findings indicate that the detection of a single tumor marker alone is insufficient for the diagnosis of lung cancer.

Therefore, this study utilized combinations of tumor markers for lung cancer screening. To explore the diagnostic efficacy of each combination, the results were analyzed using binary logistic regression and ROC curve analyses. In the healthy population, the AUC increased with an increase in tumor markers. However, this does not imply that more tumor markers in the mix are necessarily better. Sensitivity and specificity are known to be inversely related, as sensitivity increases, specificity tends to decrease. Among the 10 combinations selected with an AUC > 0.820, only the CEA + CA125 combination comprised two markers, where the remaining combinations consisted of various markers. Typically, when no significant difference exists in the diagnostic value between a larger number of combinations and a smaller number of combinations for lung cancer screening, a smaller number of combinations should be selected for testing [40]. This study compared the AUC values between different tumor markers in a healthy population using MedCalc software and found no significant differences between CEA + CA125 and other combinations, aligning with previous research findings. Furthermore, the sensitivity and PPV of CEA + CA125 were found to be the highest among all combinations in this study. Consequently, we concluded that dual tumor markers are superior to multiple tumor markers for lung cancer screening in healthy populations. CEA + CA125, in particular, emerged as the most effective of combination among all tumor marker combinations and should be prioritized for use. We applied a similar approach for lung cancer screening in patients with benign lung lesions. Our analysis revealed only two combinations with an AUC > 0.7: CEA + NSE and CEA + NSE + GRP. Notably, no significant difference was observed in the AUC between these two combinations, whereas the specificity of CEA + NSE was higher than that of CEA + NSE + GRP. Therefore, we concluded that CEA + NSE represents the optimal combination for the selection of tumor markers for lung cancer screening in patients with benign lung lesions. Other studies have corroborated the effectiveness of the CEA + CA125 combination in screening for lung cancer among healthy individuals. To the best of our knowledge, this is the first study to examine the combination of tumor markers for screening lung cancer in patients with benign lung disease.

Next, we investigated the tumor marker combinations using multivariate binary logistic regression and ROC curves. We analyzed the diagnostic efficacy of the markers across different age and sex groups. Our investigation revealed studies that used binary logistic regression to demonstrate the value of CEA, CA125, pro-GRP, and age in distinguishing between healthy patients and those with lung cancer. In contrast, CEA, NSE, pro-GRP, and age were deemed valuable in the diagnosis of patients with benign pulmonary lesions. Given our finding that dual tumor markers outperformed multiple tumor markers in lung cancer screening, we focused our ROC curve analysis solely on individual and paired markers. Notably, regardless of sex and age, CEA + CA125 emerged as the optimal choice for screening healthy individuals. When the cutoff point of CEA + CA125 was set at 0.447, this combination exhibited sensitivities of 0.676 and 0.846, respectively. In patients with benign lung lesions, age and sex did not affect the combination choice, with CEA + NSE proving to be the most efficacious option. With a cutoff value of 0.393 for CEA + NSE, the combination displayed a sensitivity of 0.645 and specificity of 0.766 for lung carcinoma screening. Importantly, age and sex had no impact on the selection of tumor marker combinations, whether in screening healthy individuals and patients with lung cancer or in screening benign lung lesions versus lung cancer. Compared with other studies that did not use ROC curve analysis, our study demonstrated a remarkable enhancement in sensitivity while maintaining a relatively stable specificity when screening for pulmonary carcinoma in healthy individuals. However, in patients with benign lung lesions, the sensitivity was significantly higher, whereas the specificity was significantly lower. One significant advantage of our study is that the ROC curve can be employed to adjust the cutoff values of tumor markers to balance sensitivity and specificity. In contrast, previous studies often used fixed cutoff values, making it challenging to meet the required sensitivity and specificity criteria.

The focus of our study differs from that of other lung cancer screening studies. In other lung cancer screenings, the discussion focused on the sensitivity and specificity of the marker combinations without specifying the tumor markers selected for combination and lung cancer screening. Consequently, researchers may experience uncertainty when selecting tumor marker combinations. This study seeks to address this gap by estimating the diagnostic value through the comparison of correlations between different combinations of tumor markers and clinical symptoms.

This study employed a cross-sectional approach, which has inherent limitations in establishing causal relationships. Future research should prioritize longitudinal exploration. Longitudinal research methods can help estimate the longitudinal trajectory characteristics of changes in individuals over time [41]. By extracting individual-specific longitudinal features, longitudinal prediction models containing more information have been proposed. These models enable clinicians to make more accurate predictions by leveraging repeated measurements of tumor markers. A longitudinal model can facilitate dynamic predictions by considering the time dependance of extracted features. For targeted patients, the predicted risk can be continuously updated over time to reflect the potential changes in prognosis.

In this study, five serological markers were used to identify the optimal combination for lung cancer screening across various groups. The study also assessed the optimal combination cutoff values, sensitivity, and specificity through a combination of ROC curves and logistic regression analysis. Our findings indicate that using dual markers outperforms the use of multiple markers for serological tumor marker screening. The combination of ROC curves and logistic regression analysis proves to be effective in identifying neoplastic markers. Furthermore, this study suggests that the method developed herein may have promising applications for tumors beyond lung cancer and other non-neoplastic diseases.

5. Conclusion

The five tumor markers, CEA, CA125, CY211, NSE, and GRP, demonstrated validity in screening both healthy individuals and patients with pulmonary carcinoma. However, only CEA, NSE, and GRP levels were effective in distinguishing patients with benign lung lesions from those with lung cancer. Conversely, single tumor markers showed limited effectiveness in lung cancer screening, highlighting the necessity of combined tumor marker testing. Specifically, CEA + CA 125 proved effective in identifying lung cancer in healthy patients, whereas CEA + NSE was effective in identifying benign lung lesions in patients with lung cancer.

Data Availability

Data used to support the findings of this study are available from the corresponding author upon request.

Additional Points

Limitations. Although this study holds significance in detecting cancer at an early stage, aiding in the identification of cancer in asymptomatic individuals, aiding in the identification of cancer in asymptomatic individuals, and facilitating earlier diagnosis, there are several limitations to consider. First, due to the cross-sectional survey design employed in this study, establishing causal relationships between the variables was not possible. Therefore, future research utilizing a longitudinal design will be necessary to establish the causal relationships between these variables. Second, this study was conducted solely in a city in Liaoning province, located in the northeastern region of the country. Consequently, caution should be exercised when generalizing the present results to other regions with different conditions until the findings are replicated in diverse settings. Nevertheless, we did not further classify patients with lung cancer. Several studies have found that serum tumor markers are helpful in the differential diagnosis of pathological lung cancer types [42]. We believe that after diagnosing suspected cancer using serology and imaging, tissue biopsy should be performed to confirm the histological classification of lung cancer based on morphology and special stains [43]. Single serological markers are of little value for predicting the histological subtypes of lung cancer [44]. Finally, biological variability existed among individual patient samples and a wide range of biomarker concentrations across all comparison patients. Inferring diagnostic patterns unique to lung cancer status is a challenge; therefore, standardization of tumor marker analysis must be addressed. Future research will focus on the histopathological classification of lung cancer in patients using tumor markers. This will aid clinicians in treating lung cancer based on changes in tumor marker concentrations.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Authors’ Contributions

JiaChen Lu, YingYing Yu, YaoHui Yi, and Ling Zhou contributed equally to this study.