Page 107 - Read Online
P. 107

Li et al. Cancer Drug Resist. 2025;8:31                                          Page 13 of 26





               Score = -0.0016 × ExpCD227 + 0.0106 × ExpRAB33A7 - 0.0008 × ExpCCDC37 - 0.08257 × ExpGYPE7 +
               0.0063 × ExpTREM17 - 0.1717 × ExpNFE27 - 0.19027 × ExpCEP295NL7 + 0.1462 × ExpHORMAD17 +
               0.1186 × ExpVSTM17 + 0.00057 × ExpMUC5B7.


               Based on the model, a risk score was assigned to the samples, and all samples were subsequently divided into
               high- and low-risk groups. The KM survival curves showed a significant difference in prognosis between the
               two groups (log-rank test P <​ 0.0001; Figure 5B). To validate the predictive performance of the model, we
               plotted the ROC curves to predict 1-, 3-, and 5-year survival rates. The results demonstrated that the model
               achieved an AUC of 1 for 1-year survival prediction [Figure 5C], 0.854 for 3-year survival [Figure 5D], and
               0.889 for 5-year survival [Figure 5E].

               To further validate the generalizability and robustness of the model, three external datasets, namely
               GSE46602, GSE70769, and GSE116918, were used. The model was applied to each dataset for risk scoring,
               survival analysis, and the KM curve plotting. The results showed significant differences in the survival time
               between the high- and low-risk groups in the GSE70769 [Figure 5F], GSE116918 [Figure 5G], and GSE46602
               [Figure 5H] datasets, with log-rank test P-values of 0.037, 0.0011, and 0.047, respectively. These findings
               provide strong evidence that the model exhibited good generalizability, stability, and robustness.


               Biological feature analysis between high- and low-risk groups
               At the transcriptome level, differential gene expression analysis was performed to compare the groups, with
               the high-risk group used as the control. After statistical screening, 1,796 genes demonstrated significant
               differential expression [Supplementary Table 7], with 1,587 genes upregulated and 209 genes downregulated
               [Figure 6A]. The expression heatmap [Figure 6B] indicated that the most significantly differentially
               expressed genes exhibited higher expression in the comparative group, with only a small number showing
               higher expression in the high-risk group. The GO and KEGG enrichment analyses were performed to
               elucidate biological functions. The KEGG enrichment analysis results [Figure 6C, Supplementary Table 8]
               showed that the top five significantly enriched pathways were primarily linked to muscle cells, including the
               calcium signaling pathway, neuroactive ligand-receptor interaction, dilated cardiomyopathy, hypertrophic
               cardiomyopathy, and adrenergic signaling in cardiomyocytes. The GO enrichment analysis results
               [Figure 6D, Supplementary Table 9] indicated that the top five significantly enriched GO terms were related
               to muscle systems, including sarcomere, contractile fiber, myofibril, muscle contraction, and muscle system
               processes. These findings suggest that the primary biological distinctions between the two groups are related
               to muscle cells or the muscular system.

               The analysis of genomic mutations in the high- and low-risk groups showed that the former group exhibited
               a higher gene mutation rate [Figure 7A], which was characterized by TP53 mutations, whereas SPOP
               mutations were predominant in the low-risk group [Figure 7B]. These findings reflected a higher level of
               malignancy in the high-risk group. The TMB is a crucial indicator of the effectiveness of immunotherapy.
               Therefore, we compared the TMB levels between the two groups [Figure 7C]. The results showed that the
               high-risk group had a slightly higher TMB, although this difference was not statistically significant (P = 0.19).
               Further analysis was conducted to determine any differences in the prognosis. Using TMB levels, all samples
               were reclassified into high- and low-TMB groups. The KM survival curves suggested that the low-TMB
               group exhibited a more favorable prognosis than the high-TMB group [Figure 7D]; however, this difference
               was not statistically significant (P = 0.12), indicating that TMB alone cannot serve as an independent
               predictor of prognosis in patients with PCa. To underscore the effectiveness of the prognostic prediction
               model constructed in this study, a combined analysis was performed, incorporating model scores (i.e., high-
               and low-risk groups) and TMB levels (i.e., high- and low-TMB groups). Four-stratified KM survival curves
               were plotted [Figure 7E], and the results showed a significant overall difference in the prognosis (P <​ 0.0001).


                                                           100
   102   103   104   105   106   107   108   109   110   111   112