Every year, more than 35,000 American men die of prostate cancer, most of them not from the tumor that was first detected in their prostate, but from a transformed, therapy-resistant descendant that has shed its original identity and acquired the properties of a stem cell. A new study from Roswell Park Comprehensive Cancer Center now offers a quantitative measure of that transformation, a score derived from a tumor's gene expression profile that rises in lockstep with disease severity and, in patients with metastatic disease, predicts survival with a hazard ratio of 4.7.

The work, led by Xiaozhuo Liu and Dean Tang, draws on 3,102 RNA-sequencing samples and 84,081 microarray profiles spread across 26 distinct cohorts, covering every clinically meaningful stage of prostate cancer from normal glandular tissue to metastatic castration-resistant disease. The central finding is both conceptually clean and clinically provocative: as prostate cancer progresses, it becomes measurably more stem-like, and that stem-like quality is the single best molecular predictor of how badly things will go.

The study's analytical framework. Panel B maps all 26 datasets onto the prostate cancer progression spectrum, from normal tissue through primary tumors of increasing Gleason grade to metastatic...
The study's analytical framework. Panel B maps all 26 datasets onto the prostate cancer progression spectrum, from normal tissue through primary tumors of increasing Gleason grade to metastatic castration-resistant disease. Panels C and D show the computational pipelines for the Stemness Index and AR-A score.

The Problem With Gleason

For decades, the Gleason score has been the clinical gold standard for grading prostate cancer. A pathologist examines the architectural pattern of prostatic glands under a microscope and assigns a number that reflects how far the tissue has drifted from normal. The system works reasonably well for primary tumors, but it breaks down precisely where it is needed most: in treatment-failed tumors and distant metastases, which have largely lost the glandular structures the score depends on.

The deeper limitation is conceptual. Gleason captures a snapshot of differentiation at one moment, in one biopsy, assessed by one pathologist. It does not quantify the underlying biological process driving that loss of differentiation, and it cannot be applied uniformly across the full arc of disease progression. What the field has lacked is a molecular ruler that works at every stage, from the first biopsy to the final metastatic lesion.

The Roswell Park team's approach draws on a concept from developmental biology: that cancer progression recapitulates, in distorted form, the process by which stem cells give rise to differentiated tissues. As tumors become more aggressive, they shed the gene expression programs of mature, specialized cells and reactivate programs associated with embryonic and adult stem cells. This 'stemness' is not merely a metaphor. It can be measured directly from a tumor's transcriptome using a Stemness Index (mRNAsi), a score that computes the Spearman correlation between a sample's gene expression profile and a stem cell-like reference signature.

Scale of the analysis
26 cohorts · 3,102 RNA-seq samples · 84,081 microarray profiles · Full disease spectrum from normal prostate to mCRPC

Building the Ruler

Two complementary scores anchor the analysis. The Stemness Index measures oncogenic dedifferentiation: higher scores indicate a more primitive, stem-like transcriptional state. The canonical Androgen Receptor Activity score (AR-A) measures the opposite: how actively the tumor is running the normal differentiation program of prostatic epithelium, quantified as the combined Z-score of a validated 10-gene signature of known AR targets.

Stemness Index (mRNAsi)Computed as the Spearman correlation between a sample's transcriptome and a stem cell-derived weight vector, then linearly scaled to a 0-to-1 range. Higher values indicate greater oncogenic dedifferentiation. Applied uniformly across all 26 cohorts.
AR-A ScoreZ-score normalized sum of expression values for 10 experimentally validated androgen receptor target genes (Bluemn 2017 signature). Calculated within individual datasets to avoid cross-cohort batch effects. Measures pro-differentiation, canonical AR signaling.

A third score, the MYC signaling activity (MYC-sig), was calculated using the same Z-score approach applied to a validated set of MYC target genes. Finally, the team derived a novel 12-gene 'PCa-Stem signature' by identifying genes consistently upregulated (fold change of 2 or greater, false discovery rate below 0.05) in the top third of Stemness-ranked tumors in both primary prostate cancer (TCGA-PRAD) and metastatic castration-resistant prostate cancer (SU2C 2019) cohorts simultaneously. The overlap of those two gene lists produced the signature.

Survival analyses used Kaplan-Meier estimation with log-rank testing, and multivariable Cox proportional hazards models adjusted for age, Gleason score, and tumor stage. Monotonic trends across ordered disease stages were assessed with the Jonckheere-Terpstra trend test. Genomic associations between Stemness and specific alterations, including MYC amplification and RB1 deletion, were drawn from cBioPortal data and tested with Fisher's exact test.

A Tale of Two Trajectories

The first finding is counterintuitive. In matched pairs of primary tumor and adjacent benign tissue from three independent cohorts (TCGA-PRAD, Wyatt 2014, Long 2020), both AR-A and Stemness are elevated in the tumor compared to normal tissue. Early prostate cancer initiation is characterized by a concordant rise in both scores. This makes biological sense: the androgen receptor is the master regulator of prostatic epithelial differentiation, and its activation in early tumors appears to drive not only luminal differentiation but also, paradoxically, a stem-like transcriptional state.

That concordance does not last. As primary tumors progress spontaneously to higher Gleason scores, the two trajectories split. Stemness continues to climb. AR-A begins to fall. The Jonckheere-Terpstra trend test confirms both trends are monotonic and statistically robust across the Gleason spectrum. The positive correlation between the two scores, which sits at a Pearson's r of 0.32 in primary tumors from the Bolis 2021 integrated cohort, weakens progressively with each increment in tumor grade and disappears entirely in the highest-grade primary cancers.

In matched tumor-normal pairs from three independent cohorts (A-C), both AR-A and Stemness are elevated in primary prostate cancer compared to adjacent benign tissue. Neoadjuvant androgen...
In matched tumor-normal pairs from three independent cohorts (A-C), both AR-A and Stemness are elevated in primary prostate cancer compared to adjacent benign tissue. Neoadjuvant androgen deprivation therapy (nADT) concordantly reduces both scores (D-F), suggesting that AR signaling drives stemness at this early stage.

From Primary Tumor to Lethal Metastasis

The full picture emerges when the analysis is extended across the entire disease spectrum. Stemness rises continuously from normal prostate tissue through treatment-naive primary tumors, through tumors that have been exposed to androgen deprivation therapy, and into metastatic castration-resistant prostate cancer (mCRPC). The endpoint is striking: mCRPC carries the highest Stemness scores of any clinical sample type in the dataset, despite the fact that AR messenger RNA levels are actually elevated in mCRPC compared to primary tumors. The tumor has amplified the gene but lost the ability to use it for differentiation.

AR-A tells the opposite story. It peaks in primary tumors and reaches its nadir in mCRPC. The correlation between the two scores, positive at r = 0.32 in primary disease, flips to a negative r = -0.15 (p = 0.005) in mCRPC. This is not a subtle statistical signal. It represents a fundamental rewiring of the relationship between differentiation and stemness as the disease evolves.

The pattern holds across independent data sources. In the Decipher GRID database of 82,470 prospectively collected biopsy samples, Stemness increases monotonically across NCCN risk groups from very low to very high risk. In the Taylor 2010 microarray dataset, metastatic prostate cancer carries the highest Stemness of any clinical stage. In genetically engineered mouse models, the most aggressive triple-knockout tumors (lacking Pten, Rb1, and Trp53) show the highest Stemness and lowest AR-A, while the indolent single-knockout tumors show the reverse.

One finding warrants particular attention. Tumor-adjacent benign tissues, which are routinely used as 'normal' controls in cancer studies, carry substantially higher Stemness scores than truly normal prostate tissue from cancer-free organ donors in the GTEx database. These tissues are not normal. Their transcriptomes have shifted toward a cancer-like state, a finding consistent with prior observations that field cancerization extends beyond the visible tumor margin.

In treatment-naive primary prostate cancer, Stemness and AR-A diverge with increasing Gleason score. The positive correlation between the two metrics (panel E) weakens and disappears in high-grade...
In treatment-naive primary prostate cancer, Stemness and AR-A diverge with increasing Gleason score. The positive correlation between the two metrics (panel E) weakens and disappears in high-grade primary tumors (panel G), where the regression slope approaches zero.
Across the full prostate cancer spectrum, Stemness rises continuously (panel F) while AR-A follows a bell curve, peaking in primary tumors and reaching its nadir in mCRPC. The correlation between...
Across the full prostate cancer spectrum, Stemness rises continuously (panel F) while AR-A follows a bell curve, peaking in primary tumors and reaching its nadir in mCRPC. The correlation between the two metrics flips from positive in primary disease (r = 0.32) to negative in mCRPC (r = -0.15, panel G).
'mCRPC, despite markedly upregulating AR mRNA, showed the lowest AR-A but highest Stemness, indicating a shift from a positive correlation between AR-A and Stemness in primary cancer to a negative association in mCRPC.'— Liu, Cortes et al., 2025

Predicting Who Dies

A biological score is only clinically useful if it predicts outcomes. High Stemness is consistently associated with worse overall survival across every cohort where survival data were available. In the Spratt 2017 cohort of 855 radical prostatectomy specimens, the log-rank p-value is 0.001. In the Tosoian 2020 cohort of 405 high-risk localized tumors, it is 0.014. In the CHAARTED 2021 cohort of 160 metastatic hormone-sensitive tumors, it is 0.006.

The 12-gene PCa-Stem signature performs even more sharply. In primary prostate cancer, high signature scores predict worse progression-free survival with a multivariable-adjusted hazard ratio of 1.636 (95% CI 1.014 to 2.64), independent of age, Gleason score, and stage. In mCRPC, the effect is larger: patients with high PCa-Stem signature scores have a multivariable-adjusted hazard ratio for overall survival of 4.712 (95% CI 2.011 to 11.04, p less than 0.001). That is a nearly fivefold difference in the risk of death, after accounting for standard clinical variables.

The 12 genes in the signature are predominantly involved in mitosis and cell-cycle progression, consistent with the finding that Stemness-high tumors are enriched for a 31-gene cell-cycle progression signature. Eleven of the 12 genes show statistically significant correlation with disease progression scores across the Bolis 2021 transcriptome atlas. The Human Protein Atlas independently identifies these same 11 genes as unfavorable prognostic markers in prostate cancer and several other malignancies, raising the possibility that the PCa-Stem signature may have prognostic utility beyond prostate cancer.

High Stemness is consistently associated with worse overall survival across three independent clinical cohorts: Spratt 2017 (p = 0.001), Tosoian 2020 (p = 0.014), and CHAARTED 2021 (p = 0.006)....
High Stemness is consistently associated with worse overall survival across three independent clinical cohorts: Spratt 2017 (p = 0.001), Tosoian 2020 (p = 0.014), and CHAARTED 2021 (p = 0.006). The CHAARTED cohort, which enrolled metastatic hormone-sensitive patients, carries the highest mean Stemness of the three.
The 12-gene PCa-Stem signature (panel D) is derived from genes consistently upregulated in high-Stemness tumors in both primary and metastatic disease. It predicts worse progression-free survival...
The 12-gene PCa-Stem signature (panel D) is derived from genes consistently upregulated in high-Stemness tumors in both primary and metastatic disease. It predicts worse progression-free survival in primary PCa (HR 1.636, panel T) and worse overall survival in mCRPC (HR 4.712, panel V), independent of standard clinicopathologic variables.
PCa-Stem Signature: Survival Impact
Primary PCa: HR 1.636 (95% CI 1.014–2.64) for progression-free survival
mCRPC: HR 4.712 (95% CI 2.011–11.04) for overall survival
Both adjusted for age, Gleason score, and stage

The Genomic Drivers of Stemness

High Stemness is not merely a transcriptional state. It is accompanied by measurable genomic instability. Stemness-high tumors carry a greater fraction of altered genome and a higher tumor mutation burden than Stemness-low tumors at both the primary and metastatic stages, despite no difference in patient age between the groups.

The specific genomic events associated with high Stemness differ by disease stage. In primary prostate cancer, MYC amplification is the most prominent genomic correlate of high Stemness. The prevalence of MYC amplification rises from 2% in low-grade (Gleason 6) primary tumors to 12% in high-grade (Gleason 9/10) tumors (p = 0.006). Stemness-high primary tumors are also enriched for PTEN deletion and FOXA1 mutation. In mCRPC, RB1 deletion is the genomic alteration most significantly associated with elevated Stemness. AR amplification, which occurs in 49% of mCRPC cases, is exclusively treatment-induced and absent from treatment-naive primary tumors.

The MYC story is developed in detail in the final section of the paper. MYC messenger RNA and MYC signaling activity both increase in primary tumors compared to matched benign tissue, continue to rise with Gleason grade, and reach their highest levels in mCRPC. Critically, neoadjuvant androgen deprivation therapy, which reduces both AR-A and Stemness in primary tumors, has no discernible effect on MYC activity. The Pearson correlation between MYC signaling activity and Stemness in mCRPC is 0.76, a tight positive relationship that holds across every stage of disease examined.

High-Stemness tumors carry greater genomic instability (panels A-B) and are enriched for specific driver alterations: MYC amplification in primary disease and RB1 deletion in mCRPC (panels D-F)....
High-Stemness tumors carry greater genomic instability (panels A-B) and are enriched for specific driver alterations: MYC amplification in primary disease and RB1 deletion in mCRPC (panels D-F). AR amplification, present in 49% of mCRPC cases, is exclusively treatment-induced.
MYC mRNA and MYC signaling activity increase with disease progression (panels A-E) and are positively correlated with Stemness at every stage (panels F-H). In mCRPC, the Pearson correlation...
MYC mRNA and MYC signaling activity increase with disease progression (panels A-E) and are positively correlated with Stemness at every stage (panels F-H). In mCRPC, the Pearson correlation between MYC activity and Stemness reaches r = 0.76, supporting MYC as a primary driver of the high-stemness state in advanced disease.

A Two-Phase Model of Stemness

The data support a two-phase model of how stemness is acquired and maintained during prostate cancer progression. In the first phase, covering early tumor initiation and low-grade primary disease, AR signaling drives both differentiation and stemness simultaneously. The two are positively correlated. Neoadjuvant hormone therapy, by suppressing AR-A, reduces Stemness as a consequence. Loss of PTEN and RB1, present in 9% and 13% of Gleason 6 tumors respectively, may contribute additional stemness at this stage.

In the second phase, covering high-grade primary tumors and all forms of advanced disease, the coupling between AR-A and Stemness breaks down. AR-A declines as tumors lose differentiated glandular architecture. Stemness continues to rise, now driven by accumulating genomic alterations, particularly MYC amplification and increased MYC signaling activity. Because MYC activity is insensitive to androgen deprivation, this second-phase stemness persists and grows even as AR-targeted therapies suppress canonical AR signaling. By the time a tumor reaches the mCRPC state, it has achieved maximum stemness through a mechanism that is largely independent of the pathway that all current standard-of-care therapies target.

A schematic summary of the study's central model: Stemness (red) rises continuously across the prostate cancer spectrum while canonical AR-A (blue) follows a bell curve, peaking in primary tumors...
A schematic summary of the study's central model: Stemness (red) rises continuously across the prostate cancer spectrum while canonical AR-A (blue) follows a bell curve, peaking in primary tumors and declining in advanced disease. The proposed drivers of each phase are indicated, with MYC amplification and genomic instability taking over from AR signaling in advanced disease.
'Increased Stemness quantitatively measures PCa aggressiveness, plasticity and progression, and represents a poor prognosticator for patient outcomes.'— Liu, Cortes et al., 2025

What the Score Does Not Capture

The study's primary methodological limitation is one the authors acknowledge directly: the Stemness Index was not developed for prostate cancer. It was derived from a pan-cancer analysis using embryonic stem cell reference signatures, and its sensitivity for distinguishing subtle differences within a single disease stage has not been fully validated. Whether it can reliably separate, say, Gleason 7 favorable-intermediate from Gleason 7 unfavorable-intermediate tumors is an open question.

The bulk RNA-sequencing data used throughout the analysis presents a second constraint. A bulk transcriptome reflects the average gene expression of all cells in a biopsy, including stromal fibroblasts, immune infiltrates, and endothelial cells alongside the cancer epithelium. A high Stemness score in a bulk sample could reflect genuine dedifferentiation of cancer cells, or it could partly reflect a shift in the cellular composition of the tumor microenvironment toward more stromal or immune cell types that happen to express stem-like genes. Single-cell RNA-sequencing analysis, or the application of alternative stemness pipelines such as CytoTRACE that are designed to work at single-cell resolution, would help disentangle these contributions.

The role of noncanonical AR signaling in mCRPC also remains incompletely addressed. AR amplification occurs in 49% of mCRPC cases and AR mutation in 11%, both exclusively treatment-induced. These alterations drive castration-resistant AR activity through mechanisms distinct from the canonical differentiation program measured by the AR-A score. Whether this noncanonical AR activity contributes to the high Stemness of mCRPC, or whether it is orthogonal to it, is a question the current analysis cannot answer. Targeted inhibitor studies or AR-specific perturbation experiments in mCRPC models would be needed to resolve this.

A Ruler for Aggressiveness

The practical value of this work lies in its potential to stratify patients more precisely than current tools allow. The Gleason score and PSA level, the two workhorses of prostate cancer risk assessment, capture different aspects of tumor biology but neither directly measures the stem-like state that appears to drive lethality. A Stemness score, or the 12-gene PCa-Stem signature derived here, could in principle be applied to any tumor biopsy with gene expression data, including the microarray-based Decipher Genomic Classifier already in clinical use for 82,000 patients in the GRID database.

The mechanistic findings point toward a specific therapeutic vulnerability. If MYC amplification and signaling are the primary drivers of stemness in advanced prostate cancer, then MYC-targeting strategies, which have historically been difficult to develop but are now an active area of drug discovery, could in principle reduce stemness and restore sensitivity to AR-targeted therapies. The observation that neoadjuvant hormone therapy reduces Stemness in primary tumors but leaves MYC activity untouched suggests that combining AR pathway inhibition with MYC inhibition early in the disease course, before the second-phase genomic drivers take hold, might prevent the emergence of the high-stemness mCRPC state altogether.

That hypothesis remains to be tested. What this study provides is the quantitative framework to ask the question rigorously, and a set of validated scores that can track the answer across the full arc of disease.