Clustering cancers by shared transcriptional risk reveals novel targets for cancer therapy

Gao, Hua; Baylis, Richard A.; Luo, Lingfeng; Kojima, Yoko; Bell, Caitlin F.; Ross, Elsie G.; Wang, Fudi; Leeper, Nicholas J.

doi:10.1186/s12943-022-01592-y

Letter to the Editor
Open access
Published: 18 May 2022

Clustering cancers by shared transcriptional risk reveals novel targets for cancer therapy

Hua Gao ORCID: orcid.org/0000-0002-1029-1476^1,2,
Richard A. Baylis¹,
Lingfeng Luo^1,2,
Yoko Kojima^1,2,
Caitlin F. Bell^2,3,
Elsie G. Ross¹,
Fudi Wang^1,2 &
…
Nicholas J. Leeper^1,2,3

Molecular Cancer volume 21, Article number: 116 (2022) Cite this article

2644 Accesses
2 Citations
2 Altmetric
Metrics details

Main text

The pursuit of targeted cancer therapies has greatly benefitted from the existence of large transcriptomic datasets, such as The Cancer Genome Atlas (TCGA), which have enabled the correlation of intra-tumoral gene expression with patient survival. Here, we use pathway enrichment data to identify three distinct groups of cancers characterized by cluster-specific biology and diverging mortality rates. To explore the clinical actionability of these findings, we leveraged the drug prediction algorithm, OCTAD [1] to: (1) determine whether any promising investigational drugs can reverse these detrimental gene expression patterns; and (2) ascertain whether any FDA-approved drugs could be repurposed to improve cluster-specific cancer outcomes.

To perform these studies, tumor tissue mRNA-Seq data, patients’ demographic information, and survival status for 27 individual cancer types from the TCGA [2] (updated through May 2021) were used to compute a survival analysis for each gene and each cancer type. We then used each gene’s correlation with patient outcomes to perform a GSEA [3] hallmark pathway enrichment analysis [4], allowing us to understand how each of the pathways correlated with patient survival for each cancer type. Cancers were then clustered using a shared nearest neighbor modularity optimization with the Seurat package [5], as described in the online Methods.

We next used transcriptomic data from the drug disturbance dataset, LINCS [6] (which has disturbance expression data from 71 cell lines treated with 12,442 compounds) to screen for drugs that could reverse the deleterious expression profile associated with each cancer cluster. These data were used to calculate a Kolmogorov-Smirnov statistic [1] to predict the reversal ability of each compound for each cancer type. Briefly, if a compound completely reversed the risk-associated genes (e.g. the mortality-linked (hazard ratio > 1) genes clustered at the downregulated tail of the disturbance expression distribution, as detailed in the online Methods), the reversal score would approach the minimum value of − 1. The cluster-level effect was then defined as the aggregate of the most significant reversal effects for all cancer types belonging to a given cancer cluster. Two strategies were subsequently used to validate the predicted drugs. First, cancer cell lines were treated with the compound predicted to have the strongest cluster-specific beneficial effect, and then submitted for RNA sequencing. The observed reversal score was calculated as the weighted sum of the differential expression of the top 200 detrimental genes, with the survival risk as the weight. Second, a pharmacovigilance study [7] was performed for the FDA-approved drug predicted to have the strongest cluster-specific benefit (restricted to drugs prescribed to > 1000 patients in the Stanford Hospitals). Specifically, 1:5 propensity score matched cohorts (matched on demographics, smoking status, comorbid conditions, procedures, and therapeutics in the 6 months leading up to enrollment) treated with or without the drug of interest were evaluated for cluster-specific cancer incidence within 5 years.

In contrast to prior reports that focused on a cell-of-origin pattern [2], our pathway-based transcriptomic-survival analysis (Table S2) identified three cancer clusters (Fig. 1A) which had no discernable connection between the cancers that clustered together (e.g., cellular origin, organ system, sex-specific cancers). For example, rectal adenocarcinoma and colon adenocarcinoma were clustered in different groups.

One cluster, which included glioblastoma multiforme and breast cancer, was dominated by inflammatory pathways (Fig. 1B), including cytokine and complement cascades. This suggested that dysregulation of these pathways was associated with worse patient outcomes for these cancers and that targeting these pathways may be uniquely beneficial for these cancers. The second cluster, which included acute myeloid leukemia and hepatocellular carcinoma, was enriched in metabolic pathways like fatty acid metabolism and glycolysis. The remaining cancers, including melanoma and colon adenocarcinoma, were enriched in proliferative pathways, like the G2M checkpoint. Interestingly, when plotted on Kaplan-Meier curves, the metabolic cluster had significantly worse survival (Fig. 1C; hazard ratio (HR) 1.33 vs. inflammatory cancers, P < 0.001; HR 1.66 vs. proliferative cancers, P < 0.001).

To investigate the clinical relevance of these findings, we then applied the in silico drug repurposing pipeline outlined above. This approach identified numerous preclinical and FDA-approved compounds predicted to effectively reverse the high-risk transcriptional signatures associated with each cancer cluster. Proof-of-principle testing was performed for the top preclinical compound (Table S3), AZ-628 (an experimental Raf inhibitor), and the top FDA-approved drug, clopidogrel (a widely-prescribed antiplatelet medicine). As predicted, 0.1 μM AZ-628 selectively reversed the expression of survival risk genes in vitro in an inflammatory cancer cell line (Fig. 1D). Similarly, clopidogrel use (75 mg/day) amongst ‘real-world’ patients was associated with a specific reduction in the incidence of inflammatory cancers, but had no effect on other cancer types (Fig. 1E, HR 0.72, P < 0.001).

Conclusions

To date, no study has integrated gene-to-survival correlation data to simultaneously identify similarities between cancers and determine which pathways are most important for patient outcomes. While prior multi-omics efforts identified a cell-of-origin pattern across diverse cancer types [2], our analyses demonstrated that malignancies may be better distinguished according to dysregulation of key inflammatory, metabolic, or proliferative pathways. This approach allowed groups of cancers to be stratified by mortality risk, and revealed important biologic similarities that could provide novel mechanistic insights. From a translational perspective, these efforts also identified novel targets that could provide survival benefit for certain cancer types, but may need to be avoided for others. While prospective validation studies are required, this proof-of-principle study shows the potential of integrating tumor transcriptomics and patient survival data to identify important patterns between cancers and predict targets for future cancer therapeutics.

Availability of data and materials

Original RNA-Seq data are available in the NCBI BioProject PRJNA807725.

Abbreviations

TCGA:: The Cancer Genome Atlas
OCTAD:: Open Cancer TherApeutic Discovery
GSEA:: Gene Set Enrichment Analysis
LINCS:: The Library of Integrated Network-Based Cellular Signatures
HR:: Hazard ratio
ACC:: Adrenocortical carcinoma
BLCA:: Bladder Urothelial Carcinoma
BRCA:: Breast invasive carcinoma
CESC:: Cervical squamous cell carcinoma
CHOL:: Cholangiocarcinoma
COAD:: Colon adenocarcinoma
ESCA:: Esophageal carcinoma
GBM:: Glioblastoma multiforme
HNSC:: Head and Neck squamous cell carcinoma
KIRC:: Kidney renal clear cell carcinoma
KIRP:: Kidney renal papillary cell carcinoma
LAML:: Acute Myeloid Leukemia
LGG:: Brain Lower Grade Glioma
LIHC:: Liver hepatocellular carcinoma
LUAD:: Lung adenocarcinoma
LUSC:: Lung squamous cell carcinoma
MESO:: Mesothelioma
OV:: Ovarian serous cystadenocarcinoma
PAAD:: Pancreatic adenocarcinoma
READ:: Rectum adenocarcinoma
SARC:: Sarcoma
SKCM:: Skin Cutaneous Melanoma
STAD:: Stomach adenocarcinoma
THCA:: Thyroid carcinoma
UCEC:: Uterine Corpus Endometrial Carcinoma
UCS:: Uterine Carcinosarcoma
UVM:: Uveal Melanoma

References

Zeng B, Glicksberg BS, Newbury P, Chekalin E, Xing J, Liu K, et al. OCTAD: an open workspace for virtually screening therapeutics targeting precise cancer patient groups using gene expression features. Nat Protoc. 2021;16:728–53.
Article CAS Google Scholar
Hoadley KA, Yau C, Hinoue T, Wolf DM, Lazar AJ, Drill E, et al. Cell-of-origin patterns dominate the molecular classification of 10,000 tumors from 33 types of Cancer. Cell. 2018;173:291–304.e6.
Article CAS Google Scholar
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci. 2005;102:15545–50.
Article CAS Google Scholar
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. The. Innovation. 2021;2:100141.
PubMed PubMed Central Google Scholar
Hao Y, Hao S, Andersen-Nissen E, Mauck WM, Zheng S, Butler A, et al. Integrated analysis of multimodal single-cell data. Cell. 2021;184:3573–3587.e29.
Article CAS Google Scholar
Stathias V, Turner J, Koleti A, Vidovic D, Cooper D, Fazel-Najafabadi M, et al. LINCS Data Portal 2.0: next generation access point for perturbation-response signatures. Nucleic Acids Res. 2020;48:D431–9.
Article CAS Google Scholar
Datta S, Posada J, Olson G, Li W, O’Reilly C, Balraj D, et al. A new paradigm for accelerating clinical data science at Stanford Medicine. arXiv. 2020:200310534 Available from: http://arxiv.org/abs/2003.10534. Cited 2022 Feb 9.

Download references

Acknowledgements

This research used data and services provided by STARR, “STAnford medicine Research data Repository,” a clinical data warehouse containing live Epic data from Stanford Health Care, the Stanford Children’s Hospital, the University Healthcare Alliance and Packard Children’s Health Alliance clinics and other auxiliary data from Hospital applications such as radiology PACS. STARR platform is developed and operated by Stanford Medicine Research IT team and is made possible by Stanford School of Medicine Research Office.

Funding

This study was supported by the National Institutes of Health (R35 HL144475 to N.J.L) and the American Heart Association (19EIA34770065 to N.J.L.).

Author information

Authors and Affiliations

Division of Vascular Surgery, Department of Surgery, Stanford University School of Medicine, Stanford, CA, 94305, USA
Hua Gao, Richard A. Baylis, Lingfeng Luo, Yoko Kojima, Elsie G. Ross, Fudi Wang & Nicholas J. Leeper
Stanford Cardiovascular Institute, Stanford University, Stanford, CA, 94305, USA
Hua Gao, Lingfeng Luo, Yoko Kojima, Caitlin F. Bell, Fudi Wang & Nicholas J. Leeper
Division of Cardiovascular Medicine, Department of Medicine, Stanford University School of Medicine, Biomedical Innovations Building, 240 Pasteur Drive, #3654, Stanford, CA, 94305, USA
Caitlin F. Bell & Nicholas J. Leeper

Authors

Hua Gao
View author publications
You can also search for this author in PubMed Google Scholar
Richard A. Baylis
View author publications
You can also search for this author in PubMed Google Scholar
Lingfeng Luo
View author publications
You can also search for this author in PubMed Google Scholar
Yoko Kojima
View author publications
You can also search for this author in PubMed Google Scholar
Caitlin F. Bell
View author publications
You can also search for this author in PubMed Google Scholar
Elsie G. Ross
View author publications
You can also search for this author in PubMed Google Scholar
Fudi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas J. Leeper
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

NJL, RAB and HG designed this study and drafted the manuscript. HG conducted the analysis. LL and YK performed the in vitro validation experiments. HG, CFB, EGR and FW designed the pharmacovigilance study. All authors revised this manuscript and approved the final version.

Corresponding author

Correspondence to Nicholas J. Leeper.

Ethics declarations

Ethics approval and consent to participate

The pharmacovigilance study in this study used data and services provided by STARR. The STARR-OMOP-deid database is a pre-IRB direct SQL access to de-identified dataset, in OMOP Common Data Model.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Additional file 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Gao, H., Baylis, R.A., Luo, L. et al. Clustering cancers by shared transcriptional risk reveals novel targets for cancer therapy. Mol Cancer 21, 116 (2022). https://doi.org/10.1186/s12943-022-01592-y

Download citation

Received: 22 February 2022
Accepted: 10 May 2022
Published: 18 May 2022
DOI: https://doi.org/10.1186/s12943-022-01592-y

Clustering cancers by shared transcriptional risk reveals novel targets for cancer therapy

Main text

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Additional file 2.

Rights and permissions

About this article

Cite this article

Share this article

Molecular Cancer

Contact us