Logo image
Multimodal integration of real world genomic and clinical data for the prediction of brain metastasis development in breast cancer
Conference poster   Peer reviewed

Multimodal integration of real world genomic and clinical data for the prediction of brain metastasis development in breast cancer

A. Safonov, D. Smith, S. Nandakumar, L. Boe, E. Ferraro, J Shen, K. Tsai, I. Khatri, R. Kumar, J. An, …
Clinical cancer research, v 32(4_Supplement), PS2-08-05
17 Feb 2026

Abstract

Background: Despite advances in metastatic breast cancer (MBC) treatment, brain metastases (BM) continue to pose a major challenge, driving significant morbidity and poor prognosis. In the absence of consensus screening strategies, BMs are typically diagnosed only after neurological sequelae have already developed. This underscores the urgent need for predictive tools to identify patients at risk for BM before clinical symptoms arise. Methods: This study included 3901 patients who underwent sequencing of primary tumor (n = 1874) or non-brain metastasis of samples (n = 2027) with MSK-IMPACT, a custom tumor-normal next generation sequencing assay. We developed lasso machine-learning models to predict onset and timing of BM in two different scenarios. First, we predicted time from to BM diagnosis from the initial biopsy confirming metastatic breast cancer. Second, we developed an “early-stage” model capturing time from curative breast surgery to BM onset. Both models exclusively incorporated clinicopathologic and genomic data available at time of initial metastatic disease and initial surgery, respectively. We developed validative gradient boosting machine (GBM) machine learning and Fine-Gray models, allowing for consideration of variable interactions and competing risks, respectively. Results: Our cohort included 529 BM events over a median follow-up of 50 mos. In metastatic samples, the lasso-based ML model effectively stratified the cohort into high, intermediate and low-risk groups: the high-risk group (25% of the cohort) exhibited a hazard ratio (HR) of 10.1 (95% C.I 6.83 - 17.23) while the intermediate group (50% of the cohort) exhibited an HR of 3.65 (95% C.I 2.30 - 5.78) relative to the low-risk group. The model was driven by several genomic features (functional variants in TP53, RB1, PTEN, MDM2) as well as several clinical features (low ER and PR percentage, number of metastatic sites, metastasis to lung, adrenal or distant lymph node, and pre-menopausal status). In early-stage samples, the ML model stratified the cohort into two groups, with the high-risk group exhibiting an HR of 5.7 (95% C.I 3.83 - 8.53) relative to the low-risk group. Feature importance in the early-stage setting was driven by stage, ER/PR status as well as TP53 mutation status. The predictive performance of these models, along with the relevance of these key variables, was preserved in the GBM and Fine-Gray models.

Metrics

1 Record Views

Details

Logo image