Educational content on VJOncology is intended for healthcare professionals only. By visiting this website and accessing this information you confirm that you are a healthcare professional.

Share this video  

ESMO 2025 | AI-generated synthetic cohorts for metastatic breast cancer research

Eddy Saad, MD, Beth Israel Deaconess Medical Center, Boston, MA, explores a study analyzing the use of AI-generated synthetic real-world data (sRWD) for metastatic breast cancer. The research assessed methods including classification and regression trees (CART) to replicate real-world baseline characteristics and progression-free survival outcomes. CART models showed the highest fidelity while maintaining acceptable patient privacy, highlighting the potential of sRWD to accelerate collaborative research and serve as external comparators in clinical trials. This interview took place at the European Society for Medical Oncology (ESMO) 2025 Congress in Berlin, Germany.

These works are owned by Magdalen Medical Publishing (MMP) and are protected by copyright laws and treaties around the world. All rights are reserved.

Transcript

This is basically an effort that started a couple of years ago. We were looking into ways that could really facilitate data sharing between multiple stakeholders. And we’re looking specifically at real-world data. So the reason for that is that given the recent acceleration in the approvals for drugs and oncology, we really have like this vicious cycle of unmet needs and exponential increase in the demand for data...

This is basically an effort that started a couple of years ago. We were looking into ways that could really facilitate data sharing between multiple stakeholders. And we’re looking specifically at real-world data. So the reason for that is that given the recent acceleration in the approvals for drugs and oncology, we really have like this vicious cycle of unmet needs and exponential increase in the demand for data. And while clinical trials are great because they are the gold standard, they really require a lot of resources that oftentimes we do not have. So our aim was to try to get real-world data, but to try to bypass some of the regulatory challenges that come with these. And this is how we wanted to implement artificial intelligence models to try and take the real-world data and derive from it a synthetic version that replicates the same statistical patterns without compromising patients’ privacy. And this is exactly what we show. We show that a specific AI model that we call classification and regression trees is really able to do just that. And then we offer a framework of how this model can be integrated in a sort of a platform where this data could be continuously ingested, processed by AI, and then used to derive these synthetic data sets. Using these synthetic data sets, we are able to model the clinical outcomes for patients in the real world, and we can use that to predict clinical outcomes for a specific real patient that comes into our clinic.

This transcript is AI-generated. While we strive for accuracy, please verify this copy with the video.

Read more...