Skip to Main Content

COVID-19 is an emerging, rapidly evolving situation.

What people with cancer should know:

Get the latest public health information from CDC:

Get the latest research information from NIH:

Principal Investigator
Andreas Maier
FAU Erlangen Nürnberg
Position Title
Professor, Head of the Pattern Recognition Lab
About this CDAS Project
PLCO (Learn more about this study)
Project ID
Initial CDAS Request Approval
Jan 27, 2020
Development of a machine learning algorithm for generating realistic synthetic electronic healthcare records (Prosate and Breast)
Machine learning brings new opportunities to the healthcare industries: it can improve the sensitivity of detection of diseases, add more value to treatment decisions and help to personalize the treatment. However, to be able to utilize machine learning sufficient amount of patient data is required. Still, due to many legal, privacy and security regulations, real persons' data is mostly not accessible to researchers.
In the project PLCO-529, the algorithm for the creating of the realistic Lung Cancer Patients is created. Unfortunately, not all the time the sufficient amount of data about the specific cancer is available. The goal of this project is to validate whenever the algorithm created in PLCO-529 can be used to create realistic healthcare records, for patients with other types of cancer (for example for Prostate and Breast cancer). This project also aims to validate whenever it is possible to use a combination of the data (for example Lung and Prostate) to create realistic healthcare records if not enough patient data is available.

- Implement an algorithm to gather causal dependencies between patient characteristics, diagnostic & therapeutic procedures, and patient outcomes (complications, survival)
- Validation of the patient generator against real patient population from available data
- Integration of domain transfer between Lung- Prostate - Breast cancer


Daniel Stromer FAU Erlangen Nürnberg
Oliver Haas FAU Erlangen Nürnberg