Privacy-preserving generation of synthetic patient data
In the project we analyze privacy and anonymity and ways of retaining these features. Next we analyze machine and deep learning methods capable of generating new data samples, and finally existing solutions of privacy preserving generation of synthetic patient data. After the analysis a proposal of our own solution is presented. This method is later evaluated from the point of privacy and usability in real situations and research. Results are compared with analyzed and existing ones. In the conclusion, we discuss the results of the proposed method, its pros, cons, and possible improvements.
- Create a method of patient data generation from real data
- Preserve underlying correlations and characteristics of the data
- Preserve privacy and anonymity of the patients and prevent re-identification
- Use newly generated data for machine and deep learning research
- Combat patient data scarcity in academic and commercial areas
Ing. Milan Unger, PhD. - Siemens Healthcare