Early Prediction of Pancreatic Cancer Using Data Analysis Techniques in IBM SPSS Modeler
1. Acquire and analyze a relevant set of data, including factors associated with pancreatic cancer
2. Preprocess and clean the data with IBM SPSS Modeler (missing value treatment, discretization, normalization).
3. Explore relationships between variables to discover relevant patterns using graphical and statistical tools.
4. Build predictive models with algorithms such as: decision trees (C&R Tree, CHAID), Neural networks, Logistic regression and
SVM
5. Evaluate the performance of each model using cross-validation techniques.
6. Extract clinical rules and interpretations useful for medical decision making.
7. To propose a clinical decision support tool based on the trained model.
Iñigo Monedero Goicoechea: tenure-track professor at University of Seville