Computerized Early Diagnosis of Lung Cancer
This study will be conducted using 4 solutions:
1. IBM Software Analytic Solution (SPSS Modeller).
2. Microsoft Software Analytics Solution (Microsoft SQL Server, Azure Machine Learning, and Power BI).
3. Open Source Analytics Solution (Python, Jupyter, MySQL, MySQL Workbench, Kettle/Spoon, Tableau, and Weka).
4. Big Data Analytics Solution (AWS, Jupyter, PySpark, Spark)
The modelling methods used in this study will be four classification algorithms:
1. IF-THEN Rule
2. Decision tree
3. Bayesian classifiers
4. Neural networks
- Diagnose lung cancer in screening before severe symptoms appear.
- Potentially increase the ratio of early diagnosis of lung cancer patients.
Project supervisor: David Sundaram