AI-Driven Analyses of Morphological and Phenotypic Profiles Across Cancers
Aim 1: Develop AI-based pipelines to extract high-dimensional morphological features from histopathology images.
We will build quantitative image analysis workflows that leverage advanced AI techniques, including convolutional neural networks and vision-based foundation models, to process whole-slide pathology images. These approaches will generate comprehensive morphological representations that capture subtle tissue and cellular patterns not easily discernible by traditional methods.
Aim 2: Integrate histopathology images with clinical information using multimodal approaches (e.g., vision–language models) to capture clinically relevant context.
We will align histopathology image features with textual data from pathology reports and structured clinical variables. By applying multimodal AI frameworks such as vision–language models, we will connect visual patterns with semantic descriptions of tumor grade, morphology, and disease state, thereby enhancing interpretability and linking image-derived features directly to clinical context.
Aim 3: Correlate image- and text-derived features with clinical phenotypes, including tumor stage, demographics, treatment response, and survival outcomes.
We will systematically test associations between multimodal features and key patient characteristics across the PLCO cohort. These analyses will enable the identification of signatures predictive of disease progression and survival. The integration of multimodal features with survival analyses will highlight clinically actionable biomarkers of prognosis.
Aim 4: Construct pan-cancer predictive models that harmonize morphology, reports, and phenotype data across the four cancer types in the PLCO study.
We will build integrative models that unify histopathology, pathology reports, and patient-level phenotype information across lung, colorectal, ovarian, and prostate cancers in the PLCO dataset. These models will identify shared pan-cancer signatures while preserving subtype-specific associations, producing robust predictive tools that generalize across cancer types and advance the interpretability of morphology–phenotype relationships.
JUNHAN ZHAO University of Chicago