Skip to Main Content
An official website of the United States government

Efficient risk-based collection of biospecimens in cohort studies: Designing a prospective study of diagnostic performance for multicancer detection tests.

Authors

Ramos MLF, Chaturvedi AK, Graubard BI, Katki HA

Affiliations

  • Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Department of Health and Human Services, Bethesda, Maryland.

Abstract

In cohort studies, it can be infeasible to collect specimens on an entire cohort. For example, to estimate sensitivity of multiple Multi-Cancer Detection (MCD) assays, we desire an extra 80mL of cell-free DNA (cfDNA) blood, but this much extra blood is too expensive for us to collect on everyone. We propose a novel epidemiologic study design that efficiently oversamples those at highest baseline disease risk from whom to collect specimens, to increase the number of future cases with cfDNA blood collection. The variance reduction ratio from our risk-based subsample versus a simple random (sub)sample (SRS) depends primarily on the ratio of risk model sensitivity to the fraction of the cohort selected for specimen collection subject to constraining the risk model specificity. In a simulation where we chose 34% of Prostate, Lung, Colorectal, and Ovarian Screening Trial cohort at highest risk of lung cancer for cfDNA blood collection, we could enrich the number of lung cancers 2.42-fold and the standard deviation of lung-cancer MCD sensitivity was 31-33% reduced versus SRS. Risk-based collection of specimens on a subsample of the cohort could be a feasible and efficient approach to collecting extra specimens for molecular epidemiology.

Publication Details

PubMed ID
38965750

Digital Object Identifier
10.1093/aje/kwae139

Publication
Am J Epidemiol. 2024 Jul 4

Related CDAS Studies Related CDAS Studies