Skip to Main Content

COVID-19 is an emerging, rapidly evolving situation.

What people with cancer should know:

Get the latest public health information from CDC:

Get the latest research information from NIH:

The following PLCO Breast dataset(s) are available for delivery on CDAS. For each dataset, a Data Dictionary that describes the data is publicly available. In order to obtain the actual data in SAS or CSV format, you must begin a data-only request. Data will be delivered once the project is approved and data transfer agreements are completed.

To learn more about the Breast data collected as part of the study, visit the PLCO Data Collected documentation page.

Datasets and Data Dictionaries

Data Dictionary
(PDF - 494.0 KB)
1. The Breast dataset is a comprehensive dataset that contains nearly all the PLCO study data available for breast cancer incidence and mortality analyses. For many women the trial documents multiple breast cancers, however, this file only has data on the earliest breast cancer diagnosed in the trial. The dataset contains one record for each of the approximately 78,000 women in the PLCO trial.
Data Dictionary
(PDF - 194.9 KB)
2. The Breast Secondary dataset contains data available for additional breast cancers documented in the trial and collected on the Breast Cancer Supplement form. The dataset contains one record for each of the approximately 78,000 women in the PLCO trial.

User Guides and Other Files

User Guides are intended to serve as a guide to using the data contained in these datasets.

PLCO User Guide
(PDF - 360.0 KB)

Data-Collection Forms

The following forms were used to collect data that is now available in the datasets listed above. They are provided in PDF format.

Baseline Questionnaire - Female
BQF3 - Scanned (1.7 MB)
Breast Cancer Supplemental Form
BCS - Scanned (317.6 KB)
Other Cancer Form
OCF - Scanned (405.5 KB)
Annual Study Update Form
ASU - Scanned (82.2 KB)