Skip to Main Content

An official website of the United States government

Principal Investigator
Name
Keyvan Farahani
Degrees
PhD
Institution
National Cancer Institute
Position Title
Program Director
Email
About this CDAS Project
Study
NLST (Learn more about this study)
Project ID
NLST-780
Initial CDAS Request Approval
Apr 12, 2021
Title
Distribution of NLST imaging and limited clinical data through NCI public image repositories
Summary
In March 2021, NCI/DCP (Dr. Lori Minasian and Dr. Paul Pinsky) agreed to make the NLST image collection and limited clinical data (herein referred to as NLST data) publicly available through NCI image repositories, TCIA and IDC (Imaging Data Commons). The aims of this proposal are to (1) gain IDC access to the NLST data, and (2) coordinate the public release of this data through TCIA and IDC, tentatively planned for September 2021. We will pursue these aims in coordination with IMS, various NCI divisions and programs (DCP, CIP, and CBIIT), the National Biomedical Imaging Archive (NBIA), TCIA, and IDC. At completion of this project we will provide the research community with unrestricted access to the NLST data. Coordination and synchronization of the public release will provide the user with a choice to download the NLST data from TCIA or view the imaging data on the cloud based IDC, create data cohorts and process them in public cloud resources, including the NCI Cancer Cloud Resources, as components of the NCI Cancer Research Data Commons (CRDC). While, based on the current requirements, there will be egress charges for downloading data from the cloud, users of IDC will be able to take advantage of developing and advanced cloud compute tools and infrastructure. They will be able to combine and correlated imaging data with other data types (e.g., pathology, genomics, clinical) when available and they may choose to share their results through CRDC.
Furthermore, while limited clinical data will be made publicly available through NCI imaging repositories, users interested in additional clinical data will be directed, on both TCIA and IDC, to submit their requests through CDAS for approval.
Aims

(1) Gain IDC access to the NLST data
(2) Coordinate the public release of this data through TCIA and IDC
(3) Develop a data descriptor manuscript describing the attributes of the NLST data and access systems. The manuscript will be submitted at the time of public release of the data.

Collaborators

Keyvan Farahani (NCI)
Ulrike Wagner (FNLCR)
John Freymann (FNLCR)
Justin Kirby (FNLCR)
David Clunie (Pixelmed)
Bill Clifford (Institute for Systems Biology)
Bill Longabaugh (Institute for Systems Biology)
Andrey Fedorov (Harvard University)
Scott Gustafson (Ellumen Inc.)
Tracy Nolan (University of Arkansas)
Jeremy Jarosz (University of Arkansas)
Lawrence Tarbox (University of Arkansas)
William Bennett (University of Arkansas)
Kirk Smith (University of Arkansas)
Fred Prior (University of Arkansas)