GCD Datasets in Progress

The Datasets in Progress table lists known global coherent datasets that are not yet sufficiently complete for release. Note that for datasets with more than one tissue the "approximate numbers of individuals" refers to the tissue with the greatest number of individuals and all tissues may not have this degree of coverage.

To add to or update this list please contact repdata@sagebase.org.

Dataset Name Tumor
Tissue Type
Species Disease Approx No
Individuals
Investigator Institution Reference
PMID
Description
Human_Cancer_Ovarian_TCGA Ovarian Human Cancer 387 TCGA TCGA tcga.cancer.gov
Human_Cancer_Lung_TCGA Lung Human Cancer 500 TCGA TCGA tcga.cancer.gov
Human_Cancer_HCC_NCI HCC Human Cancer 200 Snorri Thorgeirsson NCI 19366792
Human_Cancer_Lung_Canary Lung Human Cancer     Canary  
Human_Cancer_Pancreatic_ICGC Pancreatic Human Cancer 500 ICGC ICGC Description
Human_Cancer_Ovarian_ICGC Ovarian Human Cancer 500 ICGC ICGC Description
Human_Cancer_Gastric_ICGC Gastric Human Cancer 500 ICGC ICGC Description
Human_Cancer_Breast
(triple negative)_ICGC
Breast
(triple negative)
Human Cancer 500 ICGC ICGC Description
Human_Cancer_Breast(HER2)
_ICGC
Breast(HER2) Human Cancer 500 ICGC ICGC Description
Human_Cancer_HCC (viral)
_ICGC
HCC (viral) Human Cancer 500 ICGC ICGC Description
Human_Cancer_HCC (alcohol)
_ICGC
HCC (alcohol) Human Cancer 500 ICGC ICGC Description
Human_Cancer_Brain(pediatric)
_ICGC
Brain(pediatric) Human Cancer 500 ICGC ICGC Description
Human_Cancer_Oral_ICGC Oral Human Cancer 500 ICGC ICGC Description
Human_Cancer_CLL_ICGC CLL Human Cancer 500 ICGC ICGC Description
Human_Cancer_Glioblastoma
_ICGC
Glioblastoma Human Cancer 500 ICGC ICGC Description
Human_Cancer_Lung_ICGC Lung Human Cancer 500 ICGC ICGC Description
Human_Cancer_AML_ICGC AML Human Cancer 500 ICGC ICGC Description
Human_Cancer_Colon_ICGC Colon Human Cancer 500 ICGC ICGC Description
Human_Cancer_Lung_ACRG Lung Human Cancer 2,000 ACRG ACRG Description
Human_Cancer_Gastric_ACRG Gastric Human Cancer 2,000 ACRG ACRG Description
Human_Cancer_Myeloma_Sage Myeloma Human Cancer 300 Stephen Friend Sage Description
Human_Cancer_Lung_Sage Lung Human Cancer 300 Stephen Friend Sage Description
Human_Cancer_Ovarian_Sage Ovarian Human Cancer 300 Stephen Friend Sage Description
Human_Cancer_AML_Sage AML Human Cancer 300 Stephen Friend Sage Description
Human_Cancer_Breast_Sage Breast Human Cancer 300 Stephen Friend Sage Description
Human_Cancer_Medulloblastoma
_JHSM
Medulloblastoma Human Cancer 47 Bert Vogelstein JHSM Description
Human_Cancer_Pancreas_JHSM Pancreas Human Cancer 47 Bert Vogelstein JHSM Description
Human_Cancer_Breast_NKI Breast Human Cancer 1,000 Rene Bernards NKI Description
Human_Cancer_Colon_HKU Colon Human Cancer 400 Suet-Yi Leung HKU Description

 

Descriptions:

Human_Cancer_Pancreatic_ICGC
Human_Cancer_Ovarian_ICGC
Human_Cancer_Gastric_ICGC
Human_Cancer_Breast(triple negative)_ICGC
Human_Cancer_Breast(HER2)_ICGC
Human_Cancer_HCC (viral)_ICGC
Human_Cancer_HCC (alcohol)_ICGC
Human_Cancer_Brain(pediatric)_ICGC
Human_Cancer_Oral_ICGC
Human_Cancer_CLL_ICGC
Human_Cancer_Glioblastoma_ICGC
Human_Cancer_Lung_ICGC
Human_Cancer_AML_ICGC
Human_Cancer_Colon_ICGC

These cohorts all consist of prospectively collected matched tumor and adjacent normal tissue from 500 patients with particular cancer diagnosis. Data includes full genome DNA sequence, DNA methylation, mRNA Expression profiling, miRNA. Clinical outcomes available. www.icgc.org

^ Back to table

Human_Cancer_Lung_ACRG
Eli Lilly and Company, Merck, and Pfizer Inc. have formed the Asian Cancer Research Group, Inc., (ACRG), an independent, not-for-profit company established to accelerate research and ultimately improve treatment for patients affected with the most commonly-diagnosed cancers in Asia. Over the next two years ACRG have committed to create one of the most extensive pharmacogenomic cancer databases known to date. This database will be composed of data from approximately 2,000 tissue samples from patients with lung and gastric cancer that will be made publicly available to researchers and, over time, further populated with clinical data from a longitudinal analysis of patients. Comparison of the contrasting genomic signatures of these cancers could inform new approaches to treatment

^ Back to table

Human_Cancer_Gastric_ACRG
Eli Lilly and Company, Merck, and Pfizer Inc. have formed the Asian Cancer Research Group, Inc., (ACRG), an independent, not-for-profit company established to accelerate research and ultimately improve treatment for patients affected with the most commonly-diagnosed cancers in Asia. Over the next two years ACRG have committed to create one of the most extensive pharmacogenomic cancer databases known to date. This database will be composed of data from approximately 2,000 tissue samples from patients with lung and gastric cancer that will be made publicly available to researchers and, over time, further populated with clinical data from a longitudinal analysis of patients. Comparison of the contrasting genomic signatures of these cancers could inform new approaches to treatment

^ Back to table

Human_Cancer_Myeloma_Sage
Clinical outcomes with responder/non-responder status for standard of care therapies. Partial retrospective, partial prospective. Patient-driven. Minimum full exome sequencing.

^ Back to table

Human_Cancer_Lung_Sage
Clinical outcomes with responder/non-responder status for standard of care therapies. Partial retrospective, partial prospective. Patient-driven. Minimum full exome sequencing.

^ Back to table

Human_Cancer_Ovarian_Sage
Clinical outcomes with responder/non-responder status for standard of care therapies. Partial retrospective, partial prospective. Patient-driven. Minimum full exome sequencing.

^ Back to table

Human_Cancer_AML_Sage
Clinical outcomes with responder/non-responder status for standard of care therapies. Partial retrospective, partial prospective. Patient-driven. Minimum full exome sequencing.

^ Back to table

Human_Cancer_Breast_Sage
Clinical outcomes with responder/non-responder status for standard of care therapies. Partial retrospective, partial prospective. Patient-driven. Minimum full exome sequencing.

^ Back to table

Human_Cancer_Medulloblastoma_JHSM
Comprehensive data on DNA and mRNA (copy number, expression, sequence) from 47 medulloblastoma samples including sequence on >20,000 genes. Clinical outcomes available

^ Back to table

Human_Cancer_Pancreas_JHSM
Comprehensive data on DNA and mRNA (copy number, expression, sequence) from 47 pancreatic cancer samples including sequence on >20,000 genes. Clinical outcomes available

^ Back to table

Human_Cancer_Breast_NKI
A cohort of 1000 individuals with invasive breast cancer from the Netherlands. Comprehensive genomic data being generated from DAN and mRNA (copy number, expression, sequence). Clinical outcome data (median follow up 8 years)

^ Back to table

Human_Cancer_Colon_HKU
A cohort of 400 individuals with colon carcinoma with RNA profiling (already generated), DNA genotyping (planned) and miRNA (planned). Clinical outcomes available.

^ Back to table



View Sage Available and Transition Datasets