github link

search-icon
Find the data you need

Search the multi-organism collection of genome wide gene expression data obtained from publicly available sources like GEO, ArrayExpress, and SRA. The data has been processed uniformly and normalized using a set of standardized pipelines curated by the Childhood Cancer Data Lab (CCDL).

dataset-icon
Create custom datasets

Build and download custom datasets tailored to your needs including gene expression matrices and sample metadata.

You can use refine.bio datasets for preliminary assessment of biological signals and to accelerate validation of your research findings.

Differential Expression Analysis
Learn how you can do differential expression analysis with refine.bio datasets.
Pathway Analysis
Learn how you can use refine.bio data to identify pathways that are active in your biological condition of interest.
Use your data alongside refine.bio data
We make our transcriptome indices and our reference distributions used for quantile normalization available to make your own data more comparable to refine.bio data.
refine.bio Compendia
refine.bio compendia are collections of samples that have been processed and packaged for broad and flexible use.
Explore the docs
Learn about how we source and process data and other downstream analyses you can do with refine.bio data.
Sign Up for Updates
Be the first to know about new features, compendia releases, and more!