A library of open statistical genetics data.

Remotely hosted, queryable from Python and R.

Fast

Careful partitioning and state-of-the-art file formats mean only the data you need gets downloaded.

Free

An open community resource. All datasets available under their original licenses.

Easy

No file management or custom data structures. Use the data frame formats you already know.

pip install bedrock-bio
install.packages("bedrockbio")
import bedrock

gwas = bedrock.load("gwas/ukbb_height")
gwas.filter(gwas.pval < 5e-8).to_polars()
library(bedrockbio)

gwas <- bedrock_load("gwas/ukbb_height")
gwas |> filter(pval < 5e-8) |> collect()
gwas/ukbb_* UK Biobank GWAS summary statistics
eqtl/gtex_v8 GTEx v8 eQTL results
ref/gnomad_v4 gnomAD v4 allele frequencies
ld/1kg_eur 1000 Genomes LD reference panels
annot/vep_* Pre-computed variant annotations

More datasets in development. See the full catalog or request a dataset.