A library of open statistical genetics data.
Remotely hosted, queryable from Python and R.
Fast
Careful partitioning and state-of-the-art file formats mean only the data you need gets downloaded.
Free
An open community resource. All datasets available under their original licenses.
Easy
No file management or custom data structures. Use the data frame formats you already know.
Install
pip install bedrock-bio
install.packages("bedrockbio")
Usage
import bedrock gwas = bedrock.load("gwas/ukbb_height") gwas.filter(gwas.pval < 5e-8).to_polars()
library(bedrockbio) gwas <- bedrock_load("gwas/ukbb_height") gwas |> filter(pval < 5e-8) |> collect()
Available Datasets
gwas/ukbb_*
UK Biobank GWAS summary statistics
eqtl/gtex_v8
GTEx v8 eQTL results
ref/gnomad_v4
gnomAD v4 allele frequencies
ld/1kg_eur
1000 Genomes LD reference panels
annot/vep_*
Pre-computed variant annotations
More datasets in development. See the full catalog or request a dataset.