gnomAD v4/v4.1 in Google BigQuery?

Is there a roadmap for bringing gnomAD v4 or v4.1 to Google BigQuery’s public datasets?

Thanks for reaching out. We do not have plans to load gnomAD data into BigQuery. Out of curiosity, could you explain the benefit of having data hosted in BigQuery over our current API and downloadable files?

BigQuery enables large-scale genomic queries with SQL, parallel processing, and easy integration with other data (e.g., clinical, phenotypic). Many researchers prefer it for:

  • Speed and scale (especially with huge cohorts)
  • Cost-effective querying (versus spinning up cloud compute for custom processing)
  • Integration into Google Cloud-based pipelines (e.g., Terra, Vertex AI)

On a few occasions, the gnomAD APIs have been offline for 3-4 hours, and we have had to rerun the Terra pipelines.