Dear gnomAD team,
I noticed that gnomAD v4 provides age-related information, for example:
##INFO=<ID=age_hist_het_bin_freq,Number=A,Type=String,Description=“Histogram of ages of heterozygous individuals; bin edges are: 30.0|35.0|40.0|45.0|50.0|55.0|60.0|65.0|70.0|75.0|80.0; total number of individuals of any genotype bin: 1332|1401|1642|2949|4283|3601|3282|2909|1955|1202”>
##INFO=<ID=age_hist_hom_bin_freq,Number=A,Type=String,Description=“Histogram of ages of homozygous alternate individuals; bin edges are: 30.0|35.0|40.0|45.0|50.0|55.0|60.0|65.0|70.0|75.0|80.0; total number of individuals of any genotype bin: 1332|1401|1642|2949|4283|3601|3282|2909|1955|1202”>
The total number of individuals in each age bin is provided in these annotations. Although the exact AN per bin is not specified, the maximum possible AN for each bin can be inferred as twice the total number of individuals.
However, when I analyzed variants across different age groups, I found that some variants have counts exceeding the maximum possible number of individuals per bin. For example, for chr10:96318482:A>G, the following INFO is shown:
...;age_hist_hom_bin_freq=1404|1573|1873|3189|4517|3871|3613|3152|2133|1271;...
Some of these values exceed the corresponding total number in the header:
1332|1401|1642|2949|4283|3601|3282|2909|1955|1202
This discrepancy appears in many variants. Could you please clarify whether this results from a technical or annotation issue, or if I might be misinterpreting the meaning of these INFO fields?
Thank you very much for your help!
Best regards,
Juse