“For information on constraint, see The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020). Descriptions of the fields in these files can be found in the Supplementary Dataset 11 section on pages 74-77 of the Supplementary Information.”
However, I couldn’t locate pages 74-77 in Supplementary Dataset 11. From what I’ve downloaded, it seems that Dataset 11 is the Gene constraint file itself, not the documentation.
Could you please help clarify what I might be missing?
I came across the following note in the Supplementary Information:
“Note that this file contains all transcripts: for gene-based analyses, the file should be filtered to canonical transcripts (canonical == true), and LOEUF decile bin (oe_lof_upper_bin) recomputed for each gene.”
Could you please clarify how I can filter the file to include only the canonical transcripts? I assumed there would be a specific filter column for this, but I couldn’t find it.