I am getting this error from Arrow when trying to convert the VCFs into Parquet files:
Arrow(InvalidArgumentError("Column 'info_AF' is declared as non-nullable but contains null values"))
I’d like to just verify whether the declaration of the info_AF
in your VCFs and the data in those columns are as they should be with regards to the VCF spec, and this is more a general problem with Arrow. Which makes me wonder whether it would be possible to offer your data as Parquet in addition to VCF and Hail?