This directory contains the array data submission from GenoGraphic project. The original data is under raw_data directory. File raw_data/NG_hg38_rs_pos.new.csv, provided by the submitter, is a position reference file for all rs that are present in the data set. However, upon evaluation, even though majority of sites are on GRCh38, there are some GRCh37 sites. The aggregated data (on GRCh38), in vcf format, are under aggregated_data/natgeo_vcf_with_freq. The aggregated file contains individual genotypes and aggregated allele frequencies for a global population that contains all individuals in the submission. VCF positions, reference alleles are taken from NG_hg38_rs_pos.new.csv, and validated against the reference genome.