Scalable VCF format

Scalable VCF improves storage, scalability

The default Variant Call Format (VCF) scales poorly, requiring 900 tebibytes (TiB) for a 150,000-genome VCF, since it densely represents both reference-genotypes and allele-indexed arrays leading to unnecessary data duplication and very large files. The Scalable Variant Call Representation (SVCR) reduces file sizes by ensuring they scale linearly with samples, is lossless and mergeable, allowing for N + 1 and N + K incremental joint-calling.