Convert VCF file's Variants Coordinates

June 9, 2014

You've all faced the case where you downloaded from the 1000 Genome Project, TCGA or got from a colleague a VCF file with coordinates from a different reference genome than the one you are using.

This could represent an issue especially if you are building a database, comparing variants or when using GATK.

VCF-RefChange is a Python script solve the problem of the conversion of a VCF variants coordinates from one reference genome version to another.

It relies on the output of NCBI's BLAST, it works better with small sequences (ie. human mithochondrial dna).

If you'd like to use the script with larger files (Whole exome or genome), we reccomend that you split the VCF file by chromosome or smaller chunks.

Comments