Last Updated: February 25, 2016
·
3.627K
· arrogantrobot

Convert 23andme raw data to VCF

Use this tool to convert your raw 23andme data into a VCF file, then run it through snpEff, or whatever other tools you like: https://github.com/arrogantrobot/23andme2vcf

3 Responses
Add your response

When I tried this the script went into an infinite loop with the error message

Use of uninitialized value $dataline in split at 23andme2vcf.pl line 165, <GEN1> line 960613.
Use of uninitialized value $chr in string eq at 23andme2vcf.pl line 166, <GEN1> line 960613.
Use of uninitialized value $pos in numeric gt (>) at 23andme2vcf.pl line 170, <GEN1> line 960613.
Use of uninitialized value $data
line in scalar chomp at 23andme2vcf.pl line 164, <GEN1> line 960613.
Use of uninitialized value $data_line in split at 23andme2vcf.pl line 165, <GEN1> line 960613.
Use of uninitialized value $chr in string eq at 23andme2vcf.pl line 166, <GEN1> line 960613.
Use of uninitialized value $pos in numeric gt (>) at 23andme2vcf.pl line 170, <GEN1> line 960613.

This happened with either my data set or the dataset that was supplied in the folder.

Rich

over 1 year ago ·

Same for me. Prints endless "uninitialized values" and took more than a week, until I ended up killing the script.

over 1 year ago ·

I have recently pushed a fix for the missing reference position issues. I have also added a means for improving the reference included with the project. When you run your data, if any sites in your data are not included in the reference, the chromosome and position will be noted in a file named "sitesnotin_reference.txt". There is no personally identifying information in this, just genomic positions. If you'd like me to update the included reference to add the reference bases for these sites, just get that list to me, and I will update the included reference.

over 1 year ago ·