Have questions? Visit https://www.reddit.com/r/SNPedia


From SNPedia


23andMe is a private biotech company based in the USA.

The Ancestry only product produces the same Promethease report as the Ancestry + Health product

You can download your raw data, but these steps may be helpful if you're on an ipad or iphone.

There is also a developer API.

In 2007, they began offering direct-to-consumer genetic testing. For version 1 and version 2 of their service they used a customized Illumina Hap550+ array. As of November 24, 2010, their version 3 service is based on a customized Illumina OmniExpress+ array.

August 2017 they introduced a new v5 array, based on the illumina GSA chip. There appears to have been an issue where during the the first 24 hours, many users raw data included only 200k snp, instead of the expected 600k. After that was corrected promethease is seeing way too many warnings of heterozygosity for serious pathogenic BRCA2 snps. see https://www.reddit.com/r/promethease/comments/6stg3t/scary_false_23andme_brca2_miscalls_on_new_v5_chip/ and https://www.23andmeforums.com/discussion/comment/89971/

A comparison of the platform's result for User:Lilly Mendel

23andMe customers may be interested in using Promethease to learn more about their raw dna.

Example Promethease reports for this company

 PlatformRaw data availableSummary
Aaron Vollrath23andMe v2falseMale on 23andMe
Almelina23andMe v2falseFemale with cancer 2x
Blainebettinger23andMe v2
FTDNA Family Finder
true23andMe v2 & v3, FTDNA Family Finder, Ancestry.com
23andMe v2
Visualized family snps; whole family 23andMe genotypes
Ronald Bailey23andMe v223andMe data from the science correspondent of Reason magazine

Here is a list of all of the SNPedia snps on the customized chip

Older posts (from 2007) related to the (extensive) press coverage received by 23andMe:

You can watch the founders explain their plan during this youtube video.

Blog coverage has been extensive, you will find more information at bbgm

According to a story in Fortune, Warren Buffet and Jimmy Buffet submitted DNA tests for 23andMe. Approximately March 2007. Fortune Magazine June 11 2007

NY Times biography of Anne Wojcicki

NY Times coverage of 23andMe

a well written analysis of 23andMed from bioarraynews (login req)

Bio-IT World has announced the three keynote speakers for its sixth annual Bio-IT World Conference & Expo, to be held in Boston April 28-30, 2008).

The keynoters will be Linda Avey, co-founder of 23andMe; Joshua Boger, president & CEO of Vertex Pharmaceuticals; and John Reynders, CIO, Johnson & Johnson Life Sciences Division.

Linda Avey is the co-founder of 23andMe, the Bay Area consumer genomics start-up. She previously worked in sales and business development for Affymetrix and Perlegen Sciences. Prior to that, Avey had stints at Spotfire and Applied Biosystems. Avey founded 23andMe with Anne Wojcicki, wife of Google co-founder Sergey Brin (Google is also an investor in 23andMe). Partnering with Illumina, 23andMe will shortly unveil a consumer genotyping service offering individuals whole-genome analysis to learn about genealogy, ancestry, and medical issues. Avey will speak on Wednesday morning, April 30, 2008.

Various news sites reported on Oct-4-2007 23andMe had raised an additional $9 million in funding. investor include Google, Mohr Davidow Ventures, New Enterprise Associates, and Genentech.

genomeboy found a few new details on 23andMe and Navigenics

photos from Flickr..


Customers can now download their raw data. Developers should use the API.

If you look here: https://customercare.23andme.com/hc/en-us/articles/219202567-What-do-I-need-to-know-about-transitioning-to-the-new-23andMe-

and then click on the link for 'Locked Reports', you'll go to here: https://customercare.23andme.com/hc/en-us/articles/219202567-What-do-I-need-to-know-about-transitioning-to-the-new-23andMe-#locked_report

which eventually leads to the following:

To opt in to a locked report, click the name of the report and read general information about the condition. The report will tell you how to make your results visible. Once you have opted in to the report, it will shuffle itself, alphabetically, within the appropriate list. Note that once you opted in and unlocked a locked report, it cannot be re-locked.

The raw data is delivered in a compressed ZIP file (about 5 MB). Sample raw data is available from SNPedia, for example for Mikolaj_Habryn, (download ).

 unzip -t genome_Mikolaj_Habryn_20080522154706.zip
 Archive:  genome_Mikolaj_Habryn_20080522154706.zip
     testing: genome_Mikolaj_Habryn_20080522154706.txt   OK
 No errors detected in compressed data of genome_Mikolaj_Habryn_20080522154706.zip.

SNP data is reported in an ASCII file containing comma-separated values (TAB-separated to be precise).

 unzip -c genome_Mikolaj_Habryn_20080522154706.zip genome_Mikolaj_Habryn_20080522154706.txt  | head -25
 Archive:  genome_Mikolaj_Habryn_20080522154706.zip
   inflating: genome_Mikolaj_Habryn_20080522154706.txt  
 # This data file generated by 23andMe at: Thu May 22 15:47:06 2008
 # Below is a text version of your data. Fields are TAB-separated
 # Each line corresponds to a single SNP.  For each SNP, we provide its identifier 
 # (an rsid or an internal id), its location on the reference human genome, and the 
 # genotype call oriented with respect to the plus strand on the human reference 
 # sequence.  We are using reference human assembly build 36.  Note that it is possible 
 # that data downloaded at different times may be different due to ongoing improvements 
 # in our ability to call genotypes.
 # More information on reference human assembly build 36:
 # http://www.ncbi.nlm.nih.gov/projects/mapview/map_search.cgi?taxid=9606&build=36
 # rsid  chromosome      position        genotype
 rs3094315       1       742429  AG
 rs12562034      1       758311  AG
 rs3934834       1       995669  CC
 rs9442372       1       1008567 AG
 rs3737728       1       1011278 AG
 rs11260588      1       1011521 GG
 rs6687776       1       1020428 CC
 rs9651273       1       1021403 AG
 rs4970405       1       1038818 AA

The fourth column of the SNP data contains the actual genotype (AG, CC, CC and so on). Further analysis of this data set reveals that 23andMe (unlike deCODEme) reports a very small number of deletions and insertions, coded as II (occurs 30 times), DD (occurs 28 times), and DI (occurs 3 times). The actual numbers can be calculated like this (in a Unix command shell):

 unzip -c genome_Mikolaj_Habryn_20080522154706.zip genome_Mikolaj_Habryn_20080522154706.txt  |\
 awk 'NF==4{print $4}' | sort | uniq -c | sort -n 
       3 DI
      28 DD
      30 II
     234 AT
     354 CG
    5491 --
   15980 GT
   16301 AC
   70180 CT
   70451 AG
   93323 AA
   94030 TT
  105381 GG
  105994 CC


genotyping http://www.google.com/patents/US8428886

rs10513789 http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO2&Sect2=HITOFF&p=1&u=/netahtml/PTO/search-bool.html&r=1&f=G&l=50&co1=AND&d=PTXT&s1=8,187,811&OS=8,187,811&RS=8,187,811