23andMe
From SNPedia
23andMe is a private biotech company based in the USA.
In 2007, they began offering direct-to-consumer genetic testing. Their service uses a customized Illumina Hap550+ array.
23andMe customers may be interested in using the Personal Genome Explorer and Promethease to learn more about their own dna.
Here is a list of all of the SNPedia snps on the customized chip
Older posts (from 2007) related to the (extensive) press coverage received by 23andMe:
You can watch the founders explain their plan during this youtube video.
Blog coverage has been extensive, you will find more information at bbgm
According to a story in Fortune, Warren Buffet and Jimmy Buffet submitted DNA tests for 23andMe. Approximately March 2007. Fortune Magazine June 11 2007
NY Times biography of Anne Wojcicki
NY Times coverage of 23andMe
a well written analysis of 23andMed from bioarraynews (login req)
Bio-IT World has announced the three keynote speakers for its sixth annual Bio-IT World Conference & Expo, to be held in Boston April 28-30, 2008).
The keynoters will be Linda Avey, co-founder of 23andMe; Joshua Boger, president & CEO of Vertex Pharmaceuticals; and John Reynders, CIO, Johnson & Johnson Life Sciences Division.
Linda Avey is the co-founder of 23andMe, the Bay Area consumer genomics start-up. She previously worked in sales and business development for Affymetrix and Perlegen Sciences. Prior to that, Avey had stints at Spotfire and Applied Biosystems. Avey founded 23andMe with Anne Wojcicki, wife of Google co-founder Sergey Brin (Google is also an investor in 23andMe). Partnering with Illumina, 23andMe will shortly unveil a consumer genotyping service offering individuals whole-genome analysis to learn about genealogy, ancestry, and medical issues. Avey will speak on Wednesday morning, April 30, 2008.
Various news sites reported on Oct-4-2007 23andMe had raised an additional $9 million in funding. investor include Google, Mohr Davidow Ventures, New Enterprise Associates, and Genentech.
genomeboy found a few new details on 23andMe and Navigenics
photos from Flickr..
[edit] Data
Customers can now download their raw data. The raw data is delivered in a compressed ZIP file (about 5 MB). Sample raw data is available from SNPedia, for example for Mikolaj_Habryn, (download ).
unzip -t genome_Mikolaj_Habryn_20080522154706.zip
Archive: genome_Mikolaj_Habryn_20080522154706.zip
testing: genome_Mikolaj_Habryn_20080522154706.txt OK
No errors detected in compressed data of genome_Mikolaj_Habryn_20080522154706.zip.
SNP data is reported in an ASCII file containing comma-separated values (TAB-separated to be precise).
unzip -c genome_Mikolaj_Habryn_20080522154706.zip genome_Mikolaj_Habryn_20080522154706.txt | head -25 Archive: genome_Mikolaj_Habryn_20080522154706.zip inflating: genome_Mikolaj_Habryn_20080522154706.txt # This data file generated by 23andMe at: Thu May 22 15:47:06 2008 # # Below is a text version of your data. Fields are TAB-separated # Each line corresponds to a single SNP. For each SNP, we provide its identifier # (an rsid or an internal id), its location on the reference human genome, and the # genotype call oriented with respect to the plus strand on the human reference # sequence. We are using reference human assembly build 36. Note that it is possible # that data downloaded at different times may be different due to ongoing improvements # in our ability to call genotypes. # # More information on reference human assembly build 36: # http://www.ncbi.nlm.nih.gov/projects/mapview/map_search.cgi?taxid=9606&build=36 # # rsid chromosome position genotype rs3094315 1 742429 AG rs12562034 1 758311 AG rs3934834 1 995669 CC rs9442372 1 1008567 AG rs3737728 1 1011278 AG rs11260588 1 1011521 GG rs6687776 1 1020428 CC rs9651273 1 1021403 AG rs4970405 1 1038818 AA ...
The fourth column of the SNP data contains the actual genotype (AG, CC, CC and so on). Further analysis of this data set reveals that 23andMe (unlike deCODEme) reports a very small number of deletions and insertions, coded as II (occurs 30 times), DD (occurs 28 times), and DI (occurs 3 times). The actual numbers can be calculated like this (in a Unix command shell):
unzip -c genome_Mikolaj_Habryn_20080522154706.zip genome_Mikolaj_Habryn_20080522154706.txt |\
awk 'NF==4{print $4}' | sort | uniq -c | sort -n
3 DI
28 DD
30 II
234 AT
354 CG
5491 --
15980 GT
16301 AC
70180 CT
70451 AG
93323 AA
94030 TT
105381 GG
105994 CC