Have questions? Visit https://www.reddit.com/r/SNPedia

Talk:Genoset

From SNPedia

Missing Information[edit]

Genosets are great, but empty ones are not! Genosets Gs165 and Gs166 have not yet been assigned a magnitude... I am not sure why there have been added so many Haplogroup genosets without any criteria. That is just an excellent way to fill a database with junk. The genosets are all between Gs1074...Gs3049.

Another interesting genoset is Gs140 for slow metabolizers. (How was that set generated?) It may be useful also to have one for fast ones as well... E:V:A 17:41, 6 October 2010 (UTC)

I've set the magnitudes for gs165 gs166. The gs1074... is the result of Special:Contributions/RebekahThorn and while she added a lot of good info, she left even more stubs undone. I'd hoped that others with an interest in genealogy might pickup where she left off, but at this point I agree. They are becoming cruft and should be removed. Cariaso 19:52, 6 October 2010 (UTC)
Great! If you have an idea where to pick that (work) up, perhaps... Also, please disregard Gs140 comment, just had a look at both Gs138 and Gs139. E:V:A 11:55, 7 October 2010 (UTC)


Is it possible to calculate prevalence of genosets, let's say from openSNP data? I would be quite interested in gs224, and how the prevalence compares to reported literature values for http://www.ncbi.nlm.nih.gov/pubmed/18598896?dopt=Abstract . by Karl, 5 March 2014
possible, certainly, but to make it easy I've sent email to Bastian at opensnp, to learn an answer to a question of my own about the urls for each user's data. --- cariaso 03:51, 5 March 2014 (UTC)
just got this email from the opensnp guys for anyone who's interested:

Hi Karl! Thank you very much :)

Genosets is a good idea - it seems we can just parse all of these from SNPedia (like this: http://snpedia.com/index.php/Gs181/criteria ) and then assign users to each genoset depending on their alleles, and we'd get numbers of users for each set.

I've opened an issue on github for this to discuss with others: https://github.com/gedankenstuecke/snpr/issues/105

It might take quite a while since we've got other issues before that, but you should be able to follow the progress via the link.

Thanks for the suggestion! Philipp Bayer

, by Karl, 5 March 2014

the opensnp issue is 99% resolved, and documented at https://github.com/gedankenstuecke/snpr/issues/105 . I'm waiting for the rest of the records to process, and should have full results in ~24h. --- cariaso 16:26, 5 March 2014 (UTC)