As you know, I’ve been collecting submissions of known relationships and shared cMs for the Shared cM Project for about 1.5 years, and I have more than 10,000 submissions. (If you have submitted yet and would like to, the form is HERE).
I have a ton of data and recently provided an update (see “Update to the Shared cM Project“). However, there is still much to do. For example, I’d like to analyze the following:
- Longest segment analysis
- Known endogamy/pedigree collapse versus no known endogamy pedigree collapse
- Company breakdowns (23andme v. AncestryDNA v. Family Tree DNA v. GEDmatch)
- Others? (I’m open to suggestions!)
But I need help! I have time to oversee and assist with the analysis, but I just don’t have time to do it all myself. If you have statistics experience or are otherwise able to assist me with this project over the next month or two, please contact me at [email protected]. Include your experience and your interest in the project. Thank you!
And if you missed them, there were two excellent posts this week with incredibly valuable data for the genealogical community:
- The Limits of Predicting Relationships Using DNA by Leah Larkin
- The Halves & The Halve Nots by Jim Owson
Be sure to read both! These are great examples of genealogists using data we already have at our fingertips to learn more about using DNA.