As you might recall, a few months ago I sent out a call (“Collecting Sharing Information for Known Relationships“) for information about the amount of DNA shared by people having a known genealogical relationship. I was hoping to get a better picture of the ranges of the amount of DNA shared by people in these relationships (through about the third cousin range). The incredibly generous genetic genealogy community responded by submitting data bout more than 6,000 relationships!
I posted information a few weeks ago (“Collecting Sharing Information for Known Relationships – Part II“), but today I have an update.
This data is shared under a Attribution-NonCommercial-ShareAlike CC license. You are free to share and use the information for non-commercial purposes, as long as you give proper attribution and release anything you create under the same license.
NOTE: The information below is available HERE in an Excel file.
There are several important limitations to keep in mind. For example, this is all self-reported data, which means that there could be errors in relationships as well as in data entry. Additionally, people sometimes reported the same relationship from multiple companies. As a result, be sure to consider these limitations when utilizing this data.
I would love to hear your thoughts about this data, including the results as well as other ways to analyze the data set.