One issue with the Shared cM Project, however, is that it is user-submitted data, meaning there are invariably two inherent problems that will affect that data: (1) data entry errors; and (2) relationships that are not accurate.
It is actually a very simple matter to resolve both of these issues, and that is to provide the distributions for the data. The distributions will show clearly where the outliers (the errors and the incorrect relationships) reside. To generate distributions, I enlisted the help of mathematician Ingrid Baade, who volunteered all of her time. I am forever in her debt for this contribution!