The Effect of Phasing on Reducing False Distant Matches (Or, Phasing a Parent Using GEDmatch)

Genealogical autosomal DNA evidence relies on segments of DNA shared between two or more individuals. When they are true matching segments, they provide information about shared ancestry. One problem that genealogists are currently facing is the inability to decipher between “real” or “true” matching segments and “false” segments.

I won’t get too much into all the different terminology of “real” versus “false” here, because it isn’t important and takes away from the more important discussion. Genealogists, like patent attorneys, can be their own lexicographer, just so long as they are understood by the reader by providing a good definition. So here are my definitions for this post (and I typically use these elsewhere): ... Click to read more!

The Danger of Distant Matches

We know that small segments shared between two individuals can be problematic (see Small Matching Segments – Friend or Foe?), whether the two individuals are closely related or distantly related (or not related at all, as we’ll see). I call small segments (which I usually classify as 5 cM or less) as POISON because it is currently impossible to decipher between which are real segments and which are not.

In the following analysis, I use the wonderful new Match-O-Match tool at DNAGedcom to compare my and my parents’ match lists from AncestryDNA. The Match-O-Match tool is a powerful spreadsheet analysis tool developed by Don Worth. It is available to DNAGedcom subscribers as part of the DNAGedcom Client. For more, see page 10 of the PDF HERE. Thank you Don for this great new tool! ... Click to read more!

The DNA Era of Genealogy

When does DNA prove a relationship? When is a triangulation group sufficiently large enough to prove descent from an ancestral couple? When is a shared DNA segment large enough to prove someone is your first (or second/third/fourth, etc.) cousin? At what point does the DNA prove that I am descended from Samuel Snell? When does the DNA prove that you’ve found your great-grandmother’s biological parents?

NEVER.

And this is, perhaps, one of the greatest misconceptions in the post-DNA era of genealogy.

What is Proof?

Genealogy is the study of lives and relationships. Accordingly, genealogists spend much of their time identifying, hypothesizing, supporting, and sometimes rejecting, relationships.

Unless you have direct knowledge of a relationship (and even sometimes when you do), you identify relationships using evidence that you’ve gathered from multiple different sources (including DNA, census, land, tax, vital, and many other types of records). ... Click to read more!

Visual Phasing: An Example (Part 5 of 5)

Five-Part Series on Visual Phasing:

In “Visual Phasing: An Example (Part 1),” we identified and labeled all of the recombination points in the three siblings, Susan, Brooke, and Felix. Then, in Part 2 of the series, we used the identified and labeled recombination points to assign segments of DNA to the four grandparents (the blue grandparent and grey grandparent pair, and the purple grandparent and green grandparent pair). In Part 3 of the series, we used cousin matching to identify the grandparental source of the chromosomal segments. And finally, in Part 4 of the series, we characterized my paternal chromosome.

After Part 3, we had the following for Brooke, Susan, and Felix:

step4g

Now that we have this information, let’s see if we can use that to explore new matches with Brooke, Susan, and Felix. ... Click to read more!

GUEST POST – What a Difference a Phase Makes

The following is a guest post by Ann Turner, founder of the of the GENEALOGY-DNA mailing list at RootsWeb and co-author (with Megan Smolenyak) of “Trace Your Roots with DNA: Using Genetic Tests to Explore Your Family Tree.” Thank you Ann for this terrific post!

Genetic genealogists use autosomal DNA testing to locate people who share some DNA, enough to point to a relationship in a genealogical time frame. We’re not impressed by accidental matches that occur simply because all humans share 99.9% of their DNA. We want to be confident that the shared DNA segment is Identical by Descent (IBD) from a particular common ancestor, one who lived some number of generations in the past.

Two practical difficulties stand in the way of definitive confirmation. One is that our pedigrees are not complete, and we cannot test every link in the chain to prove that the segment traveled down the pathway we’ve identified through the paper trail. Indeed, as more data accumulates we frequently discover that a match we attributed to one ancestor must have come through an entirely different line. ... Click to read more!

Finding Genetic Cousins – Separating Fact from Fiction

AncestryDNAShort Summary: Before the end of the year, AncestryDNA plans to update our match lists using a new algorithm that reduces the number of false positive matches. For the first time, matching DNA segments will be characterized as IBS (i.e., a false positive) based on something other than simply segment length.

AncestryDNA Day

Last Monday, October 6th, I and six other members of the genetic genealogy community attended a ‘Bloggers Day’ hosted by AncestryDNA at the San Francisco headquarters of Ancestry.com. Two other members of the group have already written about the event:

While at ‘Bloggers Day’ we discussed many issues including the Y-DNA and mtDNA databases originally scheduled for destruction, upcoming changes to AncestryDNA’s matching algorithm (much more below), and other upcoming changes to the AncestryDNA about which you will hopefully soon hear much more. ... Click to read more!

TGG Interview Series VI – Ann Turner

Ann Turner has been a member of the genetic genealogy community since 2000, and during that time she has made great contributions to field (as will become obvious from her interview). According to her brief biography at the Journal of Genetic Genealogy:

Ann Turner is the founder of the GENEALOGY-DNA mailing list at RootsWeb and the co-author (with Megan Smolenyak) of “Trace Your Roots with DNA: Using Genetic Tests to Explore Your Family Tree.” She received her undergraduate degree in biology in 1964 and her M.D. from Stanford University in 1970. In recent years, she developed software for neuropsychological testing and wrote utility programs for the PAF genealogy program. One of these utilities provided a way to split out all people in a database who were related via their mitochondrial DNA, six years before mtDNA tests were commercially available. The inspiration for this feature came from the (then) forward-looking predictions of Dr. Thomas Roderick, now associate editor of JoGG. ... Click to read more!

The Growing Phenomenon of the Unlinked Family Cluster

Have you experienced this? You’ve identified a very clear cluster that includes numerous DNA matches that all descend from a single family, but you have no idea how this family links into your family tree. Try as you might, and despite building numerous trees, you can’t seem to figure out how these DNA matches and this single ancestral family link into your family tree. If this sounds familiar, you have an Unlinked Family Cluster!

Defining “Unlinked Family Cluster”

An Unlinked Family Cluster is a very specific phenomenon in genetic genealogy, one that is becoming increasingly common. We see more and more of these clusters for various reasons; the matching databases get larger and larger meaning that these clusters get larger and easier to identify. Additionally, the more we work with our closest DNA matches, the more we have very promising “left behind” matches that don’t fit into our known ancestry. Often, these “left behind” matches form a shared match cluster or a triangulated cluster around a specific family. This is an Unlinked Family Cluster. ... Click to read more!

Analyzing Segment Frequency at GEDmatch

This post was inspired by an excellent post from Lara Diamond today entitled “Long Segment–But No Close Connection“.

What is DNA Segment Frequency?

In addition to segment size, segment frequency may be another important consideration for genealogists.

There are two ways to think about segment frequency: The first measurement of segment frequency is the frequency of a DNA segment among all humans. This is a number that is currently unknown, and can’t yet reliably be estimated even with simulations; there just aren’t enough people in the world who have tested yet. This is especially true for a segment that might be found outside of people with European descent as testing of these populations has been minimal or practically non-existent. ... Click to read more!

AncestryDNA Plans Update to Matching Algorithm

AncestryDNAAncestryDNA is making several changes to its matching algorithm in the next week or two (an exact time is not yet available). You may recall an announcement that was made earlier this month entitled “New Advances in DNA Science Coming Your Way” (pdf) in which they stated the following:

“These advancements are expected to deliver more-precise predictions of whom you are related to, and how closely, among the million-plus others in the AncestryDNA database.”

There were no specifics in the announcement, however. Last night, AncestryDNA provided additional information about the changes that we will be seeing in our match lists in the next week or two.

Before I launch into the specifics, here is a very high-level summary, based on the information we were provided: ... Click to read more!