How Artificial Intelligence AI and Machine Learning Impact Genealogy

Artificial Intelligence and Genealogy
Elevenses with Lisa Episode 32

In this episode we tackle a few small geeky tech questions about artificial intelligence, better known as AI, that may have a pretty big impact on your genealogy life. Questions like:

  • Is artificial intelligence the same thing as machine learning?
    And if not how are they related?
  • And am I using AI, maybe without even being aware of it?
  • And what impact is AI really having on our lives? Is it all good, or are there some pitfalls we need to know about?

We’re going to approach these with a focus on family history, but pretty quickly I think we’ll discover it’s a much more far-reaching subject. And that means this episode is for everyone.

Free Webinar AI Machine learning and Genealogy

Watch the free video below.

While I’ve done my own homework on this subject and written about it in my book The Genealogist’s Google Toolbox, I’m smart enough to call in an expert in the field. So, my special guest is Benjamin Lee. He is the developer of the Newspaper Navigator, the new free tool that uses artificial intelligence to help you find and extract images from the free historical newspaper collection at The Library of Congress’ Chronicling America. I covered Newspaper Navigator extensively in Elevenses with Lisa episode 26.

Ben  is a 2020 Innovator-in-Residence at the Library of Congress, as well as a third year Ph.D. Student in the Paul G. Allen School for Computer Science & Engineering at the University of Washington, where he studies human-AI interaction with his advisor, Professor Daniel Weld.

He graduated from Harvard College in 2017 and has served as the inaugural Digital Humanities Associate Fellow at the United States Holocaust Memorial Museum,  as well as a Visiting Fellow in Harvard’s History Department. And currently he’s a National Science Foundation Graduate Research Fellow.

Thank you so much to Ben Lee for a really interesting discussion and for making Newspaper Navigator available to researchers. I am really looking forward to hearing from him about his future updates and improvements.

Artificial Intelligence and Genealogy

Covering technology and its application to genealogy is always a bit of a double-edged sword. It can be exciting and helpful, and also problematic in its invasiveness.

Tools like family tree hints, the Newspaper Navigator and Google Lens (learn more about that in Elevenses with Lisa episode 27) all have a lot to offer our genealogy research. But on a personal level, you may be concerned about the long reaching effects of artificial intelligence on the future, and most importantly your descendants. In today’s deeply concerning cancel culture and online censorship, AI can seriously impact our privacy, security and even our freedom.

As I did my research for this episode I discovered a few things. Artificial Intelligence and machine learning is having the same kind of massive and disrupting impact that DNA has had on genealogy, with almost none of the same publicity. (For background on DNA data usage, listen to Genealogy Gems Podcast episode 217. That episode covers the use of DNA in criminal cases and how our data potentially has wide-reaching appeal to many other entities and industries.)

A quick search of artificial intelligence ancestry.com in Google Patents reveals that work continues on ways to apply AI to DNA and genealogy. (See image below)

Patents for AI machine learning and DNA

Patent search result: a pending patent involving AI and DNA by Regeneron Pharmaceuticals, Inc.

AI now makes our genealogical research and family tree data just as valuable to others outside of genealogy.

This begs the question, who else might be interested in our family tree research and data?

Who Is Interested in Your Genealogy Data

One answer to this question is academic researchers. During my research on this subject The Record Linking Lab at Brigham Young University surfaced as just one example. It’s run by a BYU Economics Professor who published a research paper on their work called Combining Family History and Machine Learning to Link Historical Records. The paper was co-authored with a Notre Dame Economics and Women’s Studies professor.

In this example, their goals are driven by economic, social, and political issues rather than genealogy. Their published paper does offer an eye-opening look at the value that those outside the genealogy community place on all of the personal data we’re collecting and the genealogical records we are linking. Our work is about our ancestors, and therefore it is about ourselves. Even if living people are not named on our tree, they are named in the records we are linking to it. We are making it all publicly available.

In the past, historical records like birth and death, military and the census have been available to these researchers, but on an individual basis. This made them difficult to work with. Academic (and industry) researchers couldn’t easily follow these records for individual people, families, and generations of families through time in order to draw meaningful conclusions. But for the first-time machine learning is being applied to online genealogy research data making it possible to link these records to living and deceased individuals and their families.  

It’s a lot to think about, but it’s important because it is our family history data.  We need to understand how our data is being used inside and outside the genealogy sandbox.

Answers to Your Live Chat Questions About AI

One of the advantages of tuning into the live broadcast of each Elevenses with Lisa show is participating in the Live Chat and asking your questions.

Elevenses with Lisa Q&A on AI and Genealogy

www.GenealogyGems.com/Elevenses

From Linda J: ​What about all the “people search” sites (not genealogy) that have all, or a lot of, our personal date?
Lisa’s Answer: My understanding is that much of the information provided on many of the “people search” websites comes from public information. So while the information is much easier to access these days, it’s been publicly available for years. That information isn’t as accessible to projects like the one discussed in this episode because those websites don’t make their Application Programming Interface (known as API) publicly available like FamilySearch does.

From Doug H: Wouldn’t that potentially find errors in our trees?
Lisa’s Answer: Yes.

From Sheryl T: ​Do these academic researchers have access to the living people on the trees? Or are those protected from them as it is to the public?
Lisa’s Answer: They have access to all information attached to people marked as “Living Person.” Therefore, if the attached record names them, their identity would then be known. Click a hint on your tree at Ancestry for example, and the found records clearly spell out the name of the person they believe is your “Living” person.

From Nancy M: ​How long do the show notes stay available? am looking for Google Books two weeks ago and last week’s Allen Co Library.
Lisa’s Answer: The show notes remain available until the episode is archived in Premium Membership. You can find all of the currently available free Elevenses with Lisa episodes on our website in the menu under VIDEOS click Elevenses with Lisa.

Nannie A: I heard a rumor that Ancestry .com has been sold. Do you know if that’s true?
Lisa’s Answer: Yes, they were sold again this year. Read:
Private equity firm Blackstone Group Inc. buying Ancestry.com for $4.7 billion
Private equity wants to own your DNA by CBS News.

Resources

Get My Free Genealogy Gems Newsletter – click here.
Bonus Download exclusively for Premium Members: Download the show notes handout. 
Become a Genealogy Gems Premium Member today. 

 

Google Drive: A Challenger to Dropbox and Evernote

Google Drive Packs Powerful PunchGoogle Drive is giving some of their competitors a run for their money. This free google tool is just what genealogists are looking for to create, consolidate, and organize their files.

I have been using Google Drive for about a year now. I upload my family photos, GEDCOMs, and my family history notes to the drive. I love the ease in which I can save these things to the cloud and rest knowing my hard work is safely backed up. You can imagine my excitement when our Google expert, Lisa Louise Cooke, shared her new premium video: All About Google Drive. There is so much more I didn’t know Google Drive could do!

Lisa shares ten benefits to using Google Drive and how it packs a powerful punch. Used as a file hosting service, Google Drive can offer you more free storage than Dropbox. Further, Google Drive may be a viable competitor to Evernote for several reasons. You can store files, create files, and edit them all via Google Drive. What’s even better is that Google Drive works across all different computing devices like PC, Mac, Windows, Android, and Apple. This means that syncing and accessing it all has never been easier.

Getting More from Google Drive

But wait, there’s more! Just when you thought you have heard it all, Lisa shares the power of the companion tool, Google Docs, to create documents, drawings, forms, and more. Haven’t had the money to purchase Microsoft Office yet? Not a problem! Google Docs is free to use. Lisa walks you through how to create and save a document and other files by using Google Docs. It is so easy!

Google Drive and Google Docs

You will continue to be amazed at the Google Extensions that are available from the Google Store. I had no idea there were so many. I was particularly excited to hear how I could easily save and clip items from webpages. Imagine finding a digital image of your great-grandmother’s obituary you want to save. How do you do that without having to save the whole page? There’s a Google Extension for that!

Google Drive, Google Docs, and the many extensions available really pack a powerful punch. Watch All About Google Drive to learn more about these knock-out features!

The Genealogy Gems Premium website members have exclusive access to all our full length video tutorials on topics ranging from research strategies to technology tools. They also have access to the full audio archive of The Genealogy Gems Premium Podcast. Click here to learn more about The Genealogy Gems Premium Membership.

Watch a preview:

More Gems on Google Drive and Tools

How to Use Google to Search for Family History & Genealogy

7 Free Google Search Features Every Genealogist Should UseGoogle Drive and other tips

Google Keyword Search Tips

“I Found 130 Letters by My Ancestor!” Why Use Google Books for Genealogy

Betty has at least 130 good reasons to use Google Books for genealogy! She used this powerful Google tool to find her ancestor’s name in a book–which led to a treasure trove of his original letters in an archive. Here’s what happened–and how to try this with your own family history research. 

You’ve heard me say that Google Books is the tool I turn to every day. Now, you may be thinking, “But my ancestors wouldn’t be in history books!” Resist the temptation to make assumptions about sources, and about your ancestors. With over 25 million books, Google Books is more likely to have something pertinent to your genealogy research than you think. And as I often tell my audiences, those books can include source citations, providing a trail to even more treasures.

Why to Use Google Books for Genealogy: Success Story!

At the National Genealogical Society conference this past spring, Betty attended my class and then stopped by the Genealogy Gems booth to share her story. I recorded it, and here’s a transcription:

Betty: I was stuck on my Duncan Mackenzie ancestor, so I put his name in Google Books, because when you’re stuck, that’s what you do!

Lisa: Yes, I do!

Betty: So, up popped this history of Mississippi, it was sort of a specific history, and it said Duncan Mackenzie had written a letter to his brother-in-law in North Carolina from Covington County, Mississippi. And of course I already had my tax records and my census records that placed him in Covington County. This was in the 1840s. I thought, this just couldn’t be him! Why would any of my relatives be in a book? [Sound familiar?]

So, finally, weeks later, it occurred to me to go back and look at the footnotes in the book, and I found that the letters could be found in the Duncan McLarin papers at Duke University. So, I didn’t even think to even borrow the microfilm. I just told my husband, “next time you go East for work, we need to go by Duke University.” So I set up a time, and I went, and it WAS my great-great-grandfather who wrote those letters! I have now transcribed 130 letters from that collection. They let me scan them all, and I’ve been back again to scan the rest of the legal papers.

Lisa: So, an online search into Google Books not only help you find something online, but it led you to the offline gems!

Betty: And it just changed my life! Because I spend all my time on these letters. It’s distracted me from other lines! [LOL! I get that!]

How to Use Google Books for Genealogy

Are you ready to put Google Books to work in your own research and discover some genealogy gems of your own? Here, I re-create Betty’s search for you, so you can see how to get started:

1. Go to Google Books (books.google.com). Enter search terms that would pertain to your ancestor, like a name and a place.

2. Browse the search results. The first three that show up here all look promising. Click on the first one.

3. Review the text that comes up in the text screen. As you can see here, Duncan McKenzie of Covington County is mentioned–and the source note at the bottom of the page tells you that the original letter cited in the book is at Duke University.

Learn More about Using Google Books for Genealogy

Learn more by watching my free Google Books video series at the Genealogy Gems YouTube Channel. Click the video below to watch the first one. (And be sure to subscribe while you’re there, because there are more videos to come!)

Then, watch the video below for a quick preview of my full one hour video class (and downloadable handout) called Google Books: The Tool You Need Every Day!, available to all Genealogy Gems Premium Members.

AncestryDNA Works Toward Genetics + Genealogy Integration

 

AncestryDNA Review GEDCOM DNA integrationThe ideal genetic genealogy interface creates a seamless transition between genetics technology and genealogical research findings. Most currently available tools are either DNA technology without much genealogy, or genealogy without much DNA technology. AncestryDNA is really pioneering the genetic and genealogical integration with its newest AncestryDNA product update.

The goal of genetic genealogy is to aid your traditional research by verifying known connections and providing clues to as yet unknown ancestors. DNA was never meant to replace traditional research methods, nor has it ever claimed that ability. Rather, it is meant to aid your traditional research by verifying known connections and providing clues to as-yet unknown ancestors.

I admit, I dream of a future technology so precise that it pinpoints the locations of ancestors and defines our exact relationships to others. While we are not there yet, many have experienced a genetic test’s power to obliterate previously-held beliefs about relationship and heritage, and create new intricate and personal relationships where before there were only blank spaces. In this sense, genetic genealogy can be viewed as a kind of police force of the genealogy world, righting wrongs and taking names. But I digress.

For now, the ideal must remain a seamless transition between genetics technology and traditional research results, so that the two so completely complement each other that we can’t see where one stops and the other begins. Yet the two worlds are often separated by a chasm of misunderstanding and just plain ignorance. Of the three testing companies, two are making mediocre efforts at best to try to help you incorporate your genetics into your genealogy. They are basically dishing out a serving of genetics, offering a vending machine of genealogy snacks and calling it a full meal.

With one exception.

AncestryDNA has put genetic and genealogical integration at the forefront of its product.  They are the only company making a serious effort to integrate your genetics and your genealogy. To be successful, they need two things: tons of people and their genealogy. The more people test, the better the database becomes. Not just in terms of the matches you find, but also in terms of statistics and the power that numbers have to solve complex problems, like relatedness.

So, how do they get more people interested in genetic genealogy?

This reminds me of my early days at Relative Genetics, one of the first genetic genealogy companies.  I was fresh out of college and tasked with training our CEO, CFO, QA director, and marketing director about what exactly it was that we did as a genetic genealogy company. None of these men had any experience in genetics or genealogy. In those meetings as we were trying to figure out ways to grow our company in an unknown industry, I felt like I was the constant downer to the party.  As a scientist I had been trained that there are no absolutes. Whenever we talk about outcomes it is always in terms of “most likely” or “less likely” and to never, ever say “always.” So when they would get excited about an idea and propose wording for an ad campaign, I was always reining them in.

After reading a recent announcement by AncestryDNA, I feel like their marketing department had a meeting on the day their scientific advisor was out sick and without his or her corralling, they started a stampede.

Which, of course, was exactly what they wanted.

In their press release, Ancestry’s Dr. Ken Chahine, SVP and GM of AncestryDNA said, “It is effectively a shortcut through time—you take the test today and we tell you who your ancestors were, for example, in the 1700s. You don’t need to research records or build a family tree — AncestryDNA now transports you to the past.”

Which is exactly what people want to hear, especially non-genealogists who are curious about their past, but don’t have the tools or know-how or interest in doing the actual genealogy work.

But is it true? Is genetic genealogy a short cut through time?

“Absolutely,” says the marketing team.

“Sometimes, and that depends on factor A, and factor B and situation C and…” say the scientists.

And they are both right. The trick is to hear them both as you review these kinds of new advances in genetic genealogy.

What makes the “absolutely” true is that one of the dreams of genetic genealogy is to use the DNA of living people today to actually reconstruct the genetics of our ancestors. So that their actual DNA profile is known. Then it will be easy to identify their descendants as we will be able to see immediately what part of our DNA came from which of our ancestors. Ancestry has demonstrated their ability to do this in a large scale study of the descendants of a 19th-century American and his two successive wives.

Now, time for the “Sometimes.” This full genome reconstruction hasn’t been done yet for your grandparents, or great grandparents. Right now the best we can do is use your DNA to link you to living individuals, then rely on your traditional genealogy to help you find your common ancestor. Ancestry is trying to help you do that using their DNA circles, and now with their New Ancestor Discoveries.

Remember that to be included in a DNA circle you have to have a “ticket” to the party, meaning both your genetics and your genealogy match with at least two other people in the database and a circle is created around the host of the party, who is your common ancestor.

With New Ancestor Discoveries, we are letting those with just a genetic ticket into the party. Meaning that if you share DNA with two or more people in a DNA Circle, the host of that circle is named as an ancestor who might be on your pedigree chart.

Did you notice how I said “might?” That this newly discovered ancestor MIGHT be in your pedigree chart?

As an idea, New Ancestor Discoveries is VERY EXCITING, don’t you think? To be able to find out using both genetics and genealogy that a particular person living 100 years ago might just be the one who belongs in that blaring blank space on your pedigree chart? And it will be. But right now, Ancestry needs to work out some bugs, starting with a stronger acknowledgement that the ancestor listed in the Discoveries is by no means an absolute, but just a hint.

Genetic Genealogy and DNAIn coming posts I will share with you how I am using the New Ancestry Discoveries to discover more about my genealogy, even if it isn’t exactly in the way Ancestry intended. For now, learn more by reading my recent posts: from the left side of the Genealogy Gems home page, search on the category “DNA.”

And click here to visit my website and learn more about how I can help you navigate the exciting world of genetic genealogy.

Pin It on Pinterest

MENU