How Artificial Intelligence AI and Machine Learning Impact Genealogy

Artificial Intelligence and Genealogy
Elevenses with Lisa Episode 32

In this episode we tackle a few small geeky tech questions about artificial intelligence, better known as AI, that may have a pretty big impact on your genealogy life. Questions like:

  • Is artificial intelligence the same thing as machine learning?
    And if not how are they related?
  • And am I using AI, maybe without even being aware of it?
  • And what impact is AI really having on our lives? Is it all good, or are there some pitfalls we need to know about?

We’re going to approach these with a focus on family history, but pretty quickly I think we’ll discover it’s a much more far-reaching subject. And that means this episode is for everyone.

Free Webinar AI Machine learning and Genealogy

Watch the free video below.

While I’ve done my own homework on this subject and written about it in my book The Genealogist’s Google Toolbox, I’m smart enough to call in an expert in the field. So, my special guest is Benjamin Lee. He is the developer of the Newspaper Navigator, the new free tool that uses artificial intelligence to help you find and extract images from the free historical newspaper collection at The Library of Congress’ Chronicling America. I covered Newspaper Navigator extensively in Elevenses with Lisa episode 26.

Ben  is a 2020 Innovator-in-Residence at the Library of Congress, as well as a third year Ph.D. Student in the Paul G. Allen School for Computer Science & Engineering at the University of Washington, where he studies human-AI interaction with his advisor, Professor Daniel Weld.

He graduated from Harvard College in 2017 and has served as the inaugural Digital Humanities Associate Fellow at the United States Holocaust Memorial Museum,  as well as a Visiting Fellow in Harvard’s History Department. And currently he’s a National Science Foundation Graduate Research Fellow.

Thank you so much to Ben Lee for a really interesting discussion and for making Newspaper Navigator available to researchers. I am really looking forward to hearing from him about his future updates and improvements.

Artificial Intelligence and Genealogy

Covering technology and its application to genealogy is always a bit of a double-edged sword. It can be exciting and helpful, and also problematic in its invasiveness.

Tools like family tree hints, the Newspaper Navigator and Google Lens (learn more about that in Elevenses with Lisa episode 27) all have a lot to offer our genealogy research. But on a personal level, you may be concerned about the long reaching effects of artificial intelligence on the future, and most importantly your descendants. In today’s deeply concerning cancel culture and online censorship, AI can seriously impact our privacy, security and even our freedom.

As I did my research for this episode I discovered a few things. Artificial Intelligence and machine learning is having the same kind of massive and disrupting impact that DNA has had on genealogy, with almost none of the same publicity. (For background on DNA data usage, listen to Genealogy Gems Podcast episode 217. That episode covers the use of DNA in criminal cases and how our data potentially has wide-reaching appeal to many other entities and industries.)

A quick search of artificial intelligence ancestry.com in Google Patents reveals that work continues on ways to apply AI to DNA and genealogy. (See image below)

Patents for AI machine learning and DNA

Patent search result: a pending patent involving AI and DNA by Regeneron Pharmaceuticals, Inc.

AI now makes our genealogical research and family tree data just as valuable to others outside of genealogy.

This begs the question, who else might be interested in our family tree research and data?

Who Is Interested in Your Genealogy Data

One answer to this question is academic researchers. During my research on this subject The Record Linking Lab at Brigham Young University surfaced as just one example. It’s run by a BYU Economics Professor who published a research paper on their work called Combining Family History and Machine Learning to Link Historical Records. The paper was co-authored with a Notre Dame Economics and Women’s Studies professor.

In this example, their goals are driven by economic, social, and political issues rather than genealogy. Their published paper does offer an eye-opening look at the value that those outside the genealogy community place on all of the personal data we’re collecting and the genealogical records we are linking. Our work is about our ancestors, and therefore it is about ourselves. Even if living people are not named on our tree, they are named in the records we are linking to it. We are making it all publicly available.

In the past, historical records like birth and death, military and the census have been available to these researchers, but on an individual basis. This made them difficult to work with. Academic (and industry) researchers couldn’t easily follow these records for individual people, families, and generations of families through time in order to draw meaningful conclusions. But for the first-time machine learning is being applied to online genealogy research data making it possible to link these records to living and deceased individuals and their families.  

It’s a lot to think about, but it’s important because it is our family history data.  We need to understand how our data is being used inside and outside the genealogy sandbox.

Answers to Your Live Chat Questions About AI

One of the advantages of tuning into the live broadcast of each Elevenses with Lisa show is participating in the Live Chat and asking your questions.

Elevenses with Lisa Q&A on AI and Genealogy

www.GenealogyGems.com/Elevenses

From Linda J: ​What about all the “people search” sites (not genealogy) that have all, or a lot of, our personal date?
Lisa’s Answer: My understanding is that much of the information provided on many of the “people search” websites comes from public information. So while the information is much easier to access these days, it’s been publicly available for years. That information isn’t as accessible to projects like the one discussed in this episode because those websites don’t make their Application Programming Interface (known as API) publicly available like FamilySearch does.

From Doug H: Wouldn’t that potentially find errors in our trees?
Lisa’s Answer: Yes.

From Sheryl T: ​Do these academic researchers have access to the living people on the trees? Or are those protected from them as it is to the public?
Lisa’s Answer: They have access to all information attached to people marked as “Living Person.” Therefore, if the attached record names them, their identity would then be known. Click a hint on your tree at Ancestry for example, and the found records clearly spell out the name of the person they believe is your “Living” person.

From Nancy M: ​How long do the show notes stay available? am looking for Google Books two weeks ago and last week’s Allen Co Library.
Lisa’s Answer: The show notes remain available until the episode is archived in Premium Membership. You can find all of the currently available free Elevenses with Lisa episodes on our website in the menu under VIDEOS click Elevenses with Lisa.

Nannie A: I heard a rumor that Ancestry .com has been sold. Do you know if that’s true?
Lisa’s Answer: Yes, they were sold again this year. Read:
Private equity firm Blackstone Group Inc. buying Ancestry.com for $4.7 billion
Private equity wants to own your DNA by CBS News.

Resources

Get My Free Genealogy Gems Newsletter – click here.
Bonus Download exclusively for Premium Members: Download the show notes handout. 
Become a Genealogy Gems Premium Member today. 

 

AncestryDNA® Ethnicity Estimates Updated

Here’s the latest DNA update quoted from Ancestry®:

Ancestry DNA ethnicity update

Ancestry® Expands Reference Panel to Deliver More Precise Results and New Regions

“Today, Ancestry® announced their latest update to AncestryDNA® ethnicity estimates.

This update was made possible thanks to an increase in the AncestryDNA reference panel.

The reference panel is now more than double its previous size with samples from more places around the world, allowing Ancestry to determine ethnic breakdowns with a higher degree of precision.  

New ethnicity estimates will roll out to new and existing customers over several months, resulting in these potential developments for customers.”

New Ethnicity Regions

From their blog post:

“For example, previously we had North and South America as two large regions: Native American–Andean and Native American–North, Central, South.

With this new update, we are able to refine the areas into 11 smaller ones.

If you received one of the older regions before, your new report will most likely have one of the newer, more precise regions instead like Indigenous Eastern South America, Indigenous Cuba, and Indigenous Americas–Mexico, among others.” 

More Global Regions

“This advancement will enable AncestryDNA to deliver even more regions globally to enhance the experience across diverse populations including improvements and region realignment in West Africa, northwestern Europe, the Americas, Oceania, and South Asia.”

Ancestry DNA ethnicity update offers more global regions

When You Will See the Update

“It’s important to note that we are phasing the update over time to ensure individual attention is given to delivering each result; therefore, some may see results earlier or later than others.”

when you will see the ancestry dna ethnicity results update

Read the Full Announcement

Get all the details on this new update announcement by reading their article Ancestry® Expands Reference Panel to Deliver More Precise Results and New Regions

List of AncestryDNA® Regions

“More than 1,000 global regions make up the ethnicities displayed in our DNA test. As DNA science improves, the number of regions we test for (and the countries covered in each region) may change.

This article lists each region, but to see which areas of the globe are included in the regions, you’ll need to view the list from your DNA Story page (which will highlight an area of the map when you click a region).

To see all the regions, click See other regions tested at the bottom of your ethnicity estimate and click on a region on the next page. 

Ethnicity Estimate FAQ

Check out the interactive map and watch the explanatory video: FAQ for new AncestryDNA ethnicity estimate.

ancestry dna ethnicity FAQ

Click here for AncestryDNA ethnicity estimate FAQ

Results May Vary, Here’s an Example

If you’ve taken a DNA test, you may have received different ethnicity results than you expected and different from your family members. DNA expert Diahan Southard explains why this happens in the Genealogy Gems article “Results May Vary:” One Family’s DNA Ethnicity Percentages. Click here to start reading now.  

Click here to pick from our vast collection of DNA articles including DNA Ethnicity Accuracy: How It’s Getting More Specific.

More Resources

Get the DNA SUPER BUNDLE: 10 Quick Reference genetic genealogy guides by Diahan Southard at the Genealogy Gems store. 

10 DNA Genetic Genealogy quick reference guides by Diahan Southard

10 DNA Genetic Genealogy quick reference guides by Diahan Southard available now at the Genealogy Gems Store.

What Do You Think?

Have you noticed the update in your AncestryDNA® account? Did this update deliver any surprises? Please leave a comment below and share what you learned. 

1950 Census Substitute: What To Use Until its Release Date

The 1950 federal U.S. census will not be released to the public until April 2022. Are you as excited about that as I am? This census will provide volumes of new information about our families and their lives.

An enumerator interviews President Truman and the First Family for the 1950 Census. Image from www.census.gov.

An enumerator interviews President Truman and the First Family for the 1950 Census. Image from www.census.gov.

Answers to Your Questions about the 1950 Census

Here are answers to four of the common questions we receive about the 1950 census:

What will I be able to learn from the 1950 census?

With each decade the federal government has asked more detailed questions. The information collected has expanded our understanding of the families, their backgrounds, and their lifestyle.

Here’s what the front page of the 1950 Census of Population and Housing form looked like:

1950 census form page 1

As you can see there is a wealth of information that will be of interest to family historians. 20 questions were asked of everyone. The detailed questions at the bottom of the form were asked of 5% of the population. 

The back side of the form may not be as familiar to you, but it too collected a vast amount of fascinating data about housing:

1950 census form page 2

Let’s take a closer look at one of the rows:

1950 census up close

1950 census instructions population schedule

Instructions regarding the front and back of the Population and Housing Schedule Form P1

As you can see the back side of the form is focused on housing. Here you’ll find answers to questions about:

  • Type of Living Quarters
  • Type of Structure
  • Whether a business was run from the house
  • The condition of the building
  • If there are any inhabitants who may be somewhere else at the time the census was taken
  • How many rooms
  • Type of water, toilet and shower / bath facilities
  • Kitchen and cooking facilities
  • Occupancy
  • Financial and rental arrangements

Additional questions were not asked of all, but rather were asked on a rotating basis. These centered around additional features of the home such as radio, television, cooking fuel, refrigeration, electricity and the year the home was built.

Are enumerator instructions available for the 1950 census?

The instructions issued to enumerators can provide you with further insight into the records themselves. It can also clarify the meaning of marks and numbers you may find on the documents.

And yes, the US Census Bureau has indeed published the instructions for the 1950 census on their website here. According to their site:

“During the 1950 census, approximately 143,000 enumerators canvassed households in the United States, territories of Alaska and Hawaii, American Samoa, the Canal Zone, Guam, Puerto Rico, the Virgin Islands, and some of the smaller island territories. The U.S. Census Bureau also enumerated Americans living abroad for the first time in 1950. Provisions were made to count members of the armed forces, crews of vessels, and employees of the United States government living in foreign countries, along with any members of their families also abroad.”

1950 census manual

Also on that web page you’ll find instructions for the following years: 1790, 1850, 1860, 1870, 1890, 1900, 1910, 1920, 1930, and 1940.

Can I request individual census entry look-ups?

Yes, you may apply to receive copies of individual census entries from 1950-2010 for yourself or immediate relatives. It’s not cheap—it’s $65 per person, per census year. (Check the website for current pricing.) But if you’re having research trouble you think would be answered by a census entry, it might be worth it. Click here to learn buy lithium medication online more about the “Age Search Service” offered through the Census Bureau.

Is there a 1950 census substitute database?

Yes, Ancestry has one. You might find it a little gimmicky, because it’s just taken from their city directory collection from the mid-1940s to the mid-1950s. But it’s a good starting point to target your U.S. ancestors living during that time period. The annual listings in city directories can help you track families from year to year.

More 1950 Census Resources

Your 1950s family history may appear in other records as well, and I’ve got some tips to help you in your search:

The 1950 Census for Genealogy

Watch my video All About the 1950 Census

Pin It on Pinterest

MENU