How Artificial Intelligence AI and Machine Learning Impact Genealogy
Artificial Intelligence and Genealogy
Elevenses with Lisa Episode 32
In this episode we tackle a few small geeky tech questions about artificial intelligence, better known as AI, that may have a pretty big impact on your genealogy life. Questions like:
- Is artificial intelligence the same thing as machine learning?
And if not how are they related? - And am I using AI, maybe without even being aware of it?
- And what impact is AI really having on our lives? Is it all good, or are there some pitfalls we need to know about?
We’re going to approach these with a focus on family history, but pretty quickly I think we’ll discover it’s a much more far-reaching subject. And that means this episode is for everyone.
While I’ve done my own homework on this subject and written about it in my book The Genealogist’s Google Toolbox, I’m smart enough to call in an expert in the field. So, my special guest is Benjamin Lee. He is the developer of the Newspaper Navigator, the new free tool that uses artificial intelligence to help you find and extract images from the free historical newspaper collection at The Library of Congress’ Chronicling America. I covered Newspaper Navigator extensively in Elevenses with Lisa episode 26.
Ben is a 2020 Innovator-in-Residence at the Library of Congress, as well as a third year Ph.D. Student in the Paul G. Allen School for Computer Science & Engineering at the University of Washington, where he studies human-AI interaction with his advisor, Professor Daniel Weld.
He graduated from Harvard College in 2017 and has served as the inaugural Digital Humanities Associate Fellow at the United States Holocaust Memorial Museum, as well as a Visiting Fellow in Harvard’s History Department. And currently he’s a National Science Foundation Graduate Research Fellow.
Thank you so much to Ben Lee for a really interesting discussion and for making Newspaper Navigator available to researchers. I am really looking forward to hearing from him about his future updates and improvements.
Artificial Intelligence and Genealogy
Covering technology and its application to genealogy is always a bit of a double-edged sword. It can be exciting and helpful, and also problematic in its invasiveness.
Tools like family tree hints, the Newspaper Navigator and Google Lens (learn more about that in Elevenses with Lisa episode 27) all have a lot to offer our genealogy research. But on a personal level, you may be concerned about the long reaching effects of artificial intelligence on the future, and most importantly your descendants. In today’s deeply concerning cancel culture and online censorship, AI can seriously impact our privacy, security and even our freedom.
As I did my research for this episode I discovered a few things. Artificial Intelligence and machine learning is having the same kind of massive and disrupting impact that DNA has had on genealogy, with almost none of the same publicity. (For background on DNA data usage, listen to Genealogy Gems Podcast episode 217. That episode covers the use of DNA in criminal cases and how our data potentially has wide-reaching appeal to many other entities and industries.)
A quick search of artificial intelligence ancestry.com in Google Patents reveals that work continues on ways to apply AI to DNA and genealogy. (See image below)
AI now makes our genealogical research and family tree data just as valuable to others outside of genealogy.
This begs the question, who else might be interested in our family tree research and data?
Who Is Interested in Your Genealogy Data
One answer to this question is academic researchers. During my research on this subject The Record Linking Lab at Brigham Young University surfaced as just one example. It’s run by a BYU Economics Professor who published a research paper on their work called Combining Family History and Machine Learning to Link Historical Records. The paper was co-authored with a Notre Dame Economics and Women’s Studies professor.
In this example, their goals are driven by economic, social, and political issues rather than genealogy. Their published paper does offer an eye-opening look at the value that those outside the genealogy community place on all of the personal data we’re collecting and the genealogical records we are linking. Our work is about our ancestors, and therefore it is about ourselves. Even if living people are not named on our tree, they are named in the records we are linking to it. We are making it all publicly available.
In the past, historical records like birth and death, military and the census have been available to these researchers, but on an individual basis. This made them difficult to work with. Academic (and industry) researchers couldn’t easily follow these records for individual people, families, and generations of families through time in order to draw meaningful conclusions. But for the first-time machine learning is being applied to online genealogy research data making it possible to link these records to living and deceased individuals and their families.
It’s a lot to think about, but it’s important because it is our family history data. We need to understand how our data is being used inside and outside the genealogy sandbox.
Answers to Your Live Chat Questions About AI
One of the advantages of tuning into the live broadcast of each Elevenses with Lisa show is participating in the Live Chat and asking your questions.
From Linda J: What about all the “people search” sites (not genealogy) that have all, or a lot of, our personal date?
Lisa’s Answer: My understanding is that much of the information provided on many of the “people search” websites comes from public information. So while the information is much easier to access these days, it’s been publicly available for years. That information isn’t as accessible to projects like the one discussed in this episode because those websites don’t make their Application Programming Interface (known as API) publicly available like FamilySearch does.
From Doug H: Wouldn’t that potentially find errors in our trees?
Lisa’s Answer: Yes.
From Sheryl T: Do these academic researchers have access to the living people on the trees? Or are those protected from them as it is to the public?
Lisa’s Answer: They have access to all information attached to people marked as “Living Person.” Therefore, if the attached record names them, their identity would then be known. Click a hint on your tree at Ancestry for example, and the found records clearly spell out the name of the person they believe is your “Living” person.
From Nancy M: How long do the show notes stay available? am looking for Google Books two weeks ago and last week’s Allen Co Library.
Lisa’s Answer: The show notes remain available until the episode is archived in Premium Membership. You can find all of the currently available free Elevenses with Lisa episodes on our website in the menu under VIDEOS click Elevenses with Lisa.
Nannie A: I heard a rumor that Ancestry .com has been sold. Do you know if that’s true?
Lisa’s Answer: Yes, they were sold again this year. Read:
Private equity firm Blackstone Group Inc. buying Ancestry.com for $4.7 billion
Private equity wants to own your DNA by CBS News.
Resources
Get My Free Genealogy Gems Newsletter – click here.
Bonus Download exclusively for Premium Members: Download the show notes handout.
Become a Genealogy Gems Premium Member today.
How to Get Back Into Genealogy
Show Notes: Restart Your Genealogy!
Has it been a while since you worked on your genealogy research? As passionate as we may be about genealogy, the reality is that a little thing called “Life” can get in the way!
Getting back into genealogy can actually be a bit daunting. Where did you leave off? Where should you start back up?
If it’s been months or even years since you had your hands in genealogy, you’re in the right place. In this video, we’re going to talk about how to pick up your genealogy after a hands-off spell so that you can quickly and efficiently get back on the trail of your ancestors.
And by the way, perhaps you haven’t taken a break, but you feel like you’ve gotten a little out of control and disorganized in what you’ve been doing so far. This process also works very nicely as a quick audit to help you get back on track.
How to Jump Back into Your Genealogy
Has it been a while since you worked on your genealogy research? As passionate as we may be about genealogy, the reality is that that little thing called life can get in the way.
In my case, my daughter got married earlier this year. There were plans to make, bridal shows to throw, and the wedding itself which meant planning a trip because it was a destination wedding. Needless to say, I didn’t work on family history for several months.
If it’s been months or even years since you had your hands in genealogy, you’re in the right place. In this article and companion video we’re going to talk about how to pick up your genealogy after a hands-off spell so that you can quickly and efficiently gets back on the trail of your ancestors.
Even if you haven’t taken a break, you might be feeling a little out of control and disorganized in what you’ve been doing so far. This quick genealogy audit can help you get back on track too!
Genealogy Restart Checklist
I love a good to-do list where I can have the satisfaction of checking things off and knowing that at the end of it I have accomplished something. Some of the things on this list may not apply depending on how long your genealogy hiatus has been. If that’s the case you get to check them off right away!
Get my comprehensive downloadable Genealogy Restart Checklist. (Premium Membership required)
Step 1: Find Out Where You Left Off in Your Research
Do you remember where you left off the last time you were researching your family tree? If not, your search history is a great place to start. For example, if you used the popular genealogy website Ancestry.com you can pull up your past search history.
How to find your search history at Ancestry.com
At the Ancestry® home page you will see a box at the top that highlights the recently modified items in your family tree. According to one source at Ancestry.com, this “shows a list of last modified nodes in the tree. For a shared tree – any user who has access to the hint can modify the nodes and it will show up in that list. It (also) shows a hint leaf for the nodes that have at least one undecided hint.”
This could be a place to start, but I recommend reviewing Your Recent Searches if you want to pick up where you left off.
You’ll find your search history in the menu under Search. Click All Collections. Toward the top of the All Collections page you’ll see Your Recent Searches. It’s just above the map. You’ll see a few buttons listed for the most recent names you searched. Next, click the View All button to get a more comprehensive view of your activity history, starting with the most recent activity.
On the Recent Activity page, you’ll see the names you searched for and the details you included such as a place and time frame. Ancestry also tells you the date you ran the search.
If you see searches in the list that you don’t need anymore, click the trash can button to delete them.
Notice over on the left that you are viewing Recent Searches, but you do have other options:
- All Recent (activity)
- Viewed Content (records you’ve viewed)
- Viewed Collections (record collections you accessed)
All Recent provides the best overall picture of your past search history. This is a great tool for jogging your memory and helping you decide where to pick back up.
Review your activity history in your genealogy software.
You can also review your most recently activity in your genealogy database software.
In RootsMagic for example, in the menu go to Search > History or click the History tab at the top of the side bar on the left side of the screen.
Step 2: Identify Gaps that Need to be Filled
Many people enjoy focusing their research on their direct ancestors (grandparents, great grandparents, etc.) While you may have traced back many generations, you may have missed a few things along the way. This is a good time to start with yourself and work backwards through the direct ancestors in your family tree. Look for gaps in your timelines and information, and then start back up by researching to fill them in. Of course, you can also do with any relative that you want to learn more about.
Once you’ve identified the person you want to work on, create a research plan. If you’ve never created a research plan before, don’t worry, it doesn’t have to be complicated. You create and track it on paper, a spreadsheet or any number of notetaking programs. The important thing is that you identify:
- your specific research question,
- the records you think you’ll need to answer it
- the locations where you think those records may be housed.
See this in action in my video Hard to Find Records, a Case Study.
Premium Members check out these classes with downloadable handouts:
- How Alice the Genealogist Avoids the Rabbit Hole which includes creating a research plan.
- Using Evernote to Create a Research plan
Step 3: Prepare for Genealogy Research Success Going Forward:
Since you’re picking your genealogy back up, this is the perfect time to check to make sure you’re set up for success going forward. These remaining items will help ensure that your new discoveries will be well-documented, organized, and protected from loss.
Genealogy software database
If you already have genealogy database software, open it up and see if there’s a newer version available. Look for Check for Updates in the menu.
If you don’t have a genealogy database software program on your computer, go get one now! We’re talking about a software program that you install on your computer. It’s a database specifically designed to record all the information you find. It keeps it organized and searchable, allows for source citations, photos, links, and more. It also gives you tremendous flexibility in running reports. This is something with which an online tree can’t compete. And most importantly all your data resides on your computer hard drive. This means it’s completely within your control and not subject to a paid subscription, or problems with a website such as the site being closed or sold off. The tree you build can be synced to an online tree if you wish to do so. Back in the old days (early 2000s) a database on your computer was the only option, and it remains your best option today.
Genealogy software is typically very affordable. You can even download Family Tree Builder at MyHeritage for free. If you’re willing to invest a few dollars there are several excellent programs to choose from such as RootsMagic, Family Tree Maker, Legacy, etc. I use RootsMagic but all of these programs have been around a long time and are great. The one you pick really depends which user interface you like, and to what extent you may want to sync your tree online.
Premium Member Resource Video: Take Control of Your Family Tree.
Cloud backup
If you don’t have a cloud backup program running on your computer, now is the time to get one. What’s the point of restarting your genealogy research if you’re going to risk losing everything if your computer is damaged or stolen? I’ve used Backblaze for years because it’s reliable, affordable, has an app, and automatically backs up all my files including video. There are several out there to choose from. The important thing is to pick one and get it installed on your computer. It will run automatically in the background, giving you peace of mind that your files are backed up offsite on the cloud in a secure location.
Status of Genealogy Website Subscriptions
Now that you have the tools you need to restart your genealogy research, it’s time to check genealogy websites. Did you have subscriptions to some of the popular genealogy websites like MyHeritage or Ancestry? Log in and go to your account to see if they are still active, and if they are, when they are set to renew. This will help you decide where to spend your time first. Start with the subscription that is up for renewal first. Then you can determine if you want to allow it to renew or cancel and try another genealogy website subscription to round out your research.
If you don’t have any current subscriptions, consider focusing first on familysearch, the largest free genealogy website. Then, depending on your research goals, you can select the paid subscription(s) that will support your research plan.
A Paper Filing System
While we don’t generate as much paper these days as we used to, some paper is inevitable. Don’t add to the paper clutter. If you don’t have a paper filing system in place, take a moment and set one up. Pick a filing system and stick to it. Then as you start your genealogy research you’ll always have a place to put things.
Filing Digital Content
The same goes for digital files as goes for paper files. Don’t jump back into your research without a filing system in place. It’s important to download the digital records you find so that you have access to them even when your subscriptions run out. Avoid a messy computer and commit to a digital filing system and filing name convention.
Check out all of my organization system classes.
Source Citation Brush Up
Were you citing your sources consistently when you last worked on your family history research? If not, STOP EVERYTHING and watch my video Source Citations for Genealogy. Citing your sources will save you headache down the road. You may discover that a previous conclusion was incorrect, and you’ll want to review the source where you got that information. A downloaded record usually doesn’t include specific details as to where you go it. Going forward, as you download records and add the details into your database be sure to also add the source citation.
With this in mind, familiarize yourself with the source citation tool in your genealogy program. If it looks daunting, don’t panic. Head to the menu and click Help, and then search for source citation. There you’ll find the instructions you need to once and for all get a handle on how to cite sources in your software.
Now’s the Time to Restart Your Genealogy
Don’t let the passing of time stop you from getting back into your favorite hobby. By following this checklist you will quickly get back into goal-oriented research and exciting discoveries about your family.
Resources
Downloadable ad-free Show Notes handout (Premium Membership Required.)
Bonus Download: Genealogy Restart Checklist (Premium Membership required)
The Genealogy FAN Club Principle Overcomes Genealogy Brick Walls
Another brick wall…busted! We all have trouble spots in our family history research. Sometimes, we just need a little help breaking through. Here’s a tried-and-true method for using the genealogy FAN club principle to overcome brick walls in your family history research from guest author Amie Bowser Tennant.
A FAN club stands for Family, Associates, and Neighbors. Using the FAN club principle is a process in which genealogists identify a list of people (family, associates, and neighbors) that lived and associated with a given ancestor. By researching these other people, you may flesh out some new hints for your own research. Ultimately, identifying our ancestors FAN club is an effective tool for overcoming brick walls in genealogy research.
Renowned genealogist and author Elizabeth Shown Mills, coined the phrase “FAN Club” for genealogical purposes. She points out the significance of not only searching records for an ancestor’s surname, but also paying attention to documents about the ancestor’s “FAN Club” (Friends, Associates, Neighbors). Historical information, she says, is like real estate: the true value of any piece of information is unknown until it is put into community context. Learn more in Elizabeth’s “QuickSheet: The Historical Biographer’s Guide to Cluster Research (the FAN Principle).”
Step 1: “F” Stands for Family
Searching out other family members may prove helpful. Like in the case of Michael Knoop of Miami County, Ohio, I noticed there was another man in the county named Jacob Knoop. What was even more unique is both Michael and Jacob were born in New Brunswick. How unusual, I thought! Two men with the same last name, both born in New Brunswick, living in a small, farming area in Ohio! They had to be related, and they were. Jacob was Michael’s older brother.
Because I was having trouble finding when Michael had come to America, I traced Jacob instead. I located the passenger list with Jacob’s name on it and in doing so, I viewed all the passengers and found Michael, their mother, and lots of siblings!
In the case of Catherine Fearer Coddington, wife of James Coddington, I was having difficulty finding who her parents were. By searching for other Fearer individuals in the area, I discovered a biographical sketch on a John Fearer, Jr. Historical Encyclopedia of Illinois, Volume 2, reads:
“In 1836[,]John Fearer [Jr.] brought his family to Illinois. From Wheeling, West Va., the journey was made entirely by water. A landing on the Illinois soil was made at Hennepin. James Coddington, from near the Fearer’s old home in Maryland had already settled north of Princeton, in Bureau County, and later married John Fearer’s sister Catherine. The family found a home at Coddington’s until Mr. Fearer rented land near by.”
Catherine had a brother! With this new information, I was able to easily trace John’s father to John Fearer, Sr. of Allegany County, Maryland and finally connect Catherine to her parents through a probate record.
It’s easy to see what a powerful strategy researching the relatives of your ancestors can be!
Step 2: “A” Stands for Associates
An associate could be a business partner, a witness on a document, a pastor, a lawyer, or the man that bailed Grandpa out of jail! Associates are often related. To create a list of associates, you might start gathering all witnesses to vital events, such as baptismal or christening records, marriage records, probate, land, and affidavits.
Were the courthouse records in your targeted area destroyed? Check the local newspapers for clues for possible associates. As an example, Jacob Trostel was a signee and vouched for Harvey D. Wattles’ tavern license. The license and names of the vouchers were listed in the newspaper, too. Eleven other men of the community appear on that petition. Later, Jacob himself petitions for a tavern license. That petition is signed by twelve men: George Filler, Conrad Slaybaugh, Lebright E. Hartzell, William G. Eicholtz, Isaac Yount, Joseph Dull, Isaac Myers, George W. Rex, Daniel Filler, William Harlan, and John Bream.
In both of these examples, relatives of Jacob Trostel had been vouchers. By tracing them, we were able to find out more about Jacob and his family.
Step 3: “N” Stands for Neighbors
Where can we find a list of our ancestors neighbors? A census, of course! When looking at a census page, we look for other people on the page with the same surname as our targeted ancestor. There’s a good chance those folks could also be related. But, your ancestor’s neighbors may also hold rich clues that can help you in your research. Many neighbors intermarried, sold land to each other, and even migrated to new locations together.
Besides looking at individuals listed on the same census page as your ancestor, remember to turn the page! Sometimes, a neighbor is not on the same page as your ancestor, but rather the pages before or after. Just because a person appears directly after your ancestor on the census rolls doesn’t necessarily mean they were neighbors. This only indicates the order in which the census taker visited the homes. You might also be able to identify close neighbors by looking at land ownership maps for the area. In this way, you can easily identify who lived near-by.
If you are having difficulty determining where your ancestors came from, researching the neighbors may give the answer. Many neighbors migrated together. Always check at least one page before your ancestor and one page after your ancestor in any given census.
Genealogy Fan Club: Comments and More Resources
There are likely dozens of successful ways for creating a FAN club for your ancestor. We would love to hear your examples in the comments below. For even more ways to break through those genealogy brick walls, enjoy these links below.
Read our article Solve Your Genealogy Brick Walls: 3 Tips for Breaking Through!
Even better: Genealogy Gems Premium Members can watch Lisa’s one hour video class Brick Walls: Cold Case Investigative Techniques. In this video you’ll not only learn how to apply criminal cold case strategies to your brick walls, but you’ll also get loads of fresh and innovative ideas you can try right away. If you are not a Premium Member yet, learn more about becoming a Genealogy Gems Premium Member here.