From Research to Data with AI 1 of 5: From Records to Raw Data—Extracting with AI

Andrew Redfern, Fiona Brooker
Mar 12, 2026
1.1K views
CC

About this webinar

Use AI to extract, clean, organise, and analyse your family history research. Intermediate level, focused on workflows and data handling; ideal for users managing large research projects; activities include table-building, clustering, and data cleaning.

About the speakers

Andrew Redfern is an enthusiastic family historian and accomplished speaker, having delivered presentations both in his home country of Australia and internationally over many years. His innovative wo...
Learn more...
Fiona Brooker is a professional genealogist (Memories In Time) who has been actively researching her family history for over 35 years, inspired by two marriage certificates and a collection of family ...
Learn more...

Key points and insights


Navigating the transition from dusty archival documents to organized digital data is a pivotal challenge for the modern family historian. In this insightful webinar, "From Records to Raw Data: Extracting with AI," expert genealogists Andrew Redfern and Fiona Brooker demonstrate how artificial intelligence is revolutionizing this workflow. Recorded in March 2026, the session serves as a practical guide for researchers looking to move beyond simple searches and into deep data extraction. By focusing on the nuances of AI-powered Optical Character Recognition (OCR) and sophisticated prompting techniques, the presenters showcase how these tools can bridge the gap between illegible cursive and structured, searchable information. This session is essential for anyone seeking to accelerate their research while maintaining the rigorous standards of genealogical proof.

  • Context is King with AI-Enhanced OCR: Unlike traditional OCR that merely identifies characters, AI-enhanced OCR utilizes Large Language Models to understand the context and meaning of surrounding text. This allows the AI to accurately "fill in the blanks" of faded or damaged documents by predicting words based on linguistic patterns and historical document structures.
  • The Power of Diplomatic Transcription: The webinar introduces the concept of "diplomatic transcription"—a verbatim, literal record that captures every abbreviation, spelling variation, and punctuation mark exactly as it appears. Using specific "faithful" prompts, researchers can force AI to avoid modernizing text, ensuring the original evidence is preserved for future analysis.
  • Strategic "Chunking" for Accuracy: To combat the risk of "hallucinations" (where AI makes up information), the experts recommend breaking large documents into manageable, single-page chunks. This incremental approach allows for a more thorough verification process and prevents the AI from exceeding its "context window," leading to significantly higher data integrity.

    Watching the full recording of this webinar offers a unique opportunity to see these techniques applied to real-world genealogical puzzles, including complex coroner’s reports and elusive birth registrations with surname variations. Witnessing the presenters troubleshoot live AI responses—including moments where the AI "runs off the rails"—provides invaluable "critical thinking" skills that cannot be learned from a manual alone. To truly transform your workflow, explore the additional resources and bonus prompts included in the webinar syllabus. These tools are designed to help you experiment with your own ancestors' records and discover the hidden stories waiting within your data.


Comments (68)

Sort byNewest
  1. RB
    Robert Barry
    7 days ago

    Very informative. I learnt a lot.

  2. TS
    Torhild Shirley
    7 days ago

    Love that Andrew and Fiona are going live with the risk of AI not behaving - that happens to us all.

  3. BO
    Brian O'Farrell
    7 days ago

    Interesting and timely. I would have liked to see how Claude and Gemini might have handled the prompt. I recently tried Claude after mostly using ChatGPT for the past year. Claude was much improved from when I last tried and returned significantly better results than ChatGPT.

  4. AS
    Angela Southworth
    7 days ago

    I enjoyed walking through examples in real time and hearing how the presenters differed in their verbage selection.

  5. DK
    Dale A Ketcheson
    7 days ago

    The first set of these AI classes were amazing and this second set has started out just as well. I especially learn from seeing what prompts and processes Fiona and Andrew use in their research. It helps to see an 'answer' before I can then figure out how to do it on my own.

  6. AW
    Allison Willis
    7 days ago

    I really liked this webinar, and there is a wealth of knowledge here!!!! Worth watching again too!!

  7. MR
    Maureen S ROSO
    7 days ago

    This course has come handy in my current research thanks.

  8. KM
    Kathleen Marine
    7 days ago

    As one who thought A I was a bad thing for Genealogist, after this presentation I have to eat my words. I can be a useful tool when used correctly. I hope the subsequent classes teach me how to use it correctly.

From Research to Data with AI 1 of 5: From Records to Raw Data—Extracting with AI - Legacy Family Tree Webinars