Introduction to OCR and Image Derivatives

The goal of digitizing historical and archival materials is to create fresh access to these special collections.

Enabling full-text search in a digital image collection makes it easier for a student, researcher, or other library patron to find the content they're looking for. But how do you optimize your text conversion to get the best results?

Optical character recognition (OCR) isn't new technology, but it is still evolving and improving.

In this webinar, recorded July 20, 2017, Backstage's Alicia Sell and Eric Larson walk you through the basics of OCR and the derivative files you'll need to enable text discovery in your digital collections.

Alicia and Eric cover what you can expect from today's OCR engines, what the software can and can't do. They'll talk about optimizing images at the capture stage to make OCR more reliable. And they'll discuss the pairing of images and text in various formats.

Video temporarily offline. Please contact for help.


How can we help you plan for your digitization project? Call us at 1-800-288-1265 or drop us a note.

Contact Backstage

  • NEWS
  • It's all about access.