Improvement of Omnipage18's Efficiency

Comments

OCR, is it Text or

OCR, is it Text or Handwriting?

Ever wonder how OCR might one day distinguish handwriting from text? In this paper published in May 2012, Karl-Heinz Steinke illustrates his latest work using OCR software Omnipage 18 and Omnipage SDK (that's software development kit to: 1) improve detection of otherwise missed text and 2) distinguish printed text from handwriting. Clear illustrations are used to explain the process of how the software finds text and handwriting and determines which is which. While the examples are of text and writing on herbarium sheets, the methods ought to apply to text and writing on other label types as well. Steinke also notes the ability to distinguish between handwriting and text means that nonsense text produced by OCR of handwriting could be algorithmically removed from the OCR output leaving behind only the text.

Improvement of Omnipage18's Efficiency

Comments

Language