Known OCR, ML, NLP Issues

From iDigBio
Revision as of 22:37, 10 January 2013 by Dpaul (Talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Specific Issues Needing Work

  • how to get OCR to ignore a map (reduce OCR confusion)
  • ...
    and
    ___ 
    present a challenge and confuse OCR and parsing.
  • figure out an algorithm that would separate images into sets with no handwriting, little handwriting (mostly text typed or printed), lots of handwriting