Difference between revisions of "Augmenting OCR"

From iDigBio
Jump to: navigation, search
m (adding more about the workingn group's efforts. removing language about the stub)
Line 1: Line 1:
This page is just a stub for the Augmenting OCR Working Group(aOCR). Over time, members of this group and the community will work together to develop content and deliverables that will serve the broader digitization community.
+
This is a community derived encyclopedia of information about the Augmenting OCR working group. Members of this working group and the community are encouraged to work together to develop content and deliverables that will serve the broader digitization community.  
  
Please let us know if you need assistance modifying this page: [https://www.idigbio.org/contact/Website_feedback iDigBio Help Desk]
+
We are focusing the efforts of the working group to put together materials to help the community get more from their OCR strategies. Topics we are gathering material on include:  
  
Also, if you would like to learn more about wiki syntax:
+
*known effective practices for getting the most from any OCR software.<br>
[http://meta.wikimedia.org/wiki/Help:Wikitext_examples Mediawiki Wikitext Examples]
+
*known issues that hinder good (useful) OCR output.<br>
 +
*reporting findings after working with real image data and programmers to improve parsing of OCR output.<br>
 +
*lists of OCR software currently being utilized by the natural history collections community.<br>
  
*[[OCR Resources]]
+
Check out the following pages. We welcome your input!<br>
*[[Technical Issues]]
+
 
 +
*[[OCR Resources]]  
 +
*[[Technical Issues]]  
 
*[[OCR / NLP Workflows]]
 
*[[OCR / NLP Workflows]]
 +
 +
Please let us know if you need assistance modifying this page: [https://www.idigbio.org/contact/Website_feedback iDigBio Help Desk]
 +
 +
Also, if you would like to learn more about wiki syntax: [http://meta.wikimedia.org/wiki/Help:Wikitext_examples Mediawiki Wikitext Examples]
 +
 +
<br>

Revision as of 11:43, 9 August 2012

This is a community derived encyclopedia of information about the Augmenting OCR working group. Members of this working group and the community are encouraged to work together to develop content and deliverables that will serve the broader digitization community.

We are focusing the efforts of the working group to put together materials to help the community get more from their OCR strategies. Topics we are gathering material on include:

  • known effective practices for getting the most from any OCR software.
  • known issues that hinder good (useful) OCR output.
  • reporting findings after working with real image data and programmers to improve parsing of OCR output.
  • lists of OCR software currently being utilized by the natural history collections community.

Check out the following pages. We welcome your input!

Please let us know if you need assistance modifying this page: iDigBio Help Desk

Also, if you would like to learn more about wiki syntax: Mediawiki Wikitext Examples