OCR / NLP Workflows: Difference between revisions

No edit summary
Line 1: Line 1:
== OCR / NLP Workflow & Protocol Documents ==
== OCR / NLP Workflow & Protocol Documents ==
=== The SALIX Method ===
=== The SALIX Method ===
The SALIX Method: A semi-automated workflow for herbarium specimen digitization. Barber, A.C, Lafferty, D., & Landrum, L.R. ''in press''. Taxon,Volume 62, Number 3, 17 June 2013, pp. 581-590(10) DOI: http://dx.doi.org/10.12705/623.16
The SALIX Method: A semi-automated workflow for herbarium specimen digitization. Barber, A.C, Lafferty, D., & Landrum, L.R. ''in press''. Taxon,Volume 62, Number 3, 17 June 2013, pp. 581-590(10) DOI: http://dx.doi.org/10.12705/623.16
Line 16: Line 15:
'''Using OCR in Specimen Cataloging:''' Though perfect parsing algorithms are still being developed, considerable advantages can be obtained by sorting yet-to-be cataloged specimens by extracting information from the OCR (sort by label types for example). For some ideas on how to do this, refer to the following PowerPoint presentations: [https://www.idigbio.org/sites/default/files/workshop-presentations/aocr-wgw/gottschalk_gainesville.pptx OCR implementation in The Caribbean Plants Digitization Project] and [https://www.idigbio.org/sites/default/files/workshop-presentations/aocr-wgw/Watson-Tri-Trophic-Digitization-OCR.pptx Tri-Trophic Digitization: Putting the OCR in Workflow].
'''Using OCR in Specimen Cataloging:''' Though perfect parsing algorithms are still being developed, considerable advantages can be obtained by sorting yet-to-be cataloged specimens by extracting information from the OCR (sort by label types for example). For some ideas on how to do this, refer to the following PowerPoint presentations: [https://www.idigbio.org/sites/default/files/workshop-presentations/aocr-wgw/gottschalk_gainesville.pptx OCR implementation in The Caribbean Plants Digitization Project] and [https://www.idigbio.org/sites/default/files/workshop-presentations/aocr-wgw/Watson-Tri-Trophic-Digitization-OCR.pptx Tri-Trophic Digitization: Putting the OCR in Workflow].
----
----
== OCR Workflow at the Royal Botanic Garden Edinburgh ==
== OCR Workflow at the Royal Botanic Garden Edinburgh ==
'''[http://www.idigbio.org/sites/default/files/working-groups/aocr/OCRWorkflowRBGE.docx Draft OCR workflow for RBGE]'''
'''[http://www.idigbio.org/sites/default/files/working-groups/aocr/OCRWorkflowRBGE.docx Draft OCR workflow for RBGE]'''
4,713

edits