Presentations & Reports: Difference between revisions

From iDigBio
Jump to navigation Jump to search
Line 7: Line 7:
::;[http://manuscripttranscription.blogspot.com/2013/02/improving-ocr-inputs-from-ocr-outputs.html Improving OCR Inputs from OCR Outputs] - Ben Brumfield: Efforts to improve the quality of OCR by pre-processing images based on the output of 'naive' OCR execution.  Topics included handwriting detection within Dataset 1 ([http://manuscripttranscription.blogspot.com/2013/02/detecting-handwriting-in-ocr-text.html final report]) and label extraction from Dataset 3 ([http://manuscripttranscription.blogspot.com/2013/02/results-of-ocrocrop-approach-to.html final report]).
::;[http://manuscripttranscription.blogspot.com/2013/02/improving-ocr-inputs-from-ocr-outputs.html Improving OCR Inputs from OCR Outputs] - Ben Brumfield: Efforts to improve the quality of OCR by pre-processing images based on the output of 'naive' OCR execution.  Topics included handwriting detection within Dataset 1 ([http://manuscripttranscription.blogspot.com/2013/02/detecting-handwriting-in-ocr-text.html final report]) and label extraction from Dataset 3 ([http://manuscripttranscription.blogspot.com/2013/02/results-of-ocrocrop-approach-to.html final report]).
::;Image Segmentation - Phuc Nguyen
::;Image Segmentation - Phuc Nguyen
::;Parsing Dataset 1 - Robert Anglin
::;[https://www.idigbio.org/sites/default/files/workshop-presentations/aocr-hackathon/HackathonPresentation.doc Parsing Dataset 1] - Robert Anglin
::;LabelX - Bryan Heidorn & Qianjin Zhang
::;LabelX - Bryan Heidorn & Qianjin Zhang
::;Parsing Dataset 2 - Dmitry Mozzherin
::;Parsing Dataset 2 - Dmitry Mozzherin

Revision as of 14:55, 12 March 2013

Talks & Reports from Hackathon 1


Hackathon Overview & Intro to iDigBio - Deborah Paul
Overview of the Hackathon goals and introduction to iDigBio for those new to the project. Review this presentation before the other hackathon presentations to learn about the aOCR working group and the talks, presentations, and work being done at this hackathon and after.
Hackathon Metrics - Alex Thompson
Parsing Dataset 1 - Daryl Lafferty
Improving OCR Inputs from OCR Outputs - Ben Brumfield
Efforts to improve the quality of OCR by pre-processing images based on the output of 'naive' OCR execution. Topics included handwriting detection within Dataset 1 (final report) and label extraction from Dataset 3 (final report).
Image Segmentation - Phuc Nguyen
Parsing Dataset 1 - Robert Anglin
LabelX - Bryan Heidorn & Qianjin Zhang
Parsing Dataset 2 - Dmitry Mozzherin
Services and Workflow UIs - Robin and Paul Schroeder
Workflows - John Pickering
DarwinScore and Apiary - Jason Best

Back to Hackathon Wiki