Search results

Page title matches

OCR SaaS
=OCR SaaS= *Process the ocr with the available OCR engines

1 KB (190 words) - 15:48, 17 April 2014
OCR Tips
...400–600 dpi for texts printed in smaller fonts (9pt or smaller). For best OCR results vertical and horizontal resolutions must be the same. See [http://f ...Setting an extremely low resolution (less than 150 dpi) adversely affects OCR quality. See [http://finereader.abbyy.com/guide/ User's Guide] for addition

5 KB (776 words) - 17:33, 3 January 2014
OCR Resources
[[Category:OCR]] == OCR Software used by ADBC projects ==

7 KB (1,027 words) - 13:33, 25 August 2014
Augmenting OCR
[[Category:OCR]] === Augmenting OCR Working Group (A-OCR) Overview ===

5 KB (743 words) - 17:59, 18 June 2014
OCR / NLP Workflows
== OCR / NLP Workflow & Protocol Documents == === Adding OCR to your Workflow ===

4 KB (581 words) - 13:15, 25 August 2014
Augmenting OCR Logistics
Augmenting OCR Working Group Workshop, presented by iDigBio, the National Resource (Home U .../www.idigbio.org/wiki/index.php/IDigBio_Augmenting_OCR_Workshop Augmenting OCR Workshop Agenda]

4 KB (650 words) - 15:51, 26 September 2012
Known OCR, ML, NLP Issues
...g list of known topics where work is needed that would improve things like OCR output, overall parsing results, and meaningful data set creation for digit <li>how to get OCR to ignore a map (reduce OCR confusion)</li>

957 bytes (157 words) - 18:30, 31 January 2013
IDigBio Augmenting OCR Workshop
...ound:#D58B28;text-align:center;font-size:9pt" | Quick Links for Augmenting OCR Workshop |[http://tinyurl.com/AOCRTwoDayAgenda Augmenting OCR Workshop Agenda]

6 KB (879 words) - 17:23, 3 February 2015
Transcription Hackathon OCR Integration Planning
...esentation/d/13Vugd2gaZBza5WZKfGqkpT7TLwqqqNLjCBEjOvfQxlw/edit?usp=sharing OCR Group Presentation][https://www.idigbio.org/sites/default/files/workshop-pr

648 bytes (85 words) - 16:32, 14 January 2014
Digitization Projects Using OCR / ML / NLP
[[Category:OCR]] == Use Cases for OCR / ML / NLP in Current Digitization Efforts ==

3 KB (409 words) - 18:00, 18 June 2014
Augmenting OCR Working Group Resources
== Augmenting OCR Working Group Discussion Forum == ...rg/forums/augmenting-ocr-and-nlp https://www.idigbio.org/forums/augmenting-ocr-and-nlp]

2 KB (369 words) - 14:40, 27 January 2014

Page text matches

Known OCR, ML, NLP Issues
...g list of known topics where work is needed that would improve things like OCR output, overall parsing results, and meaningful data set creation for digit <li>how to get OCR to ignore a map (reduce OCR confusion)</li>

957 bytes (157 words) - 18:30, 31 January 2013
Tools for data use and research
== OCR ==

417 bytes (43 words) - 16:54, 25 May 2016
ISchools2013
...enges of digitizing natural history collections with a focus on the use of OCR and NLP. Panel presentations planned encompass an all-inclusive world view ...eanest OCR possible and 2) to use NLP algorithms in an effort to parse the OCR output into Darwin Core fields. Participants' outputs will be compared and

2 KB (245 words) - 16:47, 26 October 2012
OCR SaaS
=OCR SaaS= *Process the ocr with the available OCR engines

1 KB (190 words) - 15:48, 17 April 2014
Augmenting OCR Working Group Resources
== Augmenting OCR Working Group Discussion Forum == ...rg/forums/augmenting-ocr-and-nlp https://www.idigbio.org/forums/augmenting-ocr-and-nlp]

2 KB (369 words) - 14:40, 27 January 2014
Digitization Projects Using OCR / ML / NLP
[[Category:OCR]] == Use Cases for OCR / ML / NLP in Current Digitization Efforts ==

3 KB (409 words) - 18:00, 18 June 2014
Technical Issues
...20 pixels or better is preferred. See [http://code.google.com/p/tesseract-ocr/wiki/FAQ Tesseract's FAQ] for more information concerning this issue. ...hics or even a simple black border can cause interference that will affect OCR output. For example, the following images will produce no output from Tesse

2 KB (250 words) - 11:07, 9 May 2013
OCR / NLP Workflows
== OCR / NLP Workflow & Protocol Documents == === Adding OCR to your Workflow ===

4 KB (581 words) - 13:15, 25 August 2014
Augmenting OCR
[[Category:OCR]] === Augmenting OCR Working Group (A-OCR) Overview ===

5 KB (743 words) - 17:59, 18 June 2014
Fixed Dataset Errata
::::/webroot/datasets/lichens/gold/ocr/NY01075763_lg.txt fixed --[[User:Dpaul|Dpaul]] 16:33, 27 February 2013 (EST ::datasets/lichens/gold/ocr/WIS-L-0012040_lg.txt: Longitude recorded as L49 (capitalized for clarity) i

9 KB (1,267 words) - 16:40, 3 July 2013
Transcription Hackathon OCR Integration Planning
...esentation/d/13Vugd2gaZBza5WZKfGqkpT7TLwqqqNLjCBEjOvfQxlw/edit?usp=sharing OCR Group Presentation][https://www.idigbio.org/sites/default/files/workshop-pr

648 bytes (85 words) - 16:32, 14 January 2014
OCR Resources
[[Category:OCR]] == OCR Software used by ADBC projects ==

7 KB (1,027 words) - 13:33, 25 August 2014
IDigBio Augmenting OCR Workshop
...ound:#D58B28;text-align:center;font-size:9pt" | Quick Links for Augmenting OCR Workshop |[http://tinyurl.com/AOCRTwoDayAgenda Augmenting OCR Workshop Agenda]

6 KB (879 words) - 17:23, 3 February 2015
Presentations & Reports
...and used extensively at Arizona State University. The purpose is to parse OCR'd label data into the respective data fields (e.g. Collector, collection nu ...tp://manuscripttranscription.blogspot.com/2013/02/detecting-handwriting-in-ocr-text.html final report]) and label extraction from Dataset 3 ([http://manus

4 KB (627 words) - 13:12, 13 June 2013
OCR Tips
...400–600 dpi for texts printed in smaller fonts (9pt or smaller). For best OCR results vertical and horizontal resolutions must be the same. See [http://f ...Setting an extremely low resolution (less than 150 dpi) adversely affects OCR quality. See [http://finereader.abbyy.com/guide/ User's Guide] for addition

5 KB (776 words) - 17:33, 3 January 2014
Hackathon FAQ
...n parsing again. Given the 2-day agenda, it's probably not feasible to run OCR and output algorithms on all 10,000 images in a dataset at the hackathon. ...ld?''' : For the scope of this hackathon, getting any taxon name from the OCR output and into the CSV file into the field aocr: verbatimScientificName is

4 KB (567 words) - 19:44, 13 January 2013
Image Selection and Processing Protocols
==== OCR Images ==== *200 Additional Hand Parsed labels from Raw OCR Output (Silver)

4 KB (609 words) - 13:25, 16 July 2013
Five iConference2013 Talks
....pdf/iConference2013WorkshopDP.pptx Introducing iDigBio and the Augmenting OCR Working Group] === :::;Jason Best: The Apiary Project: combining OCR technology, OCR output from herbarium specimen or other images containing museum specimen d

8 KB (1,125 words) - 14:38, 18 March 2013
Hackathon Challenge
...versity museum collections specimen data can be sped up if the output from OCR can be parsed faster and more accurately and packaged into semantically mea ...utput or repeat the OCR with the software of choice and then parse the new OCR output attempting to successfully populate as many of the selected Darwin C

9 KB (1,485 words) - 13:23, 6 January 2014
2013 AOCR Hackathon Wiki
|[https://www.idigbio.org/content/update-idigbio-augmenting-ocr-working-group iDigBio AOCR Hackathon Workshop Report] ...& Reports#Paul_Schroder|Paul & Robin Schroeder: iDigBio Hackathon: iDigBio OCR Hackathon Initial Results - Rest API]]

8 KB (1,061 words) - 16:32, 9 July 2015
IConference 2013 iDigBio AOCR WG Wiki
..., & Ulate, W. (2013). Help iDigBio reveal hidden data - iDigBio Augmenting OCR working group needs you. iConference 2013 Proceedings (pp. 1019-1021). doi ...owledge and collaboration as part of our multi-faceted approach to improve OCR strategies and natural language processing (NLP) algorithms used in digitiz

11 KB (1,417 words) - 13:59, 29 May 2013
ScioTR
...ouch-enabled Windows 8 app which integrates Optical Character Recognition (OCR), Natural Language Parsing (NLP), Model Strategies and crowdsourcing to pro

1 KB (159 words) - 11:57, 24 July 2013
User Interface Wish List
While this first hackathon centers around the task of getting data from OCR output parsed into semantically meaningful parts for insertion into a datab

914 bytes (135 words) - 01:39, 11 January 2013
Transcription Hackathon
*[[Transcription Hackathon OCR Integration Planning| OCR Integration Track]] ...lt/files/workshop-presentations/citscribe/lightningtalk-miaochen.pdf Using OCR]

9 KB (1,240 words) - 15:35, 3 February 2015
Specimen Image Processing
# optical character recognition (OCR).

1 KB (173 words) - 10:55, 2 October 2014
RESTful Documentation by Paul Schroeder
{{Caution|This page refers to an API developed as part of the 2014 Augmenting OCR hackathon which is not supported by iDigBio and is no longer available.}} ...own OCR software to use this puppy. A drawback is that I see a lot of the OCR results are terrible.

14 KB (1,847 words) - 11:57, 9 July 2018
Augmenting OCR Logistics
Augmenting OCR Working Group Workshop, presented by iDigBio, the National Resource (Home U .../www.idigbio.org/wiki/index.php/IDigBio_Augmenting_OCR_Workshop Augmenting OCR Workshop Agenda]

4 KB (650 words) - 15:51, 26 September 2012
Access to Digitization Tools and Methods
...ata/metadata capture and enrichment such as Optical Character Recognition (OCR), text mining, Natural Handwriting Recognition (NHR), Natural Language Proc ...Data Discovery and Doer Happiness: Uses for Optical Character Recognition (OCR) Output.'''] (recording) Deborah Paul, Andrea Matsunaga, Miao Chen, Jason B

11 KB (1,463 words) - 12:43, 3 February 2015
2013 Hackathon Logistics
...e Botanical Research Institute of Texas (BRIT)] and the iDigBio Augmenting OCR Working Group, supported by iDigBio, the National Resource (Home Uniting Bi *[https://www.idigbio.org/wiki/index.php/2013_AOCR_Hackathon_Wiki Augmenting OCR Hackathon Agenda and Wiki]

8 KB (1,182 words) - 18:55, 9 February 2013
Participant Related Projects
...mThePage.com], and read more about the world of crowdsourced transcription/OCR-correction at [http://manuscripttranscription.blogspot.com Ben's blog] ...t represents a very very new tool that SilverBiology is developing for the OCR meeting and some internal projects.

9 KB (1,328 words) - 13:48, 17 January 2013
Technology Glossary
| Optical Character Rrecognition (OCR)||The mechanical or electronic conversion of scanned images of handwritten,

4 KB (547 words) - 15:20, 13 May 2016
IDigBio Virtual Appliances
- Optical Character Recognition(OCR) appliance

4 KB (502 words) - 16:47, 4 August 2014
IDigBio Listservs
...https://www.idigbio.org/wiki/index.php/2013_AOCR_Hackathon_Wiki Augmenting OCR Working Group Hackathon (Feb 2013)]

6 KB (776 words) - 11:26, 22 February 2024
Input CSV Format
| Output of the process of applying OCR to the multimedia object. | Free form text describing the software utilized for OCR as well as any additional technique (cropping, color alteration applied, co

9 KB (1,298 words) - 11:13, 10 October 2017
Citstitch Hackathon
...erty and Ed Gilbert: on building restful web services for returning parsed OCR output

5 KB (711 words) - 12:34, 3 February 2015
Digitizing and Imaging Collections: New Methods, Ideas, and Uses
...-- to using the images with tools like Inselect, and using text search of OCR output from these images to maximize a digitization workflow. We then look | 6 ||4:40 - 5:00 ||10:40-11:00am||Using OCR for QC in the digitisation workflow of RBGE herbarium||'''Robyn Drinkwater'

8 KB (1,025 words) - 14:30, 11 July 2016
Wiki Home
**[[Augmenting OCR Working Group Resources]]

5 KB (707 words) - 14:20, 15 November 2021
Dataset Errata
...ent. ('''Daryl)''' ('''Bryan''': Agreed. Should be fixed to match the OCR label.) ...ixed to match the label as best as possible. If it is not clear follow the OCR file.)

77 KB (10,277 words) - 18:06, 15 September 2013
Progress in Digitization
...r-recognition-ocr-output-digitization Using optical character recognition (OCR) output in digitization: see your data before it's in the database and afte ...ization-incorporating-ocr-digitisation-and-curation-workflow Incorporating OCR into a digitisation and curation workflow]''' ('''Elspeth Haston''', Robyn

15 KB (1,733 words) - 12:56, 3 February 2015
Small Herbarium Workshop FSU
.../default/files/workshop-presentations/small-herbarium2013/OCRoverview.pptx OCR, it's not just for parsing! Think Sort.] [pptx] Deborah Paul

9 KB (1,122 words) - 13:04, 22 September 2015
NANSH Working Group
*OCR, Machine Learning, Natural Language Processing

9 KB (1,216 words) - 14:05, 25 April 2015
Digitization Template 2
...gumenting-ocr-hackathon-fort-worth-texas-february-13-14 iDigBio Augmenting OCR Hackathon Workshop Report] ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]

21 KB (2,495 words) - 16:16, 23 September 2014
Digitization Template 1
...gumenting-ocr-hackathon-fort-worth-texas-february-13-14 iDigBio Augmenting OCR Hackathon Workshop Report] ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]

22 KB (2,626 words) - 16:14, 23 September 2014
IDigBio Summit 2015
*[[Media:iDigBio-Summit-V_AOCR-Poster.pdf|Augmenting OCR]] (Deb Paul) *[[Media:iDigBio-Summit-V_AOCR-Poster.pdf|Augmenting OCR]] (Deb Paul)

23 KB (2,798 words) - 13:37, 7 September 2022
Public Participation Platforms
FromThePage focuses on textual transcription, OCR correction and metadata description of digitized documents.

9 KB (1,254 words) - 15:02, 12 December 2023
IDigBio Working Groups
[[Category:OCR]] == Augmenting OCR (aOCR) ==

78 KB (10,722 words) - 18:41, 15 December 2022
TCN Resources
|valign="top"|label imaging, using OCR for data capture [http://lbcc1.acis.ufl.edu/?q=project_workflow workflow] ...ls; data capture from label images; data capture via voice recognition and OCR

33 KB (4,485 words) - 14:27, 15 November 2021
SWG Webinar Series
...r transcribing Stage 1 records into Stage 2 records with the assistance of OCR and NLP.

8 KB (1,098 words) - 16:17, 16 August 2019
PacificDigitization
*[[Media:26March2014UsesForOCR.pdf|Uses for Optical Character Recognition (OCR) Output (Deborah Paul)]]

10 KB (1,344 words) - 13:08, 3 February 2015
Specimens Full Circle SPNHC 2015
...ll-circle/410_Gilbert_OCR_2015-05-20.pdf Specimen label digitization using OCR/NLP tools integrated within the Symbiota processing toolkit (pdf)]||'''Ed G

13 KB (1,695 words) - 11:14, 4 January 2017
Digitization Template 6
...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]

18 KB (2,010 words) - 16:47, 24 February 2015
Citizen Science/Crowdsourcing Tools for Digitization
*[http://evernote.com/ Evernote] (NLP/OCR API) “Evernote makes it easy to remember things big and small from you

12 KB (1,680 words) - 13:08, 3 September 2014
Data Management Interest Group
*Macroalgal TCN using Voice Recognition and OCR output to speed up digitization

17 KB (2,330 words) - 01:57, 29 January 2020
Digitization Template 5
...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]

23 KB (2,733 words) - 16:05, 16 October 2014
Digitization Template 4
...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]

23 KB (2,739 words) - 16:06, 16 October 2014
Glossary of Tools
|valign="top"|Commercial OCR software that can be used to convert typed and typeface label data from spe |valign="top"|An advanced optical character recognition (OCR) or rather more specific handwriting recognition system that allows fonts a

72 KB (10,638 words) - 12:39, 23 May 2016
Managing Natural History Collections Data for Global Discoverability
:On OCR, NLP, duplicate harvesting

22 KB (3,098 words) - 22:25, 21 September 2015
IDigBio Workshops
*integration of OCR into the transcription workflow ...igbio-augumenting-ocr-hackathon-fort-worth-texas-february-13-14 Augmenting OCR Hackathon]

148 KB (18,946 words) - 18:51, 24 August 2020
Glossary of Projects and Organizations
...ough the online reader or downloaded in part or as a complete work in PDF, OCR text, or JPG2000 file formats.

33 KB (4,698 words) - 13:33, 23 May 2016
Glossary of Terms
|valign="top"|OCR, software |valign="top"|OCR, software 

174 KB (25,314 words) - 13:20, 22 March 2019
5th Annual Digital Data Conference, Florida Museum of Natural History
...oncurrent Session 7'' '''Digitizing insect specimen photographs with an OCR and machine-learning enabled information extraction pipeline''' Neha Kum

49 KB (6,555 words) - 17:10, 14 June 2021

Search results

Page title matches

Page text matches

Navigation menu

Search