Search results

Jump to navigation Jump to search

Page title matches

  • =OCR SaaS= *Process the ocr with the available OCR engines
    1 KB (190 words) - 15:48, 17 April 2014
  • ...400–600 dpi for texts printed in smaller fonts (9pt or smaller). For best OCR results vertical and horizontal resolutions must be the same. See [http://f ...Setting an extremely low resolution (less than 150 dpi) adversely affects OCR quality. See [http://finereader.abbyy.com/guide/ User's Guide] for addition
    5 KB (776 words) - 17:33, 3 January 2014
  • [[Category:OCR]] == OCR Software used by ADBC projects ==
    7 KB (1,027 words) - 13:33, 25 August 2014
  • [[Category:OCR]] === Augmenting OCR Working Group (A-OCR) Overview ===
    5 KB (743 words) - 17:59, 18 June 2014
  • == OCR / NLP Workflow & Protocol Documents == === Adding OCR to your Workflow ===
    4 KB (581 words) - 13:15, 25 August 2014
  • Augmenting OCR Working Group Workshop, presented by iDigBio, the National Resource (Home U .../www.idigbio.org/wiki/index.php/IDigBio_Augmenting_OCR_Workshop Augmenting OCR Workshop Agenda]
    4 KB (650 words) - 15:51, 26 September 2012
  • ...g list of known topics where work is needed that would improve things like OCR output, overall parsing results, and meaningful data set creation for digit <li>how to get OCR to ignore a map (reduce OCR confusion)</li>
    957 bytes (157 words) - 18:30, 31 January 2013
  • ...ound:#D58B28;text-align:center;font-size:9pt" | Quick Links for Augmenting OCR Workshop |[http://tinyurl.com/AOCRTwoDayAgenda Augmenting OCR Workshop Agenda]
    6 KB (879 words) - 17:23, 3 February 2015
  • ...esentation/d/13Vugd2gaZBza5WZKfGqkpT7TLwqqqNLjCBEjOvfQxlw/edit?usp=sharing OCR Group Presentation][https://www.idigbio.org/sites/default/files/workshop-pr
    648 bytes (85 words) - 16:32, 14 January 2014
  • [[Category:OCR]] == Use Cases for OCR / ML / NLP in Current Digitization Efforts ==
    3 KB (409 words) - 18:00, 18 June 2014
  • == Augmenting OCR Working Group Discussion Forum == ...rg/forums/augmenting-ocr-and-nlp https://www.idigbio.org/forums/augmenting-ocr-and-nlp]
    2 KB (369 words) - 14:40, 27 January 2014

Page text matches

  • ...g list of known topics where work is needed that would improve things like OCR output, overall parsing results, and meaningful data set creation for digit <li>how to get OCR to ignore a map (reduce OCR confusion)</li>
    957 bytes (157 words) - 18:30, 31 January 2013
  • == OCR ==
    417 bytes (43 words) - 16:54, 25 May 2016
  • ...enges of digitizing natural history collections with a focus on the use of OCR and NLP. Panel presentations planned encompass an all-inclusive world view ...eanest OCR possible and 2) to use NLP algorithms in an effort to parse the OCR output into Darwin Core fields. Participants' outputs will be compared and
    2 KB (245 words) - 16:47, 26 October 2012
  • =OCR SaaS= *Process the ocr with the available OCR engines
    1 KB (190 words) - 15:48, 17 April 2014
  • == Augmenting OCR Working Group Discussion Forum == ...rg/forums/augmenting-ocr-and-nlp https://www.idigbio.org/forums/augmenting-ocr-and-nlp]
    2 KB (369 words) - 14:40, 27 January 2014
  • [[Category:OCR]] == Use Cases for OCR / ML / NLP in Current Digitization Efforts ==
    3 KB (409 words) - 18:00, 18 June 2014
  • ...20 pixels or better is preferred. See [http://code.google.com/p/tesseract-ocr/wiki/FAQ Tesseract's FAQ] for more information concerning this issue. ...hics or even a simple black border can cause interference that will affect OCR output. For example, the following images will produce no output from Tesse
    2 KB (250 words) - 11:07, 9 May 2013
  • == OCR / NLP Workflow & Protocol Documents == === Adding OCR to your Workflow ===
    4 KB (581 words) - 13:15, 25 August 2014
  • [[Category:OCR]] === Augmenting OCR Working Group (A-OCR) Overview ===
    5 KB (743 words) - 17:59, 18 June 2014
  • ::::/webroot/datasets/lichens/gold/ocr/NY01075763_lg.txt fixed --[[User:Dpaul|Dpaul]] 16:33, 27 February 2013 (EST ::datasets/lichens/gold/ocr/WIS-L-0012040_lg.txt: Longitude recorded as L49 (capitalized for clarity) i
    9 KB (1,267 words) - 16:40, 3 July 2013
  • ...esentation/d/13Vugd2gaZBza5WZKfGqkpT7TLwqqqNLjCBEjOvfQxlw/edit?usp=sharing OCR Group Presentation][https://www.idigbio.org/sites/default/files/workshop-pr
    648 bytes (85 words) - 16:32, 14 January 2014
  • [[Category:OCR]] == OCR Software used by ADBC projects ==
    7 KB (1,027 words) - 13:33, 25 August 2014
  • ...ound:#D58B28;text-align:center;font-size:9pt" | Quick Links for Augmenting OCR Workshop |[http://tinyurl.com/AOCRTwoDayAgenda Augmenting OCR Workshop Agenda]
    6 KB (879 words) - 17:23, 3 February 2015
  • ...and used extensively at Arizona State University. The purpose is to parse OCR'd label data into the respective data fields (e.g. Collector, collection nu ...tp://manuscripttranscription.blogspot.com/2013/02/detecting-handwriting-in-ocr-text.html final report]) and label extraction from Dataset 3 ([http://manus
    4 KB (627 words) - 13:12, 13 June 2013
  • ...400–600 dpi for texts printed in smaller fonts (9pt or smaller). For best OCR results vertical and horizontal resolutions must be the same. See [http://f ...Setting an extremely low resolution (less than 150 dpi) adversely affects OCR quality. See [http://finereader.abbyy.com/guide/ User's Guide] for addition
    5 KB (776 words) - 17:33, 3 January 2014
  • ...n parsing again. Given the 2-day agenda, it's probably not feasible to run OCR and output algorithms on all 10,000 images in a dataset at the hackathon. ...ld?''' : For the scope of this hackathon, getting any taxon name from the OCR output and into the CSV file into the field aocr: verbatimScientificName is
    4 KB (567 words) - 19:44, 13 January 2013
  • ==== OCR Images ==== *200 Additional Hand Parsed labels from Raw OCR Output (Silver)
    4 KB (609 words) - 13:25, 16 July 2013
  • ....pdf/iConference2013WorkshopDP.pptx Introducing iDigBio and the Augmenting OCR Working Group] === :::;Jason Best: The Apiary Project: combining OCR technology, OCR output from herbarium specimen or other images containing museum specimen d
    8 KB (1,125 words) - 14:38, 18 March 2013
  • ...versity museum collections specimen data can be sped up if the output from OCR can be parsed faster and more accurately and packaged into semantically mea ...utput or repeat the OCR with the software of choice and then parse the new OCR output attempting to successfully populate as many of the selected Darwin C
    9 KB (1,485 words) - 13:23, 6 January 2014
  • |[https://www.idigbio.org/content/update-idigbio-augmenting-ocr-working-group iDigBio AOCR Hackathon Workshop Report] ...& Reports#Paul_Schroder|Paul & Robin Schroeder: iDigBio Hackathon: iDigBio OCR Hackathon Initial Results - Rest API]]
    8 KB (1,061 words) - 16:32, 9 July 2015
  • ..., & Ulate, W. (2013). Help iDigBio reveal hidden data - iDigBio Augmenting OCR working group needs you. iConference 2013 Proceedings (pp. 1019-1021). doi ...owledge and collaboration as part of our multi-faceted approach to improve OCR strategies and natural language processing (NLP) algorithms used in digitiz
    11 KB (1,417 words) - 13:59, 29 May 2013
  • ...ouch-enabled Windows 8 app which integrates Optical Character Recognition (OCR), Natural Language Parsing (NLP), Model Strategies and crowdsourcing to pro
    1 KB (159 words) - 11:57, 24 July 2013
  • While this first hackathon centers around the task of getting data from OCR output parsed into semantically meaningful parts for insertion into a datab
    914 bytes (135 words) - 01:39, 11 January 2013
  • *[[Transcription Hackathon OCR Integration Planning| OCR Integration Track]] ...lt/files/workshop-presentations/citscribe/lightningtalk-miaochen.pdf Using OCR]
    9 KB (1,240 words) - 15:35, 3 February 2015
  • # optical character recognition (OCR).
    1 KB (173 words) - 10:55, 2 October 2014
  • {{Caution|This page refers to an API developed as part of the 2014 Augmenting OCR hackathon which is not supported by iDigBio and is no longer available.}} ...own OCR software to use this puppy. A drawback is that I see a lot of the OCR results are terrible.
    14 KB (1,847 words) - 11:57, 9 July 2018
  • Augmenting OCR Working Group Workshop, presented by iDigBio, the National Resource (Home U .../www.idigbio.org/wiki/index.php/IDigBio_Augmenting_OCR_Workshop Augmenting OCR Workshop Agenda]
    4 KB (650 words) - 15:51, 26 September 2012
  • ...ata/metadata capture and enrichment such as Optical Character Recognition (OCR), text mining, Natural Handwriting Recognition (NHR), Natural Language Proc ...Data Discovery and Doer Happiness: Uses for Optical Character Recognition (OCR) Output.'''] (recording) Deborah Paul, Andrea Matsunaga, Miao Chen, Jason B
    11 KB (1,463 words) - 12:43, 3 February 2015
  • ...e Botanical Research Institute of Texas (BRIT)] and the iDigBio Augmenting OCR Working Group, supported by iDigBio, the National Resource (Home Uniting Bi *[https://www.idigbio.org/wiki/index.php/2013_AOCR_Hackathon_Wiki Augmenting OCR Hackathon Agenda and Wiki]
    8 KB (1,182 words) - 18:55, 9 February 2013
  • ...mThePage.com], and read more about the world of crowdsourced transcription/OCR-correction at [http://manuscripttranscription.blogspot.com Ben's blog] ...t represents a very very new tool that SilverBiology is developing for the OCR meeting and some internal projects.
    9 KB (1,328 words) - 13:48, 17 January 2013
  • | Optical Character Rrecognition (OCR)||The mechanical or electronic conversion of scanned images of handwritten,
    4 KB (547 words) - 15:20, 13 May 2016
  • - Optical Character Recognition(OCR) appliance
    4 KB (502 words) - 16:47, 4 August 2014
  • ...https://www.idigbio.org/wiki/index.php/2013_AOCR_Hackathon_Wiki Augmenting OCR Working Group Hackathon (Feb 2013)]
    6 KB (776 words) - 11:26, 22 February 2024
  • | Output of the process of applying OCR to the multimedia object. | Free form text describing the software utilized for OCR as well as any additional technique (cropping, color alteration applied, co
    9 KB (1,298 words) - 11:13, 10 October 2017
  • ...erty and Ed Gilbert: on building restful web services for returning parsed OCR output
    5 KB (711 words) - 12:34, 3 February 2015
  • ...-- to using the images with tools like Inselect, and using text search of OCR output from these images to maximize a digitization workflow. We then look | 6 ||4:40 - 5:00 ||10:40-11:00am||Using OCR for QC in the digitisation workflow of RBGE herbarium||'''Robyn Drinkwater'
    8 KB (1,025 words) - 14:30, 11 July 2016
  • **[[Augmenting OCR Working Group Resources]]
    5 KB (707 words) - 14:20, 15 November 2021
  • ...ent. ('''Daryl)''' <br> ('''Bryan''': Agreed. Should be fixed to match the OCR label.) <br> ...ixed to match the label as best as possible. If it is not clear follow the OCR file.)
    77 KB (10,277 words) - 18:06, 15 September 2013
  • ...r-recognition-ocr-output-digitization Using optical character recognition (OCR) output in digitization: see your data before it's in the database and afte ...ization-incorporating-ocr-digitisation-and-curation-workflow Incorporating OCR into a digitisation and curation workflow]''' ('''Elspeth Haston''', Robyn
    15 KB (1,733 words) - 12:56, 3 February 2015
  • .../default/files/workshop-presentations/small-herbarium2013/OCRoverview.pptx OCR, it's not just for parsing! Think Sort.] [pptx] Deborah Paul
    9 KB (1,122 words) - 13:04, 22 September 2015
  • *OCR, Machine Learning, Natural Language Processing
    9 KB (1,216 words) - 14:05, 25 April 2015
  • ...gumenting-ocr-hackathon-fort-worth-texas-february-13-14 iDigBio Augmenting OCR Hackathon Workshop Report] ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]
    21 KB (2,495 words) - 16:16, 23 September 2014
  • ...gumenting-ocr-hackathon-fort-worth-texas-february-13-14 iDigBio Augmenting OCR Hackathon Workshop Report] ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]
    22 KB (2,626 words) - 16:14, 23 September 2014
  • *[[Media:iDigBio-Summit-V_AOCR-Poster.pdf|Augmenting OCR]] (Deb Paul) *[[Media:iDigBio-Summit-V_AOCR-Poster.pdf|Augmenting OCR]] (Deb Paul)
    23 KB (2,798 words) - 13:37, 7 September 2022
  • FromThePage focuses on textual transcription, OCR correction and metadata description of digitized documents.
    9 KB (1,254 words) - 15:02, 12 December 2023
  • [[Category:OCR]] == Augmenting OCR (aOCR) ==
    78 KB (10,722 words) - 18:41, 15 December 2022
  • |valign="top"|label imaging, using OCR for data capture [http://lbcc1.acis.ufl.edu/?q=project_workflow workflow] ...ls; data capture from label images; data capture via voice recognition and OCR
    33 KB (4,485 words) - 14:27, 15 November 2021
  • ...r transcribing Stage 1 records into Stage 2 records with the assistance of OCR and NLP.
    8 KB (1,098 words) - 16:17, 16 August 2019
  • *[[Media:26March2014UsesForOCR.pdf|Uses for Optical Character Recognition (OCR) Output (Deborah Paul)]]
    10 KB (1,344 words) - 13:08, 3 February 2015
  • ...ll-circle/410_Gilbert_OCR_2015-05-20.pdf Specimen label digitization using OCR/NLP tools integrated within the Symbiota processing toolkit (pdf)]||'''Ed G
    13 KB (1,695 words) - 11:14, 4 January 2017
  • ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]
    18 KB (2,010 words) - 16:47, 24 February 2015
  • *[http://evernote.com/ Evernote] (NLP/OCR API)<br>“Evernote makes it easy to remember things big and small from you
    12 KB (1,680 words) - 13:08, 3 September 2014
  • *Macroalgal TCN using Voice Recognition and OCR output to speed up digitization
    17 KB (2,330 words) - 01:57, 29 January 2020
  • ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]
    23 KB (2,733 words) - 16:05, 16 October 2014
  • ...io.org/content/hackathon-and-iconference-update-part-ii iDigBio Augmenting OCR Hackathon Update Part II Workshop Report]
    23 KB (2,739 words) - 16:06, 16 October 2014
  • |valign="top"|Commercial OCR software that can be used to convert typed and typeface label data from spe |valign="top"|An advanced optical character recognition (OCR) or rather more specific handwriting recognition system that allows fonts a
    72 KB (10,638 words) - 12:39, 23 May 2016
  • :On OCR, NLP, duplicate harvesting
    22 KB (3,098 words) - 22:25, 21 September 2015
  • *integration of OCR into the transcription workflow ...igbio-augumenting-ocr-hackathon-fort-worth-texas-february-13-14 Augmenting OCR Hackathon]
    148 KB (18,946 words) - 18:51, 24 August 2020
  • ...ough the online reader or downloaded in part or as a complete work in PDF, OCR text, or JPG2000 file formats.
    33 KB (4,698 words) - 13:33, 23 May 2016
  • |valign="top"|OCR, software |valign="top"|OCR, software<br>
    174 KB (25,314 words) - 13:20, 22 March 2019
  • ...oncurrent Session 7''<br>'''Digitizing insect specimen photographs with an OCR and machine-learning enabled information extraction pipeline'''<br>Neha Kum
    49 KB (6,555 words) - 17:10, 14 June 2021