iConference2013: Introducing iDigBio and the Augmenting OCR Working Group

Publication TypePresentation
Year of Publication2013
AuthorsPaul, Deborah L.
KeywordsAugmenting Optical Character Recognition, Darwin Core, iConference2013, iDigBio, Information Science, Natural Language Processing, Parsing, XML
AbstractHighlights of this introduction include: "What is iDigBio? Who is the AOCR wg?" Why we are here at iConference2013? Where did we come from and what makes us unique? Where is our data coming from? and How can the iSchools community get involved?” The AOCR wg is working to find ways to speed up and improve access to digitization of natural history museum specimen data and we think the Information Science community can help. Five talks follow, explaining unique key parts of our story. Note the iDigBio* Augmenting Optical Character Recognition Working Group (AOCR wg) put together 4 submissions for iSchools iConference2013: this workshop, a paper, a poster, and an alternative event. All of these are concurrent with a Hackathon at the Botanical Research Institute of Texas (BRIT) as part of a strategic outreach effort. Interested parties are encouraged to participate in our Hackathon, join an existing iDigBio working group, propose and host a workshop, and contribute to our forums and online materials. Integrated Digitized Biocollections (iDigBio) is a National Science Foundation (NSF) project funded under the Advancing Digitization of Biological Collections (ADBC) program. Thematic Collection (Museum) Networks (TCNs) are NSF-funded to digitize specimen data needed to answer grand challenge questions and provide that data to iDigBio. iDigBio is building a cyberinfrastructure to integrate data from museums across the USA, making it accessible to everyone.