Difference between revisions of "Participant Related Projects"

From iDigBio
Jump to: navigation, search
m (DarwinCore Parser)
m (DarwinCore Parser)
Line 22: Line 22:
  
 
== DarwinCore Parser ==
 
== DarwinCore Parser ==
:::; DarwinCore Parser : From Hackathon participant, Michael Giddens, a new tool written in Node.js. It represents a very very new tool that SilverBiology is developing for the OCR meeting and some internal projects.
+
The DarwinCore Parser comes to us from Hackathon participant, [[2013_Hackathon_Participants#Michael_Giddens| Michael Giddens]], and is a new tool written in Node.js. It represents a very very new tool that SilverBiology is developing for the OCR meeting and some internal projects.
 
See: https://github.com/silverbiology/dwc-parser
 
See: https://github.com/silverbiology/dwc-parser
 
*Will be used to send OCR as text and:
 
*Will be used to send OCR as text and:

Revision as of 18:57, 13 January 2013

Projects of Various Participants with Relevance to the Hackathon

Tolkin

Hackathon participant Chris Dell is associated with the Tolkin Project. Another Tolkin Informatics staff member, Elvis Wu attended the AOCR Working Group meeting in October of 2012 to help us plan the February 2013 hackathon. Reed Beaman, iDigBio Senior Personnel, is one of the Principal Investigators on this project. From the website:

TOLKIN is an information management and analytical web application to provide informatics support for phylodiversity and biodiversity research projects. As a web-based application, TOLKIN is able to support collaborative projects by providing shared access to a variety of data on voucher specimens, taxonomy, bibliography, morphology, DNA samples and sequences.

SALIX

Participant Daryl Lafferty is the developer on this project, Semi-Automated Label Information Extraction System (SALIX).

BiSciCol

Participant Bryan Heidorn and iDigBio staff member Reed Beaman are part of the Biological Science Collections Tracker project. Read more about this endeavor at http://biscicol.blogspot.com/ From their website:

BiSciCol (Biological Science Collections) Tracker is a funded NSF collaborative project with the goal of building an infrastructure designed to tag and track scientific collections and all of their derivatives.

From the Page

Hackathon participant, Ben Brumfield's consulting organization is From The Page. Check out the blog at FromThePage.com for more about Ben's experience with developing user interfaces for transcription, and OCR, and,... From the website:

FromThePage is free software that allows volunteers to transcribe handwritten documents on-line. It's easy to index and annotate subjects within a text using a simple, wiki-like mark-up. Users can discuss difficult writing or obscure words within a page to refine their transcription. The resulting text is hosted on the web, making documents easy to read and search.

DarwinCore Parser

The DarwinCore Parser comes to us from Hackathon participant, Michael Giddens, and is a new tool written in Node.js. It represents a very very new tool that SilverBiology is developing for the OCR meeting and some internal projects. See: https://github.com/silverbiology/dwc-parser

  • Will be used to send OCR as text and:
    • Get Dates from blob of text with ratings
    • Get Lat/Lng in various formats
    • Wrapper for GlobalNames and GBIF Checklist Bank for Name Recognition
    • Higher Taxa lookup from GBIF Checklist Bank
    • Wrapper for Python Lat/Lng format converter
    • Type status detection
    • Experimental: Using geos and GDAM for higher geography lookup and potential any shapefile lookup like Ecological data.
    • more stuff.... looking for help...

The Apiary Project

Find out more about Apiary at http://www.apiaryproject.org

Biodiversity Heritage Library

For an introduction to this project see: http://biodiversitylibrary.org

Back to the 2013 AOCR Hackathon Wiki