Difference between revisions of "2013 AOCR Hackathon Wiki"

From iDigBio
Jump to: navigation, search
m (Overview of the Challenge)
m (Overview of the Challenge)
Line 32: Line 32:
 
[[Known OCR, ML, NLP Issues]] and challenges
 
[[Known OCR, ML, NLP Issues]] and challenges
 
    
 
    
link to user interface wish list
+
Human-in-the-loop: [[User Interface Wish List]]
   link labelx to apiary and symbiota
+
    
  what else?
+
 
+
 
<pre>*Thank you Nesent, Hilmar Lapp and the HIP working group for this model.</pre>
 
<pre>*Thank you Nesent, Hilmar Lapp and the HIP working group for this model.</pre>

Revision as of 22:48, 10 January 2013

Welcome to the 2013 iDigBio AOCR Hackathon Wiki

  • Short URL to this hackathon wiki http://tinyurl.com/aocrhackathonwiki
  • Those participating in the first iDigBio AOCR Hackathon need an iDigBio account.
  • Note: This wiki page undergoing frequent updates and some participants have wiki edit permissions and will add to / update / edit these pages before, during and after the hackathon.

Links to Logistics, Communication, and Participant Information

Overview of the Challenge

  "core" fields   

link to explanations and examples of the 3 data sets

  set 1: LBCC label images
  set 2: NYBG and BRIT label images
  set 3: CalBug ENT label images

link to page summarizing the rules we followed to transcribe the gold set (and others)

Text Transcription Issues

Known OCR, ML, NLP Issues and challenges

Human-in-the-loop: User Interface Wish List

*Thank you Nesent, Hilmar Lapp and the HIP working group for this model.