4,713
edits
m (→Scope) |
|||
Line 22: | Line 22: | ||
***This will allow some algorithms to exploit spatial information to identify elements. This format is, however, not a main focus for this hackathon. | ***This will allow some algorithms to exploit spatial information to identify elements. This format is, however, not a main focus for this hackathon. | ||
*Some data dictionaries and authority files may be provided (or you may use those you have access to) in efforts to have cleaner OCR output before parsing. | *Some data dictionaries and authority files may be provided (or you may use those you have access to) in efforts to have cleaner OCR output before parsing. | ||
*Those wishing to pursue other goals such as image segmentation, finding specific elements, or improving usability & user interfaces to the OCR and parsing tools are encouraged to do so and report back to the group at the hackathon. | *Those wishing to pursue other goals such as image segmentation, finding specific elements, or improving usability & user interfaces to the OCR output and parsing tools are encouraged to do so and report back to the group at the hackathon. | ||
== Metrics and Evaluation == | == Metrics and Evaluation == |