Hackathon Challenge: Difference between revisions

Jump to navigation Jump to search
Line 111: Line 111:
Gold Parsed WIS-L-0011732_lg.csv (and many other lichen gold parsed labels) removes a space from verbatimLatitude and from verbatimLongitude, changing this: 60° 33.579'N into this: 60°33.579'N.  The space removal is inconsistent, on some labels, not on others.
Gold Parsed WIS-L-0011732_lg.csv (and many other lichen gold parsed labels) removes a space from verbatimLatitude and from verbatimLongitude, changing this: 60° 33.579'N into this: 60°33.579'N.  The space removal is inconsistent, on some labels, not on others.


Gold Parsed NY01075791_lg.csv converts the "u" in "Mull" to an umlaut yielding "Müll".  This actually reflects the original label, but not the Gold OCR NY01075791_lg.txt file, which has "Mull".
Gold Parsed NY01075791_lg.csv converts the "u" in "Mull" to an umlaut yielding "Müll".  This actually reflects the original label, but not the Gold OCR NY01075791_lg.txt file, which has "Mull".  Same for NY01075792_lg.csv.


'''Gold OCR Errors'''
'''Gold OCR Errors'''
Line 121: Line 121:
TENN-L-0000029_lg.txt adds a "1" to the scientificName ("Actinogyra muhlenbergii 1 (Ach.) Schol.").
TENN-L-0000029_lg.txt adds a "1" to the scientificName ("Actinogyra muhlenbergii 1 (Ach.) Schol.").


  NY01075791_lg.txt converted "Müll" on the original label NY01075791_lg.jpg to "Mull" (converted umlaut "ü" to "u".  We may want to do this, but if we do it should be standardized and consistent across all the labels.
  NY01075791_lg.txt converted "Müll" on the original label NY01075791_lg.jpg to "Mull" (converted umlaut "ü" to "u".  We may want to do this, but if we do it should be standardized and consistent across all the labels.  Same for NY01075791_lg.txt.


== Parameters ==
== Parameters ==

Navigation menu