Fixed Dataset Errata
Notes on Errata Fixed
Errors noted below are fixed
- D. Lafferty Label NY01075759_lg.txt has authority (part of verbatimScientificName) as: "Kocourková & F. Berger". Gold Parsed NY01075759_lg.csv has "Kocourkova & F. Berger", without the accent on the "a". (Or should we convert foreign characters to English characters???)
- (Bryan: All "special characters should be preserved by using UTF-8)
- (Ed: Accented "á" fixed)
- Gold label NY01075763_lg.txt has Pyrenidium actinellurn, should be Pyrenidium actinellum. Gold Parsed copies the error verbatim (as it should) and needs to be corrected if the .txt file is corrected.
- /home/aocr/datasets/lichens/gold/outputs/human/NY01075763_lg.txt fixed --Dpaul 17:28, 26 February 2013 (EST)
- /home/aocr/datasets/lichens/gold/parsed/human/NY01075763_lg.csv fixed --Dpaul 17:28, 26 February 2013 (EST)
- /webroot/datasets/lichens/gold/ocr/NY01075763_lg.txt fixed --Dpaul 16:33, 27 February 2013 (EST)
- /webroot/datasets/lichens/gold/parsed/NY01075763_lg.csv fixed --Dpaul 16:33, 27 February 2013 (EST)
- datasets/lichens/gold/ocr/WIS-L-0012040_lg.txt: Longitude recorded as L49 (capitalized for clarity) instead of 149