|
|
Line 732: |
Line 732: |
| === [[Fixed Dataset Errata]] === | | === [[Fixed Dataset Errata]] === |
| <br> | | <br> |
|
| |
| == Errors noted below are fixed ==
| |
| <br>
| |
| ::::D. Lafferty Label NY01075759_lg.txt has authority (part of verbatimScientificName) as: "Kocourková & F. Berger". Gold Parsed NY01075759_lg.csv has "Kocourkova & F. Berger", without the accent on the "a". (Or should we convert foreign characters to English characters???)
| |
| ::::'''(Bryan:''' All "special characters should be preserved by using UTF-8)
| |
| ::::'''(Ed:''' Accented "á" fixed) <br> <br>
| |
|
| |
| ::Gold label NY01075763_lg.txt has Pyrenidium actinellurn, should be Pyrenidium actinellum. Gold Parsed copies the error verbatim (as it should) and needs to be corrected if the .txt file is corrected.
| |
|
| |
| ::::/home/aocr/datasets/lichens/gold/outputs/human/NY01075763_lg.txt fixed --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST)
| |
| ::::/home/aocr/datasets/lichens/gold/parsed/human/NY01075763_lg.csv fixed --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST)
| |
| ::::/webroot/datasets/lichens/gold/ocr/NY01075763_lg.txt fixed --[[User:Dpaul|Dpaul]] 16:33, 27 February 2013 (EST)
| |
| ::::/webroot/datasets/lichens/gold/parsed/NY01075763_lg.csv fixed --[[User:Dpaul|Dpaul]] 16:33, 27 February 2013 (EST)
| |
|
| |
| ::datasets/lichens/gold/ocr/WIS-L-0012040_lg.txt: Longitude recorded as L49 (capitalized for clarity) instead of 149
| |
|
| |
| ::::/webroot/datasets/lichens/gold/ocr/WIS-L-0012040_lg.txt fixed --[[User:Dpaul|Dpaul]] 16:39, 27 February 2013 (EST)
| |
| ::::/webroot/datasets/lichens/gold/parsed/WIS-L-0012040_lg.csv fixed --[[User:Dpaul|Dpaul]] 16:39, 27 February 2013 (EST)
| |
|
| |
|
| == Unicode Reserved character (single quote) == | | == Unicode Reserved character (single quote) == |