4,713
edits
Line 2: | Line 2: | ||
::Gold label NY01075763_lg.txt has Pyrenidium actinellurn, should be Pyrenidium actinellum. Gold Parsed copies the error verbatim (as it should) and needs to be corrected if the .txt file is corrected. | ::Gold label NY01075763_lg.txt has Pyrenidium actinellurn, should be Pyrenidium actinellum. Gold Parsed copies the error verbatim (as it should) and needs to be corrected if the .txt file is corrected. | ||
<br> | |||
::::/home/aocr/datasets/lichens/gold/outputs/human/NY01075763_lg.txt fixed --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST) | ::::/home/aocr/datasets/lichens/gold/outputs/human/NY01075763_lg.txt fixed --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST) | ||
::::/home/aocr/datasets/lichens/gold/parsed/human/NY01075763_lg.csv fixed --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST) | ::::/home/aocr/datasets/lichens/gold/parsed/human/NY01075763_lg.csv fixed --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST) | ||
Line 8: | Line 9: | ||
::Inconsistency in capitalization of verbatim fields in many Gold Parsed lichens. Example: NY01075763_lg.csv. In the label and OCR text the county is capitalized as ST. FRANCOIS, but in NY01075763_lg.csv it is title case: St. Francois. The state MISSOURI is capitalized in both the .txt and the .csv file. The scoring program is case sensitive, so any difference between the gold .csv and the program generated .csv will be marked wrong. | ::Inconsistency in capitalization of verbatim fields in many Gold Parsed lichens. Example: NY01075763_lg.csv. In the label and OCR text the county is capitalized as ST. FRANCOIS, but in NY01075763_lg.csv it is title case: St. Francois. The state MISSOURI is capitalized in both the .txt and the .csv file. The scoring program is case sensitive, so any difference between the gold .csv and the program generated .csv will be marked wrong. | ||
<br> | |||
::::Alex will change the metrics to be case-insensitive. --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST) | ::::Alex will change the metrics to be case-insensitive. --[[User:Dpaul|Dpaul]] 17:28, 26 February 2013 (EST) | ||
::Gold Parsed NY01075759_lg.csv: verbatimEventDate is 1998-04-19, should be 19 April 1998. | ::Gold Parsed NY01075759_lg.csv: verbatimEventDate is 1998-04-19, should be 19 April 1998. | ||
<br> | |||
::::/home/aocr/datasets/lichens/gold/parsed/human/NY01075759_lg.csv fixed --[[User:Dpaul|Dpaul]] 18:06, 26 February 2013 (EST) | ::::/home/aocr/datasets/lichens/gold/parsed/human/NY01075759_lg.csv fixed --[[User:Dpaul|Dpaul]] 18:06, 26 February 2013 (EST) | ||
::::/home/aocr/datasets/lichens/silver/parsed/human/NY01075759_lg.csv fixed --[[User:Dpaul|Dpaul]] 18:06, 26 February 2013 (EST) | ::::/home/aocr/datasets/lichens/silver/parsed/human/NY01075759_lg.csv fixed --[[User:Dpaul|Dpaul]] 18:06, 26 February 2013 (EST) |