Dataset Errata: Difference between revisions

Jump to navigation Jump to search
m
Line 139: Line 139:
NY01075779_lg habitat concatenation (Bryan: "on Protoblastenia rupestris" appears before the location and habitate section of the label. However, Habitate says "dolomite rock along lake shore and adjacent Thuja forest; on Protoblastenia rupestris". There was a period after "forest". The period was removed and a ";" added. Then the "on Protoblastenia rupestris" from the earlier part of the label was concatinated. 
NY01075779_lg habitat concatenation (Bryan: "on Protoblastenia rupestris" appears before the location and habitate section of the label. However, Habitate says "dolomite rock along lake shore and adjacent Thuja forest; on Protoblastenia rupestris". There was a period after "forest". The period was removed and a ";" added. Then the "on Protoblastenia rupestris" from the earlier part of the label was concatinated. 
:::changed dwc:habitat to:
:::changed dwc:habitat to:
<pre>on Protoblastenia rupestris dolomite rock along lake shore and adjacent Thuja forest.<pre> --~~~~
<pre>on Protoblastenia rupestris dolomite rock along lake shore and adjacent Thuja forest.</pre> --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075780_lg NEW YOUR BOTANICAL GARDEN (Bryan: the label said "GARDEN". The OCR said "CARDEN". SIlver should be "CARDEN" Gold should be "GARDEN"  
NY01075780_lg NEW YOUR BOTANICAL GARDEN (Bryan: the label said "GARDEN". The OCR said "CARDEN". SIlver should be "CARDEN" Gold should be "GARDEN"  
:::the OCR I see says "Garden" in /home/aocr/webroot/datasets/lichens/silver/ocr --~~~~
:::the OCR I see says "Garden" in /home/aocr/webroot/datasets/lichens/silver/ocr --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)
:::fixed the Gold (changed from Carden to Garden), left the silver alone.--~~~~
:::fixed the Gold (changed from Carden to Garden), left the silver alone.--[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075789_lg catalogNumber (NY01075789) in the csv file; but it is (01075789) in the text file.  
NY01075789_lg catalogNumber (NY01075789) in the csv file; but it is (01075789) in the text file.  
:::fixed in gold csv  (path is /home/aocr/webroot/datasets/lichens/gold/parsed) --~~~~
:::fixed in gold csv  (path is /home/aocr/webroot/datasets/lichens/gold/parsed) --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075797_lg recordedBy ( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
NY01075797_lg recordedBy ( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
:::fixed in gold csv (path is /home/aocr/webroot/datasets/lichens/gold/parsed) --~~~~
:::fixed in gold csv (path is /home/aocr/webroot/datasets/lichens/gold/parsed) --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075805_lg stateProvince (South Carolina) in the csv file; but it is (S.C.) in the text file.
NY01075805_lg stateProvince (South Carolina) in the csv file; but it is (S.C.) in the text file.
:::changed to S.C. in the csv --~~~~
:::changed to S.C. in the csv --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075812_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.  
NY01075812_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.  
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075816_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.  
NY01075816_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.  
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075817_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
NY01075817_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075818_lg no scientificName
NY01075818_lg no scientificName
::: not null in the csv record I see in /home/aocr/webroot/datasets/lichens/gold/parsed
::: not null in the csv record I see in /home/aocr/webroot/datasets/lichens/gold/parsed
:::but the umlaut was missing from the txt file and the csv -- so I fixed that. --~~~~
:::but the umlaut was missing from the txt file and the csv -- so I fixed that. --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075819_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
NY01075819_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075820_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
NY01075820_lg recordedBy( William Russell Buck) in the csv file; but it is (William R. Buck) in the text file.
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075821_lg scientificName (null)  
NY01075821_lg scientificName (null)  
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075821_lg no scientificName  
NY01075821_lg no scientificName  
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075822_lg no scientificName
NY01075822_lg no scientificName
:::fixed --~~~~
:::fixed --[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


NY01075823_lg identifiedBy (Bryan:&nbsp;?? I do not see the problem)
NY01075823_lg identifiedBy (Bryan:&nbsp;?? I do not see the problem)
:::From Deb. We (the herb set) did put determinations on a separate line, as done in this file. We did not do it the same way, however. Need to discuss.--~~~~
:::From Deb. We (the herb set) did put determinations on a separate line, as done in this file. We did not do it the same way, however. Need to discuss.--[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)
:::I did put in umlauts for the u's in the name Müll. (they are in  image, not in csv or txt,but they should be).--~~~~
:::I did put in umlauts for the u's in the name Müll. (they are in  image, not in csv or txt,but they should be).--[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


TODO: fix all instances of Mull to Müll in txt and csv gold lichen at paths
TODO: fix all instances of Mull to Müll in txt and csv gold lichen at paths
/home/aocr/webroot/datasets/lichens/gold/ocr and
/home/aocr/webroot/datasets/lichens/gold/ocr and
/home/aocr/webroot/datasets/lichens/gold/parsed
/home/aocr/webroot/datasets/lichens/gold/parsed
--~~~~
--[[User:Dpaul|Dpaul]] 23:19, 30 June 2013 (EDT)


TENN-L-0000001_lg verbatimLocality mixed with verbatimElevation  
TENN-L-0000001_lg verbatimLocality mixed with verbatimElevation  
4,713

edits

Navigation menu