Dataset Errata: Difference between revisions

m
Line 650: Line 650:
==== Silver Parsed CSV WIS Issues ====
==== Silver Parsed CSV WIS Issues ====
WIS-L-0011726_lg character encoding in verbatimScientificName character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates misspelling in verbatimElevation character encoding in recordedBy  
WIS-L-0011726_lg character encoding in verbatimScientificName character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates misspelling in verbatimElevation character encoding in recordedBy  
:::no error seen in verbatimScientificName, copied with all krelb from silver ocr as '''Cetraria ’u.x9\a~i&{cc. (‘Cl''' --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::verbatimLatitude missing a space
:::verbatimElevation fixed (had 1,800 ft, but ocr had a '''dot''' as in 1.800 ft) --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::verbatimLongitude fixed (the ocr had a ? instead of minutes symbol, since silver, removed the minutes symbol and put the ? in) --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::verbatimCoordinates had missing space and same issue as above --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::recordedBy no error seen? --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::changed csv encoded to utf-8 to match ocr text file, commited to GitHub --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)


WIS-L-0011727_lg character encoding in verbatimScientificName separated verbatimLocality into two columns misspelling in verbatimLocality character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates  
WIS-L-0011727_lg character encoding in verbatimScientificName separated verbatimLocality into two columns misspelling in verbatimLocality character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates
:::Looks correctly parsed to me.  --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::I need to check this and the next 2 files to see if csv / txt encoding is same or diff. --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)


WIS-L-0011728_lg character encoding in verbatimScientificName character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates character encoding in habitat  
WIS-L-0011728_lg character encoding in verbatimScientificName character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates character encoding in habitat
:::fixed habitat, other errors not seen by this editor (dp); to be certain, i copied and pasted over from the ocr text into the csv file again for certainty. --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)


WIS-L-0011729_lg separated verbatimLocality into two columns character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates  
WIS-L-0011729_lg separated verbatimLocality into two columns character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates
:::comma removed from verbatimLocality (it's not present in ocr output file) --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)
:::other errors not seen. copied and pasted over again from txt to csv just for certainty. --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)


WIS-L-0011730_lg character encoding in verbatimScientificName character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates misspelling in habitat  
WIS-L-0011730_lg character encoding in verbatimScientificName character encoding in verbatimLatitude character encoding in verbatimLongitude character encoding in verbatimCoordinates misspelling in habitat  
:::csv encoding was ANSI, txt was utf-8. changed csv to utf-8 and fixed encoding issues and other items noted above after changing the csv encoding. --[[User:Dpaul|Dpaul]] 18:01, 19 July 2013 (EDT)


WIS-L-0011731_lg character encoding in verbatimScientificName character encoding in identifiedBy separated verbatimLocality into two columns misspelling in associatedTaxa misspelling in verbatimElevation  
WIS-L-0011731_lg character encoding in verbatimScientificName character encoding in identifiedBy separated verbatimLocality into two columns misspelling in associatedTaxa misspelling in verbatimElevation  
4,707

edits