Data Quality Toolkit 2024: Difference between revisions

Jump to navigation Jump to search
Add more data quality checks
(Add outline of data quality toolkit formatting, with examples)
(Add more data quality checks)
Line 52: Line 52:


== Geography ==
== Geography ==
=== Improperly Negated Latitudes/Longitudes ===
'''Problem:''' The sign of the latitude (decimalLatitude) or longitude (decimalLongitude) does not match the sign/hemisphere of the given country. For example, all longitudes in the U.S. should be negative.
'''How to FIND this Problem in Your Dataset:'''
* '''Arctos:'''
* '''Excel:'''
* '''OpenRefine'''
* '''Specify:'''
* '''Symbiota:'''
* '''TaxonWorks:'''
'''How to FIX this Problem in your Dataset:'''
* '''Arctos:'''
* '''Excel:'''
* '''OpenRefine'''
* '''Specify:'''
* '''Symbiota:'''
* '''TaxonWorks:'''
=== Missing Latitudes/Longitudes ===
'''Problem:''' A record has a latitude value, but not a longitude value.
'''How to FIND this Problem in Your Dataset:'''
* '''Arctos:'''
* '''Excel:'''
* '''OpenRefine'''
* '''Specify:'''
* '''Symbiota:''' Use the [https://biokic.github.io/symbiota-docs/editor/edit/ Record Search form]. For Custom Field 1, select Decimal Latitude IS NULL. For Custom Field 2, select Decimal Longitude IS NOT NULL. Then conduct a similar search with Decimal Latitude IS NOT NULL and Decimal Longitude IS NULL.
* '''TaxonWorks:'''
'''How to FIX this Problem in your Dataset:'''
* '''Arctos:'''
* '''Excel:'''
* '''OpenRefine'''
* '''Specify:'''
* '''Symbiota:''' No batch fixing possible. You will need to review the records and either add lat/long values or remove the orphaned lat/long values.
* '''TaxonWorks:'''


=== Misspelled Geographic Unit Names ===
=== Misspelled Geographic Unit Names ===
255

edits

Navigation menu