Data Ingestion Guidance: Difference between revisions

Line 65: Line 65:


==Specimen metadata==
==Specimen metadata==
*Each specimen record should have a unique (within the dataset) identifier in the ''dwc:occurrenceID'' field. When the ingestion software detects duplicate identifiers, the duplicated records are flagged as an error and are not ingested. Identifiers, if not GUIDSs or specifically UUIDs, are what is typically called the DwC (Darwin Core) triplet:<br>
*Each specimen record should have a unique (within the dataset) identifier in the ''dwc:occurrenceID'' field. When the ingestion software detects duplicate identifiers, the duplicated records are flagged as an error and are not ingested. This is the number one reason for records to be rejected. Identifiers, if not GUIDSs or specifically UUIDs, are what is typically called the DwC (Darwin Core) triplet:<br>
  <''dwc:institutionCode''>:<''dwc:collectionCode''>:<''dwc:catalogNumber''>
  <''dwc:institutionCode''>:<''dwc:collectionCode''>:<''dwc:catalogNumber''>
example with a prefix: <pre>urn:catalog:TNHC:Herpetology:122</pre>
example with a prefix: <pre>urn:catalog:TNHC:Herpetology:122</pre>
5,887

edits