Data Ingestion Guidance: Difference between revisions

No edit summary
Line 5: Line 5:
= Data Ingestion Workflow  =
= Data Ingestion Workflow  =


Working copy 1.1 (June 2015)
Working copy 1.2 (July 2015)


Audience: iDigBio data ingestion staff and data providers
Audience: iDigBio data ingestion staff and data providers
Line 33: Line 33:
#Custom RSS feed following the guidance at: [[CYWG iDigBio DwC-A Pull Ingestion| iDigBio RSS specification]]
#Custom RSS feed following the guidance at: [[CYWG iDigBio DwC-A Pull Ingestion| iDigBio RSS specification]]


* Standard DwC-A uses field names from:
* DwC-A uses field names from:
** Darwin Core: http://rs.tdwg.org/dwc/terms/
** Specimen: Darwin Core: http://rs.tdwg.org/dwc/terms/
** Audubon Core: http://terms.tdwg.org/wiki/Audubon_Core_Term_List
** Media: Audubon Core: http://terms.tdwg.org/wiki/Audubon_Core_Term_List


* A custom CSV allows providers to send data beyond standards such as Dublin Core and Darwin Core. For example, providers can send tribe taxonomic information in the field "idigbio:tribe". While creating additional fields, use field names that follow DwC format (camel case), additionally, consult the [[MISC-Authority-File-Working-Group#Data_Element_Lists_by_Data_Model_Concept|MISC field names]] (local iDigbio extensions to DwC). The host association terms are an example of an extension found in the MISC. Use the XML style field names that include the domain of the schema, e.g., dwc:termName, ac:termName. Non-standard field names are indexed and available through search API.
* A custom CSV allows providers to send data beyond standards such as Dublin Core and Darwin Core. For example, providers can send tribe taxonomic information in the field "idigbio:tribe". While creating additional fields, use field names that follow DwC format (camel case), additionally, consult the [[MISC-Authority-File-Working-Group#Data_Element_Lists_by_Data_Model_Concept|MISC field names]] (local iDigbio extensions to DwC). The host association terms are an example of an extension found in the MISC. Use the XML style field names that include the domain of the schema, e.g., dwc:termName, ac:termName. Non-standard field names are indexed and available through search API.
5,887

edits