Preparing a dataset in order to publish on an aggregator like Canadensys or GBIF or iDigBio can be quite fastidious, but data cleaning and standardisation are the key points to good quality and useful datasets. Fortunately, a lot of tools and protocols have been develop through time to help preparing biodiversity datasets.
This workshop aims to give curators, collection managers, students, and everyone willing to share their data on GBIF, a set of useful tools and methodologies to go from their in-house database to the publication of data on GBIF.
We will even go further, and discover how to search and use data published on Canadensys, which is now based on the Atlas of Living Australia framework, and on GBIF.
In particular, the workshop will focus on:
- Data cleaning with Open Refine
- How to map data to DarwinCore
- How to prepare a DarwinCore Archive on an IPT
- GBIF tools: Data Validator, Species matching, Datasets Metrics
- How to search and use the Canadensys and GBIF Explorers
- Open Data usage
More information about the workshop is available here and here.
All the material developed for the workshop (thanks to the help from the GBIF Secretariat and GBIF Spain) is available here.
This workshop is funded by the GBIF CESP Program, and is organized by Canadensys, GBIF, and iDigBio in Ottawa from 7 - 9 November 2019.