UK-SWANSPracticalDigitisation: Difference between revisions

From iDigBio
Jump to navigation Jump to search
 
 
(36 intermediate revisions by the same user not shown)
Line 1: Line 1:
short URL to this page: http://bit.ly/ukswansdigitise
{| class="wikitable" style="float:right;margin: .46em 0 0 .2em;"
{| class="wikitable" style="float:right;margin: .46em 0 0 .2em;"
|-
|-
Line 36: Line 38:
| Welcome & 5 minute stand ups<br>
| Welcome & 5 minute stand ups<br>
:your collection in focus
:your collection in focus
::[[Media:Day_2_SWANS_Morgenroth.pdf|Royal Albert Museum Exeter]] H. Morgenroth
::[[Media:RoyalAlbertMemorialMuseumExeterBurbageH.pdf|Royal Albert Museum Exeter]] H. Burbage
::[[Media:Bristol_Culture_-_Hutchinson_D.pdf|Bristol Culture Museum]] D. Hutchinson
::[[Media:BristolCultureRowsonR.pdf|Bristol Culture Museum]] R. Rowson
::[[Media:PlymouthMuseumFreedmanJ.pdf|Plymouth Museum]] J. Freedman
::[[Media:UniversityBristolEarthSciencesHildebrandtC.pdf|University of Bristol Earth Sciences]] C. Hildebrandt
| All
| All
|-
|-
Line 44: Line 52:
|-
|-
| 9:50
| 9:50
| What is digitisation? Getting to GBIF and beyond
| [[Media:WhatIsDigitization.pdf|What is digitisation?]] Getting to GBIF and beyond
| Greg Riccardi
| Greg Riccardi
|-
|-
| 10:00
| 10:00
| A framework: the 5+ task clusters
| [[Media:Bristol_Framework.pdf|A framework: the 5+ task clusters]]<br/>
[[Media:Bristol_workflows.pdf|Workflows]]
:pre-digitization curation and staging
:pre-digitization curation and staging
:specimen image capture
:specimen image capture
Line 74: Line 83:
|-
|-
| 11:20
| 11:20
| Electronic data capture
| [[Media:DataCapture.pdf|Data capture]]
:best practices, options, lessons learned
:best practices, options, lessons learned
:[https://www.idigbio.org/sites/default/files/working-groups/Herb_Workflows/APPS_Published/S11_Module%2011_%20Data%20Capture.pdf Data Capture workflow tasks] (.pdf) [https://www.dropbox.com/s/6ziinr8xt8bcq2y/Module%2011_%20Data%20Capture.docx?dl=0 docx version]
| Deborah Paul
| Deborah Paul
|-
|-
| 11:50
| 11:50
| Practical: Data standards (Darwin Core+)
| [[Media:DataStandardsOverviewBeforeHandsOnBit.pdf|Practical: Data standards (Darwin Core+)]]
:making data shareable
:making data shareable
| Greg Riccardi
:[http://rs.tdwg.org/dwc/terms/ Darwin Core Terms]
:[https://www.dropbox.com/s/cquahupx3q41phf/Royal%20Albert%20Memorial%20Museum%20Exeter%20-%20sample%20datat%20set.xlsx?dl=0 Royal Albert Dataset]
:[https://www.dropbox.com/s/madclzo2mh7goqj/Mapping_RoyalAlbertExercise.docx?dl=0 Royal Albert Mapping Exercise]
:[https://www.dropbox.com/preview/iDigInfo%20Team%20Folder/Deb/BristolUK_Workshops/day2documents/datasets/ukdata_mapped%20and%20georeferenced%20by%20idigbio/RoyalExeterMappingData/mappingRoyalExeter.xlsx?role=work Royal Albert mapped spreadsheet]
:[https://www.dropbox.com/preview/iDigInfo%20Team%20Folder/Deb/BristolUK_Workshops/day2documents/datasets/UKData_Mapped%20and%20georeferenced%20by%20iDigBio/BristolCulture_Biology-JM.xlsx?role=work Bristol Cultural Museum - Biology]
:[https://www.dropbox.com/preview/iDigInfo%20Team%20Folder/Deb/BristolUK_Workshops/day2documents/datasets/UKData_Mapped%20and%20georeferenced%20by%20iDigBio/UniversityBristolGeologyCollectionsDatabaseExport-CHFeb2018-JM.xlsx?role=work University of Bristol - Geology Collection]
| Greg Riccardi, Deborah Paul
|-
|-
| 13:00
| 13:00
Line 95: Line 111:
|-
|-
| 14:30
| 14:30
| Practical: Geo-referencing specimen data
| [[Media:GeoreferencingChoices_Bristol.pdf|Practical Exercise for Geo-referencing specimen data]]
:getting your specimens on the map
:getting your specimens on the map
:[https://www.idigbio.org/sites/default/files/working-groups/Herb_Workflows/APPS_Published/S13_Module%2013_%20Georeferencing.pdf Georeferencing Task Module] (pdf) (assumes legacy data)
:[https://www.idigbio.org/sites/default/files/workshop-presentations/geotrain/GeoreferencingQuickReferenceGuide20121008.pdf Georeferencing Quick Reference Guide] (pdf)
::[[Media:NHM_georef_guidelines_v4_DerivedFromGeoreferecingQRG.pdf|NHM Version of the Georeferencing QRG]]
:[http://herpnet.org/herpnet/documents/biogeomancerguide.pdf Guide to Best Practices for Georeferencing] - Chapman, A.D. and J. Wieczorek (eds). 2006
:[http://www.idigbio.org/sites/default/files/working-groups/gwg/GoodBadLocalitiesV27Oct2015.doc Capturing New Locality Data - Good-Bad Localities] (doc)
:[https://www.idigbio.org/sites/default/files/workshop-presentations/ttt2/GeoreferencingConceptsandLocalityTypes2013.pptx Georeferencing Concepts and Locality Types] (pptx)
:[[GWG_Second_Train_the_Trainers_Workshop|'''Georeferencing 5-day Course''': materials at iDigBio]]
:[http://www.museum.tulane.edu/geolocate/web/WebFileGeoref.aspx GEOLocate by CSV] web app
:[https://www.dropbox.com/s/guzr6lvycii4h59/DEMO_GEOLocateFileUncert_Practice.csv?dl=0 DEMO localities to georeference]
:[https://doi.org/10.3897/BDJ.4.e9559 iCollections – Digitising the British and Irish Butterflies in the Natural History Museum, London]
:[https://www.dropbox.com/s/8kgv3n7oeqji5vz/InternetResources2016.pptx?dl=0 Internet Resources]
:[http://www.streetmap.co.uk/ Streetmap]
:[http://www.geonames.org/ Geonames]
:[[UK-SWANSPracticalDigitisation#Marine_Georeferencing|Marine Georeferencing Hints]]
| Deborah Paul
| Deborah Paul
|-
|-
| 15:30
| 15:30
| Coffee
| Coffee
|
|-
|-
| 15:50
| 15:50
| Data publishing
| [[Media:Riccardi_sharing_information_long_version.pdf|Data publishing]]
:pathways to publishing
:pathways to publishing
:importance of metadata
:importance of metadata
Line 113: Line 142:
| Community resources
| Community resources
:sharing knowledge of worldwide expertise, materials
:sharing knowledge of worldwide expertise, materials
*[[Digitization_Resources| iDigBio Digitization Resources]]
*[http://gridreferencefinder.com/?xy=427675%7C561912%7CMy%20New%20Point%7C0,427726%7C562026%7CAnother_Point%7C0 Grid Reference Finder]
*[http://hasbrouck.asu.edu/sandbox/index.php? Symbiota Sandbox]
| Deborah Paul, All
| Deborah Paul, All
|-
|-
Line 119: Line 151:
:final surgery
:final surgery
:assessment
:assessment
::See [https://docs.google.com/spreadsheets/d/1Xw23zoLWohMwWPMQvBxlYJMFxiWSiu2fStasVkUIZdw/edit?usp=sharing Mini Self-Assessment Review] and possible next steps
:next steps
:next steps
:group photo
:group photo
Line 127: Line 160:
|  
|  
|}
|}
== Marine Georeferencing ==
This section added post-workshop to address outstanding questions in the group about how to georeference (geocode) marine localities.
=== General guidelines and hints ===
For a locality like "Puget Anne Sound, 10 fathoms deep", suggestions here from the Georeferencing Working Group (iDigBio GWG) and OBIS.
#'''Ship and date of collection''' -- if the name of the ship and the date of collection are known, then one can look up the ship logs/itineraries, and find where that particular ship was on that particular day. Sometimes, even coordinates are given in the ship logs. Other times, the ship log will not include specific data for the collecting event, but flanking events will be included, so one can "interpolate" between the two.
#'''Depth''' -- when just a ballpark area is given, then the depth can help narrow down the uncertainty radius (much like elevation can do this for localities on land).
#'''Reefs or other ocean floor topography''' are always useful, if they are mentioned.
#Only other helpful information I can think of are '''bearings given from lighthouses, or even buoys''' (when georeferencing Fish collections from the Great Lakes, it was common for collectors to give these as reference points).
*Another response to my query shared that marine data is not always so fit for use (better perhaps in OBIS than in other places). So we are looking into how we might help to better reach those using Darwin Core fields for what is expected / needed in particular fields to make these marine localities useful for research.
*Pieter Provoost p.provoost@UNESCO.ORG, from OBIS, georeferenced "Puget Anne Sound, 10 fathoms deep" and shared his process and results here:
**We make extensive use of [http://marineregions.org/ MarineRegions], and we also have a simple map tool at [http://iobis.org/maptool/ Maptool] http://iobis.org/maptool/ which connects to the [http://marineregions.org/ MarineRegions] API for georeferencing and makes it easier to produce WKT strings for points, lines and polygons.
**This is how I would handle "Puget Anne Sound, 10 fathoms deep" for example:
::locationID: http://marineregions.org/mrgid/15254
::verbatimLocality: Puget Sound
::locality: Puget Sound
::verbatimDepth: 10 fathoms deep
::minimumDepthInMeters: 18
::maximumDepthInMeters: 18
::decimalLongitude: -122.43
::decimalLatitude: 47.83
::coordinateUncertaintyInMeters: 20000
::footprintWKT: POLYGON ((-122.40555 47.56726, -122.50168 47.59505, -122.53601 47.91450, -122.32178 47.90438, -122.40555 47.56726))
*From Deb: a bit about the above Darwin Core fields. We discussed most of them (briefly!) at the workshop.
**First, note they have used the locationID field. In the Marine Regions database, certain marine areas have a globally unique identifier. So, using the Marine Regions database, they are populating the locationID field with this unique string. You don't have to use the locationID field, but in situations where the specific region is well defined in Marine Regions it can make good sense to do so - to be very clear about "where" you are. Note that some groups might visit the same marine location repeatedly (right?). And they may have given that marine locality a name, like "Station 67." That's a "local ID" good for communicating a locality to this specific group. IF, "Station 67" provides a unique locationID within the dataset you are publishing, you can put "Station 67" in the dwc:locationID field. In this case, you'll often have local knowledge (maps, journals, ships logs, etc.) about the exact area that is meant by "Station 67" - that you can / will use to georeference that entry.
**It's clear what Pieter put for verbatimLocality, locality, and verbatimDepth. You can see then that he converted fathoms to meters.
**Next, he placed a point in the center of Puget Sound and gave it an uncertainty. In essence he put a circle around the point - to enclose all the possible places in Puget Sound - where this specimen could be collected. Note that the value is really a "radius" of the circle (that has as its center, the latitude and longitude provided). The footprintWKT is a set of x,y coordinates that when joined by lines produces a polygon around the lat/lon point. Often polygons allow one to reduce the uncertainty compared to putting a circle around a point. But, they can get quite large (and so difficult to store in a spreadsheet or in a database). The take-home msg for this part is to talk to your IT people.
Hope you find this useful. Note that Pieter shares they'll add some more examples to the [http://iobis.org/manual/darwincore/#location OBIS manual].
Some more potentially useful bits I found:
*[http://iobis.org/manual/darwincore/#location OBIS Manual - pages for Location]
*Short training initiative 2016 http://classroom.oceanteacher.org/pluginfile.php/12805/mod_resource/content/1/Presenting%20the%20trainers.pdf
*OBIS http://www.iobis.org/ http://classroom.oceanteacher.org/course/view.php?id=319
*OBIS Simple map tool at http://iobis.org/maptool/
*Geocoding using the OBIS map tool http://classroom.oceanteacher.org/mod/lesson/view.php?id=7747 (not working yet?)
*Visualize data points (this is very cool!) http://iobis.github.io/plotter/
*Georeferencer (overlay old maps with current day!) https://www.georeferencer.com/
*And Jessica Utrup (jessica.bazeley AT YALE.EDU) added:
**NOAA provides amazing nautical charts which often have marine "landmarks" (rock formations, canyons, points, etc.) that aren’t usually found online. And what’s nice, the names almost never change. So if you have an old locality that says "off Such & Such Point" the name probably hasn’t changed in 100 years. Also, the depth information can help limit the possible localities. Here is a site that provides online charts for free.
***http://www.oceangrafix.com/


[[Category:Workshop]][[Category: Biodiversity informatics]][[Category:Imaging]][[Category:Project management]][[Category:Data use]][[Category: Collections]][[Category:Digitisation]][[Category:International collaboration]]
[[Category:Workshop]][[Category: Biodiversity informatics]][[Category:Imaging]][[Category:Project management]][[Category:Data use]][[Category: Collections]][[Category:Digitisation]][[Category:International collaboration]]

Latest revision as of 14:21, 3 April 2018

short URL to this page: http://bit.ly/ukswansdigitise

UK-SWANS Practical Digitisation Workshop
Quick Links
Swanslogo.png
Date: 9 March 2018, 8:45 AM - 16:45 PM
Agenda
Abstract

Abstract

UK-SWANS Practical Digitisation Workshop. Participants at this event are staff caring for small and regional collections, essentially, non-national museum curators. This workshop will key in on ideas, models, and training for incorporating digitization at this level. The goals are to focus on practical recommendations that require very little in the way of additional budget or expertise where possible. Practical training may be offered in one or two key areas, for example, georeferencing, data standards, and review of recommendations based on the Five Task Clusters paper (Nelson, et al 2012). Future plans include repeating this event for other regional (non-national museum) UK curators.

Organizers

This one-day event is funded by the John Ellerman Foundation and led by the UK's Bristol City Council's Culture Team as part of the 'South West Area Natural Sciences' (SWANS) project. Organizers include Bristol Culture, iDigBio and the Natural History Museum, London. People coordinating this workshop include: Isla Gladstone, Deborah Paul, Greg Riccardi, and Gil Nelson.

Agenda

group notes google doc

Time Topic Presenter / Facilitator
8:45 Coffee
9:00 Welcome & 5 minute stand ups
your collection in focus
Royal Albert Museum Exeter H. Morgenroth
Royal Albert Museum Exeter H. Burbage
Bristol Culture Museum D. Hutchinson
Bristol Culture Museum R. Rowson
Plymouth Museum J. Freedman
University of Bristol Earth Sciences C. Hildebrandt
All
9:45 Workshop goals & structure
why are we here?
Isla Gladstone
9:50 What is digitisation? Getting to GBIF and beyond Greg Riccardi
10:00 A framework: the 5+ task clusters

Workflows

pre-digitization curation and staging
specimen image capture
specimen image processing
electronic data capture
georeferencing specimen data
+ personnel
+ workflows
+ biodiversity informatics managers (or not)
Gil Nelson
10:05 Pre-digitisation curation
decisions, decisions, practical and policy
10:30 Specimen image capture & processing
best practices, current trends
Gil Nelson
11:00 Coffee
11:20 Data capture
best practices, options, lessons learned
Data Capture workflow tasks (.pdf) docx version
Deborah Paul
11:50 Practical: Data standards (Darwin Core+)
making data shareable
Darwin Core Terms
Royal Albert Dataset
Royal Albert Mapping Exercise
Royal Albert mapped spreadsheet
Bristol Cultural Museum - Biology
University of Bristol - Geology Collection
Greg Riccardi, Deborah Paul
13:00 Lunch
14:00 Managing data, a new resource
who will do it?
workflows documentation and sharing
personnel
Gil Nelson
14:30 Practical Exercise for Geo-referencing specimen data
getting your specimens on the map
Georeferencing Task Module (pdf) (assumes legacy data)
Georeferencing Quick Reference Guide (pdf)
NHM Version of the Georeferencing QRG
Guide to Best Practices for Georeferencing - Chapman, A.D. and J. Wieczorek (eds). 2006
Capturing New Locality Data - Good-Bad Localities (doc)
Georeferencing Concepts and Locality Types (pptx)
Georeferencing 5-day Course: materials at iDigBio
GEOLocate by CSV web app
DEMO localities to georeference
iCollections – Digitising the British and Irish Butterflies in the Natural History Museum, London
Internet Resources
Streetmap
Geonames
Marine Georeferencing Hints
Deborah Paul
15:30 Coffee
15:50 Data publishing
pathways to publishing
importance of metadata
expectations, benefits, research
Greg Riccardi
16:20 Community resources
sharing knowledge of worldwide expertise, materials
Deborah Paul, All
16:30 Wrap up
final surgery
assessment
See Mini Self-Assessment Review and possible next steps
next steps
group photo
16:45 Close

Marine Georeferencing

This section added post-workshop to address outstanding questions in the group about how to georeference (geocode) marine localities.

General guidelines and hints

For a locality like "Puget Anne Sound, 10 fathoms deep", suggestions here from the Georeferencing Working Group (iDigBio GWG) and OBIS.

  1. Ship and date of collection -- if the name of the ship and the date of collection are known, then one can look up the ship logs/itineraries, and find where that particular ship was on that particular day. Sometimes, even coordinates are given in the ship logs. Other times, the ship log will not include specific data for the collecting event, but flanking events will be included, so one can "interpolate" between the two.
  2. Depth -- when just a ballpark area is given, then the depth can help narrow down the uncertainty radius (much like elevation can do this for localities on land).
  3. Reefs or other ocean floor topography are always useful, if they are mentioned.
  4. Only other helpful information I can think of are bearings given from lighthouses, or even buoys (when georeferencing Fish collections from the Great Lakes, it was common for collectors to give these as reference points).
  • Another response to my query shared that marine data is not always so fit for use (better perhaps in OBIS than in other places). So we are looking into how we might help to better reach those using Darwin Core fields for what is expected / needed in particular fields to make these marine localities useful for research.
  • Pieter Provoost p.provoost@UNESCO.ORG, from OBIS, georeferenced "Puget Anne Sound, 10 fathoms deep" and shared his process and results here:
    • We make extensive use of MarineRegions, and we also have a simple map tool at Maptool http://iobis.org/maptool/ which connects to the MarineRegions API for georeferencing and makes it easier to produce WKT strings for points, lines and polygons.
    • This is how I would handle "Puget Anne Sound, 10 fathoms deep" for example:
locationID: http://marineregions.org/mrgid/15254
verbatimLocality: Puget Sound
locality: Puget Sound
verbatimDepth: 10 fathoms deep
minimumDepthInMeters: 18
maximumDepthInMeters: 18
decimalLongitude: -122.43
decimalLatitude: 47.83
coordinateUncertaintyInMeters: 20000
footprintWKT: POLYGON ((-122.40555 47.56726, -122.50168 47.59505, -122.53601 47.91450, -122.32178 47.90438, -122.40555 47.56726))
  • From Deb: a bit about the above Darwin Core fields. We discussed most of them (briefly!) at the workshop.
    • First, note they have used the locationID field. In the Marine Regions database, certain marine areas have a globally unique identifier. So, using the Marine Regions database, they are populating the locationID field with this unique string. You don't have to use the locationID field, but in situations where the specific region is well defined in Marine Regions it can make good sense to do so - to be very clear about "where" you are. Note that some groups might visit the same marine location repeatedly (right?). And they may have given that marine locality a name, like "Station 67." That's a "local ID" good for communicating a locality to this specific group. IF, "Station 67" provides a unique locationID within the dataset you are publishing, you can put "Station 67" in the dwc:locationID field. In this case, you'll often have local knowledge (maps, journals, ships logs, etc.) about the exact area that is meant by "Station 67" - that you can / will use to georeference that entry.
    • It's clear what Pieter put for verbatimLocality, locality, and verbatimDepth. You can see then that he converted fathoms to meters.
    • Next, he placed a point in the center of Puget Sound and gave it an uncertainty. In essence he put a circle around the point - to enclose all the possible places in Puget Sound - where this specimen could be collected. Note that the value is really a "radius" of the circle (that has as its center, the latitude and longitude provided). The footprintWKT is a set of x,y coordinates that when joined by lines produces a polygon around the lat/lon point. Often polygons allow one to reduce the uncertainty compared to putting a circle around a point. But, they can get quite large (and so difficult to store in a spreadsheet or in a database). The take-home msg for this part is to talk to your IT people.

Hope you find this useful. Note that Pieter shares they'll add some more examples to the OBIS manual.

Some more potentially useful bits I found:

  • And Jessica Utrup (jessica.bazeley AT YALE.EDU) added:
    • NOAA provides amazing nautical charts which often have marine "landmarks" (rock formations, canyons, points, etc.) that aren’t usually found online. And what’s nice, the names almost never change. So if you have an old locality that says "off Such & Such Point" the name probably hasn’t changed in 100 years. Also, the depth information can help limit the possible localities. Here is a site that provides online charts for free.