Managing Natural History Collections Data for Global Discoverability: Difference between revisions

Jump to navigation Jump to search
m
Line 83: Line 83:
|-
|-
|09:15-9:35
|09:15-9:35
|General Concepts and Best Practices<br/>
|General Concepts and Best Practices
:brief introduction to data modeling, the data life-cycle, and relational databases
:brief introduction to data modeling, the data life-cycle, and relational databases
|Ed Gilbert and Amber Budden
|Ed Gilbert and Amber Budden
|-
|-
|9:35-9:55
|9:35-9:55
|Overview of Data standards<br/>
|Overview of Data standards
:Darwin Core, EML, Audubon Core, GGBN, DwC-A, Identifiers (GUIDs vs local)
:Darwin Core, EML, Audubon Core, GGBN, DwC-A, Identifiers (GUIDs vs local)
|Ed Gilbert, Deb Paul
|Ed Gilbert, Deb Paul
|-
|-
|10:00-10:30
|10:00-10:30
|Hands-on Exercise with Specimen Data Set<br/>
|Hands-on Exercise with Specimen Data Set
:data set with known mapping / standardization issues.
:data set with known mapping / standardization issues.
|All
|All
Line 102: Line 102:
|-
|-
|10:50-11:30
|10:50-11:30
|Data Management Planning<br/>
|Data Management Planning
:choosing a database, data flow, data backup, field-to-database, metadata
:choosing a database, data flow, data backup, field-to-database, metadata
|Amber Budden
|Amber Budden
Line 115: Line 115:
|-
|-
|1:00-1:30
|1:00-1:30
|Images and media issues: a brief intro<br/>
|Images and media issues: a brief intro
:choosing a camera, issues across different database platforms, image submissions, linking images to occurrence records, batch processing
:choosing a camera, issues across different database platforms, image submissions, linking images to occurrence records, batch processing
|Ed Gilbert
|Ed Gilbert
|-
|-
|1:30-1:50
|1:30-1:50
|Digitization workflows and process<br/>
|Digitization workflows and process
:getting started, prioritization, specimen collecting, new database, integrating old data
:getting started, prioritization, specimen collecting, new database, integrating old data
|Deb Paul, Ed Gilbert & Katja Seltmann
|Deb Paul, Ed Gilbert & Katja Seltmann
|-
|-
|1:50-2:10
|1:50-2:10
|Common Workflows<br/>
|Common Workflows
:image to data, specimen to data, skeletal records, crowd-sourcing, OCR/NLP, georeferencing, metadata
:image to data, specimen to data, skeletal records, crowd-sourcing, OCR/NLP, georeferencing, metadata
|Deb Paul, Ed Gilbert & Katja Seltmann
|Deb Paul, Ed Gilbert & Katja Seltmann
|-
|-
|2:10-2:25
|2:10-2:25
|Optimization:
|Optimization  
:Reviewing your own workflow, common bottlenecks, policy, documentation  
:Reviewing your own workflow, common bottlenecks, policy, documentation  
|Katja Seltmann, Deb Paul & Ed Gilbert
|Katja Seltmann, Deb Paul & Ed Gilbert
Line 143: Line 143:
|-
|-
|3:20-3:50
|3:20-3:50
|Georeferencing Data (Georeferencing Workflow)<br/>
|Georeferencing Data (Georeferencing Workflow)
:visualization tools, when to georeference, best practices
:visualization tools, when to georeference, best practices
|Ed Gilbert
|Ed Gilbert
|-
|-
|3:50-4:10
|3:50-4:10
|GEOLocate Exercise (May be DEMO)<br/>
|GEOLocate Exercise (May be DEMO)
:CoGe, GPS Visualizer, re-integration, qc
:CoGe, GPS Visualizer, re-integration, qc
|Ed Gilbert
|Ed Gilbert
Line 162: Line 162:
|-
|-
|8:30-12:00
|8:30-12:00
|[http://www.dbg.org/ Desert Botanical Garden (DBG) Field Trip] and Lunch<br/>meet at 8:30 in Hotel Lobby, depart at 8:40 for DBG; garden from 9-11:30, lunch 11:30 - 12:30, depart 12:40 to ASU
|[http://www.dbg.org/ Desert Botanical Garden (DBG) Field Trip] and Lunch
:meet at 8:30 in Hotel Lobby, depart at 8:40 for DBG; garden from 9-11:30, lunch 11:30 - 12:30, depart 12:40 to ASU
|  
|  
|-
|-
Line 170: Line 171:
|-
|-
|1:00-1:25
|1:00-1:25
|Welcome Back and Intro to Data Quality<br/>
|Welcome Back and Intro to Data Quality
:inside the data-life-cycle, cost of data quality, quality vs completeness
:inside the data-life-cycle, cost of data quality, quality vs completeness
:volunteer presentations
|Amber Budden, Ed Gilbert
|Amber Budden, Ed Gilbert
|-
|-
|1:25-1:40
|1:25-1:40
|Data Cleaning<br/>
|Data Cleaning
:where, when and how does it happen?, what kind of feedback to expect
:where, when and how does it happen?, what kind of feedback to expect
:types of common errors and omissions, best practices strategies, feedback and annotation, error tracking, automation, policies and protocols  
:types of common errors and omissions, best practices strategies, feedback and annotation, error tracking, automation, policies and protocols  
Line 181: Line 183:
|-
|-
|1:40-2:20
|1:40-2:20
|Data Cleaning Exercise I<br/>
|Data Cleaning Exercise I
:(opt: quick exercise - spot the snafus)
:(opt: quick exercise - spot the snafus)
:better spreadsheet skills (Data Carpentry)
:better spreadsheet skills (Data Carpentry)
Line 187: Line 189:
|-
|-
|2:20-2:50
|2:20-2:50
|Data Cleaning Exercise II<br/>
|Data Cleaning Exercise II
:Open Refine, part I (facets, clustering)
:Open Refine, part I (facets, clustering)
|Deb Paul & Katja Seltmann
|Deb Paul & Katja Seltmann
Line 224: Line 226:
|-
|-
|10:35-12:00
|10:35-12:00
|Break out groups<br/>TNRS,ECAT,QGIS,GEOLocate,CoGe,Data Cleaning: what is scripting? what is regex? examples in Open Refine, possibly in Symbiota, your own data issues / requests,Data Cleaning Exercise II - using Open Refine, part II (Using APIs, Taxonomic Name Resolution Services)
|Break out groups
:TNRS,ECAT,QGIS,GEOLocate,CoGe,Data Cleaning: what is scripting? what is regex? examples in Open Refine, possibly in Symbiota, your own data issues / requests,Data Cleaning Exercise II - using Open Refine, part II (Using APIs, Taxonomic Name Resolution Services)
|All
|All
|-
|-
Line 232: Line 235:
|-
|-
|1:00-1:25
|1:00-1:25
|Data Publishing: in the context of the data life cycle<br/>benefits, concerns, aggregators, citation, attribution
|Data Publishing: in the context of the data life cycle
:benefits, concerns, aggregators, citation, attribution
| tbd
| tbd
|-
|-
Line 248: Line 252:
|-
|-
|3:20-4:20
|3:20-4:20
|Second round of break-out groups<br/>DWC-A publishing Exercise (or DEMO): using IPT instance OR Symbiota DwC-A mapping and publishing exercise
|Second round of break-out groups
:DWC-A publishing Exercise (or DEMO): using IPT instance OR
:Symbiota DwC-A mapping and publishing exercise,
:others
|
|
|-
|-
|4:20-4:40
|4:20-4:40
|Closing topics<br/>a greater network, the global landscape, next steps
|Closing topics
:a greater network, the global landscape, next steps
|Katja Seltmann & Nico Franz
|Katja Seltmann & Nico Franz
|-
|-
Line 260: Line 268:
|-
|-
|5:10 - 5:30
|5:10 - 5:30
|Review Data Life Cycle we’ve walked through.<br/>discussion, survey, next steps, and conclusions
|Review Data Life Cycle we’ve walked through.
:discussion, survey, next steps, and conclusions
|all
|all
|}
|}
4,713

edits

Navigation menu