4,713
edits
m (→Agenda) |
m (→Agenda) |
||
Line 83: | Line 83: | ||
|- | |- | ||
|09:15-9:35 | |09:15-9:35 | ||
|General Concepts and Best Practices | |General Concepts and Best Practices | ||
:brief introduction to data modeling, the data life-cycle, and relational databases | :brief introduction to data modeling, the data life-cycle, and relational databases | ||
|Ed Gilbert and Amber Budden | |Ed Gilbert and Amber Budden | ||
|- | |- | ||
|9:35-9:55 | |9:35-9:55 | ||
|Overview of Data standards | |Overview of Data standards | ||
:Darwin Core, EML, Audubon Core, GGBN, DwC-A, Identifiers (GUIDs vs local) | :Darwin Core, EML, Audubon Core, GGBN, DwC-A, Identifiers (GUIDs vs local) | ||
|Ed Gilbert, Deb Paul | |Ed Gilbert, Deb Paul | ||
|- | |- | ||
|10:00-10:30 | |10:00-10:30 | ||
|Hands-on Exercise with Specimen Data Set | |Hands-on Exercise with Specimen Data Set | ||
:data set with known mapping / standardization issues. | :data set with known mapping / standardization issues. | ||
|All | |All | ||
Line 102: | Line 102: | ||
|- | |- | ||
|10:50-11:30 | |10:50-11:30 | ||
|Data Management Planning | |Data Management Planning | ||
:choosing a database, data flow, data backup, field-to-database, metadata | :choosing a database, data flow, data backup, field-to-database, metadata | ||
|Amber Budden | |Amber Budden | ||
Line 115: | Line 115: | ||
|- | |- | ||
|1:00-1:30 | |1:00-1:30 | ||
|Images and media issues: a brief intro | |Images and media issues: a brief intro | ||
:choosing a camera, issues across different database platforms, image submissions, linking images to occurrence records, batch processing | :choosing a camera, issues across different database platforms, image submissions, linking images to occurrence records, batch processing | ||
|Ed Gilbert | |Ed Gilbert | ||
|- | |- | ||
|1:30-1:50 | |1:30-1:50 | ||
|Digitization workflows and process | |Digitization workflows and process | ||
:getting started, prioritization, specimen collecting, new database, integrating old data | :getting started, prioritization, specimen collecting, new database, integrating old data | ||
|Deb Paul, Ed Gilbert & Katja Seltmann | |Deb Paul, Ed Gilbert & Katja Seltmann | ||
|- | |- | ||
|1:50-2:10 | |1:50-2:10 | ||
|Common Workflows | |Common Workflows | ||
:image to data, specimen to data, skeletal records, crowd-sourcing, OCR/NLP, georeferencing, metadata | :image to data, specimen to data, skeletal records, crowd-sourcing, OCR/NLP, georeferencing, metadata | ||
|Deb Paul, Ed Gilbert & Katja Seltmann | |Deb Paul, Ed Gilbert & Katja Seltmann | ||
|- | |- | ||
|2:10-2:25 | |2:10-2:25 | ||
|Optimization | |Optimization | ||
:Reviewing your own workflow, common bottlenecks, policy, documentation | :Reviewing your own workflow, common bottlenecks, policy, documentation | ||
|Katja Seltmann, Deb Paul & Ed Gilbert | |Katja Seltmann, Deb Paul & Ed Gilbert | ||
Line 143: | Line 143: | ||
|- | |- | ||
|3:20-3:50 | |3:20-3:50 | ||
|Georeferencing Data (Georeferencing Workflow) | |Georeferencing Data (Georeferencing Workflow) | ||
:visualization tools, when to georeference, best practices | :visualization tools, when to georeference, best practices | ||
|Ed Gilbert | |Ed Gilbert | ||
|- | |- | ||
|3:50-4:10 | |3:50-4:10 | ||
|GEOLocate Exercise (May be DEMO) | |GEOLocate Exercise (May be DEMO) | ||
:CoGe, GPS Visualizer, re-integration, qc | :CoGe, GPS Visualizer, re-integration, qc | ||
|Ed Gilbert | |Ed Gilbert | ||
Line 162: | Line 162: | ||
|- | |- | ||
|8:30-12:00 | |8:30-12:00 | ||
|[http://www.dbg.org/ Desert Botanical Garden (DBG) Field Trip] and Lunch | |[http://www.dbg.org/ Desert Botanical Garden (DBG) Field Trip] and Lunch | ||
:meet at 8:30 in Hotel Lobby, depart at 8:40 for DBG; garden from 9-11:30, lunch 11:30 - 12:30, depart 12:40 to ASU | |||
| | | | ||
|- | |- | ||
Line 170: | Line 171: | ||
|- | |- | ||
|1:00-1:25 | |1:00-1:25 | ||
|Welcome Back and Intro to Data Quality | |Welcome Back and Intro to Data Quality | ||
:inside the data-life-cycle, cost of data quality, quality vs completeness | :inside the data-life-cycle, cost of data quality, quality vs completeness | ||
:volunteer presentations | |||
|Amber Budden, Ed Gilbert | |Amber Budden, Ed Gilbert | ||
|- | |- | ||
|1:25-1:40 | |1:25-1:40 | ||
|Data Cleaning | |Data Cleaning | ||
:where, when and how does it happen?, what kind of feedback to expect | :where, when and how does it happen?, what kind of feedback to expect | ||
:types of common errors and omissions, best practices strategies, feedback and annotation, error tracking, automation, policies and protocols | :types of common errors and omissions, best practices strategies, feedback and annotation, error tracking, automation, policies and protocols | ||
Line 181: | Line 183: | ||
|- | |- | ||
|1:40-2:20 | |1:40-2:20 | ||
|Data Cleaning Exercise I | |Data Cleaning Exercise I | ||
:(opt: quick exercise - spot the snafus) | :(opt: quick exercise - spot the snafus) | ||
:better spreadsheet skills (Data Carpentry) | :better spreadsheet skills (Data Carpentry) | ||
Line 187: | Line 189: | ||
|- | |- | ||
|2:20-2:50 | |2:20-2:50 | ||
|Data Cleaning Exercise II | |Data Cleaning Exercise II | ||
:Open Refine, part I (facets, clustering) | :Open Refine, part I (facets, clustering) | ||
|Deb Paul & Katja Seltmann | |Deb Paul & Katja Seltmann | ||
Line 224: | Line 226: | ||
|- | |- | ||
|10:35-12:00 | |10:35-12:00 | ||
|Break out groups | |Break out groups | ||
:TNRS,ECAT,QGIS,GEOLocate,CoGe,Data Cleaning: what is scripting? what is regex? examples in Open Refine, possibly in Symbiota, your own data issues / requests,Data Cleaning Exercise II - using Open Refine, part II (Using APIs, Taxonomic Name Resolution Services) | |||
|All | |All | ||
|- | |- | ||
Line 232: | Line 235: | ||
|- | |- | ||
|1:00-1:25 | |1:00-1:25 | ||
|Data Publishing: in the context of the data life cycle | |Data Publishing: in the context of the data life cycle | ||
:benefits, concerns, aggregators, citation, attribution | |||
| tbd | | tbd | ||
|- | |- | ||
Line 248: | Line 252: | ||
|- | |- | ||
|3:20-4:20 | |3:20-4:20 | ||
|Second round of break-out groups | |Second round of break-out groups | ||
:DWC-A publishing Exercise (or DEMO): using IPT instance OR | |||
:Symbiota DwC-A mapping and publishing exercise, | |||
:others | |||
| | | | ||
|- | |- | ||
|4:20-4:40 | |4:20-4:40 | ||
|Closing topics | |Closing topics | ||
:a greater network, the global landscape, next steps | |||
|Katja Seltmann & Nico Franz | |Katja Seltmann & Nico Franz | ||
|- | |- | ||
Line 260: | Line 268: | ||
|- | |- | ||
|5:10 - 5:30 | |5:10 - 5:30 | ||
|Review Data Life Cycle we’ve walked through. | |Review Data Life Cycle we’ve walked through. | ||
:discussion, survey, next steps, and conclusions | |||
|all | |all | ||
|} | |} |