Data Management Interest Group: Difference between revisions

From iDigBio
Jump to navigation Jump to search
 
(60 intermediate revisions by 6 users not shown)
Line 20: Line 20:
*[http://idigbio.adobeconnect.com/p893lqq26wj/ Data quality, usage, and issue tracking using GitHub], by John Wieczorek, et al at VertNet, recorded Friday 23 April, 2015. 4 - 5 PM EDT.
*[http://idigbio.adobeconnect.com/p893lqq26wj/ Data quality, usage, and issue tracking using GitHub], by John Wieczorek, et al at VertNet, recorded Friday 23 April, 2015. 4 - 5 PM EDT.
*[http://idigbio.adobeconnect.com/p7ht0zf5i7p/ Towards user-definable, semi-automated workflows for curating biodiversity data (recording)]. Presenters (abc order): David Lowery, James A. Macklin, Timothy  McPhillips, Paul J. Morris, Tianghong Song. Recorded 28 May 2015 2 - 4 PM EDT
*[http://idigbio.adobeconnect.com/p7ht0zf5i7p/ Towards user-definable, semi-automated workflows for curating biodiversity data (recording)]. Presenters (abc order): David Lowery, James A. Macklin, Timothy  McPhillips, Paul J. Morris, Tianghong Song. Recorded 28 May 2015 2 - 4 PM EDT
*[http://idigbio.adobeconnect.com/p6dr3k8f7y2/ Improving Data Quality: iDigBio Recordset data cleaning methods, tools, and data flags]. Presenters: Alex Thompson (iDigBio IT), Matt Collins (iDigBio IT), and guests: Heather Appleby and Katja Seltmann. Recorded 23 October 2015 - 2 - 3 PM EDT.
* [http://idigbio.adobeconnect.com/p79nl3ak8x8/ Variations on the theme of tracking loans, gifts, sampling, and more] Presenters: Simon Checksfield with Nicole Fisher, CSIRO; Andrew Bentley from University of Kansas Biodiversity Institute, Specify, and SPNHC; Christine Johnson, Entomology, AMNH; Tiffany Adrain, University of Iowa Paleontology;    Elspeth Haston, RBGE.
*[http://idigbio.adobeconnect.com/p3914lttxpu/ Shaping the semantic layer by mining digitised data: an encounter between iDigBio's plant records and the Environment Ontology (ENVO)] Presenters: Dr. Pier Luigi Buttigieg, HGF-MPG Group for Deep Sea Ecology and Technology, c/o Max Planck Institute for Marine Microbiology, Bremen, Germany, Email: pbuttigi@mpi-bremen.de; and Grant Godden, Research Associate, Michigan State University, Email: goddengr@msu.edu
**If you have ideas for next steps with this work, or would like to be involved in the next steps conversation, please send a note to idigbio@acis.ufl.edu
*[https://vimeo.com/idigbio/review/146166848/bca77fe5f2 DAMmed if you Do or Don't : Archiving, what is it anyway? and just what is a DAM?], by Larry Gall, Yale Peabody Museum, recorded 17 November 2015. 4 - 5 PM EST.
** A ''follow-up'' webinar on this topic, panel-style, is planned for early 2016. Stay tuned for more about this.
* Insights into Inselect Software: automating image processing, barcode reading, and validation of user-defined metadata
** [http://idigbio.adobeconnect.com/p7qo63aeo4a/ Adobe Connect webinar recording]
** [https://vimeo.com/160792078 ​MP4 Version on Vimeo]
** by Lawrence Hudson and Ben Price, Natural History Museum, London. Recorded 29 March, 2016. 11 - 12 PM EDT.
* [http://idigbio.adobeconnect.com/p8dpn6d3oyr Webinar Panel: DAMs and Archival Issues for Large and Small Collections: options, considerations, resources]
** [https://www.idigbio.org/sites/default/files/working-groups/dm/NOAA%20Archiving%20and%20Data%20Mangement.pptx Brian's Presentation]
** [https://www.idigbio.org/sites/default/files/working-groups/dm/DAMs%20and%20Preservica.pptx Euan's Presentation]
** [https://www.idigbio.org/sites/default/files/working-groups/dm/Digital%20Asset%20Presentation.pptx Mike's Presentation]
* from ''Pensoft Publishers'' and ''Biodiversity Data Journal'' Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts
**[http://idigbio.adobeconnect.com/p7sg0aym3e3/ Adobe Connect webinar recording] by Viktor Senderov - Marie Curie PhD Student at Pensoft, datascience@pensoft.net and Lyubomir Penev - Managing Directory and Founder of Pensoft Publishers, penev@pensoft.net. Recorded 16 June, 2016. 9 - 10 am EDT.
*[http://idigbio.adobeconnect.com/p2tfsebw717/ Mass Digitizing a Working Herbarium using a conveyor belt: Workflows, Strategies, Challenges] presented by Sylvia Orli, IT and Digitization Manager, US Herbarium, Smithsonian. Recorded 18 October 2016. 3 - 4 PM EDT.
=== Darwin Core Hour Recordings ===
All Darwin Core Hour resources are on or linked through Git Hub https://github.com/tdwg/dwc-qa/wiki/Webinars
* Chapter 0. [http://idigbio.adobeconnect.com/p200obyl8yp/ Introduction to Darwin Core Hour Webinar Series (adobe connect)] presented by John Wieczorek, Paula Zermoglio, and Deborah Paul. Recorded 2017-02-07. On [https://vimeo.com/idigbio/review/203288520/0d6ebbd70c Vimeo as mp4].
* Chapter 1. [http://idigbio.adobeconnect.com/p200obyl8yp/ Introduction to Darwin Core (adobe connect)] presented by John Wieczorek. Recorded 2017-02-07. On [https://vimeo.com/idigbio/review/203288520/0d6ebbd70c Vimeo as mp4]
* Chapter 2. [http://idigbio.adobeconnect.com/p2ewna3h59c/ Even Simple is Hard (adobe connect)] presented by John Wieczorek. Recorded 2017-03-07. On [https://vimeo.com/209909970 Vimeo as mp4]
* Chapter 3. [http://idigbio.adobeconnect.com/p8iboc9x62j/ Thousands of shades for “Controlled” Vocabularies (adobe connect)] presented by Paula Zermoglio. Recorded 2017-04-04. On [https://vimeo.com/album/4407185/video/212404502 Vimeo as mp4]
* Chapter 4a+b. [http://idigbio.adobeconnect.com/p46cvi3c2bi/ Evolution of Darwin Core Terms and Extensions - two extant examples for community input (adobe connect)] presented by Andy Bentley and Quentin Groom. Recorded 2017-05-02. On [https://vimeo.com/216167534 Vimeo as mp4]
* Chapter 5. [http://idigbio.adobeconnect.com/ps7hz22eiyu9 Darwin Core in Practice: Introduction to the GBIF IPT (adobe connect)] presented by Kyle Braak, Laura Russel, and Carole Sinou. Recorded 2017-06-13. On [https://vimeo.com/idigbio/review/221477895/4393ffe9b2 Vimeo as mp4]
* Chapter 6. [http://idigbio.adobeconnect.com/piczk3aht1po Where am I, exactly? Darwin Core geoferencing terms (adobe connect)] presented by David Bloom, Town Peterson, and John Wieczorek. Recorded 2017-07-11. On [https://vimeo.com/225132217 Vimeo as mp4]
* Chapter 7. Aggregators - a Darwin Core View [http://idigbio.adobeconnect.com/pjkbe6j6yljb/?OWASP_CSRFTOKEN=1b2485174a9e340bed05188cff572c35d3aaaac82add77a3f314658e6a7bdef8 Part I: GBIF & iDigBio (adobe connect)] and [http://idigbio.adobeconnect.com/p8j48suki286/ Part II: (More Than Vert)Net (adobe connect)] presented by GBIF, iDigBio, Vertnet, ALA, and Canadensys. Recorded 2017-08-15. On Vimeo as mp4: [https://vimeo.com/229759064 Part I] and [https://vimeo.com/234428277 Part II]
* Chapter 8. [http://idigbio.adobeconnect.com/p8ltdquluyy1/ A bite from the core - testing for data quality (adobe connect)] presented by Lee Belbin and Arthur Chapman. Recorded 2017-09-05 (North America) and 2017-09-06 (Oceania). On [https://vimeo.com/239698443 Vimeo as mp4]
* Chapter 9. [http://idigbio.adobeconnect.com/p1zkmx6is2c6/ Kurator Web: for Cleaner Biodiversity Data (adobe connect)] presented by John Wieczorek. Recorded 2017-10-24. On [https://vimeo.com/239711651 Vimeo as mp4]
* Chapter 10. [http://idigbio.adobeconnect.com/pqw2om865mvv/ Audubon Core and 3D Biodiversity Data: Metadata, Practice, and Unification of Efforts (adobe connect)] presented by Gary Motz and John Wieczorek. Recorded 2017-11-21. On [https://vimeo.com/244677506 Vimeo as mp4]
* Chapter 11. [http://idigbio.adobeconnect.com/pu48ho08pl43/ DwC Hour Brainstorming – Inviting the Community to Plan for Next Year (adobe connect)]. Recorded 2017-12-04. On [https://vimeo.com/245929146 Vimeo as mp4]
* Chapter 12. [http://idigbio.adobeconnect.com/pfvb99zx6nle/ Making DNA and tissue collections available by using the GGBN extensions with IPT (adobe connect)] presented by Gabriela Dröge and Katherine Barker. Recorded 2018-02-21. On [https://vimeo.com/260435567 Vimeo as mp4]
* Chapter 13. [http://idigbio.adobeconnect.com/pskdy7p1yo8m/ The Problem of Time: Dealing with Paleontological and Zooarchaeological Specimens in Darwin Core (adobe connect)] presented by Laura Brenskelle. Recorded 2018-04-24. On [https://vimeo.com/267290743 Vimeo as mp4]
2020 Darwin Core Hours coming soon. Watch this space.


==Collaborative Notes and Interest Group Documents==
==Collaborative Notes and Interest Group Documents==
Line 29: Line 65:
*[https://docs.google.com/document/d/1khoQ8yAoO1Oi2fYXcKVWxWUNR9XErN0ajMSOYXJw-cE/edit# DMI Google Doc notes for iDigBio Webinar: Data quality, usage, and issue tracking using GitHub] 23 April 2015
*[https://docs.google.com/document/d/1khoQ8yAoO1Oi2fYXcKVWxWUNR9XErN0ajMSOYXJw-cE/edit# DMI Google Doc notes for iDigBio Webinar: Data quality, usage, and issue tracking using GitHub] 23 April 2015
*[https://docs.google.com/document/d/1hEqOhWe89tm4SOx85SGuc46b1sCRMviKZ3yBwxMknBc/edit DMI Meeting 28 August - Planning a Webinar Series] Group Notes
*[https://docs.google.com/document/d/1hEqOhWe89tm4SOx85SGuc46b1sCRMviKZ3yBwxMknBc/edit DMI Meeting 28 August - Planning a Webinar Series] Group Notes
*[https://docs.google.com/document/d/1P3TGIRcyd3PEEoQXUEKaSlkTJovH_RS0Jzp0Qw_wHxI/edit 9 January 2017 DMI Organizational Meeting Notes]


==Presentations, Posters, Upcoming Topics==
==Presentations, Posters, Upcoming Topics==
Line 41: Line 78:
**[https://www.idigbio.org/sites/default/files/working-groups/dm/FP-Akka-iDigBio-webinar3.pdf Kurator presentation (pdf)]  
**[https://www.idigbio.org/sites/default/files/working-groups/dm/FP-Akka-iDigBio-webinar3.pdf Kurator presentation (pdf)]  
**Part 2 of Webinar: designed for IT-oriented folks wanting to install and test please go here http://wiki.datakurator.net/web/iDigBioWebinar_May2015 Follow the instructions and you'll have some opportunities in the second half of the webinar to get input into use of this tool.
**Part 2 of Webinar: designed for IT-oriented folks wanting to install and test please go here http://wiki.datakurator.net/web/iDigBioWebinar_May2015 Follow the instructions and you'll have some opportunities in the second half of the webinar to get input into use of this tool.
*[https://www.idigbio.org/content/improving-data-quality-idigbio-recordset-data-cleaning-method-tools-and-data-flags iDigBio Recordset Data Cleaning tools and flags: where do they come from? how can data providers use them to enhance their datasets?]
**Alex Thompson and Matt Collins, Friday, October 23rd, 2 PM EDT
** [https://docs.google.com/presentation/d/1dLq-aJNrQ81Z0zSnj_79Xnomm0TgK6GarTdLIbFJZbU/edit?usp=sharing Slides available here]
**Check out [https://www.idigbio.org/content/summer-learning-r-clean-data-idigbio-portal-recordset-correction-feature blog post by Heather Appleby and Katja Seltmann] about their experience using the information in the data flags provided by iDigBio. What did they learn? What did we learn at iDigBio? What's next?
*[https://www.idigbio.org/content/webinar-variations-theme-tracking-loans-gifts-sampling-and-more Variations on the theme of tracking loans, gifts, sampling, and more]
**Simon Checksfield, Nicole Fisher, Andrew Bentley, Matt Woodburn, Vince Smith, Christine Johnson, Tiffany Adrain, and Elspeth Haston Friday, October 30, 2015, 21:00:00 (UTC) Friday 5:00 PM (EDT);  Friday 4:00 PM (Kansas City);  Friday 9:00 PM (Edinburgh, London);  Sat 8:00 AM (Sydney)
***[[Media:2015_iDigBioWebinar_Loans.pdf |Christine Johnson, AMNH]]
***[[Media:RBGEWebinariDigBio30Oct2015LoanTracking.pdf |Elspeth Haston, RBGE]]
*[https://www.idigbio.org/content/webinar-shaping-semantic-layer-mining-digitised-data-encounter-between-idigbios-plant Shaping the semantic layer by mining digitised data: an encounter between iDigBio's plant records and the Environment Ontology (ENVO)]
**Dr. Pier Luigi Buttigieg, Max Plank Institute; Tuesday, November 10, 2015 - 9:00am to 10:00am EST
*Announcement: [https://www.idigbio.org/content/webinar-dammed-if-you-do-or-don%E2%80%99t DAMmed if you Do or Don't] : Archiving, what is it anyway? and just what is a DAM?
** PowerPoint [https://www.idigbio.org/sites/default/files/working-groups/DMIg/idigbio-dam-lfg.ppt DAMmed if you Do or Don’t] (ppt) by Larry Gall
** [https://vimeo.com/idigbio/review/146166848/bca77fe5f2 Recording],by Larry Gall (Yale Peabody Museum); Tuesday, 17 November 2015 at 4 PM EST (that's 21:00 UTC).
* DEMO and Webinar Announcement: [https://www.idigbio.org/content/insights-inselect-software-automating-image-processing-barcode-reading-and-validation-user Insights into Inselect Software: automating image processing, barcode reading, and validation of user-defined metadata]
** [[Media:IDigBio_Inselect_Demo.pdf |Insights into Inselect presentation]] (pdf); Tuesday, 29 March 2016 11 AM EDT, 4 PM BST by software developers Lawrence Hudson and Ben Price from the Natural History Museum (NHM) in London.
* from ''Pensoft Publishers'' and ''Biodiversity Data Journal'' Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts
**[http://www.idigbio.org/sites/default/files/working-groups/DMIg/iDigBio_Webinar_2.pdf Webinar Presentation (pdf)] by Viktor Senderov - Marie Curie PhD Student at Pensoft, datascience@pensoft.net and Lyubomir Penev - Managing Directory and Founder of Pensoft Publishers, penev@pensoft.net. Recorded 16 June, 2016. 9 - 10 am EDT. ([http://www.idigbio.org/sites/default/files/working-groups/DMIg/iDigBio_Webinar_2.pptx pptx])
*Providing Data to iDigBio - Getting Feedback from iDigBio: Experiencing the Data Life Cycle
** Mare Nazaire (date to be decided after their data is ingested)


== Potential Topics ==
== Potential Topics ==
*Linking specimens, notes, and literature-- what systems have you found that best serve those linkages?
*More about Archiving Options and Challenges
*Macroalgal TCN using Voice Recognition and OCR output to speed up digitization


== Relevant Papers and Documents==
== Relevant Papers and Documents==

Latest revision as of 01:57, 29 January 2020

iDigBio's Digitization Resources Wiki Home

Data Management Interest Group (DMI)

This page is devoted to resources and discussion for the DMI Group. Keeping up with data requires certain skills and infrastructure. This Interest Group plans to discuss issues surrounding shared data and the help and information the biodiversity community needs in order to ensure, if possible, that the provider has the most up-to-date versions of their own datasets. We intend to provide a forum for discussion, and act as a resource for guidance to point providers toward potential solutions. Do you need help to re-integrate data into your database? Are you able to help, or know of resources? We invite anyone with an interest in this topic to join us and contribute your observations and potential solutions to this challenging topic. Anyone is welcome to join the interest group.

The interest group schedules regular discussion sessions via Adobe Connect for the purpose of sharing techniques, strategies, uses, improvements, and technology associated with re-integrating enhanced data back into a provider's database. Resources, related documents, and discussion notes are stored below. Our first meeting: Webinar 7 August 2014, 1:00 - 2:00 PM EDT.

Dmg poster dp js.jpg

Interest Group Members

DMI Members

Meeting Recordings

Darwin Core Hour Recordings

All Darwin Core Hour resources are on or linked through Git Hub https://github.com/tdwg/dwc-qa/wiki/Webinars

2020 Darwin Core Hours coming soon. Watch this space.

Collaborative Notes and Interest Group Documents

Presentations, Posters, Upcoming Topics

Potential Topics

  • Linking specimens, notes, and literature-- what systems have you found that best serve those linkages?
  • More about Archiving Options and Challenges
  • Macroalgal TCN using Voice Recognition and OCR output to speed up digitization

Relevant Papers and Documents

A specialist’s audit of aggregated occurrence records Robert Mesibov

Relevant Links

Data Carpentry