Difference between revisions of "Data Management Interest Group"

From iDigBio
Jump to: navigation, search
m (Collaborative Notes and Interest Group Documents)
m (Darwin Core Hour Recordings)
 
(85 intermediate revisions by 6 users not shown)
Line 14: Line 14:
 
*[http://idigbio.adobeconnect.com/p9hrd10i13l/ First Meeting of the Data Management Interest Group (DMI) August 07, 2014]
 
*[http://idigbio.adobeconnect.com/p9hrd10i13l/ First Meeting of the Data Management Interest Group (DMI) August 07, 2014]
 
*[http://idigbio.adobeconnect.com/p6f5ziuxva4/ Partnering with libraries for data management, by Brian Westra, Recorded Monday October 20th, 2014. Noon - 1 PM EDT]
 
*[http://idigbio.adobeconnect.com/p6f5ziuxva4/ Partnering with libraries for data management, by Brian Westra, Recorded Monday October 20th, 2014. Noon - 1 PM EDT]
*Recording: Issues in Re-integrating Georeferenced Data, the FishNet2 Experience, by Nelson Rios, recorded Monday March 30th, 2015. 2 - 3 PM EDT
+
*[http://idigbio.adobeconnect.com/p2xbxdd55nn/ Issues in Re-integrating Georeferenced Data, the FishNet2 Experience, by Nelson Rios, recorded Monday March 30th, 2015. 2 - 3 PM EDT]
 +
**[http://fishnet2.net/search.aspx FishNet 2]
 +
**[http://fishnet2.net/api/v1/apihelp.htm the FishNet 2 API]
 +
**[http://fishnet2.net/repatriation.html FishNet 2 Project Results - Publicly Available]
 +
*[http://idigbio.adobeconnect.com/p893lqq26wj/ Data quality, usage, and issue tracking using GitHub], by John Wieczorek, et al at VertNet, recorded Friday 23 April, 2015. 4 - 5 PM EDT.
 +
*[http://idigbio.adobeconnect.com/p7ht0zf5i7p/ Towards user-definable, semi-automated workflows for curating biodiversity data (recording)]. Presenters (abc order): David Lowery, James A. Macklin, Timothy  McPhillips, Paul J. Morris, Tianghong Song. Recorded 28 May 2015 2 - 4 PM EDT
 +
*[http://idigbio.adobeconnect.com/p6dr3k8f7y2/ Improving Data Quality: iDigBio Recordset data cleaning methods, tools, and data flags]. Presenters: Alex Thompson (iDigBio IT), Matt Collins (iDigBio IT), and guests: Heather Appleby and Katja Seltmann. Recorded 23 October 2015 - 2 - 3 PM EDT.
 +
* [http://idigbio.adobeconnect.com/p79nl3ak8x8/ Variations on the theme of tracking loans, gifts, sampling, and more] Presenters: Simon Checksfield with Nicole Fisher, CSIRO; Andrew Bentley from University of Kansas Biodiversity Institute, Specify, and SPNHC; Christine Johnson, Entomology, AMNH; Tiffany Adrain, University of Iowa Paleontology;    Elspeth Haston, RBGE.
 +
*[http://idigbio.adobeconnect.com/p3914lttxpu/ Shaping the semantic layer by mining digitised data: an encounter between iDigBio's plant records and the Environment Ontology (ENVO)] Presenters: Dr. Pier Luigi Buttigieg, HGF-MPG Group for Deep Sea Ecology and Technology, c/o Max Planck Institute for Marine Microbiology, Bremen, Germany, Email: pbuttigi@mpi-bremen.de; and Grant Godden, Research Associate, Michigan State University, Email: goddengr@msu.edu
 +
**If you have ideas for next steps with this work, or would like to be involved in the next steps conversation, please send a note to idigbio@acis.ufl.edu
 +
*[https://vimeo.com/idigbio/review/146166848/bca77fe5f2 DAMmed if you Do or Don't : Archiving, what is it anyway? and just what is a DAM?], by Larry Gall, Yale Peabody Museum, recorded 17 November 2015. 4 - 5 PM EST.
 +
** A ''follow-up'' webinar on this topic, panel-style, is planned for early 2016. Stay tuned for more about this.
 +
* Insights into Inselect Software: automating image processing, barcode reading, and validation of user-defined metadata
 +
** [http://idigbio.adobeconnect.com/p7qo63aeo4a/ Adobe Connect webinar recording]
 +
** [https://vimeo.com/160792078 ​MP4 Version on Vimeo]
 +
** by Lawrence Hudson and Ben Price, Natural History Museum, London. Recorded 29 March, 2016. 11 - 12 PM EDT.
 +
* [http://idigbio.adobeconnect.com/p8dpn6d3oyr Webinar Panel: DAMs and Archival Issues for Large and Small Collections: options, considerations, resources]
 +
** [https://www.idigbio.org/sites/default/files/working-groups/dm/NOAA%20Archiving%20and%20Data%20Mangement.pptx Brian's Presentation]
 +
** [https://www.idigbio.org/sites/default/files/working-groups/dm/DAMs%20and%20Preservica.pptx Euan's Presentation]
 +
** [https://www.idigbio.org/sites/default/files/working-groups/dm/Digital%20Asset%20Presentation.pptx Mike's Presentation]
 +
* from ''Pensoft Publishers'' and ''Biodiversity Data Journal'' Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts
 +
**[http://idigbio.adobeconnect.com/p7sg0aym3e3/ Adobe Connect webinar recording] by Viktor Senderov - Marie Curie PhD Student at Pensoft, datascience@pensoft.net and Lyubomir Penev - Managing Directory and Founder of Pensoft Publishers, penev@pensoft.net. Recorded 16 June, 2016. 9 - 10 am EDT.
 +
*[http://idigbio.adobeconnect.com/p2tfsebw717/ Mass Digitizing a Working Herbarium using a conveyor belt: Workflows, Strategies, Challenges] presented by Sylvia Orli, IT and Digitization Manager, US Herbarium, Smithsonian. Recorded 18 October 2016. 3 - 4 PM EDT.
 +
 
 +
=== Darwin Core Hour Recordings ===
 +
All Darwin Core Hour resources are on or linked through Git Hub https://github.com/tdwg/dwc-qa/wiki/Webinars
 +
 
 +
* Chapter 0. [http://idigbio.adobeconnect.com/p200obyl8yp/ Introduction to Darwin Core Hour Webinar Series (adobe connect)] presented by John Wieczorek, Paula Zermoglio, and Deborah Paul. Recorded 2017-02-07. On [https://vimeo.com/idigbio/review/203288520/0d6ebbd70c Vimeo as mp4].
 +
* Chapter 1. [http://idigbio.adobeconnect.com/p200obyl8yp/ Introduction to Darwin Core (adobe connect)] presented by John Wieczorek. Recorded 2017-02-07. On [https://vimeo.com/idigbio/review/203288520/0d6ebbd70c Vimeo as mp4]
 +
* Chapter 2. [http://idigbio.adobeconnect.com/p2ewna3h59c/ Even Simple is Hard (adobe connect)] presented by John Wieczorek. Recorded 2017-03-07. On [https://vimeo.com/209909970 Vimeo as mp4]
 +
* Chapter 3. [http://idigbio.adobeconnect.com/p8iboc9x62j/ Thousands of shades for “Controlled” Vocabularies (adobe connect)] presented by Paula Zermoglio. Recorded 2017-04-04. On [https://vimeo.com/album/4407185/video/212404502 Vimeo as mp4]
 +
* Chapter 4a+b. [http://idigbio.adobeconnect.com/p46cvi3c2bi/ Evolution of Darwin Core Terms and Extensions - two extant examples for community input (adobe connect)] presented by Andy Bentley and Quentin Groom. Recorded 2017-05-02. On [https://vimeo.com/216167534 Vimeo as mp4]
 +
* Chapter 5. [http://idigbio.adobeconnect.com/ps7hz22eiyu9 Darwin Core in Practice: Introduction to the GBIF IPT (adobe connect)] presented by Kyle Braak, Laura Russel, and Carole Sinou. Recorded 2017-06-13. On [https://vimeo.com/idigbio/review/221477895/4393ffe9b2 Vimeo as mp4]
 +
* Chapter 6. [http://idigbio.adobeconnect.com/piczk3aht1po Where am I, exactly? Darwin Core geoferencing terms (adobe connect)] presented by David Bloom, Town Peterson, and John Wieczorek. Recorded 2017-07-11. On [https://vimeo.com/225132217 Vimeo as mp4]
 +
* Chapter 7. Aggregators - a Darwin Core View [http://idigbio.adobeconnect.com/pjkbe6j6yljb/?OWASP_CSRFTOKEN=1b2485174a9e340bed05188cff572c35d3aaaac82add77a3f314658e6a7bdef8 Part I: GBIF & iDigBio (adobe connect)] and [http://idigbio.adobeconnect.com/p8j48suki286/ Part II: (More Than Vert)Net (adobe connect)] presented by GBIF, iDigBio, Vertnet, ALA, and Canadensys. Recorded 2017-08-15. On Vimeo as mp4: [https://vimeo.com/229759064 Part I] and [https://vimeo.com/234428277 Part II]
 +
* Chapter 8. [http://idigbio.adobeconnect.com/p8ltdquluyy1/ A bite from the core - testing for data quality (adobe connect)] presented by Lee Belbin and Arthur Chapman. Recorded 2017-09-05 (North America) and 2017-09-06 (Oceania). On [https://vimeo.com/239698443 Vimeo as mp4]
 +
* Chapter 9. [http://idigbio.adobeconnect.com/p1zkmx6is2c6/ Kurator Web: for Cleaner Biodiversity Data (adobe connect)] presented by John Wieczorek. Recorded 2017-10-24. On [https://vimeo.com/239711651 Vimeo as mp4]
 +
* Chapter 10. [http://idigbio.adobeconnect.com/pqw2om865mvv/ Audubon Core and 3D Biodiversity Data: Metadata, Practice, and Unification of Efforts (adobe connect)] presented by Gary Motz and John Wieczorek. Recorded 2017-11-21. On [https://vimeo.com/244677506 Vimeo as mp4]
 +
* Chapter 11. [http://idigbio.adobeconnect.com/pu48ho08pl43/ DwC Hour Brainstorming – Inviting the Community to Plan for Next Year (adobe connect)]. Recorded 2017-12-04. On [https://vimeo.com/245929146 Vimeo as mp4]
 +
* Chapter 12. [http://idigbio.adobeconnect.com/pfvb99zx6nle/ Making DNA and tissue collections available by using the GGBN extensions with IPT (adobe connect)] presented by Gabriela Dröge and Katherine Barker. Recorded 2018-02-21. On [https://vimeo.com/260435567 Vimeo as mp4]
 +
* Chapter 13. [http://idigbio.adobeconnect.com/pskdy7p1yo8m/ The Problem of Time: Dealing with Paleontological and Zooarchaeological Specimens in Darwin Core (adobe connect)] presented by Laura Brenskelle. Recorded 2018-04-24. On [https://vimeo.com/267290743 Vimeo as mp4]
 +
2020 Darwin Core Hours coming soon. Watch this space.
  
 
==Collaborative Notes and Interest Group Documents==
 
==Collaborative Notes and Interest Group Documents==
Line 21: Line 62:
 
*[https://docs.google.com/document/d/1YFYIZ3KkxJeI0PvHcyuPEWpnI5v6oHU1ZmGhvci1UIU/edit DMI Google Notes, Partnering with Librarians for Data Mgmt August 07, 2014]
 
*[https://docs.google.com/document/d/1YFYIZ3KkxJeI0PvHcyuPEWpnI5v6oHU1ZmGhvci1UIU/edit DMI Google Notes, Partnering with Librarians for Data Mgmt August 07, 2014]
 
**[[Media:AdobeConnectChat20October2014.pdf |DMI Meeting Chat Transcript, Partnering with Librarians for Data Mgmt August 07, 2014]]
 
**[[Media:AdobeConnectChat20October2014.pdf |DMI Meeting Chat Transcript, Partnering with Librarians for Data Mgmt August 07, 2014]]
*DMI Google Notes, Issues in Re-integrating Georeferenced Data, the FishNet2 Experience March 30, 2015
+
*[[Media:30032015_DMIG_NelsonRios_FishNet2_Reintegration.pdf |DMI Meeting Chat Transcript, Issues in Re-integrating Georeferenced Data, the FishNet2 Experience March 30, 2015]]
**DMI Meeting Chat Transcript, Issues in Re-integrating Georeferenced Data, the FishNet2 Experience March 30, 2015
+
*[https://docs.google.com/document/d/1khoQ8yAoO1Oi2fYXcKVWxWUNR9XErN0ajMSOYXJw-cE/edit# DMI Google Doc notes for iDigBio Webinar: Data quality, usage, and issue tracking using GitHub] 23 April 2015
 +
*[https://docs.google.com/document/d/1hEqOhWe89tm4SOx85SGuc46b1sCRMviKZ3yBwxMknBc/edit DMI Meeting 28 August - Planning a Webinar Series] Group Notes
 +
*[https://docs.google.com/document/d/1P3TGIRcyd3PEEoQXUEKaSlkTJovH_RS0Jzp0Qw_wHxI/edit 9 January 2017 DMI Organizational Meeting Notes]
  
==Presentations, Posters, Potential Topics==
+
==Presentations, Posters, Upcoming Topics==
 
*[https://www.idigbio.org/sites/default/files/working-groups/dm/DataRepatriationRe-integration.pptx Data Management: The Data Re-integration Step] Presentation from first meeting (Webinar), 7 August 2014
 
*[https://www.idigbio.org/sites/default/files/working-groups/dm/DataRepatriationRe-integration.pptx Data Management: The Data Re-integration Step] Presentation from first meeting (Webinar), 7 August 2014
 
*[https://www.idigbio.org/sites/default/files/working-groups/dm/iDigBioPresentation_20141020.pptx Partnering with libraries for data management, by Brian Westra, 20 October 2014]
 
*[https://www.idigbio.org/sites/default/files/working-groups/dm/iDigBioPresentation_20141020.pptx Partnering with libraries for data management, by Brian Westra, 20 October 2014]
 
*[https://www.idigbio.org/wiki/images/2/2c/Dmg_poster_dp_js.jpg DMI Poster at DigBio Summit IV, 27-28 October 2014]
 
*[https://www.idigbio.org/wiki/images/2/2c/Dmg_poster_dp_js.jpg DMI Poster at DigBio Summit IV, 27-28 October 2014]
 
**Poster presented by Mare Nazaire, content by working group, design by Jeremy Spinks.
 
**Poster presented by Mare Nazaire, content by working group, design by Jeremy Spinks.
*Webinar: [https://www.idigbio.org/content/webinar-issues-re-integrating-georeferenced-data-fishnet2-experience FishNet2 on re-integrating georeferences back into local collections databases] (March 30, 2015)
+
*Webinar Calendar Announcement: [https://www.idigbio.org/content/webinar-issues-re-integrating-georeferenced-data-fishnet2-experience FishNet2 on re-integrating georeferences back into local collections databases] (March 30, 2015)
*Macroalgal TCN using Voice Recognition and OCR output to speed up digitization (may be in April 2015)
+
*Webinar Calendar Announcement [https://www.idigbio.org/content/webinar-data-quality-usage-and-issue-tracking-using-github Data quality, usage, and issue tracking using GitHub: the view from VertNet] (23 April 2015)
*Filtered PUSH, Kepler Kurator, *Akka (coming in May 2015)
+
**Presentation Slides from Data quality, usage, and issue tracking using GitHub: the view from VertNet
 +
*Webinar Calendar Announcement: [https://www.idigbio.org/content/webinar-towards-user-definable-semi-automated-workflows-curating-biodiversity-data Towards user-definable, semi-automated workflows for curating biodiversity data].(Filtered PUSH, Kepler Kurator, *Akka) May 28th, 2015
 +
**[https://www.idigbio.org/sites/default/files/working-groups/dm/FP-Akka-iDigBio-webinar3.pdf Kurator presentation (pdf)]
 +
**Part 2 of Webinar: designed for IT-oriented folks wanting to install and test please go here http://wiki.datakurator.net/web/iDigBioWebinar_May2015 Follow the instructions and you'll have some opportunities in the second half of the webinar to get input into use of this tool.
 +
*[https://www.idigbio.org/content/improving-data-quality-idigbio-recordset-data-cleaning-method-tools-and-data-flags iDigBio Recordset Data Cleaning tools and flags: where do they come from? how can data providers use them to enhance their datasets?]
 +
**Alex Thompson and Matt Collins, Friday, October 23rd, 2 PM EDT
 +
** [https://docs.google.com/presentation/d/1dLq-aJNrQ81Z0zSnj_79Xnomm0TgK6GarTdLIbFJZbU/edit?usp=sharing Slides available here]
 +
**Check out [https://www.idigbio.org/content/summer-learning-r-clean-data-idigbio-portal-recordset-correction-feature blog post by Heather Appleby and Katja Seltmann] about their experience using the information in the data flags provided by iDigBio. What did they learn? What did we learn at iDigBio? What's next?
 +
*[https://www.idigbio.org/content/webinar-variations-theme-tracking-loans-gifts-sampling-and-more Variations on the theme of tracking loans, gifts, sampling, and more]
 +
**Simon Checksfield, Nicole Fisher, Andrew Bentley, Matt Woodburn, Vince Smith, Christine Johnson, Tiffany Adrain, and Elspeth Haston Friday, October 30, 2015, 21:00:00 (UTC) Friday 5:00 PM (EDT);  Friday 4:00 PM (Kansas City);  Friday 9:00 PM (Edinburgh, London);  Sat 8:00 AM (Sydney)
 +
***[[Media:2015_iDigBioWebinar_Loans.pdf |Christine Johnson, AMNH]]
 +
***[[Media:RBGEWebinariDigBio30Oct2015LoanTracking.pdf |Elspeth Haston, RBGE]]
 +
*[https://www.idigbio.org/content/webinar-shaping-semantic-layer-mining-digitised-data-encounter-between-idigbios-plant Shaping the semantic layer by mining digitised data: an encounter between iDigBio's plant records and the Environment Ontology (ENVO)]
 +
**Dr. Pier Luigi Buttigieg, Max Plank Institute; Tuesday, November 10, 2015 - 9:00am to 10:00am EST
 +
*Announcement: [https://www.idigbio.org/content/webinar-dammed-if-you-do-or-don%E2%80%99t DAMmed if you Do or Don't] : Archiving, what is it anyway? and just what is a DAM?
 +
** PowerPoint [https://www.idigbio.org/sites/default/files/working-groups/DMIg/idigbio-dam-lfg.ppt DAMmed if you Do or Don’t] (ppt) by Larry Gall
 +
** [https://vimeo.com/idigbio/review/146166848/bca77fe5f2 Recording],by Larry Gall (Yale Peabody Museum); Tuesday, 17 November 2015 at 4 PM EST (that's 21:00 UTC).
 +
* DEMO and Webinar Announcement: [https://www.idigbio.org/content/insights-inselect-software-automating-image-processing-barcode-reading-and-validation-user Insights into Inselect Software: automating image processing, barcode reading, and validation of user-defined metadata]
 +
** [[Media:IDigBio_Inselect_Demo.pdf |Insights into Inselect presentation]] (pdf); Tuesday, 29 March 2016 11 AM EDT, 4 PM BST by software developers Lawrence Hudson and Ben Price from the Natural History Museum (NHM) in London.
 +
* from ''Pensoft Publishers'' and ''Biodiversity Data Journal'' Online direct import of specimen records from iDigBio infrastructure into taxonomic manuscripts
 +
**[http://www.idigbio.org/sites/default/files/working-groups/DMIg/iDigBio_Webinar_2.pdf Webinar Presentation (pdf)] by Viktor Senderov - Marie Curie PhD Student at Pensoft, datascience@pensoft.net and Lyubomir Penev - Managing Directory and Founder of Pensoft Publishers, penev@pensoft.net. Recorded 16 June, 2016. 9 - 10 am EDT. ([http://www.idigbio.org/sites/default/files/working-groups/DMIg/iDigBio_Webinar_2.pptx pptx])
 +
*Providing Data to iDigBio - Getting Feedback from iDigBio: Experiencing the Data Life Cycle
 +
** Mare Nazaire (date to be decided after their data is ingested)
 +
 
 +
== Potential Topics ==
 
*Linking specimens, notes, and literature-- what systems have you found that best serve those linkages?
 
*Linking specimens, notes, and literature-- what systems have you found that best serve those linkages?
 +
*More about Archiving Options and Challenges
 +
*Macroalgal TCN using Voice Recognition and OCR output to speed up digitization
  
 
== Relevant Papers and Documents==
 
== Relevant Papers and Documents==

Latest revision as of 01:57, 29 January 2020

iDigBio's Digitization Resources Wiki Home

Data Management Interest Group (DMI)

This page is devoted to resources and discussion for the DMI Group. Keeping up with data requires certain skills and infrastructure. This Interest Group plans to discuss issues surrounding shared data and the help and information the biodiversity community needs in order to ensure, if possible, that the provider has the most up-to-date versions of their own datasets. We intend to provide a forum for discussion, and act as a resource for guidance to point providers toward potential solutions. Do you need help to re-integrate data into your database? Are you able to help, or know of resources? We invite anyone with an interest in this topic to join us and contribute your observations and potential solutions to this challenging topic. Anyone is welcome to join the interest group.

The interest group schedules regular discussion sessions via Adobe Connect for the purpose of sharing techniques, strategies, uses, improvements, and technology associated with re-integrating enhanced data back into a provider's database. Resources, related documents, and discussion notes are stored below. Our first meeting: Webinar 7 August 2014, 1:00 - 2:00 PM EDT.

Dmg poster dp js.jpg

Interest Group Members

DMI Members

Meeting Recordings

Darwin Core Hour Recordings

All Darwin Core Hour resources are on or linked through Git Hub https://github.com/tdwg/dwc-qa/wiki/Webinars

2020 Darwin Core Hours coming soon. Watch this space.

Collaborative Notes and Interest Group Documents

Presentations, Posters, Upcoming Topics

Potential Topics

  • Linking specimens, notes, and literature-- what systems have you found that best serve those linkages?
  • More about Archiving Options and Challenges
  • Macroalgal TCN using Voice Recognition and OCR output to speed up digitization

Relevant Papers and Documents

A specialist’s audit of aggregated occurrence records Robert Mesibov

Relevant Links

Data Carpentry