Difference between revisions of "Data Carpentry"

From iDigBio
Jump to: navigation, search
Line 144: Line 144:
 
*[http://software-carpentry.org/blog/2014/05/our-first-data-carpentry-workshop.html Our First Data Carpentry Workshop] by Karen Cranston
 
*[http://software-carpentry.org/blog/2014/05/our-first-data-carpentry-workshop.html Our First Data Carpentry Workshop] by Karen Cranston
 
*[https://www.idigbio.org/content/tales-data-carpentry-workshop-demand Tales from the First Data Carpentry Workshop] by Deb Paul
 
*[https://www.idigbio.org/content/tales-data-carpentry-workshop-demand Tales from the First Data Carpentry Workshop] by Deb Paul
*[https://datacarpentry.etherpad.mozilla.org/13] EtherPad document for workshop notes
+
*[https://datacarpentry.etherpad.mozilla.org/13 EtherPad document for workshop notes]
 
*[https://github.com/datacarpentry/2014-09-29-iDigBio Data Carpentry Materials on GitHub]
 
*[https://github.com/datacarpentry/2014-09-29-iDigBio Data Carpentry Materials on GitHub]
 
*[http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1003542 Ten Simple Rules for the Care and Feeding of Scientific Data. Goodman et al]
 
*[http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1003542 Ten Simple Rules for the Care and Feeding of Scientific Data. Goodman et al]
 
*[http://faculty.chicagobooth.edu/matthew.gentzkow/research/CodeAndData.pdf Code and Data for the Social Sciences: A Practitioner's Guide. Matthew Gentzkow, Jesse M. Shapiro Chicago Booth and NBER March 10,2014]
 
*[http://faculty.chicagobooth.edu/matthew.gentzkow/research/CodeAndData.pdf Code and Data for the Social Sciences: A Practitioner's Guide. Matthew Gentzkow, Jesse M. Shapiro Chicago Booth and NBER March 10,2014]
 
*[http://dx.doi.org/10.4033/iee.2013.6b.6.f Nine simple ways to make it easier to (re)use your data. White et al.]
 
*[http://dx.doi.org/10.4033/iee.2013.6b.6.f Nine simple ways to make it easier to (re)use your data. White et al.]

Revision as of 13:33, 3 September 2014

This wiki supports the Data Carpentry Workshop to be held at the University of Florida at iDigBio. (Date to be announced). It is the first in a series of four biodiversity informatics workshops planned in the upcoming year (2014-2015).

Digitization Training Workshops Wiki Home

Planning Team

  • François Michonneau (FLMNH - iDigBio)
  • Katja Seltmann (TTD-TCN, AMNH)
  • Matt Collins (ACIS - iDigBio)
  • Dan Stoner (ACIS - iDigBio)
  • Deborah Paul (FSU - iDigBio)
  • Tracy K. Teal (BEACON)
  • Pam Soltis (FLMNH - iDigBio PI)
  • Derek Masaki (USGS)
  • Shari Ellis (iDigBio)
  • Kevin Love (iDigBio)
  • Mike Smorul (SESYNC)
  • Juliet Pulliam (UF)
  • and assistance from Nirav Merchant at iPlant.

Workshop Evaluation

  • link to pre-workshop survey
  • link here at end of workshop

Agenda

  • pre-workshop meeting (online)
    • Software installed?
    • Instructors available
    • Questions?
  • pre-workshop meeting / dinner day before
    • All welcome. Place/time TBA.
Course Overview - Day 1
8:30-9:00 Introductions / Overview / Why Data Carpentry? / How to organize data projects All
9:00-10:00 Better use of spreadsheets, part I Tracy Teal
10:00-10:30 Break
10:30-12:00 Better use of spreadsheets part II Tracy Teal
12:00-1:30 Lunch (with OpenRefine Demo) Deb Paul
1:30-3:00 SQL Introduction Matt Collins
3:00-3:30 Break
3:30-5:00 SQL part II Matt Collins
5:00-5:30 Review / Wrap up for tomorrow
Course Overview - Day 2
8:30-10:00 Introduction to the shell Tracy Teal
10:00-10:30 Break
10:30-12:00 Introduction to R François Michonneau
12:00-1:30 Lunch (with OpenRefine Demo) Deb Paul
1:30-3:00 Manipulating and plotting data in R François Michonneau
3:00-3:30 Break
3:30-4:30 Getting data in and out of R: How to integrate R in your workflow François Michonneau
4:30-5:00 Scaling it up: Demo using the iPlant Discovery Environment (DE) François Michonneau
5:00-5:30 Advanced shell Matt Collins
5:30-6:00 Review / Wrap up / Evaluation and Feedback

Link to Workshop Report

Logistics

Remote Participation

Remote participation will be provided via Adobe Connect: [room URL to be decided]

  • Remote can join those present for notes in (Google Doc) or (MoPad)?

Presentation Documents

  • links to any presentations (like power points) here.

Workshop Recordings

Day 1

  • [8:30am-10:00am]
  • [10:30am-12:00pm]
  • [1:00pm-3:00pm]
  • [3:30-6pm]

Day2

  • [8:30am-10:00am]
  • [10:30am-12:00pm]
  • [1:00pm-3:00pm]
  • [3:30-6pm]

Data Carpentry Resources and Links