The majority of the activities are as a systems programmer and technology implementer for the National Resource for Digitization of Biological Collections (iDigBio) project. Design, implement and support complex ETL mappings to migrate large data volumes from heterogeneous source systems into a central NoSQL data store. Develop and use tools to perform data analytics, data manipulation, and reporting according to data consumer needs, and participate in the design of new or changing data mappings and workflows, evolving the iDigBio data model as data standards are updated and data growth need arises. Produce technical specification and documentation to effectively communicate with data providers and consumers.
Develop software for data-related cloud middleware and web portals. The incumbent will design, implement, and maintain storage, infrastructure, platform, and software clouds including software and hardware selection. Integrate external cloud and distributed data resources with resources developed as part of the projects. Collect and report performance and quality metrics to insure resources are meeting project goals. Create documentation and software packages to make work usable by other institutions. Train collaborators and end users on the cloud and software resources created. Liaise with developers and users of biological collection management systems providing data to iDigBio to gain an understanding of data requirements and functionality, to address data transformation issues, to develop tools that facilitate data quality improvement, and to enable bi-directional data flows.
Coordinate ingestion processes across national and international partner organizations. Interpret iDigBio project needs, stakeholder requests, and PI directives to drive prioritization and triage of tasks and issues. Interface directly with data providers when technical complexity exceeds the capabilities of mobilization staff. Collaborate with national and international organizations as needed to improve data quality and data sharing standards. Collaborate with Partner Projects and new potential partners by way of physical and virtual meetings. Manage the Data Ingestion and Mobilization meeting.
Assist in maintaining existing computer, networking, and software infrastructure in ACIS laboratory. Integrate infrastructure developed as part of the project into the overall resource offerings of the ACIS laboratory. Document best practices and develop technical support materials for ACIS hardware and software.
11:55 p.m. (ET) 16 November 2018