Loading…
TDWG 2016 has ended

Monday, December 5 • 17:00 - 17:15
IDQ: Integrating Data Quality into Biodiversity Workflows

Sign up or log in to save this to your schedule and see who's attending!

This talk will provide an overview of the Integrated Data Quality (IDQ) software package,its design philosophy, some of the history behind it at iDigBio, and its future as a reference implementation candidate of some of the ongoing harmonization work on data quality in TDWG and GBIF task groups.  IDQ is a software package for building data quality processes to maximize their ability to integrate into diverse workflows. The ultimate goal of the project is to provide a robust set of pre-packaged test, assertion and correction tools that can be utilized by users of all skill levels across a wide variety of biodiversity data.  It grew out of work done on data quality at the iDigBio project and is being spun out of the main code base in order to open up the tools and methods used to a broader community of users. The base library provides tools and interfaces for easily constructing efficient data quality workflows, and separate modules build upon the core to provide the actual library of tests and assertions. It is intended to be usable at all scales, from working on individual records to aggregator sized data processing pipelines and all the steps in between. The code is hosted on iDigBio’s Github organization at: https://github.com/iDigBio/idq .
 


Monday December 5, 2016 17:00 - 17:15
Auditorium CTEC

Attendees (7)