What Is Data Curation?

Data curation, as defined by The University of Illinois’ Graduate School of Library and Information Science: “is the active and ongoing management of data through its life cycle of interest and usefulness.” Sayeed Choudhury, Associate Dean for Research Data Management at Johns Hopkins University (JHU) and leader of the Data Conservancy, further breaks down Data Curation iterative activities to:

Preserving: Collecting and taking care of research data.
Sharing: Revealing data’s potential across domains
Discovering: Promoting the re-use and new combinations of data

According to Alation:

“In practice, data curation is more concerned with maintaining and managing the metadata rather than the database itself and, to that end, a large part of the process of data curation revolves around ingesting metadata such as schema, table and column popularity, usage popularity, top joins/filters/queries. Data curators not only create, manage, and maintain data, but may also be involved in determining best practices for working with that data. Data curators often present the data in a visual format such as a chart, dashboard or report.”

[dv-promo buttontext=’REGISTER FOR OUR DATA CATALOG TRAINING PROGRAM’ buttonurl=’https://training.dataversity.net/learning-paths/odc0-optimizing-your-data-catalog-learning-plan?utm_source=dataversity&utm_medium=inline_ad&utm_campaign=ODC_LP_temp2&utm_content=copy3′]

Other Data Curation Definitions Include:

“Digital curation involves maintaining, preserving and adding value to digital research data throughout its lifecycle.” (Digital Curation Centre)
“The process of “caring” for Data, including to organizing, describing, cleaning, enhancing and preserving data for public use. Through curation the ICPSR (the International Leader in Data Stewardship) provides meaningful and enduring access to data.” (ICPSR)
“A means of managing data that makes it more useful for users engaging in data discovery and analysis.” (Alation)

Businesses Perform Data Curation To:

Enable data discovery and retrieval
Maintain Data Quality
Add value
Provide for data reuse over time
Maximize Access
Leverage human responses towards customized knowledge
Compliment work in Data Governance

Data Curation Processes:

Image used under license from Shutterstock.com

What Is Data Curation?

Other Data Curation Definitions Include:

Businesses Perform Data Curation To:

Data Curation Processes:

What Is AI Governance?

What Is Data Stewardship?

What Is Data Modeling?

Thanks!

What Is Data Curation?

Other Data Curation Definitions Include:

Businesses Perform Data Curation To:

Data Curation Processes:

Related Data Concepts

What Is AI Governance?

What Is Data Stewardship?

What Is Data Modeling?

Lead the Data Revolution from Your Inbox.

Thanks!