TrainingUpcoming remote training

Data Management Training

Build capacity in data stewardship, data science and research reproducibility

Open reproducible research is becoming ever more critical for answering today’s complex questions at the scale and speed needed for solutions. In collaboration with the National Center for Ecological Synthesis and Analysis, DataONE has developed lessons, best practices, and training programs in data management to support research efficiency, productivity, and transparency.

In-Person and Remote Training Workshops

Learn Directly from DataONE Experts

An innovator in data management and infrastructure, DataONE has partnered with NCEAS, leaders in data-intensive synthesis research, to offer access to experienced trainers, phenomenal resources, and an inclusive and interactive learning environment.

Our courses are short but intensive introductions that will build your skills in a variety of data science topics, ranging from the basics of programming in a new language to advanced computing techniques. As active practitioners in advancing the field of data science, our instructors are able to incorporate the latest advancements into the curriculum.

Curriculum At A Glance

Metadata - What is it and how to write a quality data description

Data Modeling - Tidy data for efficient access and storage

Publication - Data publishing, citation and credit

R - Data munging and writing functions in R

GitHub - Working collaboratively in git and GitHub

Workflows - Packages for publishing reproducible research

Data visualization - Working with ggplot and leaflet

GitHub pages - Publishing analytical webpages

RMarkdown - Literate analysis with RMarkdown

Open to researchers and students from any discipline or sector, courses are offered at NCEAS in Santa Barbara, California - and we welcome locals and travelers alike! We can also arrange a customized training at your home institution by request.
Due to COVID-19 our training has moved from in-person to an immersive remote environment
Upcoming Training

Reproducible Research Techniques for Synthesis

Dates: November 12-13 and 17-18
Location: Remote (via Zoom)

This five-day workshop is designed to help researchers stay abreast of current best practices and initiatives and get started on acquiring good data science skills to maximize their productivity, share their data with the scientific community effectively and efficiently, and benefit from the re-use of their data by others.

The course will be held remotely and run on November 12 and 13, break for the weekend, and resume for November 16, 17 and 18. Full details and registraiton information available at the link below.

Resources for Teaching and Self Learning

Support Elevated Data Literacy in Your Community

DataONE lessons and best practices are available through the Data Management Skillbuilding Hub; a repository for open educational resources for use in data management instruction and learning development. The Skillbuilding Hub is a community developed resource with materials applicable across a range of contexts, intended for use by researchers, teachers, librarians, information managers, or anyone who wants to learn or teach better data management practices. All the materials are CC0 licensed and designed for you to adopt and adapt to your needs.