You are here
A Reproducible Network Analysis of the DataONE Linked Open Data Graph
Audrey McCombs is a co-major PhD student in Ecology and Statistics at Iowa State University. She holds a Master's degree in philosophy and an MFA in creative writing, and worked for many years in natural resources management before heading back to grad school. Her research interests involve applying complexity science to ecological systems, specifically the study of graph-theoretic characteristics of empirical ecological networks. Originally from the San Francisco Bay Area, she misses the ocean but enjoys the summer thunderstorms and fireflies of the Midwest.
With over 800,000 datasets accessible through programmatic interfaces, DataONE provides a rich corpus of machine readable metadata that is also expressed as a linked open data (LOD) graph. The goal of this project is to explore the LOD graph of DataONE and provide a network analysis on the graph and how the network differs from the content available through the traditional DataONE Application Programming Interface (API). For example: How interconnected are data sets and researchers? How many individual authors contributed to how many data sets? Can fields such as keywords be normalized to a small set of controlled vocabularies? How do network analysis measures differ by metadata standard, year of publication, or other facets?