Needs and expectations of DataONE tools:
To assess his latest model, Andreas wants to be able to compare his prediction of maximum spring bloom biomass with what was actually observed. Monitoring data are gathered by the government and are available online, but significant work must be put in to the dataset before it can be used. If DataONE could provide the data in a more usable format, it would save considerable work.
Like most modelers, Andreas hopes that his models can elucidate both the specific biogeochemical dynamics for the area in question as well as be applicable to comparable systems elsewhere. Currently, too much of his time is focused on acquiring and managing the data in the specific context of his research, and he cannot afford to test the applicability of his model to other systems. DataONE could potentially solve that problem, expanding Andreas' research capabilities and revealing his work to a broader base of researchers.
Intellectual and physical skills that can be applied:
Andreas has significant programming knowledge in MatLab and Fortran. He is likely to be able to overcome functional deficiencies in DataONE tools as long as he doesn't have to spend too much time cleaning up the datasets themselves. Andreas' work is likely to illustrate some of the more powerful analytical capabilities gained from sharing datasets via DataONE, but only if he uses the data referencing protocols so that users can track those links to his models.
Technical support available:
Andreas is part of a highly sophisticated technical community, with whom he can work both formally and informally. However, he has no additional technical support for his own work beyond himself.
Personal biases about data sharing and reuse (and data management more generally):
Andreas needs real world data about his area of interest, both to calibrate and to assess his model.
Some of Andreas’s model code is publicly available and open source and he has already published on most of it. However, he is confused as to what he should do with his model output. Normally it just sits on his servers and is used for a couple years by him alone. Other researchers have asked to look at his code, but no one has asked about his model output. Andreas doesn’t think the older model output is very useful, in part because it is difficult for anyone other than him to understand, and would like to delete it to free up server space for new output, but worries that he might be losing.