Finalized:Wednesday, April 26, 2017
Author(s):Peckham, S. D., M. Stoica, E. Jafarov, A. Endalamaw, and W. R. Bolton
Modern geoscientists have online access to an abundance of different data sets and models, but these resources differ from each other in myriad ways and this heterogeneity works against interoperability as well as reproducibility. The purpose of this paper is to illustrate the main issues and some best practices for addressing the challenge of reproducible science in the context of a relatively simple hydrologic modeling study for a small Arctic watershed near Fairbanks, Alaska. This study requires several different types of input data in addition to several, coupled model components. All data sets, model components and processing scripts (e.g., for preparation of data and figures, and for analysis of model output) are fully documented and made available online at persistent URLs. Similarly, all source codes for the models and scripts are open source, version controlled, and made available online via GitHub. Each model component has a Basic Model Interface to simplify coupling and its own HTML help page that includes a list of all equations and variables used. The set of all model components (TopoFlow) has also been made available as a Python package for easy installation. Three different graphical user interfaces for setting up TopoFlow runs are described, including one that allows model components to run and be coupled as web services.
Peckham, S. D., M. Stoica, E. Jafarov, A. Endalamaw, and W. R. Bolton (2017), Reproducible, component-based modeling with TopoFlow, a spatial hydrologic modeling toolkit, Earth and Space Science, 4, 377–394, doi:10.1002/2016EA000237.This material is based upon work supported by the National Science Foundation under Grant No. 1440332, 1440333. Opinions, findings, conclusions or recommendations expressed are those of the authors and do not reflect the views of the NSF.