Finalized:Tuesday, August 1, 2017
Author(s):Elag, M. M., and Kumar, P., Marini, L., Myers, J. D., Hedstrom, M., and Plale, A. B.
Scientists' ability to synthesize and reuse long-tail scientific data lags far behind their ability to collect and produce these data. Many Earth Science Cyberinfrastructures enable sharing and publishing their data over the web using metadata standards. While profiling data attributes advances the Linked Data approach, it has become clear that building information-networks among distributed data silos is essential to increase their integration and reusability. In this research, we developed a Long-Tail Information-Network (LTIN) model, which uses a metadata-driven approach to build semantic information-networks among datasets published over the web and aggregate them around environmental events. The model identifies and characterizes the spatial and temporal contextual association links and dependencies among datasets. This paper presents the design and application of the LTIN model, and an evaluation of its performance. The model capabilities were demonstrated by inferring the information-network of a stream discharge located at the downstream end of the Illinois River.
Elag, M. M., and Kumar, P., Marini, L., Myers, J. D., Hedstrom, M., and Plale, A. B.. "Identification and characterization of information-networks in long-tail data collections," Env. Modeling and Software, v.94, 2016, p. 100. doi:10.1016/j.envsoft.2017.03.032This material is based upon work supported by the National Science Foundation under Grant No. 1440315. Opinions, findings, conclusions or recommendations expressed are those of the authors and do not reflect the views of the NSF.