1 post / 0 new
Data discovery use cases in the geosciences: what we can and cannot answer now; a review of tools and the CINERGI approach

Title: Data discovery use cases in the geosciences: what we can and cannot answer now; a review of tools and the CINERGI approach

Names of session leads: 

  • Leslie Hsu (C4P RCN, SEN RCN, CINERGI), Amarnath Gupta (CINERGI, GEAR), Ilya Zaslavsky (CINERGI, GEAR, co-chair of the Architecture WG)
  • Science use case discussants: Bernhard Peucker-Ehrenbrink (EC member), Leslie Hsu, Emma Aronson (co-chair of the EC Science Committee, BiG CZ SSI), Karen Stocks (co-chair of the EarthCube TAC Use Cases WG)

Description: Working with geoscientists from different fields, we developed and deconstructed a number of interesting data discovery use cases. Some parts of these discovery requests can be answered with the CINERGI inventory, but many still cannot. There are several common characteristics for those queries that we can address only partially: they may involve spatial, temporal and/or semantic joins; expect to return datasets, samples, time series or individual observations; require semantic rewriting or reasoning; may require a number of additional processing steps and sub-queries, etc. We will review several query examples and demonstrate CINERGI strategies for extending the inventory of geoscience resources to satisfy them. Please bring your own use cases that you would like to discuss, or participate in the discussion of discovery use cases that have been developed so far. See a list of discovery use cases being discussed within the CINERGI project at https://github.com/CINERGI/UseCases/issues; the ideas come from several geoscience domains including paleogeoscience, hydrology, climate modeling, geochemistry, metagenomics, sedimentology, stratigraphy, and critical zone science.

Tentative structure:

  • Presentation of use cases developed within CINERGI (45 mins)
  • Presentation of CINERGI pipeline and its extensions to new use cases (15 mins)
  • Discussion of additional discovery use cases, from other domains (30 mins)

Target audience: geoscientists, data facility managers, CI researchers, anybody interested in resource sharing and discovery.

Session leads and discussants are committed to attending - but it would be important to schedule the session in the first 1.5 days because of other commitments.