The Australian Geoscience Data Cube - foundations and lessons learned

Date

2017-04-12

Authors

Lewis, Adam
Oliver, Simon
Lymburner, Leo
Evans, Benjamin
Wyborn, Lesley
Mueller, Norman
Roberts, Dale
Trenham, Claire
Sixsmith, J
Wu, Wenjun

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Abstract

The Australian Geoscience Data Cube (AGDC) aims to realise the full potential of Earth observation data holdings by addressing the Big Data challenges of volume, velocity, and variety that otherwise limit the usefulness of Earth observation data. There have been several iterations and AGDC version 2 is a major advance on previous work. The foundations and core components of the data cube are: (1) data preparation, including geometric and spectral radiometric corrections to Earth observation data to produce standardised surface reflectance measurements that support time-series analysis, and collection management systems which track the provenance each data cube product and formalise re-processing decisions; (2) the software environment used to manage and interact with the data, including a minimal relational model that uses ‘not-only-SQL’ to simplify the process of adding new datasets to the data cube, or to simply ‘reference’ external datasets; and (3) the supporting, integrated, high performance computing - high performance data environment (HPC-HPD) provided by the Australian National Computational Infrastructure which supports both large scale analysis within the NCI, and direct access to data using standards-based web services. A growing number of exemplars demonstrate that the data cube approach allows analysts to extract rich new information from Earth observation time series, including through new methods that draw on the full spatial and temporal coverage of the Earth observation archives. To enable easy-uptake of the AGDC, and to facilitate future cooperative development, our code is developed under an open-source, Apache License, Version 2.0. This open-source approach is enabling other organisations, including the Committee on Earth Observing Satellites (CEOS), to explore the use of similar data cubes in developing countries.

Description

Keywords

Landsat, Time-series, Big data, Data cube, High performance computing, High performance data, Collection management, Geometric correction, Pixel quality, Australian Geoscience Data Cube

Citation

Source

Remote Sensing of Environment

Type

Journal article

Book Title

Entity type

Access Statement

Open Access

License Rights

Creative Commons Attribution 4.0 International (CC BY 4.0)

Restricted until