The Australian Geoscience Data Cube - foundations and lessons learned
Date
2017-04-12
Authors
Lewis, Adam
Oliver, Simon
Lymburner, Leo
Evans, Benjamin
Wyborn, Lesley
Mueller, Norman
Roberts, Dale
Trenham, Claire
Sixsmith, J
Wu, Wenjun
Journal Title
Journal ISSN
Volume Title
Publisher
Elsevier
Abstract
The Australian Geoscience Data Cube (AGDC) aims to realise the full potential of Earth observation data holdings by addressing the Big Data challenges of volume, velocity, and variety that otherwise limit the usefulness of Earth observation data. There have been several iterations and AGDC version 2 is a major advance on previous work. The foundations and core components of the data cube are: (1) data preparation, including geometric and spectral radiometric corrections to Earth observation data to produce standardised surface reflectance measurements that support time-series analysis, and collection management systems which track the provenance each data cube product and formalise re-processing decisions; (2) the software environment used to manage and interact with the data, including a minimal relational model that uses ‘not-only-SQL’ to simplify the process of adding new datasets to the data cube, or to simply ‘reference’ external datasets; and (3) the supporting, integrated, high performance computing - high performance data environment (HPC-HPD) provided by the Australian National Computational Infrastructure which supports both large scale analysis within the NCI, and direct access to data using standards-based web services. A growing number of exemplars demonstrate that the data cube approach allows analysts to extract rich new information from Earth observation time series, including through new methods that draw on the full spatial and temporal coverage of the Earth observation archives. To enable easy-uptake of the AGDC, and to facilitate future cooperative development, our code is developed under an open-source, Apache License, Version 2.0. This open-source approach is enabling other organisations, including the Committee on Earth Observing Satellites (CEOS), to explore the use of similar data cubes in developing countries.
Description
Keywords
Landsat, Time-series, Big data, Data cube, High performance computing, High performance data, Collection management, Geometric correction, Pixel quality, Australian Geoscience Data Cube
Citation
Collections
Source
Remote Sensing of Environment
Type
Journal article
Book Title
Entity type
Access Statement
Open Access
License Rights
Creative Commons Attribution 4.0 International (CC BY 4.0)
Restricted until
Downloads
File
Description