WHAT IS THE OPEN DATA CUBE?
The Open Data Cube (ODC) is an open source solution for accessing, managing, and analyzing large quantities of Geographic Information System (GIS) data - namely Earth Observation (EO) data. It presents a common analytical framework composed of a series of data structures and tools which facilitate the organization and analysis of large gridded data collections. test
The Open Data Cube was developed for the analysis of temporally-rich earth observation data, however the flexibility of the platform also allows other gridded data collections to be included and analyzed. Such data may include elevation models, geophysical grids, interpolated surfaces and model outputs.
The Open Data Cube project is a collection of libraries for working with Earth Observation Data with a consistent interface. The goal of the project is to simplify the loading of this data from variety of sources sources and formats to a simple loading function call. The three core components to enable this vision are ODC-Geo, ODC-STAC, and the Datacube Core.
[describe function]
Datacube Core
Is responsible for maintaining large indexes of geospatial metadata that may be locally or remotely accessible. Simplifies data loading...
Geo is all the geometry transformations for mapping from pixel coordinates to the real world
https://odc-geo.readthedocs.io/en/latest/intro.html
ODC-STAC provides the same loading interface directly from STAC Catalogues
A key characteristic of the Open Data Cube is that every unique observation is kept, which contrasts with many other methods used to handle large gridded data collections. Some of the major advantages of ODC are the following:
Flexible framework
User maintains control and ownership over their data
Paradigm shift from scene-based analysis to pixel based
Lower barrier to entry for remote sensing data analysis.
The Open Data Cube system is designed to:
Catalogue large amounts of Earth Observation data
Provide a Python based API for high performance querying and data access
Give scientists and other users easy ability to perform Exploratory Data Analysis
Allows scalable processing of the stored data on a continental scale
Track the provenance of all the contained data to allow for quality control and updates
The Open Data Cube (ODC) is an open source project born out of the need to better access and manage satellite data. Most raster datasets can be indexed to your ODC, including commercial, in-situ, or derived products.
It has evolved to support interactive data science and scientific computing. ODC will always be 100% open source software, free for all to use and released under the liberal terms of the Apache 2.0 license
ODC ECOSYSTEM GEOSPATIAL
DATA MANAGEMENT & ANALYSIS SOFTWARE
ODC CORE
ODC ALGORITHMS
ODC APPS
SATELLITE DATA
Examples:
-
Landsat
-
Sentinel
-
MODIS
FLEXIBLE DEPLOYMENT
Depending on your application, the Open Data Cube can be deployed on HPC, Cloud, and local installations. Typical installations run on Linux, MacOS, and Windows.
INFORMED DECISIONS
Examples:
-
Deforestation
-
Water Quality
-
Illegal Mining
WHAT CAN IT DO?
The Open Data Cube (ODC) is an open source project born out of the need to better access and manage satellite data. It provides the foundation of several international, regional to national scale data architecture solutions, such as Digital Earth Australia, Digital Earth Africa, and others. The Open Data Cube works well with multiple types of data including CEOS Analysis Ready Data (CEOS-ARD) and data commonly found from other providers. Most raster datasets can be indexed to your ODC, including commercial, in-situ, or derived products.
The Open Data Cube system is designed to:
-
Catalogue large amounts of Earth Observation data
-
Provide a Python based API for high performance querying and data access
-
Give scientists and other users easy ability to perform Exploratory Data Analysis
-
Allows scalable processing of the stored data on a continental scale
-
Track the provenance of all the contained data to allow for quality control and updates
It has evolved to support interactive data science and scientific computing. ODC will always be 100% open source software, free for all to use and released under the liberal terms of the Apache 2.0 license.
The ODC supports a broad range of applications including land, water, cloud, and time series analysis:
-
Mosaic Creation - A mosaic is a composite image created by combining the most appropriate pixels from a collection of source images. A common use case is to create cloud-free images for applications that are not time-dependent. There are a number of approaches.
-
Spectral Products - Spectral Index algorithms calculate the relative magnitudes of wavelength components. The particular wavelength components used determine the product calculated.Common Spectral Indices include NDVI, NDBI, NDWI, and products such as Fractional Cover.
-
Water Mapping - Water Observations from Space (WOfS) - WOfS is an automated water mapping algorithm created by Geoscience Australia. Surface water is detected in satellite images and for each location the number of occurrences is summed through time. The result is a percentage value of the number of times water was observed at the location.
-
Land Classification and Land Change
-
Cloud Statistics - Cloud statistics are valuable information for performing analyses. For example, if there are extensive clouds during a season, it may significantly impact mosaic products or index values. Users may also want a way to find dates when there are few clouds, so that they may assess land features.
-
Time Series Analysis
ODC APPLICATIONS
The ODC core serves as a layer between satellite data providers and applications. A set of open source tools exist to help scientists conduct research using data managed by the ODC. Popular tools used within the community that utilizes the ODC Core as its basis include:
-
Command Line Tools: A tool used by programmers/developers to interface with the ODC.
-
Open Data Cube Explorer: A visual and interactive web application that lets users explore their inventory of available data.
-
Open Data Cube Stats: An optimized means of defining and executing advanced analysis on ODC system. This tool is oriented towards scientists.
-
Jupyter Notebooks: Research documents centered around techniques in EO sciences. A notebook contains executable code detailing examples of how the data cube is used in a research setting, and therefore is an invaluable reference material for new users.
-
Open Geospatial Consortium (OGC) Web Services: Adapters that can connect non-ODC applications to the ODC.