Geo Engine offers good integration into the RDC. The VAT instance is running fully containerized on the de.NBI cloud and by supporting SSO (Single Sign-On) it allows users to login using their GFBio credentials. A prototypical implementation exists for accessing data from the Aruna Object Storage. Furthermore, the VAT system is connected to the Query Portal (GFBio Search) through the GFBio Data Collection Service.
The easiest way to become familiar with Geo Engine is to take a look at the publicly accessible VAT instance, which is a community-agnostic GIS web application backed by Geo Engine with several biodiversity research related datasets available.
You can also use the Python package, either connecting to the VAT instance's backend, which runs at https://vat.gfbio.org/api/, or to your own backend.
For running your own backend (and frontend) take a look at the backend and frontend GitHub repositories. You can find instructions on how to compile and run them in the respective readme files.
Geo Engine provides many operators which can be applied to spatio-temporal datasets. Several datasets and operator invocations can be combined in workflows to create arbitrarily complex processing pipelines whose results can be further reused through the python API and visualized in the frontend.
In the following, we describe a simple example on how you can combine different datasets and operators in the VAT instance. In this example, you will take elephant occurrence datasets of two distinct species and combine them with a vegetation index dataset to visualize the difference in their habitats.
There is documentation available where you can learn more about Geo Engine's architecture and e.g. all operators available. Also see the Python API reference and the Jupyter Notebook examples.
Since Geo Engine is open-source, you can easily extend and customize it to your own needs. Most relevant for creating new applications are the options to add datasets, data providers and create custom dashboards.
As an admin of an existing Geo Engine instance you can already manage datasets via the python API (https://python.docs.geoengine.io/datasets.html), but you also have the option to extend the backend with entirely new data provider implementations here.
New dashboards can be added to the frontend here, where you can also take a look at existing dashboards. A dashboard can reuse existing core components and thus offers an easy way to only perform customization where needed to create a new dashboard.
Beilschmidt, C., Drönner, J., Mattig, M., & Seeger, B. (2023). Geo Engine: Workflow-driven Geospatial Portals for Data Science. Datenbank-Spektrum, 1-9.