Discover RegenAg-x
The agri-food data space demonstrator for sovereign data sharing and Artificial Intelligence services
This demonstrator is conceived with a significant effort in guidance and use‑case development to support participants who have shown interest but have not had the opportunity or capacity to present their use cases separately for this call.
In addition, it involves a wide and diverse network of participants interested in being part of the demonstrator, broadly representing the different types of actors involved in the agri‑food sector, especially the productive side, since we want to maintain close collaboration with the future Common European Agricultural Data Space (CEADS), which focuses on this sector.
The data space will be governed by an explicit governance code, ensuring greater transparency, with special attention to preserving the fairness of participants and their non‑discrimination, as well as their sustainability.
Participants will play different roles, including data producers and consumers, data consumers and service providers, IT service providers and operators of ecosystem services of the data space. Computing providers will also act as data intermediaries, in particular in the role of trusted third party where information is processed.
Intermediation for data processing will be carried out using compute‑to‑data technology, so that data sovereignty can be guaranteed by design. This service will be contracted with a cloud "data rooms" provider, who will provide the high‑performance environment for compute‑to‑data processing, including Artificial Intelligence processes.
The data space governance code will enable the enactment of access and resource‑usage policies. It will also define mechanisms to create incentives for sharing data and services, as well as for the sustainability of the data space itself. These mechanisms will be based on an electronic Euro that uses distributed ledger technologies, with no real economic value during the demonstration phase, but which will allow all participants to explore possible data economies for the future sustainability of the data space.
The distributed ledger infrastructure used for the monetization of the data space will also allow tracking of the data space transactions and their audit. Based on this ledger, dispute‑resolution mechanisms stipulated in the governance code will also be deployed.
The demonstrator will be based on the FAIR principles to facilitate the discovery and access to the resources shared through it, and to make them interoperable and reusable. To this end, all resources will have descriptions of their characteristics and conditions of use, using semantic technologies that facilitate their automatic processing.
The data space will host both proprietary resources (data and services for processing) shared under conditions of use, and open data sources, especially high‑value datasets that are highly relevant to the agri‑food sector (Chamorro‑Padial, García, Gil, 2024). Especially open data sources relevant to the agri‑food sector such as those generated by the Galileo and Copernicus programs.
The services shared through the data space will include algorithms that implement mechanisms to validate the quality level of the data contributed by participants. These algorithms, for example Exploratory Data Analysis (EDA), can be applied to the data to compute quality metrics while ensuring their sovereignty. This will be due to the very design of the data space, which implements compute‑to‑data mechanisms that can ensure that the owner does not lose control over the data.
To facilitate interoperability and reuse of data and services, the data space will integrate services for transforming data into representations based on semantic technologies that will refer to formalisms such as ontologies of common vocabularies in the agri‑food domain.
The proposed demonstrator will be deployed using components from the Pontus‑X data space ecosystem. These components are open source and any new development or improvement carried out during the project will be shared in the same way as in the Pontus‑X open source code repositories.
Services will be contracted from a company that provides Pontus‑X components to facilitate the integration of the demonstrator with other data spaces that are already underway in this ecosystem. In addition, this company will provide the components in a dedicated manner for the test version of the data space, which will operate independently. These components, both in their shared and dedicated versions, provide the following technologies:
- Compute‑to‑data technologies to guarantee privacy and data sovereignty by design, so that it can be guaranteed that data is processed in a protected and confidential manner. They also enable federated learning.
- Mechanisms to share data, but also data processing services, preferably encapsulated as containers (Docker and Kubernetes) with all their code and dependencies to facilitate their portability and execution in the compute‑to‑data environment. These services include advanced descriptive, predictive and prescriptive analytics tools.
- Distributed ledger technologies that allow tracing all transactions carried out in the ecosystem of data spaces, and therefore across multiple connected data spaces.
- Smart contract technologies that allow implementing monetization mechanisms for data, services and the computational costs of the compute‑to‑data mechanism. Distributed ledger and smart contracts based on the EVM (Ethereum Virtual Machine) standard. They facilitate independence from the underlying technological solution, enabling portability and deployment across different infrastructures.
- Digital Wallet technologies to ensure the sovereignty of participants and the self‑management of attributes related to their identity.