• Home
  • About
    • Objectives and work packages
    • Team members
    • Our Partners
    • Collaborations
  • Get involved
    • Open positions
    • Events
  • News
  • Publications and outputs
    • Project deliverables
    • Publications
  • Home
  • About
    • Objectives and work packages
    • Team members
    • Our Partners
    • Collaborations
  • Get involved
    • Open positions
    • Events
  • News
  • Publications and outputs
    • Project deliverables
    • Publications
  • September 10, 2025

BIOcean5D data hub: making a holistic vision of marine biodiversity possible

BIOcean5D is dedicated to the collaborative and multidisciplinary exploration of marine life, to advance understanding about how it changes with space, time and human impact​. Achieving that objective requires the collection, integration and harmonisation of a vast amount of complex, heterogeneous and scattered information – made possible by the BIOcean5D data hub.
Credit: Unsplash | Conny Schneider

Presented in a previous article following its launch last year, the data hub operates as a central repository that enables all data used within BIOcean5D to be accessed via a dedicated platform. We caught up with the EMBL team responsible for the conceptualisation of the data hub and the data flows into the hub to learn more about how their work makes BIOcean5D’s holistic exploration of marine biodiversity possible.

“The main challenge of the BIOcean5D data hub is the design of an infrastructure and data upload procedure capable of accommodating the highly diverse range of data used within the project,” explains Kerstin Leberecht, data manager for the Traversing European Coastlines (TREC) expedition. “Among others, the data includes sequencing, imaging and historic data, modelling data outputs, acoustic recordings, in addition to a wide range of physical, chemical and environmental context data.”

Development of the BIOcean5D data hub was carried out in parallel to an equivalent data hub dedicated to the TREC – Tara Europa expedition, with the design of both platforms supporting significant data flow between the projects. “The data integrated into the BIOcean5D data hub from TREC – Tara Europa includes aerosol, sediment and shallow water samples collected on land by EMBL’s mobile laboratories, together with water samples collected in parallel from further at sea by the Tara Ocean Foundation’s schooner Tara,” explains Kerstin.

Credit: Kogia | Sumer Verma

In addition to the diversity of data types and formats, the BIOcean5D hub also needs to ensure compatibility of data originating from a variety of sources. Firstly, from the 31 partners across 11 countries involved in the project. But furthermore, from the combination of newly-generated data with existing historical datasets and archives from European marine stations, recent major ocean biodiversity surveys, as well as relevant data from a scattered network of previous and ongoing EU, international, national and local projects and citizen science initiatives.

Compliance must also be ensured with international data management standards, in particular, the Open Science (OS) framework, FAIR (Findable, Accessible, Interoperable, Reusable) principles and regulations concerning data protection. “These requirements mean that each dataset submitted to the data hub must be accompanied and enriched by well-curated and detailed metadata,” explains Matej Trojak, Planetary Biology Biocurator at EMBL. Metadata enriches raw data with descriptive (such as how and why the data was created) or contextual information (within the context of TREC, for example, where the sample was collected). The difference between data and metadata, however, is subtle and context-dependent. “Temperature, for example, could be considered data or metadata depending on the research context,” explains Matej. The diversity of data and number of partners further complicated the management of metadata submission and, as a result, the system developed for the BIOcean5D data hub underwent multiple cycles of refinement and optimisation.

Credit: Kogia | Sumer Verma

The overarching ambition of collecting, integrating and harmonising such a diverse range of new and existing biodiversity data is to enable a holistic exploration and understanding of marine biodiversity through multidisciplinary collaboration. “We’ve worked to ensure the data hub is user-friendly, intuitive and as seamless as possible, which has required careful balancing between compliance and usability,” explains Kerstin. Dedicated data hub training sessions have also been organised. “These sessions have proved to be a valuable source of feedback and inspiration for further development, helping us to identify and prioritise improvements.”

With the data hub now up and running, with continuous technical improvements and integration of ever more data, “The rest is now up to the scientists!” explains Anthony Fullam, Senior Bioinformatics Software Engineer at EMBL. “Most data hubs specialise in one particular type of data, there aren’t many that integrate such a range and scale of data. The analyses made possible by connecting and combining the information available in the BIOcean5D data hub could be quite powerful and unique.” The immense potential of collaborative data sharing has not gone unnoticed! “We’re getting very positive feedback about the data hub as well as requests to help set up similar platforms for other major international research projects,” explains Kerstin.     

It is planned that the BIOcean5D data hub will become publicly accessible with continued maintenance to support ongoing access, usability and the generation of insights to guide the protection and restoration of our Ocean.

Read more about the BIOcean5D data hub here

Recent posts

Lamprey + citizen power to reveal plankton diversity

The Lamprey and Curiosity microscope are two invaluable tools used by BIOcean5D to harness the immense potential of citizen science, improve ocean literacy, engage and empower the general public and increase awareness of the immense challenges facing our Ocean. Following on from the Curiosity microscope last week, discover the Lamprey here!

Bringing plankton discovery inland with the Curiosity microscope

The Curiosity microscope and Lamprey are two invaluable tools used by BIOcean5D to harness the immense potential of citizen science, improve ocean literacy, engage and empower the general public and increase awareness of the immense challenges facing our Ocean. Discover the Curiosity microscope here, followed by the Lamprey next week!

A decisive moment for the future of our Ocean

World leaders, political decision-makers and marine science experts gathered in Nice, France this month for the largest Ocean summit ever organised.

BBNJ: protecting biodiversity in the Wild West of the high seas

Recently published policy briefs provide recommendations to support the rapid and effective implementation of the BBNJ Treaty and activate unprecedented protection of our Ocean and its biodiversity.

Share this story

Back to all news

PrevPreviousLamprey + citizen power to reveal plankton diversity

Dive deeper

Back to all news

BIOcean5D data hub: making a holistic vision of marine biodiversity possible

This website is co-funded by the European Union (GA#101059915). Views and opinions expressed are however those of the author(s) only and do not necessarily reflect those of the European Union. Neither the European Union nor the granting authority can be held responsible for them.

This work is supported by the UK government Horizon Europe Guarantee, UKRI Grant Reference Number 10039266.

This work has received funding from the Swiss State Secretariat for Education, Research and Innovation (SERl) under contract #22.00255.

Join our newsletter

Keep up to date with our latest news and opportunities to get involved. 

We are grateful to our friends at Kogia for access to their beautiful photo and video gallery.

Contact us

Linkedin X-twitter

© 2024 | Privacy policy