2026-03-17
Integrating the biodiversity genomics continuum: harmonising data from barcodes to reference genomes
Publication
Publication
Research Ideas and Outcomes , Volume 12 - Issue e187033
Biodiversity genomics is converging from historically separate approaches — DNA barcoding and reference genome sequencing — into an integrated digital ecosystem driven by shared data stewardship principles: transparent provenance, persistent identifiers and interoperable repositories. We demonstrate how these workflows can operate within a unified informatics architecture spanning data generation, validation, publication and reuse. We describe coordinated infrastructure, including the European BOLD mirror, ERGA Genome Tracking Console and metadata platforms COPO and PlutoF. These systems employ harmonised validation pipelines, shared metadata standards that bridge the Darwin Core and Genomic Standards Consortium vocabularies and automated data exchange amongst ENA, UNITE and GBIF. Workflows in Galaxy, Nextflow and Snakemake are registered in WorkflowHub as Research Object Crates (RO-Crates), ensuring reproducibility and complete provenance. Key outcomes include comprehensive data flow documentation, automated quality control using BUSCO and ERGA Assembly Reports and robust specimen-to-data linkage. We identify challenges in metadata harmonisation, distributed tracking, collaborative attribution and infrastructure sustainability and provide recommendations for addressing them through existing platforms and emerging RO-Crate standards. This work establishes practical foundations for treating biodiversity molecular data as a continuum, demonstrating how FAIR principles can scale to continental initiatives.
| Additional Metadata | |
|---|---|
| , , , , , , | |
| Pensoft Publishers | |
| doi.org/10.3897/rio.12.e187033 | |
| Research Ideas and Outcomes | |
| Released under the CC-BY 4.0 (“Attribution 4.0 International”) License | |
| Organisation | Staff publications |
|
Heil, Katharina, Alioto, Tyler, Böhne, Astrid, Brown, Tom, Chadwick, Eli, Chua, Physilia, … Vos, R. (2026). Integrating the biodiversity genomics continuum: harmonising data from barcodes to reference genomes. Research Ideas and Outcomes, 12(e187033). doi:10.3897/rio.12.e187033 |
|