2025-12-16
Quantification of plant trait data from herbarium scans in the DiSSCo Research Infrastructure
Publication
Publication
Research Ideas and Outcomes , Volume 11 - Issue e160367
The Distributed System for Scientific Collections (DiSSCo) is a research infrastructure to integrate European natural science collections (NSCs) digitally. The aim is to facilitate and enhance the access, management and analysis of collection assets in one unified digital collection. The Machine Annotation Services (MAS) are essential components of DiSSCo’s Digital Specimen Architecture (DSArch). These services automate the annotation of digital objects to enable labelling and categorisation of NSC's digital assets. To further advance this, a Machine Learning as a Service (MLaaS) approach was developed which provides researchers with the access to pre-trained machine-learning models for complex tasks, such as instance segmentation and morphological analysis of datasets. MLaaS enhances the DiSSCo’s scalability and flexibility and allows the integration of machine-learning tools in close alignment with the FAIR (Findable, Accessible, Interoperable, Reusable) principles. This study employs DiSSCO's MLaaS framework for the quantitative analysis of herbarium specimens. Machine-learning models, such as Mask R-CNN and YOLO11, are comparatively applied to detect and generate the pixel-level masks of plant organs in herbarium sheets. Subsequently, these models are used to reconstruct the scale in the herbarium sheet and to calculate the surface area of identified plant organs. The determination of quantitative characteristics of plant specimens, such as measuring leaf area or the timestamp of the floral transition, opens up herbarium data for reuse in the large prognosis platforms currently developed in the framework of the Common European Data Spaces. In this way, plant trait data mobilised from natural science collections can improve the predictive capability of the vegetation model components of climate-related data spaces.
| Additional Metadata | |
|---|---|
| , , , , , , , , , | |
| Pensoft Publishers | |
| doi.org/10.3897/rio.11.e160367 | |
| Research Ideas and Outcomes | |
| Released under the CC-BY 4.0 (“Attribution 4.0 International”) License | |
| Organisation | Staff publications |
|
R. Rajendran (Rajapreethi), C. Weiland (Claus), J. Grieb (Jonas), S. Theocharides (Soulaine), Leeflang, S., Addink, W., & Islam, S. (2025). Quantification of plant trait data from herbarium scans in the DiSSCo Research Infrastructure. Research Ideas and Outcomes, 11(e160367). doi:10.3897/rio.11.e160367 |
|