The SciDataCon 2025 Programme is now published.

13–16 Oct 2025
Brisbane Convention & Exhibition Centre
Australia/Brisbane timezone

Leveraging AI to Automatically Link Controlled Vocabulary Terms in Metadata

13 Oct 2025, 18:00
1h 30m
Brisbane Convention & Exhibition Centre

Brisbane Convention & Exhibition Centre

Merivale St, South Brisbane QLD 410
Poster Rigorous, responsible and reproducible science in the era of FAIR data and AI Poster Session

Speaker

Vyacheslav Tykhonov (DANS-KNAW)

Description

Automatically linking controlled vocabulary terms in metadata enhances semantic consistency and improves data interoperability across systems—particularly by connecting terms from frameworks such as OntoPortal, Skosmos, Wikidata, and others. This work presents an AI-driven approach that leverages Large Language Models (LLMs) in combination with knowledge graph techniques to identify and establish meaningful connections between controlled vocabulary terms. By harnessing the contextual understanding of LLMs and the structural capabilities of knowledge graphs, this method enables the automated enrichment and alignment of metadata vocabularies. The approach reduces manual curation efforts, supports scalable metadata harmonization, and opens new possibilities for intelligent data integration across domains.

Primary author

Presentation materials

There are no materials yet.