Significance of the Issues
Australia’s healthcare system generates a vast amount of data, however, data systems are highly fragmented, with information captured across diverse and often incompatible systems. This lack of interoperability creates major barriers to the integration and analysis of health data at scale, limiting the nation’s ability to conduct efficient, multi-centre...
Session Title:
Building an Open Data Collaborative Network in Asia-Oceania
Session Contents:
One of the primary objectives in the current Open Science movement is to initiate new research fields and technologies leveraging the vast amounts of data now being generated across numerous scientific domains. Adherence to the FAIR Principles for data has been recognized as a standard for...
Enhancing data literacy is important for Aboriginal and Torres Strait Islander Peoples because it essential for effective engagement and communication with health services, self-determination, personal health management and Indigenous Data Sovereignty. Data literacy incorporates accessing, using and sharing data in order to make decisions within personal and professional spheres and it is a...
Significance of the issues to be tackled:
Earth and environmental (E&E) datasets have a critical role to play in the Sustainable Development of our Planet. They contribute to the prediction of natural hazards, effective development of our natural resources, long term monitoring of vegetation changes, etc., and underpin many UN Sustainable Development Goals. Since the 1990s there have been...
This session will be conducted in a panel format and explore the central question “How is data empowering Indigenous communities?” It will bring together 5 speakers from diverse backgrounds across Australia and Canada, to present for 10-15 minutes each, followed by a facilitated discussion and Q&A with the audience.
The panel members will offer their own perspectives on the use of data in...
As part of International Data Week, we (the 3 Early Career Researcher networks of the hosting organisations, CODATA Connect, WDS ECR Network and RDA Early Career and Engagement IG) are organising a dynamic 3-minute scientific research pitch competition targeted at early career researchers. This event will provide a platform for emerging scientists to showcase their innovative research,...
Research funding organizations in the Asia Pacific are moving forward to develop support for research data management (RDM), especially for data-intensive research. At the same time, researchers and students are becoming more proactive in sharing datasets to enhance research visibility and impact. The convergence of top-down data policies and bottom-up initiatives is shaping a culture of data...
Overview
The [NIH Office of Data Science Strategy][1] launched the [Generalist Repository Ecosystem Initiative][2] (GREI) in February 2022, in recognition of the key role generalist repositories play in the NIH data sharing landscape to support the FAIR sharing of data and other research outputs. The GREI program represents a groundbreaking collaborative model that brings together seven...
The [Australian Reference Genome Atlas][1] (ARGA) is a next-generation platform designed to index, connect and expose genomic data for Australia’s mega-biodiversity. It sits at the interface between the twin problems of genomic data discovery in an age where data are rapidly proliferating, and the crisis in documenting and understanding Australia’s vulnerable biodiversity. More than 80% of...
A Thematic Research Data Commons is a vehicle for the ARDC and our national partners to collaboratively develop and deliver sustainable digital research infrastructure on a national scale. It is enabling us to best meet the needs of our diverse national research communities in a strategic and comprehensive way.
The Planet Research Data Commons (Planet RDC) is delivering enduring digital...
Background
The digitalisation of Electronic Health Record (EHR) data has unlocked unique opportunities for research. Unlike administrative datasets, EHRs provide granular clinical data, real-time updates within systems, and access to detailed clinical notes. Despite these advantages, EHR data—primarily collected for operational purposes—remains siloed, lacks standardisation between systems,...
The Aotearoa Genomic Data Repository (AGDR, https://data.agdr.org.nz/) provides secure within-nation storage, management, and sharing of non-human genomic data generated from biological and environmental samples originating in Aotearoa New Zealand. Te ao Māori, the worldview of the Indigenous people of Aotearoa New Zealand, recognises all living entities as taonga (treasured or precious) that...
Overview
Understanding unnecessary data, known as Dark Data is a major operational challenge in large-scale shared storage. We propose an alternative approach that leverages HPC workflow tools to collect extended metadata at each stage of job execution, minimizing the need for fundamental system changes.
Background
The High Performance Computing Infrastructure (HPCI) project was...
Founded in 1966, CODATA is the Committee on Data of the International Science Council (ISC). CODATA’s vision “is of a world in which science is empowered to address universal challenges through the transparent, trustworthy and equitable use of data and information” [CODATA 2025]. CODATA’s mission “is to connect data and people to advance science and improve our world” [CODATA 2025]. CODATA...
Indigenous Data Sovereignty (IDSov) is a global Indigenous developed and led philosophy asserting Indigenous rights in data. In Australia, an Anglo-colonised state, IDSov has increasingly been identified and communicated as a high priority for the nation state. The nation state has committed to supporting ‘data sharing’ through a National Partnership Agreement for Closing the Gap by providing...
The increasing digitalization of science, coupled with the push for Open Science and FAIR data (Findable, Accessible, Interoperable, Reusable), presents significant challenges for managing diverse research outputs effectively throughout their lifecycle. Traditional Data Management Plans (DMPs) often lack the detail and machine-actionability needed for dynamic research processes, while Current...
Agriculture Victoria Research (AVR) undertook a project for multimodal data analysis and anomaly detection for beehive health to address the critical challenge of determining hive health comprehensively in a pollination environment where diverse modalities of data are at interplay. Honeybee populations play an essential role in pollination within Victoria's horticulture industry, making the...
Fragmented health data systems across Africa perpetuate inequities in crisis response and research participation, echoing colonial legacies of extractive data practices. In a decisive move toward sovereignty, African Ministries of Health and national data custodians are advancing federated approaches that retain local governance while enabling cross-border collaboration through standardized...
Academic institutions often hold large volumes of unstructured text data—such as chat transcripts, research publications, and strategic documents—but may lack accessible methods to analyze and interpret these resources effectively. This presentation shares how Singapore Management University (SMU) Libraries leveraged BERTopic, an AI-driven topic modeling tool for text clustering, along with...
This study presents the first findings of the project: Open science and the management of traditional and scientific knowledge: A case study of the Takinahakỹ Center for Indigenous Higher Education at the Federal University of Goiás, Brazil. The main aim is to improve the capacity of indigenous students to effectively manage and safeguard the records of traditional and scientific knowledge...
About Data Commons and Data Meshes
A data commons is a cloud-based software platform with a governance framework that enables a research community to manage, analyze and share its data. A data mesh is a collection of two or more data commons, cloud-based computational resources, and other cloud-based resources that interoperate using a common set of core software services and a hybrid...
Australia is a massively multilingual country, in one of the world’s most linguistically diverse regions. Significant collections of this intangible cultural heritage have been amassed, including collections of Aboriginal and Torres Strait Islander languages, Australian Englishes, and regional languages of the Pacific, as well as collections important for cyber-security and for emergency...
In response to inefficiencies in governmental regulation, such as excessive regulation, an overabundance of laws and procedures, lack of flexibility, and disregard for the costs, countries around the world began efforts to optimize regulation, in part by privatization. The trend toward privatization of social services necessitated substantial development of governmental supervision practices....
Introduction to the Africa PID Alliance and the Digital Object Container Identifier (DOCiD ™️)
The Africa PID Alliance is an Open Infrastructure program of the Training Centre In
Communication that is focused on producing African African-originated persistent Identifiers designed to enhance the representation, sovereignty, and visibility of African research output globally. This...
Data sharing is considered one of the effective practical means to enhance research transparency. Data repositories are pivotal e-infrastructure in fostering this. As a generalist data repository, Science Data Bank (ScienceDB) offers free services to the global community for sharing and dissemination of non-traditional research outputs, such as datasets and codes. It has been built and...
The EU-funded [OSTrails project][1] is advancing a federated approach to Open Science by addressing a key challenge: the fragmentation of research data management (RDM) practices across disciplines, tools, and institutions. By building a network of interoperable services for planning, tracking, and assessing research activities, OSTrails promotes reproducible, FAIR-aligned, and responsible...
Our university is built on the traditional and contemporary homelands of the Dakota people, a federally recognized Tribal Nation made up of four communities and their sovereign governments. We recognize the importance of acknowledging the People on whose land we live, learn, and work, but understand that words are not enough. Within our institutional data repository, we seek to improve and...
With the pace of research accelerating into the age of Quantum computing and AI, data mobility has become the lifeblood of modern scientific research. As unprecedented volumes of research data are being created at increasing speed, it is imperative that data can be easily moved and shared to be findable and accessible. Yet achieving data mobility in scientific research faces significant...
Vertical Federated Learning (VFL) has emerged as a transformative approach in collaborative machine learning, enabling multiple parties to jointly train models while maintaining data privacy through vertical partitioning of features. This paradigm has gained significant traction in privacy-sensitive domains such as healthcare and finance, where different organisations possess distinct feature...
Addressing the requirements for transparent, cost effective and high-impact research in this era of big data and cross-disciplinary research will require significant community changes to how research software is created, managed and maintained. In this talk I will introduce Astronomy Data And Computing Services (ADACS): a highly successful initiative of the Australian astronomy community...
Many current solutions for data management are expensive or require considerable technical underpinnings (or both). The global data community needs to consider simpler approaches in order to include more participants and to improve equity, but this requires guidance about minimal requirements. The Protocols for the Implementation of Archival Repository Services are an attempt to start the...
The Australian National Persistent Identifier (PID) Strategy is a critical national initiative that aims to accelerate Australian research quality, efficiency and impact through universal use of connected persistent identifiers. It supports a vision where researchers, institutions, and infrastructures are connected through a universal, trusted, and interoperable system of PIDs. This strategy...
The Health Studies Australian National Data Asset (HeSANDA), led by the Australian Research Data Commons (ARDC), is building national research infrastructure to enhance the discoverability, access, and reuse of data from health research studies across Australia. HeSANDA was established as a response to the critical need for more accessible and interoperable health research data.
The...
As demand grows to improve health outcomes for Aboriginal and Torres Strait Islander clients, health practitioners and researchers are increasingly embedding Indigenous perspectives through co-design and applying Indigenous data governance frameworks. This paper shares the preliminary reflections from a research program co-led by a Queensland-based Indigenous community-controlled organisation...
The COVID-19 pandemic has altered how health data is regarded and was a distinct driver for change. The need for rapid analysis and assessment of health data at scale brought sharp focus to the challenges, highlighting the importance of Findable, Accessible, Interoperable and Reuseable (FAIR) data. The heterogeneous nature of health data, together with the wide array of systems and associated...
This presentation explores ways of improving researchers’ data skills by creating an environment that engages learners, helps them form networks and gives them greater control over what happens. It draws on three years’ experience facilitating a national research data skills summer school for the Australian Research Data Commons (ARDC). The presentation is suitable for anyone who mentors,...
In the current data-driven era, open government data serves as a catalyst for open innovation and the development of value-added services, while also promoting governmental transparency. These attributes collectively contribute to advancing the United Nations' Sustainable Development Goals (SDGs), a set of 17 objectives outlined in the 2030 Agenda for Sustainable Development, which encompass...
National metrology institutes (NMIs) serve to establish measurement standards for their respective nations and disseminate these standards to end users, including various industries. Through this process, NMIs facilitate freedom in economic activities by overcoming technical trade barriers through compliance with the International Committee for Weights and Measures Mutual Recognition...
Many universities have adopted the use of Data Management Plans (DMPs) for research teams to outline how their research data will be handled both during and after a project. DMPs support responsible data management in accordance with the FAIR principles: Findable, Accessible, Interoperable, and Reusable. The objectives are to contribute towards research integrity, reproducibility, and...
This study focuses on intelligent data mining and knowledge discovery services, addressing the critical challenges researchers face in processing massive datasets and optimizing experimental schemes. We propose an innovative solution integrating knowledge graph and artificial intelligence technologies, with a specific application to thermoelectric domain through the development of a...
The Taiwan Gateway to Health Data (GHD TW) is a government-funded project that collaborates with all primary data custodians and data controllers in Taiwan. We establish a data portal for various data users, including industrial and academic researchers, to promote community health and clinical and biomedical research. Our primary responsibility is to provide services that enhance the...
The RO-Crate (Research Object Crate) specification (Sefton et al. 2023) is a method for describing data sets with rich, interoperable Linked Data metadata. This presentation will show how we, the Language Data Commons of Australia project (LDaCA), use well described RO-Crate data packages (Soiland-Reyes et al. 2022) to enable CARE (Carroll et al. 2020) and FAIR (Wilkinson et al. 2016)...
With the onset of the Open Science movement, research sites and clinical research sponsors are becoming increasingly entrusted with the storage of large amounts of research data and samples. The prospect of sharing a wide array of health data is an exciting one, as the collaboration of ideas and the expansion of shared knowledge promises to lead to accelerated research outcomes. However,...
The Cross Domain Interoperability Framework (CDIF) provides a set of implementation guidelines designed to lower the barriers to cross-domain research data reuse. CDIF provides standards and methodologies for addressing interoperability issues preventing cross-domain research data utilization. CDIF’s initial version comprises five core profiles: Discovery, Access, Controlled Vocabularies, Data...
Background
This study investigated mental health research data analysis across institutions where privacy of data is key, regulatory restrictions, and variation in how data is structured and stored. These limitations are especially pronounced in resource-constrained settings and federated data analysis offers a promising solution. The Observational Medical Outcomes Partnership Common Data...
The Open Science landscape today is full of good intentions, grand infrastructures, and endless new portals. Yet for many researchers, the daily reality has changed very little. Managing data, publications, and workflows remains fragmented, bureaucratic, and painfully slow. FAIR principles are widely endorsed, but in practice, they are often difficult and time-consuming to implement. The...
Over the past decade, open science has moved from the margins to the mainstream. Yet for many researchers, putting open science into practice remains challenging — due to disciplinary norms, fragmented support systems, and tools that prioritize policy over usability. At Springer Nature, we ar evolving our approach to open science support by embedding FAIR principles into product design and...
Introduction
Citizens' juries and assemblies are increasingly popular public participation methodologies for deliberation on data and artificial intelligence. They are formal, top-down exercises that aim to address power imbalances in the design, application, or regulation of data and AI processes through allowing residents to debate and provide recommendations on specific questions. In March...
The International Meridian Circle Program (IMCP) represents a pivotal international initiative aimed at advancing coordinated space weather observations to address critical global scientific challenges and enhance operational applications. Effective data governance, sharing, and utilization form the cornerstone for transforming multi-national observational synergies into scientific...
Douala General Hospital is a first-class hospital in Cameroon where we meet a
multidisciplinary medical team treating several thousand patients each year. This hospital hosts
numerous patient records that may be useful for public health research. However, majority of
these records are paper-based, hence limiting their exploitation. For some cases, particularly
the pulmonology department,...
Data repositories recognize that the CARE Principles (Collective Benefit, Authority to Control, Responsibility and Ethics) and data sovereignty are integral when working with indigenous communities, but it can be difficult to put words into action. Ocean Networks Canada (ONC) has been working with Local Contexts to address this gap.
ONC has been working on integrating Local Context Labels...
Australia is one of the most food secure countries in the world. However, long-term strategies are needed to ensure Australia has a resilient and sustainable food industry that maintains its ability and reputation for delivering high-quality food nationally and internationally.
The Australian Research Data Commons (ARDC) established its Food Security Data Challenges program to support the...
Polar Environment Data Science Center (PEDSC) of the Joint Support-Center for Data Science Research (DS), the Research Organization of Information and Systems (ROIS) aimed to promote opening and sharing scientific data obtained by research activities in polar regions. Its purpose is to strengthen collaboration with universities and other communities, and to support creation of further...
The Australian Internet Observatory (AIO - https://internetobservatory.org.au/) has been funded by the Australian Research Data Commons (ARDC – www.ardc.edu.au) in 2024 to support large-scale access to and use of social media and digital data more broadly by Australian researchers. A core part of the AIO is the Australian Internet observatory Research Dashboard (AIReD -...
To address the pressing challenge of capturing complex non-linear structures in semi-supervised multi-view clustering, we introduce a fundamentally novel framework:Label Propagation Assisted Soft-constrained Deep Non-negative Matrix Factorization for Semi-supervised Multi-view Clustering (LapSDNMF). Unlike prior approaches,LapSDNMF innovatively integrates deep hierarchical modelling with label...
With the rise of cyber threats, automating Named Entity Recognition (NER) in open-source documents is crucial for Cyber Threat Intelligence (CTI). However, cybersecurity NER models face challenges in maintaining large annotated datasets due to the ever-evolving threat landscape. To address this, we introduce LETNER, a label-efficient NER framework that balances performance and annotation...
Automatically linking controlled vocabulary terms in metadata enhances semantic consistency and improves data interoperability across systems—particularly by connecting terms from frameworks such as OntoPortal, Skosmos, Wikidata, and others. This work presents an AI-driven approach that leverages Large Language Models (LLMs) in combination with knowledge graph techniques to identify and...
The field of corpus linguistics has revolutionised linguistic research by providing data-driven insights into the structure, usage, and evolution of languages. By leveraging large-scale text corpora, researchers can uncover patterns in grammar, vocabulary, syntax, and language use that are not easily observable through traditional methods (Omarova et al., 2025). This data-driven approach...
Introduction:China has rich geographical and biological diversity, and cultural resources, which have enriched people’s lives.Its diverse and complex geographical environment has given rise to a wealth of geographical products.Geographical Indications (GIs) products hold significant importance in promoting agriculture, enhancing product quality, preserving cultural heritage, and driving...
Effective research data management (RDM) is essential for ensuring that data adheres to the FAIR principles of Findability, Accessibility, Interoperability, and reusability. In this session we will examine how these principles drive the life cycle of metagenomic data at the Uniklinikum University in Aachen, from data generation to long-term storage and reuse.
The process begins when...
Due to the permafrost develops at certain subsurface depths, it cannot be directly observed by remote sensing, and ground-based surveys are costly. As a result, there remains considerable uncertainty in our current understanding of permafrost distribution. This study employs an ensemble simulation approach using multiple machine learning models, integrating the most comprehensive international...
The Macro View, reported at IDW2023, set out to estimate the national scale of research data that is under management for the purpose of future access in Australia and New Zealand. Two key observations can be made:
- The participating institutions lacked internal reports on data as an
asset, from which a total could be easily aggregated. Instead one
off measurement tasks were...
Systemic reform in science continues to face a collective action problem: researchers agree that contributions such as data sharing, peer review, software development, and community engagement are essential, yet these remain structurally undervalued in current incentive systems. Although the Open Science movement has promoted greater transparency and expanded recognition, uptake of alternative...
Dataspaces can unlock the potential of data-intensive research by enabling trusted data sharing. This session explores the challenges and opportunities facing organisations and researchers as they navigate the adoption of trusted sharing using dataspaces.
Bringing together key stakeholders, including researchers, policymakers, and infrastructure providers, this forum will identify barriers,...
In Australia, the interest in Indigenous Data Sovereignty (IDSov) has increased over the past decade. There are now emerging competing interests between the state as holders of vast Indigenous data assets and how these data are governed and Indigenous communities to drive their agenda on Indigenous data priorities including data sharing from the state to communities and state actors...
The Intergovernmental Panel on Climate Change (IPCC) has been providing climate Assessment Reports (ARs) since 1988, which document the state of climate change and future projections under various options for action. These ARs form the basis of international agreements and actions. UN Secretary-General Guterres described climate action as “the 21st century's greatest opportunity to drive...
The Terrestrial Ecosystem Research Network (TERN) is Australia's national collaborative research Infrastructure for long-term environmental monitoring, data-driven ecological research, and evidence-based decision-making. TERN provides an integrated, standardised, and openly accessible data infrastructure that facilitates collecting, curating, analysing, and distributing high-quality ecological...
Introduction: Proper data management is essential to ensure the integrity, transparency, and reuse of data in scientific research. Objectives: This study aimed to investigate the practices and perceptions related to data management among researchers at a university hospital in southern Brazil. Methods: This was a cross-sectional, exploratory study. Consecutive sampling was used to...
Whole-slide images (WSIs) drive state-of-the-art computational pathology, but hospitals typically restrict their analysis to isolated, air-gapped workstations because these gigapixel slides contain highly sensitive patient data. On such systems the workflow for a single case is onerous: (i) technicians copy the multi-gigabyte WSI to a removable medium and walk it to the secure workstation;...
Dengue surveillance in many countries still relies on a simple outbreak rule: declare an alert when reported cases exceed the historical mean + 2 standard deviations. Although easy to apply, this cut‑off does not adapt to changing transmission patterns, generates frequent false positives, and ignores forecast uncertainty. We develop and evaluate risk‑based outbreak thresholds that incorporate...
Against the global backdrop of "data sharing and reuse", publishing scientific data as a formal research output—compared to traditional data submission or repository archiving—proves more effective in advancing open data sharing and academic recognition. Currently, data publishing remains in its early exploratory stages in China and worldwide, with dedicated scientific data publishing journals...
Since 2015, China has introduced policies and regulations to encourage the sharing of scientific data. Among them, the publication of scientific data is an important part of scientific data sharing. The publication of scientific data includes two parts: data paper and dataset, in which the data paper is published as journal paper, and the dataset is published by registering DOI or e-journal on...
RADAR, developed and operated by FIZ Karlsruhe – Leibniz Institute for Information Infrastructure, provides a robust and versatile research data repository designed to facilitate adherence to FAIR principles—ensuring data is Findable, Accessible, Interoperable, and Reusable. Since its inception, RADAR has evolved significantly, offering enhanced services tailored to diverse research...
The presentation will discuss the importance of relationship building between the university researchers and local Aboriginal community organisations as part of a national research project. The project from the University of Melbourne is led by Distinguished Prof Marica Langton included the current presenters who are members of the research team.
The research project was codesigned by...
This study investigates the current research status of trustworthiness evaluation in China through literature review and web-based surveys, revealing a lack of tailored evaluation frameworks and practices specifically targeting scientific data management platforms in Chinese universities. Building upon the FAIR principles (Findability, Accessibility, Interoperability, and Reusability), this...
Modern life science research increasingly relies on complex data analysis, demanding robust bioinformatics tools, substantial computational resources, and specialised expertise. The German Network for Bioinformatics Infrastructure (de.NBI) addresses these challenges as a national, academic research infrastructure funded by the German Federal Ministry of Education and Research (BMBF) and...
In the context of the digital economy, data has become an important strategic foundational resource and economic growth engine for a country. China is the first country to elevate data to the level of production factors in its policy system, and officially listed data as another key production factor after land, labor, capital, and technology in 2019. When the transformation of production...
The Australian Research Data Commons (ARDC) funded Bushfire Data Commons (BDC) established a range of projects focused on an ever increasing problem to Australia: bushfires. These projects are highlighted in https://ardc.edu.au/program/bushfire-data-challenges/. One of the ARDC funded projects focused on establishment of a front end dashboard to showcase the data sets and tools arising from...
Artificial intelligence (AI) offers powerful potential to address pressing challenges in transboundary water management, especially in regions with insufficient infrastructure for in-situ water quantity and quality monitoring and modeling. However, the successful application of AI in this context depends on more than algorithmic accuracy and can be challenging to achieve even in a system with...
Crisis management plays a role in achieving a sustainable and resilient future by preparing governments and communities to effectively respond to and recover from disruptions . Crisis management generates large volumes of heterogeneous data, including spanning structured databases, unstructured government reports, real-time news reports, and social media channels. Despite such an availability...
Background: Antimicrobial resistance (AMR) is a growing concern in agribusiness sectors with serious consequences to productivity and public health. A data centric approach is needed to support Australian agribusinesses and water sectors to understand the impact of antimicrobial usage on the emergence of resistance for diseases that farmers are faced on a daily basis. The SAAFE CRC...
As digital health technologies proliferate, the potential to harness real-world data (RWD) for improving healthcare outcomes grows dramatically. However, the realization of a truly responsive Learning Health System remains hindered by the complexities surrounding health data sharing. These complexities span technical, legal, regulatory, financial, organizational, and ethical domains and are...
Background: Antimicrobial resistance (AMR) is a growing concern in agribusiness sectors with serious consequences to productivity and public health. A data-centric approach is needed to support Australian agribusinesses and water sectors to understand the impact of antimicrobial usage on the emergence of resistance for diseases that farmers are faced on a daily basis. The SAAFE CRC...
Defined as the presence of any infectious microorganisms in the bloodstream, bloodstream infections (BSIs) pose a major threat to public health. BSI is an important complication that may affect the recovery time, treatments of injured patients. The studies on patients with injury-related BSIs report data from single or selected hospitals. No population-based studies have been conducted on...
In a context where the amount of data is doubling every three years, there is an urgent need to develop and define policies and harmonised practices for research data at an institutional level, connected to national strategies and international frameworks. The aim of this paper will be to demonstrate how international, national and institutional levels can be connected, through the example of...
The World Data System (WDS) is an affiliated body of the International Science Council (ISC) that supports research data repositories and data service providers worldwide. The WDS Early Career Researcher (WDS-ECR) Network is dedicated to nurturing, advancing, and strengthening the capacities of early career scientists within data-centric fields. The network has ten overarching goals as part of...
Introduction
This workshop will examine how the evolving architectures of data policy and AI governance challenge traditional notions of political sovereignty, including state sovereignty, the sovereignty of the people, and the sovereignty of the individual human being, as recognized in the United Nations Charter and the Universal Declaration of Human Rights (UDHR).
The United Nations...
The data science and research data communities share many common goals and challenges. Despite this, the two communities tend to have separate venues for convening, membership, and educational tracks. This session of short presentations and panel discussion will explore some of the ways that these two worlds can come together in the areas of education, training, and data stewardship practices....
Significance of the issues to be tackled:
Earth and environmental (E&E) datasets covering the six spheres of Earth System Science (geosphere, hydrosphere, biosphere, cryosphere, atmosphere, and anthroposphere) have been collected over centuries. Properly curated and preserved over time, E&E datasets can provide evidence-based inputs into longitudinal monitoring of changes over decades...
Conversations around the data lifecycle from creation to re-use frequently revolve around the challenges of limited resources, lack of understanding by parties at various points of the process of what is involved in making it successful, and lack of interest from funding and re-use communities (e.g. Borgman & Groth, 2025, Specht et al., 2025, Stahlman, 2022). This is exacerbated when the data...
The rapid growth in global data production, particularly from remote sensing and Earth observation, has created significant opportunities to address pressing global challenges such as climate change, biodiversity loss, and sustainable resource management. Open data from satellites, drones, and environmental sensors, although increasingly available, often require complex integration due to...
Fair and equitable sharing of the benefits generated by using genomic sequence data and other related digital data, referred to in policy circles as “Digital Sequence Information” (DSI), is a hot topic in several international fora. It builds off similar discussions related to the sharing of benefits from using physical genetic resources, which led to the creation of the Nagoya Protocol in...
In SSH (Social Science and Humanities) the link between data and publication can be seen from different angles depending on its potential use. The first use that comes to mind is to cite a dataset in a publication for the purposes of scientific verification. It can be done in a number of ways, from a simple text citation, both in the publication or in the description of a dataset, to a PID...
Significance of the issues to be tackled in the session
Effective sample management is essential to ensuring the integrity, reproducibility, and openness of research across diverse disciplines. From the Physical and Life Sciences to the Social Sciences and the Arts, material samples serve as the foundation for countless...
The development of Indigenous data sovereignty as a global movement and the creation of the CARE principles have resulted in a diversity of solutions to meet local issues and contexts. This movement from principles to practice has involved both locally-specific and global principles that are informing change to both Indigenous and non-Indigenous data practice.
This session focuses on...
The rapid changes in the way we undertake scientific research driven by digitalisation and artificial intelligence means we need to look again at the basis of scientific methods, requiring the input from the philosophy of science and one end while being pragmatic about the uses of technology at the other. This approach will provide a way to ensure that our research and teaching is relevant,...
As funding agencies increasingly emphasize responsible data stewardship in alignment with the FAIR (Findable, Accessible, Interoperable, and Reusable) principles, Data Management Plans (DMPs) have become a core requirement in research proposals. This emphasis reflects a growing recognition that data serves as the foundation for scientific discovery and progress. Since 2023, the National...
Physical samples are essential research assets. Systematic curation ensures their accessibility and reusability for future scientific studies. CSIRO Mineral Resources (CMR) scientists collect diverse physical samples—including rock, regolith, water, and vegetation—for research and mineral exploration projects. These samples are costly to obtain, irreplaceable, and critical for generating and...
EcoCommons is a national digital infrastructure purpose-built to advance responsible, reproducible, and scalable ecological modelling in the era of FAIR data. It enables researchers, policymakers, and environmental managers to access integrated datasets, run validated modelling workflows, and share reproducible outputs that can inform biodiversity conservation, climate adaptation, and land-use...
Older data in paper or analog format (e.g., field/lab notebooks, photos, maps) held in labs, offices, and archives across research institutions are an often overlooked resource for potential reuse in new scientific studies. However, with the uncertainty around federal funding and research administration in the United States, there has been speculation that scientists across multiple...
Recent advancements in artificial intelligence (AI) and access to new types of data have led to increased applications of AI in computational social science and humanities (SSH). A wide range of cutting-edge examples shows the results of bringing AI and SSH together, from the latest computer vision AI models used to detect archaeological traces in satellite imagery or to identify mounds on...
Important global geoscience information standards are developed, managed and governed by the Commission for the Management and Application of Geoscience Information (CGI), a commission under the auspices of the International Union of Geological Sciences (IUGS). CGI's standards include logical data models such as GeoSciML and EarthResourceML. These data models are supported by controlled...
Advancing sustainable, high-quality, long-term data stewardship and management is fundamental to ensuring that marine data remains a valuable resource for research, policy, and environmental monitoring. This paper explores the practicalities of establishing organisational policies and methodological approaches that govern how operational and research data are collected, processed, stored,...
Integrated Reference Architecture for AI-Enabled Healthcare Research:
An Australian Harmonized Approach
Gnana K Bharathy and Adrian Burton, Australian Research Data Commons (ARDC)
gnana.bharathy@ardc.edu.au
(SciDATACon Abstract ID: 61 suitable for Healthcare Data, Analytics & AI Commons...
We can state the following on the sharing of research data. Policy shapes awareness, repositories promote practices and culture shapes practice. The existence of reliable and robust data repositories across disciplines, institutions, and geographical areas is a crucial step in the data sharing process. One of the world leaders in research investment and production is South Korea, which ranks...
Funding bodies and publishing venues increasingly require researchers to deposit and share their data in order to support rigorous, responsible, and reproducible science. Rising to the occasion, libraries have been expanding their scope to support research data as a scholarly resource and are increasingly recognized as providers of research data management and repository services. These trends...
Rapid advances in AI technology have the potential to ease or speed Research into research data management challenges. The CODATA and WDS communities are already coming up with ways to leverage AI for data stewardship. With much of the research data community dedicated to FAIR implementation and the colloquial second meaning of FAIR being ‘Fully AI Ready’, there is conflation and confusion...
Identification, selection and appraisal are key digital preservation activities when ingesting data objects for long-term preservation. This paper describes the approach taken to a major new global survey by the EOSC EDEN project, designed to improve understanding of current practices in this area, and the frameworks, standards and guidelines that support digital preservation professionals...
The exponential growth of data has intensified challenges in achieving cross-disciplinary and cross-repository interoperability. Metadata standards—such as DataCite, Dublin Core, and OpenAIRE—play a pivotal role in data discovery and reuse, yet their heterogeneity creates fragmentation. This proposal presents the implementation of the CODATA-CDIF Conceptual Domain Interoperability Framework...
The FAIR guiding principles indicate that scientific datasets should be annotated with “rich” metadata that adhere to relevant community standards. Those standards include metadata reporting guidelines, which enumerate the attributes needed to describe the features of the experiments that led to the corresponding data, and the controlled terms that standardize the values of those metadata...
Core objectives of the EU Mission on Climate Adaptation include ensuring that all Europeans have access to information on climate risks by 2030, supporting local authorities in developing risk management plans, and designing transformative strategies for 150 communities and regions to lead healthier and more prosperous lives. A central point of this work is addressing the challenges of...
This session will explore the intriguing and potentially urgent interaction (and even codependency) between high quality metadata and semantic richness on the one hand and Generative Artificial Intelligence (AI) and Large Language Models (LLMs) on the other. A lot of work is going on to improve the richness, quality and standardisation of metadata and semantics in order to make data sets ‘AI...
This 90-minute session, featuring five 10-minute presentations followed by a 40-minute panel discussion, aims to tackle critical challenges in scientific data repository certification across the Asia-Pacific and Africa (APA) regions. The session seeks to establish a collaborative framework for improving repository management standards and fostering cross-regional synergies. By convening...
Introduction
The understanding of the development of scientific disciplines, knowledge dissemination, and technological evolution is predominantly informed by analyzing scientific publications, collaborative networks, and patent records. Brazil holds a prominent position in Latin America’s scientific production, becoming a key regional player and talent attractor. The growing digitization of...
This session will bring together members of the research data community with experience of and interest in developing consortia and coalitions that advance the development and application of practices, principles and standards relating to research data. It will aim to identify common considerations and challenges encountered when building coalitions that can inform those embarking on a similar...
The exponential growth of data-intensive research demands robust, interconnected infrastructure that can seamlessly translate local scientific efforts into global knowledge resources. This session directly addresses the critical challenge of developing scalable, interoperable research data infrastructure that supports the complex journey of research data from local repositories to...
Data is of ever increasing value to the global research ecosystem, as underlined by recent emphasis on the FAIR principles and Open Science. Research infrastructures and data repositories are key to enabling Open Science and implementing the FAIR principles within the research ecosystem. Early Career Researchers (ECR) play an essential role in shaping and evolving new data practices and...
Research infrastructure initiatives play a critical role in enabling data-intensive science by providing the capabilities and services necessary for researchers to deliver innovative outcomes. In the environmental sciences, continental-scale research infrastructures facilitate consistent and standardised data collection across broad spatial and temporal scales. These datasets, collected...
Vocabulary services are a critical component of data sharing infrastructures. If these can be shared across infrastructures, then a more global ecosystem for data sharing and reuse can be supported. They are a foundational enabler for the very concept of FAIR (Findable, Accessible, Interoperable, Reusable), for without common references to semantic concepts, no data can be interpreted safely....
The proposed CARE Data Maturity model and in practice Sessions at SciDataCon will explore research and practice. The interactive session will include presentations and original research.This session highlights the development of the CARE Principles for Indigenous Data Governance (Collective Benefit, Authority to Control, Responsibility, and Ethics) emerged from a workshop convened at the IDW...
Standardized hazard definitions are a key element of the analysis of disasters. Without them, monitoring and reporting of the impacts of the hazards is difficult, and so is the development of effective early warning systems and response plans. Forecasting of future events and the generation of disaster risks reduction strategies are also hindered by a lack of standardized definition. To...
The challenges of data sovereignty and AI
in the European Health Data Space (EHDS)
Talk abstract contribution to
SciDATACon Abstract ID: 61
Advancing Healthcare Research with Data, Analytics & AI Commons
Francis P. Crawley
Chair, International Data Policy Committee, CODATA
francis@codata.org
Version 3.0, 24 April 2025
The European Health Data Space (EHDS) represents a...
Low-cost sensors are revolutionizing air quality monitoring, especially in under-resourced regions like Kenya. These devices enable affordable, localized data collection, which allows communities to identify pollution hotspots, raise awareness, and advocate for action. However, their integration into formal regulatory frameworks remains limited due to concerns over data reliability and...
Background and Motivation
Controlled vocabularies and ontologies are essential for enabling data interoperability, discovery, and integration across domains. Repositories that host and expose these artifacts play a critical role in implementing the FAIR (Findable, Accessible, Interoperable, Reusable) principles. The OntoPortal Alliance, a collaboration of academic and commercial partners,...
A discoverable inventory of items of value and importance to a community, country or region has many benefits. These benefits can be for reporting (state of the environment, achievement of Sustainable Development Goals, conservation objectives), for research (exploration of international phylogenetics, development of new pharmaceuticals), or for economic reasons (tracking the status of fish...
Solid Earth science seeks to understand the complex chemical and physical processes shaping our planet. This knowledge is essential for addressing key societal challenges, from mitigating natural hazards to managing vital resources. Yet, the scale and nature of the data required for such research—spanning petabytes and crossing geographical, disciplinary, and temporal boundaries—necessitate a...
The increasing use of AI-based approaches such as machine learning (ML) across diverse scientific fields presents challenges for reproducibly disseminating and assessing research. As ML becomes increasingly integral to clinical applications, there is also a critical need for transparent reporting methods to ensure both comprehensibility and the reproducibility of pre-clinical research and...
This contribution gives an overview of the European Marine Observation and Data Network (EMODnet) and focuses on the use cases of EMODnet managed data on marine pollution.
EMODnet has been funded by the European Commission for more than 15 years and is a trusted source for marine in-situ data and data products in Europe. It is a data infrastructure supporting data-intensive research and...
Conducting high-quality research increasingly involves complex workflows and the generation of numerous intermediate datasets. Achieving reproducibility requires the availability of extensive and well-structured information. To enable this, researchers need interfaces and tools that are not only user-friendly but also FAIR-aware.
Data analysis workflows typically span multiple tools and...
Freshwater is one of our most precious resources, essential for drinking water, agriculture, inland fisheries, and recreation. However, both its availability and quality face significant challenges from human activities, with these pressures potentially intensifying due to environmental changes.
Large lakes, while covering only 3% of Earth's surface, hold approximately 87% of surface...
Session description:
The digital revolution is reshaping marine science by enabling unprecedented access to data efficiently and cost-effectively. These advancements are transformative for coral reef conservation, fostering a stronger connection between science and policy to safeguard biodiversity and support the communities that depend on these ecosystems. In this session, we...
- Introduction and Background
Recent policies and laws in the US that threaten data access have highlighted the need for infrastructure to ensure important government data is not lost during regime changes. US federal government information has been in the public domain since the 1895 Printing Act which prohibited any copyright on federal government publications (United States Congress,...
Introduction: The transformative role of health data in achieving SDG 3 cannot be overstated, as data remains “the lifeblood of public health” (Ghebreyesus, 2019). The significant role of data in promoting the Sustainable Development Goals (SDGs), particularly SDG 3 (Good Health and Well-being), and improving disaster resilience continues to gain attention worldwide. Globally, in 2023, the...
Data science has evolved significantly over the past two decades, becoming a force within academic research, public and private sector workplaces, and in government policies and practices. Exponentially increasing volumes of publicly available datasets throughout the social and scientific realms have contributed to an explosion of data science applications, including AI tools and Large...
In 2020, the HASS and Indigenous Research Data Commons (HASS and Indigenous RDC) was established to create a comprehensive digital HASS and Indigenous research infrastructure capability as part of the Australian Research Data Commons (ARDC). The HASS and Indigenous RDC is developing infrastructure across a range of focus areas to serve the needs of diverse HASS and Indigenous research...
Cognitive decline represents one of the most critical public health and societal challenges of the 21st century, with approximately 50 million people affected by dementia worldwide, and nearly 10 million new cases annually (World Health Organization, 2020). As populations age the incidence of age-related cognitive impairments is expected to rise dramatically. However, the global response is...
Background
For 30+ years, the scientific community has worked toward unified ocean and climate data networks. While initiatives like EarthCube, RDA, and WDS established critical foundations, centralized systems remain vulnerable to political shifts, technical failures, and disasters.
This session moves beyond theoretical discussions to practical solutions, building on the...
Overview:
This session aims to showcase the roles and relationships among FAIR Implementation Profiles (FIPs), FAIRsharing, and FAIR²—three complementary efforts that help researchers and data stewards to optimally reuse standards and make research data truly FAIR. The session will provide an overview of key challenges, introduce key technologies, and offer perspectives on how these tools...
Introduction
This workshop addresses the urgent need to translate the high-level vision of the UNESCO Recommendation on Open Science into concrete, actionable data governance mechanisms tailored for crisis contexts. Whether triggered by natural disasters, health emergencies, climate change, or geopolitical conflicts, in times of crisis effective and ethical data management is essential for...
In 2017, RAiD was a budding new concept and the early beginnings of a technical system for identifying and tracking research projects. The idea of RAiD as a project identifier itself came out of a project - an Australian project to better track the research data lifecycle which put research projects at centre stage. Today, RAiD is an ISO standard for Project Identifiers (23527:2022) with a...
This presentation will examine the critical role of Institutional Repositories (IRs) and Institutional Data Repositories (IDRs) as foundational knowledge infrastructures supporting data-intensive research across academic institutions. As data sharing mandates and standards from funding agencies, publishers, and disciplinary societies continue to evolve, understanding the role of institutional...
The UN 2030 Agenda has become a cornerstone of global scientific and policy efforts, calling for urgent, collective action to address poverty, inequality, environmental degradation, climate change, and other systemic challenges. The Global South is at the frontline of the risks and the opportunities inherent in the SDG framework. These regions face the most acute development challenges while...
Significance of the issues to be tackled in the session
A Trusted Research Environment (TRE) is a highly secure computer system where sensitive data is stored. TREs are designed to be safe, allowing only authorised individuals to access the data. Data cannot be added or removed without proper permissions, ensuring transparency...
Introduction: Researchers and scientists are increasingly using programming languages for data processing, visualisation, and analysis. Advancement in machine learning (ML) and artificial intelligence (AI) helps accelerate the process of analysing complex research data and conducting experiments, leading to the discovery of hidden patterns in the data. However, installing and configuring the...
From open data to open methods
Observation, interpretation, and communication are key elements of research. Whereas open science has traditionally emphasized open data and publications, the openness of research methods has received less attention. Methodology – the derivation of results and conclusions from the data – is as critical to the understanding and trust on scientific outcomes....
Open Research Data (ORD) is fundamental to the transparency and reproducibility of scientific results. It fosters scientific exchange and networking. In line with ORD strategies and funding agencies' requirements, research data should be published as openly as possible. Despite the advantages ORD brings to research, however, the publication of research data can be subject to restrictions. This...
Natural history collections are increasingly recognised as critical infrastructure for addressing complex ecological challenges, yet their full potential can only be realised through strategic integration with other ecological data streams. While FAIR principles and the Digital Extended Specimen concept provide theoretical frameworks for data integration, practical implementation requires...
Despite efforts to improve the availability and accessibility of research datasets, interoperability remains a serious barrier to reuse. Data harmonization, the process of aligning data from disparate sources to a standardized schema, plays a key role in addressing data heterogeneity and enabling integration and reuse. Consider a data scientist curating patient blood-glucose measurements as a...
In an era where data is central to scientific discovery, innovation, and public policy, Chile is taking significant steps toward developing a comprehensive national strategy for FAIR data management. This presentation introduces the Estrategia FAIR, a multi-institutional initiative aimed at embedding the principles of Findability, Accessibility, Interoperability, and Reusability (FAIR) into...
Mappings are an essential component in making research data interoperable across infrastructures, domains and disciplines. Correspondences between official and de facto standards related to conceptual models, structures and vocabularies are required to share meaning and transfer information between both humans and machines. Despite their importance, these correspondences, mappings or...
Public trust in governments, organizations and institutions that collect, protect, share and use health data is critical. Health data refers to information that describes a person’s health, their health care or anything about their health status or condition. It can be about individuals (personal health data or information) or about populations (population health data). Access to timely and...
Community-based, not-for-profit organizations are critical partners in public interest research across disciplines. This presentation explores how community engaged researchers, data stewards, and librarians can embed data management training for community partners into existing workflows by leveraging data curation as a source of knowledge about training needs.
While it is well-known that...
Background: Longitudinal studies are necessary for tracking the progression of mental health disorders such as depression, anxiety, and psychosis. However, the integration of diverse mental health data from different sources and waves—especially in low- and middle-income countries—remains complex due to variability in instruments, socio-cultural expressions, and data structural formats. This...
Situation
Recent technological progress has significantly enhanced our capacity to link person-level data across diverse sectors for research in Australia. Key advancements include: legislation to facilitate data access and availability, streamlined governance processes, enriched metadata, more efficient data linkage, and enhanced statistical methods and training. Collectively these...
As global communities continue to adopt the FAIR Principles, many organizations face the challenge of not just FAIRifying individual datasets, but entire ecosystems of data repositories and services. The many tools and assessments developed for FAIRtend to be customized to a particular data architecture, especially data organized in a file with a DOI and listed on a website. The National...
Arctic research faces persistent challenges, including limited accessibility, geopolitical complexities, and a general scarcity of high-quality data. In response, several international consortia—such as INTAROS and Arctic PASSION—have initiated collaborative efforts to address these issues through the development of integrated Earth observing systems (e.g., SIOS and GIOS). The Svalbard...
Strategic and responsible management of research data is essential for Higher Education Institutions (HEIs) such as Universidad Nacional in Costa Rica, particularly within the context of Low and Middle-Income Countries (LMICs) given the existing challenges such as budgetary constraints, demands for transparency, and increasing expectations concerning social, economic, and environmental...
As data managers and practitioners, we've all felt it: "I want the data I handle to be broadly discoverable,interoperable, and preserved according to described best practice, but I first have to address the demands of my data providers and users who have their own special (and well-loved) way of doing things. I want to make sure I'm plugged into the leading practices of the global data...
**
Bridging Data Gaps with Citizen Science for People and Policy
**
There abound evidences to demonstrate how citizen science is making efforts to bridge data gaps for people and policy. In the last few years, the CODATA-WDS Task Group (TG) on Citizen Science for the Sustainable Development Goals has prioritized global...
The rapid rise in adoption of open science practices, coupled with growing mandates from publishers and funders for data to be published, has led to a dramatic increase in supplementary data files published alongside articles and generalist repository uploads. Supplementary data are now submitted with approximately 80% of publications, a substantial increase from about 40% in the early 2000s,...
Increasing amounts of data are routinely collected by governments, but providing insights from this data is often challenging. This can be due to restrictions in accessing the data and/ or data sparsity.
This session showcases innovative, interactive health and environmental data platforms that are providing inequity-focused data insights, ranging from platforms generated using low-cost...
The World Data System (WDS) seeks to cultivate and support a global network of members committed to scientific data repositories and effective data stewardship. To keep pace with advancing technologies, growing data volumes and types, shifting user demands, and deeper integration into scientific workflows, WDS member repositories must continuously evolve. Ensuring that data remains accessible...
As the volume and complexity of research data continue to grow, researchers increasingly rely on robust digital research infrastructure and interoperability across regional and disciplinary boundaries to achieve FAIR and reproducible data, and impactful results. Building new collaborations and extending the reach of digital research infrastructure initiatives will be crucial in fostering a...
In 2019, the Helmholtz Association of German Research Centres launched the Helmholtz Metadata Collaboration (HMC) platform as part of its Information and Data Science Framework to translate global metadata concepts into application and to harmonize scientific practice within the association. HMC's mission is to facilitate the visibility and reusability of data within the Helmholtz Association...
Building data-driven solutions to support climate change adaptation is inherently a cross-disciplinary and cross-organisational challenge. Legal compliance and organisational practices and assumptions will provide requirements and constraints for the technical solutions that can be deployed in the operational environments. This session will investigate these challenges and present an initial...
Abstract:
Modern slavery affects over 50 million individuals globally, millions being subjected to forced labour that infiltrates the supply chains of major corporations. This session explores the pivotal role of data science and artificial intelligence (AI) in combating modern slavery, bringing together perspectives from academia, non-profit organisations, and government agencies. By...
The availability of more data in the public sphere has driven the rise of data journalism practices in media worldwide, including in Asia (Mutsvairo, 2019). Consequently, audiences are now more exposed to data-based products when consuming news. Data journalism serves several functions, from explaining to the audience more accurately because they are presented with numbers and data to helping...
PANGAEA – Data Publisher for Earth & Environmental Science is a worldwide recognised digital data repository that plays a pivotal role in archiving, publishing, and disseminating scientific data related to earth and environmental sciences. As a publicly accessible information system, PANGAEA ensures that high-quality, well-structured, and interoperable datasets are preserved and made available...
In early 2024, a proposal to advance bilateral collaboration around national training strategies and frameworks between the Alliance and the Australian Research Data Commons (ARDC) was introduced. The agreement was formally signed in February 2025, effective through to December 31, 2026, launching the start of a two-year knowledge and staff exchange pilot.
This pilot is in the form of a...
Data fragmentation is a persistent barrier to innovation, interoperability, and student mobility in the global post secondary ecosystem. This session presents the MortarCAPS Higher Learning Data Standard (MCDS) — a sector-aligned, data standard co-developed by post secondary institutions, technology leaders, and policy advocates in Australia and Canada.
Charlsey Pearce (CEO, MortarCAPS) and...
The Wildlife Observatory of Australia (WildObs) is building the Australia’s first national infrastructure dedicated to automated wildlife image analysis. Designed to process large volumes of camera trap data using artificial intelligence, WildObs provides the tools and infrastructure necessary to support scalable, standardised, and reproducible biodiversity monitoring across Australia’s varied...
Data Science—encompassing data collection, storage, analysis, visualization, and interpretation—has become a foundational pillar of modern research, decision-making, and innovation. Its influence extends across scientific, social, industrial, and governmental domains, transforming both methods and outcomes. This contribution discusses the importance of integrating Data Science education across...
RMIT researchers are increasingly challenged by the size, dimension, and complexity of their data, the need to develop and run sophisticated data processing and analysis pipelines, and the need for computing-intensive simulations to compare with and interpret physical experiments. To partially address these challenges, the RMIT Advanced Computing Ecosystem (RACE) model provides scalable,...
ICPSR, one of the world’s largest social science data archives, located at the Institute for Social Research at the University of Michigan, is leading a $38M National Science Foundation project, the Research Data Ecosystem (RDE), to modernize research data infrastructure and support efficient, cutting-edge, and reproducible data-driven science.
In this presentation, we will briefly discuss...
Introduction
As part of efforts to make research data more FAIR, semantic interoperability is important to consider. Standards, controlled vocabularies, and terminologies are well established types of FAIR-enabling resources that help us create interoperable systems and metadata. The [CODATA Research Data Management Terminology][1] (RDMT) is one such semantic resource that emerged...
Despite major investments in Open Science infrastructures, environmental research remains fragmented across domains, systems, and borders. We have no shortage of data. What we lack are better ways to turn it into science. ENVRI-Hub NEXT project tackles this by breaking silos and building real bridges—one infrastructure at a time.
The ENVRI-Hub NEXT advances the ENVRI-Hub, a platform that...
Introduction
The scientific community faces a critical challenge: essential contributions to research progress—including data sharing, code development, peer review, mentorship, and community engagement—remain systematically undervalued. Despite the Open Science movement’s efforts to broaden recognition beyond traditional publications, engagement with alternative evaluation systems...
The Southern Ocean's influence on global climate and marine ecosystems underscores the urgent need for comprehensive and accessible observational data. Here we present a tool for discovery of, and access to, existing Southern Ocean data – SOOSmap, Version 2. SOOSmap is a collaborative effort between the Southern Ocean Observing System and European Marine Observations and Data Network Physics,...
We are entering the fourth research paradigm following the digital revolution, which is evidenced by rapid advancements in the scientific methodology of data-intensive practices. Upskilling the next generation of the research workforce is pivotal. Further, discipline-specific advancements highlight the importance of researchers acquiring new skills to meet evolving demands.
Researchers are...
The Social Sciences and Humanities (SSH) disciplinary sector encompasses research studies that share an epistemological commitment to the critical investigation of human experience, cultural expression, and social organization. SSH research contributes to the development of theoretical frameworks and methodological approaches that are essential for understanding complex societal...
Open Science practices are essential for promoting transparency, collaboration, and accessibility in research. However, developing countries often face significant equity challenges that hinder their participation in the global research ecosystem. These challenges include capacity gaps, infrastructure disparities, lack of awareness, and digital divides. This session aims to address these...
KU Leuven RDR is the CoreTrustSeal certified institutional data repository of KU Leuven, where curation plays an important role in data FAIRification and ensuring the quality of published datasets. The curation phase is not only crucial to have some quality control on the FAIRness of the data by ensuring correct metadata input, the presence of documentation and a choice of license, but to also...