Research Meeting 21383
DDI Cross Domain Integration for FAIR Data Sharing across Discipline Boundaries
( Sep 19 – Sep 24, 2021 )
Permalink
Organizers
- Arofan Gregory (DDI Alliance / CODATA, US)
- Simon Hodson (CODATA - Paris, FR)
- Hilde Orten (NSD - Bergen, NO)
- Joachim Wackerow (GESIS - Mannheim, DE)
Contact
- Heike Clemens (for administrative matters)
External Homepage
- 2021 Dagstuhl Workshop: Further Development of the DDI Cross Domain Integration Model for FAIR Data Sharing across Discipline Boundaries, Data Documentation Initiative (DDI) Alliance, on their YouTube channel, September 22+24, 2021.
Introduction and Motivation
The Data Documentation Initiative (DDI) Alliance has been a leader in setting metadata standards for the social, behavioural, and economic sciences (SBE) for many years. They have provided specifications which support data collection, management, and dissemination with detailed descriptions of the data typical of those domains. As with many other branches of statistics and research, however, the type, volume, and sources of data have multiplied in the recent past. Many projects are now cross-disciplinary, involving data from different domains. At the same time, computational approaches to analysis of data and the reproduction and origination of research has evolved. These factors combine to highlight the need for an enhanced ability to integrate and understand data across domain boundaries, and to understand the provenance and processing of data, even as more and more of the work is performed programmatically by systems which leverage machine learning and other advanced technology approaches.
The DDI Alliance has recently published a new specification intended to fill this need for integrating data from disparate sources: DDI - Cross Domain Integration (DDI-CDI). Unlike earlier DDI work products, DDI-CDI is not domain-specific, but is designed to be used with research data from any domain. The specification provides a model for understanding and integrating data across a wide range of sources, including big data/no SQL, event history and register data, traditional columnar data, and multi-dimensional data. Further, it provides a way of describing data provenance, with a focus not only on traditional linear processes, but also on declarative "black box" processes employed by many modern systems. DDI-CDI is intended not to replace traditional domain models for data description, but to supplement them when data from different sources and of different types is being integrated. It is designed to work easily with many other popular standards and models, including semantic vocabularies and generic technology specifications for data processing, dissemination, and cataloguing.
With an expected production release at the end of 2021, the current draft of the specification is undergoing finalisation. This workshop will focus on issues of immediate importance leading up to implementation and subsequent revision of DDI-CDI. Experts in the standard and prospective implementers will be in attendance to help refine the development roadmap.
Interoperability, Sustainability, and Alignment with Other Standards
DDI - CDI is fundamentally a model which is intended to be implemented across a wide variety of technology platforms, and in combination with many other standards models, and specifications. To support this use, it is formalized using a limited subset of the Unified Modelling Language (UML).
The platform-independence of the model makes it more easily applicable across a broad range of applications and helps ensure that it will be sustainable even as the technology landscape evolves. DDI - CDI builds on many other standard models and is aligned with them where appropriate.
Topics
Modular approach
The goal is that specific modules can be used in a flexible way standalone, together with other DDI-CDI modules, or together with other specifications. The work will focus on identification of functional packages, defined function of packages, clear one-way dependencies between packages, separation between functional (core) packages/classes and supporting packages/classes.
Data structure components (toolkit)
Review an approach for building new data structure types (in addition to the existing traditional wide/rectangular data, long [event] data, multi-dimensional data, and NoSQL/key-value data). Possible additional data structure types include graphs, text, any object in a “cell” (tables, text, binary objects, arrays of arrays, etc.).
UML class model interoperable subset (UCMIS)
The strict use of UCMIS enables a robust model which can be imported in many UML tools and represented in object-oriented syntax representations. The focus here will be the relationship to other specifications (in the light of the modular approach) on the model level and syntax representation level. See documentation and spreadsheet of previously named “Practitioner's Subset for Data Modeling”.
Syntax representations of the model
Exploration and decisions on OWL/RDF-S, JSON-LD, SheX (as constraint language for RDF). The work will build on an existing mapping from UML to OWL/RDF-S.
Implementation guides
Identify the methodology by which a community of users will specify how they will employ the model in their own implementations, such that they become more easily interoperable. Intersection with other machine-processible descriptions of data-sharing resources and methods within the community will be a focus.
Related Seminars
- Research Meeting 07432: The Data Documentation Initiative [DDI] XML Standard: Support Preservation, Management, Access and Dissemination Systems for Social Science Data (2007-10-23 - 2007-10-27) (Details)
- Research Meeting 08452: The Data Documentation Initiative [DDI] XML Standard: Support Preservation, Management, Access & Dissemination Systems for Social Science Data (2008-11-02 - 2008-11-07) (Details)
- Research Meeting 09442: The Data Documentation Initiative [DDI] XML Standard: Support Preservation, Management, Access & Dissemination Systems for Social Science Data (2009-10-25 - 2009-10-30) (Details)
- Research Meeting 10432: The Data Documentation Initiative (DDI) XML Standard: Using DDI 3 to Support Production, Management, Dissemination, and Preservation Systems for Data in the Social Sciences and Economics (2010-10-24 - 2010-10-29) (Details)
- Research Meeting 11372: Semantic Statistics for Social, Behavioural, and Economic Sciences: Leveraging the DDI Model for the Web (2011-09-11 - 2011-09-16) (Details)
- Research Meeting 12422: Semantic Statistics for Social, Behavioural, and Economic Sciences: Leveraging the DDI Model for the Linked Data Web (2012-10-14 - 2012-10-19) (Details)
- Research Meeting 13432: Facilitating Process and Metadata-Driven Automation in the Social, Economic, and Behavioural Sciences with the Data Documentation Initiative (DDI) (2013-10-20 - 2013-10-25) (Details)
- Research Meeting 14422: DDI: Facilitating Process and Metadata-Driven Automation in the Social, Economic, and Behavioural Sciences with the Data Documentation Initiative (2014-10-12 - 2014-10-17) (Details)
- Research Meeting 15423: DDI: Facilitating Process and Metadata-Driven Automation in the Social, Economic, and Behavioural Sciences with the Data Documentation Initiative (2015-10-11 - 2015-10-16) (Details)
- Research Meeting 16433: DDI Moving Forward: Improvement and Refinement of Selected Areas (2016-10-23 - 2016-10-28) (Details)
- Research Meeting 17433: DDI Moving Forward: Integration of Core Components / DDI-based Infrastructure Vision (2017-10-22 - 2017-10-27) (Details)
- Research Meeting 18393: Data Documentation Initiative (DDI) - Train the Trainers (2018-09-23 - 2018-09-28) (Details)
- Research Meeting 19403: DDI 4 Core - Development of a Robust and Sustainable Model (2019-09-29 - 2019-10-04) (Details)
- Research Meeting 23393: DDI-CDI: Realising interoperable data services in the metadata ecosystem (2023-09-24 - 2023-09-29) (Details)
- Research Meeting 24413: Aligning Technology Architectures with Cross-Domain Metadata Models (2024-10-06 - 2024-10-11) (Details)
- Research Meeting 25463: Metadata Models and Services Typologies in Digital Resource-Sharing Frameworks (2025-11-09 - 2025-11-14) (Details)