TOP
Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Seminars
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Publishing
Within this website:
External resources:
dblp
Within this website:
External resources:
  • the dblp Computer Science Bibliography


Dagstuhl Perspectives Workshop 16151

Foundations of Data Management

( Apr 10 – Apr 15, 2016 )

(Click in the middle of the image to enlarge)

Permalink
Please use the following short url to reference this page: https://www.dagstuhl.de/16151

Organizers

Contact



Schedule

Motivation

The focus of Foundations of Data Management (traditionally termed Database Theory) is to provide the many facets of data management with solid and robust mathematical foundations. The field has a long and successful history and has already grown far beyond its traditional scope since the advent of the Web.

The recent push towards Big Data, including structured, unstructured and multi-media data, is transforming and expanding the field at an unusually rapid pace. However, for understanding numerous aspects of Big Data, a robust research exploration into the principled foundations is still lacking. This transformation will call upon the Database Theory community to substantially expand its body of tools, techniques, and focal questions and to much more fully embrace several other disciplines, most notably statistics and probability theory, natural language processing, data analytics, emerging hardware and software supports for computation, and data privacy and security.

Big Data is not the only force that is driving expansion and tranformation for the Foundations of Data Management. With the increasing digitization of diverse industries, including "smarter cities", education, healthcare, agriculture and others, many diverse kinds of data usage at large scales are becoming crucial. The push towards data-centric business processes, which are especially important for knowledge-worker driven processes, raise fundamentally new questions at the intersection of data and process. And increasing adoption of semantic web and other ontology-based approaches for managing and using meta-data push the boundaries of traditional Description Logics.

The purpose of the Dagstuhl Perspectives Workshop is to explore the degree to which principled foundations are crucial to the long-term success and effectiveness of the new generation of data management paradigms and applications, and to understand what forms of research need to be pursued to develop and advance these foundations.

The workshop intends to bring together specialists from the existing database theory community, and from adjoining areas, particularly from various subdisciplines within the Big Data community, to understand the challenge areas that might be resolved through principled foundations and mathematical theory.

More specifically, the goals of the workshop are:

  • To identify areas, topics and research challenges for Foundations of Data Management in the forthcoming years, in particular, areas that have not been considered as Database Theory before but will be relevant in the future and of which we expect to have papers at PODS and ICDT, the main conferences in the field.
  • To outline the techniques that will be most fruitful as starting points for addressing the new foundational challenges in Data Mangement.
  • To identify and characterize the major challenge areas in Big Data where a principled, mathematically-based approach can provide important contributions.
  • To understand research goals of neighbouring areas that may generate synergies.

Summary

The focus of Foundations of Data Management (traditionally termed Database Theory) is to provide the many facets of data management with solid and robust mathematical foundations. The field has a long and successful history and has already grown far beyond its traditional scope since the advent of the Web.

The recent push towards Big Data, including structured, unstructured and multi-media data, is transforming and expanding the field at an unusually rapid pace. However, for understanding numerous aspects of Big Data, a robust research exploration into the principled foundations is still lacking. This transformation will call upon the Database Theory community to substantially expand its body of tools, techniques, and focal questions and to much more fully embrace several other disciplines, most notably statistics and probability theory, natural language processing, data analytics, emerging hardware and software supports for computation, and data privacy and security.

Big Data is not the only force that is driving expansion and transformation for the Foundations of Data Management. With the increasing digitization of diverse industries, including "smarter cities", education, healthcare, agriculture and others, many diverse kinds of data usage at large scales are becoming crucial. The push towards data-centric business processes, which are especially important for knowledge-worker driven processes, raise fundamentally new questions at the intersection of data and process. And increasing adoption of semantic web and other ontology-based approaches for managing and using meta-data push the boundaries of traditional Knowledge Representation.

The purpose of this Dagstuhl Perspectives Workshop was to explore the degree to which principled foundations are crucial to the long-term success and effectiveness of the new generation of data management paradigms and applications, and to understand what forms of research need to be pursued to develop and advance these foundations.

For this workshop we brought together specialists from the existing database theory community, and from adjoining areas, such as Machine Learning, Database Systems, Knowledge Representation, and Business Process Management, to understand the challenge areas that might be resolved through principled foundations and mathematical theory.

More specifically, during this workshop we worked on:

  • Identifying areas, topics and research challenges for Foundations of Data Management in the forthcoming years, in particular, areas that have not been considered as Database Theory before but will be relevant in the future and of which we expect to have papers at PODS and ICDT, the main conferences in the field.
  • Outlining the techniques that will be most fruitful as starting points for addressing the new foundational challenges in Data Management.
  • Characterising the major challenge areas in Big Data where a principled, mathematically-based approach can provide important contributions.
  • Finding research goals in neighbouring areas that may generate synergies with our own.

The workshop consisted of eight invited tutorials on selected topics:

  1. Managing Data at Scale,
  2. Uncertainty and Statistics in Foundations of Data Management,
  3. Human in the Loop in Data Management,
  4. Machine Learning and Data Management,
  5. Data-Centric Business Processes and Workflows,
  6. Ethical Issues in Data Management,
  7. Knowledge Representation, Ontologies, and Semantic Web, and
  8. Classical DB Questions on New Kind of Data. The abstracts of these talks can be found below in the document.

There were also seven working groups on theory-related topics, which identified the most relevant research challenges for Foundations of Data Management in the forthcoming years, outlined the mathematical techniques required to tackle such problems, and singled out specific topics for insertion in a curriculum for the area. The topics of these working groups were:

  1. Imprecise Data,
  2. Unstructured and Semi-structured Data,
  3. Process and Data,
  4. Data Management at Scale,
  5. Data Management and Machine Learning,
  6. Knowledge-Enriched Data Management, and
  7. Theory and Society.

There was also a working group on curriculum related issues, that collected and enriched the information provided by the working groups about the design of a curriculum on Foundations of Data Management. Each one of these groups worked for two consecutive hours in different days. Workshop participants had to participate in at least two working groups, although most of the people participated in four of them. Summaries of the discussions held in each one of these working groups can be found below in the document.

During the first day of the workshop, there were also five working groups that analysed several community-related aspects. In particular:

  1. Attraction of women and young members,
  2. cross-fertilization with neighbouring areas,
  3. relationship to industry,
  4. impact of our research, and
  5. the publishing process.

The discussion within some of these working groups gave rise to the creation of specific tasks to be accomplished by the community in the following years. These tasks will be coordinated by the councils of PODS and ICDT, the two main conferences in the field.

This Dagstuhl Report will be accompanied by a Dagstuhl Manifesto, in which the outcome of the different working groups will be explained in more detail and several strategies for the development of our field will be proposed.

Copyright Marcelo Arenas, Richard Hull, Wim Martens, Tova Milo, and Thomas Schwentick

Participants
  • Serge Abiteboul (ENS - Cachan, FR) [dblp]
  • Marcelo Arenas (Pontificia Universidad Catolica de Chile, CL) [dblp]
  • Pablo Barcelo (University of Chile - Santiago de Chile, CL) [dblp]
  • Meghyn Bienvenu (University of Montpellier, FR) [dblp]
  • Iovka Boneva (Université de Lille I, FR) [dblp]
  • Angela Bonifati (University Claude Bernard - Lyon, FR) [dblp]
  • Diego Calvanese (Free University of Bozen-Bolzano, IT) [dblp]
  • Claire David (University Paris-Est - Marne-la-Vallée, FR) [dblp]
  • Daniel Deutch (Tel Aviv University, IL) [dblp]
  • Thomas Eiter (TU Wien, AT) [dblp]
  • Ronald Fagin (IBM Almaden Center - San Jose, US) [dblp]
  • Floris Geerts (University of Antwerp, BE) [dblp]
  • Georg Gottlob (University of Oxford, GB) [dblp]
  • Torsten Grust (Universität Tübingen, DE) [dblp]
  • Paolo Guagliardo (University of Edinburgh, GB) [dblp]
  • Richard Hull (IBM TJ Watson Research Center - Yorktown Heights, US) [dblp]
  • Eyke Hüllermeier (Universität Paderborn, DE) [dblp]
  • Benny Kimelfeld (Technion - Haifa, IL) [dblp]
  • Phokion G. Kolaitis (University of California - Santa Cruz, US) [dblp]
  • Maurizio Lenzerini (Sapienza University of Rome, IT) [dblp]
  • Leonid Libkin (University of Edinburgh, GB) [dblp]
  • Carsten Lutz (Universität Bremen, DE) [dblp]
  • Wim Martens (Universität Bayreuth, DE) [dblp]
  • Tova Milo (Tel Aviv University, IL) [dblp]
  • Filip Murlak (University of Warsaw, PL) [dblp]
  • Frank Neven (Hasselt University - Diepenbeek, BE) [dblp]
  • Dan Olteanu (University of Oxford, GB) [dblp]
  • Magdalena Ortiz (TU Wien, AT) [dblp]
  • Reinhard Pichler (TU Wien, AT) [dblp]
  • Andreas Pieris (TU Wien, AT) [dblp]
  • Juan L. Reutter (Pontificia Universidad Catolica de Chile, CL) [dblp]
  • Sudeepa Roy (Duke University - Durham, US) [dblp]
  • Thomas Schwentick (TU Dortmund, DE) [dblp]
  • Jianwen Su (University of California - Santa Barbara, US) [dblp]
  • Dan Suciu (University of Washington - Seattle, US) [dblp]
  • Jens Teubner (TU Dortmund, DE) [dblp]
  • Jan Van den Bussche (Hasselt University, BE) [dblp]
  • Stijn Vansummeren (Université Libre de Bruxelles, BE) [dblp]
  • Victor Vianu (University of California - San Diego, US) [dblp]
  • Domagoj Vrgoc (Pontificia Universidad Catolica de Chile, CL) [dblp]
  • Ke Yi (HKUST - Kowloon, HK) [dblp]

Classification
  • data bases / information retrieval
  • world wide web / internet

Keywords
  • Data Management
  • Database Theory
  • Big Data
  • Machine Learning
  • Data Mining
  • Information Extraction
  • Graph Algorithms
  • Knowledge Representation
  • Semantic Web
  • Scientific Workflow
  • Data-centric Workflow
  • Probabalistic Data