TOP
Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Seminars
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Publishing
Within this website:
External resources:
dblp
Within this website:
External resources:
  • the dblp Computer Science Bibliography


Dagstuhl Seminar 25021

Grand Challenges for Research on Privacy Documents

( Jan 05 – Jan 10, 2025 )

Permalink
Please use the following short url to reference this page: https://www.dagstuhl.de/25021

Organizers

Contact

Motivation

Dagstuhl Seminar will gather an interdisciplinary group of researchers from privacy, natural language processing, human-computer interaction, public policy, and law to identify and characterize key challenges to research on privacy documents: privacy policies, terms of use, cookie policies, and other texts about data practices. In the status quo, privacy documents fail to fulfill the needs of stakeholders in our information society. Although many Internet users have concerns about their privacy, most lack the time, knowledge, and other resources to understand these documents, leaving them underinformed and compromising notice and choice. The needs of other stakeholders, including researchers, policymakers, and privacy practitioners, are similarly stymied. Although a growing body of research is devoted to analyzing, reconstituting, or otherwise using these documents to satisfy stakeholders’ needs, broader interdisciplinary efforts are needed.

This seminar will identify and characterize key challenges in privacy document research and produce a roadmap of how to tackle them in order to move the field forward. The unique gathering of researchers will be an opportunity to develop and explore the challenges interactively in a highly connected interdisciplinary setting. Potential areas of challenges include:

Privacy document selection: Deciding what privacy documents to analyze is itself a complex question. Popularity, ease of collection, sectoral focus, language, and quantity of documents are factors with impacts that are only partially understood. Additionally, duplicated effort has led to siloed knowledge about best practices.

Privacy document retrieval: Privacy documents are typically collected from the Web or from app stores. Collection methods tend to be purpose-driven and organic, in response to basic collection difficulties (e.g., lack of rigid standards for where to post a privacy policy and how to parse its text) and to specific project goals. This represents a space of options that researchers to date have explored only in an ad hoc fashion.

Policy analysis: Methods vary widely from hand-labeling of small quantities of documents to text classification using large language models. Annotation schemes similarly vary in granularity from binary labeling of documents to frame-centric annotation schemes for labeling phrases or sentences. While no one set of methods accommodates all research goals, the hidden work of language resource construction for this domain is often underappreciated, and prior work remains underutilized.

Lack of a dedicated research community or publication outlets: Privacy policy analysis draws from a variety of different research areas that have had limited contact. While interdisciplinary research is widely acknowledged to be important, obstacles remain for computational researchers and law or public policy researchers to form effective research partnerships. Also, the outputs of interdisciplinary projects that focus on privacy documents may lack a venue that recognizes their value.

Availability and maintenance of research tools: The lack of community poses a challenge to discovering existing work on the analysis of privacy documents. Additionally, the lack of a way to discover tools for working with privacy documents is one of several contributing factors to lack of maintenance of them, as perceived utility is diminished.

By bringing together researchers who face these problems, the seminar will serve as a conduit for interdisciplinary discovery of shared goals and possible solutions.

Copyright Florian Schaub, Christine Utz, and Shomir Wilson

Classification
  • Computation and Language
  • Computers and Society
  • Human-Computer Interaction

Keywords
  • Privacy
  • Policy
  • Law
  • Text
  • Usability