TOP
Search the Dagstuhl Website
Looking for information on the websites of the individual seminars? - Then please:
Not found what you are looking for? - Some of our services have separate websites, each with its own search option. Please check the following list:
Schloss Dagstuhl - LZI - Logo
Schloss Dagstuhl Services
Seminars
Within this website:
External resources:
  • DOOR (for registering your stay at Dagstuhl)
  • DOSA (for proposing future Dagstuhl Seminars or Dagstuhl Perspectives Workshops)
Publishing
Within this website:
External resources:
dblp
Within this website:
External resources:
  • the dblp Computer Science Bibliography


Dagstuhl Seminar 16141

Analysis, Interpretation and Benefit of User-Generated Data: Computer Science Meets Communication Studies

( Apr 03 – Apr 08, 2016 )

(Click in the middle of the image to enlarge)

Permalink
Please use the following short url to reference this page: https://www.dagstuhl.de/16141

Organizers

Contact


Schedule

Motivation

The success of the Internet as a communication technology and tool for human interaction in countless contexts, including production and trade, has had a dramatic impact on modern societies. With diffusion rates nearing one hundred percent in most societal groups, there is virtually no one whose life is not influenced by online communication – either directly or indirectly. Every day, private end users and business users act and interact online, producing immense amounts of data. Many disciplines, including computer science, computer linguistics, psychology, and communication studies, have identified "big data" generated by online users as a research field. As a result, big data has become a somewhat over-hyped catch-all term for many different types of data, which are analyzed through varying methods for multiple purposes. This ranges from an analysis of (unstructured) Twitter or Facebook content to rule-structured texts as found in the professional media (i.e., news websites). The implication of value generated through sheer size of data sets is misleading, though – much of the value is based on the nature of these data sets as being user-generated, either on purpose or inevitably (and often unknowingly) as behavioral traces of actions with divergent aims.

Big data sets generated by human users pose some challenges to the scientific disciplines that are interested in them: Typically, computer scientists have the knowledge and tools to access, extract and process big data sets. However, the analysis and interpretation of such data mirrors the interactions of users who produced the data and is not following a purely technological logic. In other words, such data has a human/social component, and in order to interpret and understand it, social-scientific theories and methods are helpful. Social scientists, however, typically do not specialize in the practicalities of online technologies and of programming. While they have theoretical approaches and empirical methods available that can be helpful in the analysis of user generated content – and this is especially true for communication scholars who specialize in the analysis of (online) media content –, their possibilities to access and process data are limited (as this is not core to their field yet).

Consequently, both disciplinary approaches will not be able to fully address the challenges of analyzing big data based on user (inter)action from the perspective of their own "silo". A combination of the two approaches seems fruitful, as each discipline may help in solving the problems of the other, and the sum will be more than its parts – leading to a better understanding of social interaction and human communication in a digitized world. This seminar will bring together both computer scientists interested in the analysis of (large-scale) user-generated data, and communication scholars interested in computer-assisted acquisition and processing of such data. It is intended to start a fruitful dialogue on potential approaches, methods, uses and benefits of a cooperation between the two disciplines, and it will also include the input of practitioners in the field of media and business who will offer valuable insights into practical use cases.


Summary

The success of the Internet as a communication technology and tool for human interaction in countless contexts, including production and trade, has had a dramatic impact on modern societies. With diffusion rates nearing one hundred percent in most societal groups, there is virtually no one whose life is not influenced by online communication -- either directly or indirectly. Every day, private end users and business users act and interact online, producing immense amounts of data. Many disciplines, including computer science, computer linguistics, psychology, and communication studies, have identified ‘big data’ generated by online users as a research field. As a result, big data has become a somewhat over-hyped catch-all term for many different types of data, which are analyzed through varying methods for multiple purposes. This ranges from an analysis of (unstructured) Twitter or Facebook content to rule-structured texts as found in the professional media (i.e., news websites). The implication of value generated through sheer size of data sets is misleading, though -- much of the value is based on the nature of these data sets as being user-generated, either on purpose or inevitably (and often unknowingly) as behavioral traces of actions with divergent aims.

Big data sets generated by human users pose some challenges to the scientific disciplines that are interested in them: Typically, computer scientists have the knowledge and tools to access, extract and process big data sets. However, the analysis and interpretation of such data mirrors the interactions of users who produced the data and is not following a purely technological logic. In other words, such data has a human/social component, and in order to interpret and understand it, social-scientific theories and methods are helpful. Social scientists, however, typically do not specialize in the practicalities of online technologies and of programming. While they have theoretical approaches and empirical methods available that can be helpful in the analysis of user generated content -- and this is especially true for communication scholars who specialize in the analysis of (online) media content --, their possibilities to access and process data are limited (as this is not core to their field yet).

Consequently, both disciplinary approaches will not be able to fully address the challenges of analyzing big data based on user (inter)action from the perspective of their own ‘silo’. A combination of the two approaches seems fruitful, as each discipline may help in solving the problems of the other, and the sum will be more than its parts -- leading to a better understanding of social interaction and human communication in a digitized world. This seminar will bring together both computer scientists interested in the analysis of (large-scale) user-generated data, and communication scholars interested in computer-assisted acquisition and processing of such data. It is intended to start a fruitful dialogue on potential approaches, methods, uses and benefits of a cooperation between the two disciplines, and it will also include the input of practitioners in the field of media and business who will offer valuable insights into practical use cases.

Copyright Thorsten Quandt, German Shegalov, Helle Sjøvaag, and Gottfried Vossen

Participants
  • Christian Baden (The Hebrew University of Jerusalem, IL) [dblp]
  • David Domingo (Free University of Brussels, BE) [dblp]
  • Martin Emmer (FU Berlin, DE) [dblp]
  • Raul Ferrer Conill (Karlstad University, SE) [dblp]
  • Johann-Christoph Freytag (HU Berlin, DE) [dblp]
  • Krishna P. Gummadi (MPI-SWS - Saarbrücken, DE) [dblp]
  • Elisabeth Günther (Universität Münster, DE) [dblp]
  • Ari Heinonen (University of Tampere, FI) [dblp]
  • Jukka Huhtamäki (Tampere University of Technology, FI) [dblp]
  • Seth C. Lewis (University of Oregon, US) [dblp]
  • Wiebke Loosen (Hans-Bredow-Institut - Hamburg, DE) [dblp]
  • Alexander Löser (Beuth Hochschule für Technik - Berlin, DE) [dblp]
  • Truls Pedersen (University of Bergen, NO) [dblp]
  • Thorsten Quandt (Universität Münster, DE) [dblp]
  • Tatjana Scheffler (Universität Potsdam, DE) [dblp]
  • Ralf Schenkel (Universität Trier, DE) [dblp]
  • German Shegalov (Twitter - San Francisco, US) [dblp]
  • Helle Sjøvaag (University of Bergen, NO)
  • Hendrik Stange (Fraunhofer IAIS - St. Augustin, DE) [dblp]
  • Eirik Stavelin (University of Bergen, NO)
  • Martin Theobald (Universität Ulm, DE) [dblp]
  • Heike Trautmann (Universität Münster, DE) [dblp]
  • Damian Trilling (University of Amsterdam, NL)
  • Gottfried Vossen (Universität Münster, DE) [dblp]
  • Rodrigo Zamith (University of Massachusetts - Amherst, US) [dblp]

Related Seminars
  • Dagstuhl Perspectives Workshop 12182: Social, Supply-Chain, Administrative, Business, Commerce, Political networks: a multi-discipline perspective (2012-05-01 - 2012-05-04) (Details)

Classification
  • data bases / information retrieval
  • society / human-computer interaction
  • world wide web / internet

Keywords
  • Information systems
  • data science
  • communication sciences