Schlüsselwörter
(Englisch)
|
Web-based data collection; Normative Multiagent Systems; Semantic Web; Automatic Collection of Data; Policy; Obligations; Prohibitions; Social Software.
|
Forschungsprogramme
(Englisch)
|
COST-Action IS1004 - WEBDATANET: web-based data-collection - methodological challenges, solutions and implementations.
|
Kurzbeschreibung
(Englisch)
|
Web-based data collection is becoming increasingly important for many social science fields. It is not restricted to Web surveys, but it also includes non-reactive data, collected by means of various techniques from heterogeneous Web sources. A scientific methodology for Web-based data collection has not yet been developed. Relevant components for the required methodology are, from the point of view of those who will analyse these data, data validity, reliability, and quality; from the perspective of data providers, constraining the access to their data is essential, together with the possibility of being aware of how they will be stored and used. These guidelines are currently expressed in natural language. Therefore, when big amounts of data are treated for automatic extraction by means of specialized software, being compliant with those norms becomes very difficult. It is therefore clear that realizing new challenging technologies for supporting Web based data collection is an important open issue. In this project we propose to tackle this by developing models and techniques to express Web-based data collection guidelines, rules and policies at different levels of abstraction. We want to develop new techniques for automatic data extraction from the Web which rely on Semantic Web technologies and automatic reasoning to plan actions compliant with the guidelines. Finally we plan to implement a demonstrative tool able to use the above mentioned technologies for Web-based data collection.
|
Weitere Hinweise und Angaben
(Englisch)
|
Full name of research-institution/enterprise: Università della Svizzera Italiana Facoltà di scienze della comunicazione
|
Partner und Internationale Organisationen
(Englisch)
|
AT; BG; HR; CY; DK; EE; FI; FR; F.Y.R. Macedonia; DE; EL; HU; IS; IE; IL; IT; LU; MT; NL; NO; PL; PT; RO; RS; SK; SI; ES; SE; UK
|
Abstract
(Englisch)
|
In recent years, web data has become more and more ubiquitous on social web sites, devices and sensors connected by digital networks, and collecting data from the web becomes essential for many research areas like applied economics, sociology, market research, health studies, psychology, and communication sciences. In this context being able to automatically take into account legal and ethical constrains on how the data can be collected, stored, and used and considering privacy issues of the collected data is crucial, particularly in social media systems. Legal and ethical constrains are currently expressed in natural language. Therefore, when big amounts of data are treated for automatic extraction by means of specialized software, being compliant with them becomes very difficult. It is therefore clear that realizing new challenging technologies for supporting Web based data collection is an important open issue. Given these challenges, the overall goal of this project is to study and develop techniques for Web-based data collection, guaranteeing that activities performed during data collection are compliant with given legal and ethical policies. To achieve this purpose we propose new techniques for automatic data extraction from social network data by using Semantic Web technologies, for representing the data extracted, and for reasoning on their semantics. We propose also to use OWL 2 for formally expressing policies that regulate how data should be manipulated in order to be compliant with ethical guidelines and laws. To the best of our knowledge, nowadays there are no studies or tools for collecting non-reactive data from the Web that combine formal models of policy/norms representation and reasoning and knowledge extraction techniques with the goal of being compliant with on-line research ethical guidelines and legal constrains.
|
Datenbankreferenzen
(Englisch)
|
Swiss Database: COST-DB of the State Secretariat for Education and Research Hallwylstrasse 4 CH-3003 Berne, Switzerland Tel. +41 31 322 74 82 Swiss Project-Number: C11.0128
|