Privacy Policy Annotation for Semi-automated Analysis: A Cost-Effective Approach - Trust Management XII
Conference Papers Year : 2018

Privacy Policy Annotation for Semi-automated Analysis: A Cost-Effective Approach

Dhiren A. Audich
  • Function : Author
  • PersonId : 1035463
Rozita Dara
  • Function : Author
  • PersonId : 1035464
Blair Nonnecke
  • Function : Author
  • PersonId : 1035465

Abstract

Privacy policies go largely unread as they are not standardized, often written in jargon, and frequently long. Several attempts have been made to simplify and improve readability with varying degrees of success. This paper looks at keyword extraction, comparing human extraction to natural language algorithms as a first step in building a taxonomy for creating an ontology (a key tool in improving access and usability of privacy policies).In this paper, we present two alternatives to using costly domain experts are used to perform keyword extraction: trained participants (non-domain experts) read and extracted keywords from online privacy policies; and second, supervised and unsupervised learning algorithms extracted keywords. Results show that supervised learning algorithm outperform unsupervised learning algorithms over a large corpus of 631 policies, and that trained participants outperform the algorithms, but at a much higher cost.
Fichier principal
Vignette du fichier
470710_1_En_3_Chapter.pdf (294.74 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01855985 , version 1 (09-08-2018)

Licence

Identifiers

Cite

Dhiren A. Audich, Rozita Dara, Blair Nonnecke. Privacy Policy Annotation for Semi-automated Analysis: A Cost-Effective Approach. 12th IFIP International Conference on Trust Management (TM), Jul 2018, Toronto, ON, Canada. pp.29-44, ⟨10.1007/978-3-319-95276-5_3⟩. ⟨hal-01855985⟩
95 View
124 Download

Altmetric

Share

More