Measuring the Privacy Dimension of Free Content Websites through Automated Privacy Policy Analysis and Annotation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

16 Scopus citations

Abstract

Websites that provide books, music, movies, and other media free of charge are a central piece of the web ecosystem, although they are vastly unexplored, especially for their security and privacy risks. In this paper, we contribute to the understanding of those websites by focusing on the comparative analysis of their privacy policies, a primary channel where service providers inform users about their data collection and use. To better understand the data usage risks associated with such websites, we study 1,562 websites and their privacy policies in contrast to premium websites. We uncover that premium websites are more transparent in reporting their privacy practices, particularly in categories such as "Data Retention"and "Do Not Track", with premium websites are 85.00% and ≈ 70% more likely to report their practices in comparison to the free content websites. We found the free content websites' privacy policies to be more similar to one another and generic in comparison to the premium websites' privacy policies. Our findings raise several concerns, including that the reported privacy policies may not reflect the data collection practices used by service providers, and various pronounced biases across privacy policy categories. This calls for further investigation of the risks associated with the usage of such free content websites and services through active measurements.

Original languageEnglish
Title of host publicationWWW 2022 - Companion Proceedings of the Web Conference 2022
PublisherAssociation for Computing Machinery, Inc
Pages860-867
Number of pages8
ISBN (Electronic)9781450391306
DOIs
StatePublished - 16 Aug 2022
Externally publishedYes
Event31st Companion of the World Wide Web Conference, WWW 2022 - Virtual, Lyon, France
Duration: 25 Apr 2022 → …

Publication series

NameWWW 2022 - Companion Proceedings of the Web Conference 2022

Conference

Conference31st Companion of the World Wide Web Conference, WWW 2022
Country/TerritoryFrance
CityVirtual, Lyon
Period25/04/22 → …

Keywords

  • Free Content Websites
  • Natural Language Processing
  • Privacy Policy
  • Web Security

Fingerprint

Dive into the research topics of 'Measuring the Privacy Dimension of Free Content Websites through Automated Privacy Policy Analysis and Annotation'. Together they form a unique fingerprint.

Cite this