An industrial perspective on web scraping characteristics and open issues

Chiapponi, Elisa; Dacier, Marc; Thonnard, Olivier; Fangar, Mohamed; Mattsson, Mattias; Rigal, Vincent
DSN 2022, 52nd Annual IEEE/IFIP International Conference on Dependable Systems and Networks, June 27-30, 2022, Baltimore, Maryland, USA

An ongoing battle has been running for more than a decade between e-commerce websites owners and web scrapers. Whenever one party finds a new technique to prevail, the other one comes up with a solution to defeat it. Based on our industrial experience, we know this problem is far from being solved. New solutions are needed to address automated threats. In this work, we will describe the actors taking part in the battle, the weapons at their disposal, and their allies on either side. We will present a real-world setup to explain how e-commerce websites operators try to defend themselves and the open problems they seek solutions for.


DOI
Type:
Conference
City:
Baltimore
Date:
2022-06-27
Department:
Digital Security
Eurecom Ref:
6894
Copyright:
© 2022 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

PERMALINK : https://www.eurecom.fr/publication/6894