Tag Archives: antipaedo

Papers

Quantifying paedophile queries in a large P2P system

Matthieu Latapy, Clémence Magnien and Raphaël Fournier

Increasing knowledge of paedophile activity in P2P systems is a crucial societal concern, with important consequences on child protection, policy making, and internet regulation. Because of a lack of traces of P2P exchanges and rigorous analysis methodology, however, current knowledge of this activity remains very limited. We consider here a widely used P2P system, eDonkey, and focus on two key statistics: the fraction of paedophile queries entered in the system and the fraction of users who entered such queries. We collect hundreds of millions of keyword-based queries; we design a paedophile query detection tool for which we establish false positive and false negative rates using assessment by experts; with this tool and these rates, we then estimate the fraction of paedophile queries in our data; finally, we design and apply methods for quantifying users who entered such queries. We conclude that approximately 0.25 % of queries are paedophile, and that more than 0.2 % of users enter such queries. These statistics are by far the most precise and reliable ever obtained in this domain.

Posted in Papers | Also tagged |
Plots

Quantifying paedophile users on a P2P system

Quantifying paedophile users on a P2P system

> By Raphaël Fournier and Matthieu Latapy P2P systems are known to host a large amount of paedophile activity. Thus, quantifying the number of paedophile users on a P2P system is crucial, for many reasons: easy access to such content is a major societal concern, policy making and law-enforcement budgeting rely on this figure and […]

Posted in Plots | Also tagged |
Plots

Number of file-id discovered in a client-side eDonkey measurement

Number of file-id discovered in a client-side eDonkey measurement

> By Christophe Berger, Clémence Magnien, Matthieu Latapy, Firas Bessadok and Phillipe Jarlov We conduct a measurement of files available in eDonkey as follows. Our client connects to all eDonkey servers it discovers (it knows an initial lists of servers and explores the set of all servers reachable from these). Then it sends every 12 […]

Posted in Plots | Also tagged , |
Plots

Paedophile keywords in eDonkey queries

Paedophile keywords in eDonkey queries

> By Raphaël Fournier and Guillaume Valadon On a P2P system, users submit keyword-based queries to a search engine. Some of them request paedophile content. This plot gives the distribution of the number of paedophile keywords contained in the queries sent to an eDonkey server during a ten-week experiment [1]. We plotted the number of […]

Posted in Plots | Also tagged |
Plots

Measurement of eDonkey Activity with Distributed Honeypots

Measurement of eDonkey Activity with Distributed Honeypots

> By Oussama Allali, Matthieu Latapy and Clémence Magnien We propose in Measurement of eDonkey Activity with Distributed Honeypots a new way to observe activity in a P2P system. It relies on a set of honeypost, i.e. fake peers which claim to provide some files and record the queries they receive for these files. These […]

Posted in Plots | Also tagged , , |
Plots

Files diffusion in a edonkey P2P system

Files diffusion in a edonkey P2P system

> By Abdelhamid Salah Brahim, Bénédicte Le Grand and Matthieu Latapy In the paper Ten Weeks in the Life of an eDonkey Server we present the capture of exchanges between a large eDonkey server and its clients during ten weeks in continuous, leading to the observation of approximately 80 million distinct clients and 275 million […]

Posted in Plots | Also tagged , |
Plots

Time between queries in a P2P system

Time between queries in a P2P system

> By Lamia Benamara and Clémence Magnien This plot shows the distribution of the time elapsed between two successive queries made by a same peer in a P2P system, for a duration of one day. We observed 7 072 292 peers in total. A very large fraction of these time intervals (more than one third) […]

Posted in Plots | Also tagged , |
Plots

Ages in queries and filenames

Ages in queries and filenames

> By Matthieu Latapy and Clémence Magnien In the filenames of files exchanged in a P2P system, and in the keyword-based queries sent by users, age indications often appear in the form of a number followed by yo. The plot gives the distribution of these numbers in filenames and queries as observed in a 10 […]

Posted in Plots | Also tagged |
Plots

Paedophile content in Peer-to-Peer exchanges

Paedophile content in Peer-to-Peer exchanges

> Matthieu Latapy, Frédéric Aidouni and Clémence Magnien Some (but not all!) peer-to-peer (P2P) file exchange systems are used to exchange paedophile content (as well as many other kinds of content!). The corresponding filenames often claim the age of involved children under the form xyo where x is an integer and yo stands for years […]

Posted in Plots | Also tagged |
Plots

P2P file size distribution

P2P file size distribution

> Matthieu Latapy and Frédéric Aidouni In most P2P file exchange systems, the file sizes are visible without downloading the files. Here, we recorded many (22 765 093) such file sizes and plotted, for each file size (in megabytes, horizontal axis), the number of files with this size (i.e. the file size distribution). One observes […]

Posted in Plots | Also tagged |