By admin | Published: March 13, 2011
Matthieu Latapy, Clémence Magnien and Raphaël Fournier
Increasing knowledge of paedophile activity in P2P systems is a crucial societal
concern, with important consequences on child protection, policy making, and
internet regulation. Because of a lack of traces of P2P exchanges and rigorous
analysis methodology, however, current knowledge of this activity remains very
limited. We consider here a widely used P2P system, eDonkey, and focus on two
key statistics: the fraction of paedophile queries entered in the system and the
fraction of users who entered such queries. We collect hundreds of millions of
keyword-based queries; we design a paedophile query detection tool for which we
establish false positive and false negative rates using assessment by experts;
with this tool and these rates, we then estimate the fraction of paedophile
queries in our data; finally, we design and apply methods for quantifying users
who entered such queries. We conclude that approximately 0.25 % of queries are
paedophile, and that more than 0.2 % of users enter such queries. These
statistics are by far the most precise and reliable ever obtained in this
domain.
Posted in Papers | Also tagged p2p
By admin | Published: November 21, 2009

> By Raphaël Fournier and Matthieu Latapy P2P systems are known to host a large amount of paedophile activity. Thus, quantifying the number of paedophile users on a P2P system is crucial, for many reasons: easy access to such content is a major societal concern, policy making and law-enforcement budgeting rely on this figure and [...]
Posted in Plots | Also tagged p2p
By admin | Published: September 26, 2009

> By Christophe Berger, Clémence Magnien, Matthieu Latapy, Firas Bessadok and Phillipe Jarlov We conduct a measurement of files available in eDonkey as follows. Our client connects to all eDonkey servers it discovers (it knows an initial lists of servers and explores the set of all servers reachable from these). Then it sends every 12 [...]
Posted in Plots | Also tagged measurement, p2p
By admin | Published: May 9, 2009

> By Raphaël Fournier and Guillaume Valadon On a P2P system, users submit keyword-based queries to a search engine. Some of them request paedophile content. This plot gives the distribution of the number of paedophile keywords contained in the queries sent to an eDonkey server during a ten-week experiment [1]. We plotted the number of [...]
Posted in Plots | Also tagged p2p
By admin | Published: March 16, 2009

> By Oussama Allali, Matthieu Latapy and Clémence Magnien We propose in Measurement of eDonkey Activity with Distributed Honeypots a new way to observe activity in a P2P system. It relies on a set of honeypost, i.e. fake peers which claim to provide some files and record the queries they receive for these files. These [...]
Posted in Plots | Also tagged honeypot, measurement, p2p
By admin | Published: March 7, 2009

> By Abdelhamid Salah Brahim, Bénédicte Le Grand and Matthieu Latapy In the paper Ten Weeks in the Life of an eDonkey Server we present the capture of exchanges between a large eDonkey server and its clients during ten weeks in continuous, leading to the observation of approximately 80 million distinct clients and 275 million [...]
Posted in Plots | Also tagged p2p, spreading
By admin | Published: December 1, 2008

> By Lamia Benamara and Clémence Magnien This plot shows the distribution of the time elapsed between two successive queries made by a same peer in a P2P system, for a duration of one day. We observed 7 072 292 peers in total. A very large fraction of these time intervals (more than one third) [...]
Posted in Plots | Also tagged dynamics, p2p
By admin | Published: October 24, 2008

> By Matthieu Latapy and Clémence Magnien In the filenames of files exchanged in a P2P system, and in the keyword-based queries sent by users, age indications often appear in the form of a number followed by yo. The plot gives the distribution of these numbers in filenames and queries as observed in a 10 [...]
Posted in Plots | Also tagged p2p
By admin | Published: March 2, 2008

> Matthieu Latapy, Frédéric Aidouni and Clémence Magnien Some (but not all!) peer-to-peer (P2P) file exchange systems are used to exchange paedophile content (as well as many other kinds of content!). The corresponding filenames often claim the age of involved children under the form xyo where x is an integer and yo stands for years [...]
Posted in Plots | Also tagged p2p
By admin | Published: February 20, 2008

> Matthieu Latapy and Frédéric Aidouni In most P2P file exchange systems, the file sizes are visible without downloading the files. Here, we recorded many (22 765 093) such file sizes and plotted, for each file size (in megabytes, horizontal axis), the number of files with this size (i.e. the file size distribution). One observes [...]
Posted in Plots | Also tagged p2p