IEEE Transactions on Parallel and Distributed Systems, Volume 24, N°4, ISSN: 1045-9219
The endeavor of this work is to study the impact of content popularity in a large-scale Peer-to-Peer network, namely KAD. Based on an extensive measurement campaign, we pinpoint several deficiencies of KAD in handling popular content and provide a series of improvements to address such shortcomings. Our work reveals that keywords, which are associated to content, may become popular for two distinct reasons. First, we show that some keywords are intrinsically popular because they are common to many disparate contents: in such case we ameliorate KAD by introducing a simple mechanism that identifies stopwords. Then, we focus on keyword popularity that directly relates to popular content. We design and evaluate an adaptive load balancing mechanism that is backward compatible with the original implementation of KAD. Our scheme features the following properties: (i) it drives the process that selects the location of peers responsible to store references to objects, based on object popularity; (ii) it solves problems related to saturated peers that would otherwise inflict a significant drop in the diversity of references to objects, and (iii) if coupled with a load-aware content search procedure, it allows for a more fair and efficient usage of peer resources.
© 2012 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.