P2P caching
   HOME

TheInfoList



OR:

Peer-to-peer caching (P2P caching) is a
computer network A computer network is a set of computers sharing resources located on or provided by network nodes. The computers use common communication protocols over digital interconnections to communicate with each other. These interconnections are ...
traffic management technology used by
Internet Service Providers An Internet service provider (ISP) is an organization that provides services for accessing, using, or participating in the Internet. ISPs can be organized in various forms, such as commercial, community-owned, non-profit, or otherwise privat ...
(ISPs) to accelerate content delivered over
peer-to-peer Peer-to-peer (P2P) computing or networking is a distributed application architecture that partitions tasks or workloads between peers. Peers are equally privileged, equipotent participants in the network. They are said to form a peer-to-peer ...
(P2P) networks while reducing related bandwidth costs. P2P caching is similar in principle to the content caching long used by ISPs to accelerate Web (
HTTP The Hypertext Transfer Protocol (HTTP) is an application layer protocol in the Internet protocol suite model for distributed, collaborative, hypermedia information systems. HTTP is the foundation of data communication for the World Wide We ...
) content. P2P caching temporarily stores popular content that is flowing into an ISP's network. If the content requested by a subscriber is available from a cache, the cache satisfies the request from its temporary storage, eliminating data transfer through expensive transit links and reducing
network congestion Network congestion in data networking and queueing theory is the reduced quality of service that occurs when a network node or link is carrying more data than it can handle. Typical effects include queueing delay, packet loss or the blocking of ...
. This approach could make ISPs violate laws as P2P systems share files that infringe copyrights in significant portions. P2P content responds well to caching because it has high reuse patterns reflecting a Zipf's-like distribution. P2P communities have different Zipf's parameters which determine what fraction of files is requested multiple times. For example, one P2P community may request 75% of content multiple times while another may request only 10%. Some P2P caching devices can also accelerate HTTP video streaming traffic from
YouTube YouTube is a global online video sharing and social media platform headquartered in San Bruno, California. It was launched on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim. It is owned by Google, and is the second mo ...
,
Facebook Facebook is an online social media and social networking service owned by American company Meta Platforms. Founded in 2004 by Mark Zuckerberg with fellow Harvard College students and roommates Eduardo Saverin, Andrew McCollum, Dust ...
,
RapidShare RapidShare was an online file hosting service that opened in 2002. In 2009, it was among the Internet's 20 most visited websites and claimed to have 10 petabytes of files uploaded by users with the ability to handle up to three million users si ...
, MegaUpload,
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
,
AOL Video AOL (stylized as Aol., formerly a company known as AOL Inc. and originally known as America Online) is an American web portal and online service provider based in New York City. It is a brand marketed by the current incarnation of Yahoo! Inc. ...
, MySpace and other web video-sharing sites.


How P2P caching works

P2P caching involves creating a cache or temporary storage space for P2P data, using specialized communications hardware, disk storage and associated software. This cache is placed in the ISP's network, either co-located with the Internet transit links or placed at key aggregation points or at each cable head-end. Once a P2P cache is established, the network will transparently redirect P2P traffic to the cache, which either serves the file directly or passes the request on to a remote P2P user and simultaneously caches that data for the next user. To what extent the caching is beneficial depends on how similar the content interests of ISP's customers. Due to relatively small number of content shared in P2P systems (compared to Web) and semantic, geographic, and organization interests of users sharing ratio in P2P can be significantly higher than HTTP/Web caching. P2P caching typically works with a network traffic-mitigation technology called
Deep Packet Inspection Deep packet inspection (DPI) is a type of data processing that inspects in detail the data being sent over a computer network, and may take actions such as alerting, blocking, re-routing, or logging it accordingly. Deep packet inspection is oft ...
(DPI). DPI technology is used by service providers to understand what traffic is running across their networks and to separate it and treat it for the most efficient delivery. DPI products identify and pass P2P packets to the P2P caching system so it can cache the traffic and accelerate it. Peerapp Ltd. holds the first patent for P2P caching technology, which was filed in 2000.


The P2P bandwidth problem

In 2008,
peer-to-peer Peer-to-peer (P2P) computing or networking is a distributed application architecture that partitions tasks or workloads between peers. Peers are equally privileged, equipotent participants in the network. They are said to form a peer-to-peer ...
traffic was estimated to account for 50% of all Internet traffic, and was expected to quadruple between 2008 and 2013, reaching 3.3 exabytes per month– or the equivalent of 500 million DVDs each month. However, this trend has been discontinued, as by 2016 the global P2P traffic began to lower, showing a 6% descent between 2016 and 2021. These statistics may be explained by the popularization of
Video on Demand Video on demand (VOD) is a media distribution system that allows users to access videos without a traditional video playback device and the constraints of a typical static broadcasting schedule. In the 20th century, broadcasting in the form of ...
services, which have (until the moment) used a centralized architecture for data distribution. Increasing P2P traffic has created problems for ISPs. Networks can become saturated with P2P traffic, creating congestion for other types of Internet use. The cost of P2P traffic is disproportionate to the amount of revenue ISPs make from these customers because of the flat-rate packages of bandwidth commonly sold. To prevent P2P traffic from degrading service for all subscribers, ISPs typically face three choices: * Invest in additional bandwidth and equipment. Unfortunately, increasing bandwidth often does not solve the problem, because P2P applications inherently tend to consume as much bandwidth as available. * Implement stricter byte caps, policies, or P2P traffic-shaping, limiting the speed of P2P traffic. The difficulty is that P2P packets are becoming harder and harder to identify, especially with the introduction of encryption (such as
BitTorrent protocol encryption Protocol encryption (PE), message stream encryption (MSE) or protocol header encrypt (PHE) are related features of some peer-to-peer file-sharing clients, including BitTorrent clients. They attempt to enhance privacy and confidentiality. In addit ...
). Traffic shaping can also generate negative publicity and customer reactions. * Implement a form of P2P caching. Caching releases the bandwidth demand on critical Internet links and improves the experience for all users – P2P users whose file sharing is improved through using the cache, and non-P2P users who experience better performance from networks un-congested from P2P traffic. The initial adopters of P2P caching have been ISPs in Asia, the Pacific Rim, Latin America, the Caribbean and the Middle East, whose subscribers are heavy users of P2P networks and where providing the additional bandwidth to handle P2P data is very costly due to the expense of international transit links. P2P caching is expected to become an increasingly essential technology for ISPs and MSOs (multiple system operators) worldwide, particularly with the growing popularity of P2P content among broadband subscribers and the adoption of P2P as a content-distribution strategy by mainstream content providers such as the BBC.


P2P caching implementations

* PeerApp UltraBand Media Caching Software * Corelli Gareth Tyson, Andreas Mauthe, Sebastian Kaune, Mu Mu and Thomas Plagemann. Corelli: A Peer-to-Peer Dynamic Replication Service for Supporting Latency-Dependent Content in Community Networks. {{cite web , url=http://comp.eprints.lancs.ac.uk/2044/1/MMCN09.pdf , title=Archived copy , accessdate=2012-04-26 , url-status=dead , archiveurl=https://web.archive.org/web/20150618193018/http://comp.eprints.lancs.ac.uk/2044/1/MMCN09.pdf , archivedate=2015-06-18 is a community-based P2P caching system that operates in a decentralized way across multiple peers. This allows a caching service to be realised in environments that do not possess fixed caching infrastructure, e.g. a Wireless ad hoc network. * Community Caching is a P2P community-interest-aware, distributed caching solution for structured (DHT-based) P2P systems. It alleviates the overhead due to isolating P2P communities and loss of content popularity due to aggregation of content from multiple communities.


Sources

Internet Protocol based network software Peer-to-peer file sharing Cache (computing)