Torrent file
   HOME

TheInfoList



OR:

In the BitTorrent file distribution system, a torrent file or meta-info file is a
computer file A computer file is a computer resource for recording data in a computer storage device, primarily identified by its file name. Just as words can be written to paper, so can data be written to a computer file. Files can be shared with and trans ...
that contains metadata about files and folders to be distributed, and usually also a list of the network locations of trackers, which are computers that help participants in the system find each other and form efficient distribution groups called ''swarms''. A torrent file does not contain the content to be distributed; it only contains information about those files, such as their names, folder structure, sizes, and cryptographic hash values for verifying file integrity. The term ''torrent'' may refer either to the metadata file or to the files downloaded, depending on the context. A torrent file acts like a table of contents (index) that allows computers to find information through the use of a BitTorrent client. With the help of a torrent file, one can download small parts of the original file from computers that have already downloaded it. These "peers" allow for downloading of the file in addition to, or in place of, the primary server. Torrent files are normally named with the extension ".torrent". Torrent files themselves and the method of using torrent files have been created to ease the load on central servers, as instead of sending a file to for request, it can crowd-source the bandwidth needed for the file transfer, and reduce the time needed to download large files. Many free/freeware programs and operating systems, such as the various
Linux distributions A Linux distribution (often abbreviated as distro) is an operating system made from a software collection that includes the Linux kernel and, often, a package management system. Linux users usually obtain their operating system by downloading one ...
offer a torrent download option for users seeking the aforementioned benefits. Other large downloads, such as media files, are often torrented as well.


Background

Typically, Internet access is asymmetrical, supporting greater download speeds than upload speeds, limiting the bandwidth of each download, and sometimes enforcing bandwidth caps and periods where systems are not accessible. This creates inefficiency when many people want to obtain the same set of files from a single source; the source must always be online and must have massive outbound bandwidth. The BitTorrent protocol addresses this by decentralizing the distribution, leveraging the ability of people to network "
peer-to-peer Peer-to-peer (P2P) computing or networking is a distributed application architecture that partitions tasks or workloads between peers. Peers are equally privileged, equipotent participants in the network. They are said to form a peer-to-peer ...
", among themselves. Each file to be distributed is divided into small information chunks called ''pieces''. Downloading peers achieve high download speeds by requesting multiple pieces from different computers simultaneously in the swarm. Once obtained, these pieces are usually immediately made available for download by others in the swarm. In this way, the burden on the network is spread among the downloaders, rather than concentrating at a central distribution hub or cluster. As long as all the pieces are available, peers (downloaders and uploaders) can come and go; no one peer needs to have all the chunks or to even stay connected to the swarm in order for distribution to continue among the other peers. A small torrent file is created to represent a file or folder to be shared. The torrent file acts as the key to initiating downloading of the actual content. Someone interested in receiving the shared file or folder first obtains the corresponding torrent file, either by directly downloading it or by using a
magnet link Magnet is a URI scheme that defines the format of magnet links, a de facto standard for identifying files ( URN) by their content, via cryptographic hash value rather than by their location. Although magnet links can be used in a number of co ...
. The user then opens that file in a BitTorrent client, which automates the rest of the process. In order to learn the internet locations of peers who may be sharing pieces, the client connects to the trackers named in the torrent file, and/or achieves a similar result through the use of
distributed hash table A distributed hash table (DHT) is a distributed system that provides a lookup service similar to a hash table: key–value pairs are stored in a DHT, and any participating node can efficiently retrieve the value associated with a given key. The ...
s. Then the client connects directly to the peers in order to request pieces and otherwise participate in a swarm. The client may also report progress to trackers, to help the tracker with its peer recommendations. When the client has all the pieces, the BitTorrent client assembles them into a usable form. They may also continue sharing the pieces, elevating their status to that of ''seeder'' rather than an ordinary peer.


File structure

A torrent file contains a list of files and integrity metadata about all the pieces, and optionally contains a large list of trackers. A torrent file is a bencoded dictionary with the following keys (the keys in any bencoded dictionary are lexicographically ordered): * —the URL of the tracker * —this maps to a dictionary whose keys are dependent on whether one or more files are being shared: ** —a list of dictionaries each corresponding to a file (only when multiple files are being shared). Each dictionary has the following keys: *** —size of the file in bytes. *** —a list of strings corresponding to subdirectory names, the last of which is the actual file name ** —size of the file in bytes (only when one file is being shared though) ** —suggested filename where the file is to be saved (if one file)/suggested directory name where the files are to be saved (if multiple files) ** —number of bytes per piece. This is commonly 28 KiB = 256 KiB = 262,144 B. ** —a hash list, i.e., a concatenation of each piece's
SHA-1 In cryptography, SHA-1 (Secure Hash Algorithm 1) is a cryptographically broken but still widely used hash function which takes an input and produces a 160-bit (20- byte) hash value known as a message digest – typically rendered as 40 hexadec ...
hash. As SHA-1 returns a 160-bit hash, will be a string whose length is a multiple of 20 bytes. If the torrent contains multiple files, the pieces are formed by concatenating the files in the order they appear in the dictionary (i.e., all pieces in the torrent are the full piece length except for the last piece, which may be shorter). All strings must be
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of ...
encoded, except for , which contains binary data. A torrent is uniquely identified by an ''infohash'', a
SHA-1 In cryptography, SHA-1 (Secure Hash Algorithm 1) is a cryptographically broken but still widely used hash function which takes an input and produces a 160-bit (20- byte) hash value known as a message digest – typically rendered as 40 hexadec ...
hash calculated over the contents of the dictionary in bencode form. Changes to other portions of the torrent does not affect the hash. This hash is used to identify the torrent to other peers via DHT and to the tracker. It is also used in
magnet links Magnet is a URI scheme that defines the format of magnet links, a de facto standard for identifying files ( URN) by their content, via cryptographic hash value rather than by their location. Although magnet links can be used in a number of con ...
.


Extensions

A torrent file can also contain additional metadata defined in extensions to the BitTorrent specification. These are known as "BitTorrent Enhancement Proposals." Examples of such proposals include metadata for stating who created the torrent, and when.


Draft extensions

These extensions are under consideration for standardization. Most are already widely adopted as ''de facto'' standards.


Distributed hash tables

BEP-0005 extends BitTorrent to support
distributed hash table A distributed hash table (DHT) is a distributed system that provides a lookup service similar to a hash table: key–value pairs are stored in a DHT, and any participating node can efficiently retrieve the value associated with a given key. The ...
s. A trackerless torrent dictionary does not have an key. Instead, a trackerless torrent has a key: For example, 'nodes': "127.0.0.1", 6881 your.router.node", 4804, The specification recommends that "should be set to the K closest nodes in the torrent generating client's routing table. Alternatively, the key could be set to a known good node such as one operated by the person generating the torrent."


Multiple trackers

BEP-0012 extends BitTorrent to support multiple trackers. A new key, , is placed in the top-most dictionary (i.e., with and )


HTTP seeds

BEP-0017 extends BitTorrent to support HTTP seeds, later more commonly termed "web seeds" to be inclusive of
HTTPS Hypertext Transfer Protocol Secure (HTTPS) is an extension of the Hypertext Transfer Protocol (HTTP). It is used for secure communication over a computer network, and is widely used on the Internet. In HTTPS, the communication protocol is enc ...
. A new key, , is placed in the top-most list (i.e., with and ). This key's value is a list of web addresses where torrent data can be retrieved: This feature is very commonly used by open source projects offering software downloads. Web seeds allow smart selection and simultaneous use of
mirror site Mirror sites or mirrors are replicas of other websites or any network node. The concept of mirroring applies to network services accessible through any protocol, such as HTTP or FTP. Such sites have different URLs than the original site, but hos ...
s, P2P or HTTP(S), by the client. Doing so reducing the load on the project's servers while maximizing download speed. automatically generates torrents with web seeds.


Private torrents

BEP-0027 extends BitTorrent to support private torrents. A new key, , is placed in the dictionary. This key's value is 1 if the torrent is private: Private torrents are to be used with a ''private tracker''. Such a tracker restricts access to torrents it tracks by checking the peer's IP, refusing to provide a peer list if the IP is unknown. The peer itself is usually registered to the tracker via a gated online community; the private tracker typically also keep statistics of data transfer for use in the community. Decentralized methods like DHT, PeX, are disabled to maintain the centralized control. A private torrent can be manually edited to remove the private flag, but doing so will change the info-hash (deterministically), forming a separate "swarm" of peers. On the other hand, changing the tracker list will not change the hash. The flag does not offer true privacy, instead operating as a gentlemen's agreement.


Merkle trees

BEP-0030 extends BitTorrent to support
Merkle trees In cryptography and computer science, a hash tree or Merkle tree is a tree in which every "leaf" (node) is labelled with the cryptographic hash of a data block, and every node that is not a leaf (called a ''branch'', ''inner node'', or ''inode'') ...
. The purpose is to reduce the file size of torrent files, which reduces the burden on those that serve torrent files. A torrent file using Merkle trees does not have a key in the list. Instead, such a torrent file has a key in the list. This key's value is the root hash of the Merkle hash:


Examples


Single file

A de-bencoded torrent file (with 256 KiB = 262,144 bytes) for a file (whose size is 678 301 696 bytes) might look like: Note: here would be a 51 KiB value ( \times 160 = 414080\ \mathrm).


Multiple files

A de-bencoded torrent file (with 256 KiB = 262144 B) for two files, and , might look like:


See also

*
Glossary of BitTorrent terms This is a glossary of jargon related to peer-to-peer file sharing via the BitTorrent protocol. Terms Availability :(Also known as distributed copies.) The number of full copies of a file (or set of files and directories) directly available t ...
*
Magnet links Magnet is a URI scheme that defines the format of magnet links, a de facto standard for identifying files ( URN) by their content, via cryptographic hash value rather than by their location. Although magnet links can be used in a number of con ...


References


External links


Official BitTorrent Specification

BitTorrent Definition
{{DEFAULTSORT:Torrent File BitTorrent Computer file formats