Preferential Attachment
   HOME

TheInfoList



OR:

A preferential attachment process is any of a class of processes in which some quantity, typically some form of wealth or credit, is distributed among a number of individuals or objects according to how much they already have, so that those who are already wealthy receive more than those who are not. "Preferential attachment" is only the most recent of many names that have been given to such processes. They are also referred to under the names
Yule Yule, actually Yuletide ("Yule time") is a festival observed by the historical Germanic peoples, later undergoing Christianised reformulation resulting in the now better-known Christmastide. The earliest references to Yule are by way of indig ...
process, cumulative advantage,
the rich get richer "The rich get richer and the poor get poorer" is an aphorism due to Percy Bysshe Shelley. In ''A Defence of Poetry'' (1821, not published until 1840) Shelley remarked that Utilitarianism, the promoters of utility had exemplified the saying, "To ...
, and the
Matthew effect The Matthew effect of accumulated advantage, Matthew principle, or Matthew effect, is the tendency of individuals to accrue social or economic success in proportion to their initial level of popularity, friends, wealth, etc. It is sometimes summar ...
. They are also related to
Gibrat's law Gibrat's law, sometimes called Gibrat's rule of proportionate growth or the law of proportionate effect, is a rule defined by Robert Gibrat (1904–1980) in 1931 stating that the proportional rate of growth of a firm is independent of its absolut ...
. The principal reason for scientific interest in preferential attachment is that it can, under suitable circumstances, generate
power law In statistics, a power law is a Function (mathematics), functional relationship between two quantities, where a Relative change and difference, relative change in one quantity results in a proportional relative change in the other quantity, inde ...
distributions. If preferential attachment is non-linear, measured distributions may deviate from a power law. These mechanisms may generate distributions which are approximately power law over transient periods.


Definition

A preferential attachment process is a
stochastic Stochastic (, ) refers to the property of being well described by a random probability distribution. Although stochasticity and randomness are distinct in that the former refers to a modeling approach and the latter refers to phenomena themselv ...
urn process, meaning a process in which discrete units of wealth, usually called "balls", are added in a random or partly random fashion to a set of objects or containers, usually called "urns". A preferential attachment process is an urn process in which additional balls are added continuously to the system and are distributed among the urns as an increasing function of the number of balls the urns already have. In the most commonly studied examples, the number of urns also increases continuously, although this is not a necessary condition for preferential attachment and examples have been studied with constant or even decreasing numbers of urns. A classic example of a preferential attachment process is the growth in the number of
species In biology, a species is the basic unit of classification and a taxonomic rank of an organism, as well as a unit of biodiversity. A species is often defined as the largest group of organisms in which any two individuals of the appropriate s ...
per
genus Genus ( plural genera ) is a taxonomic rank used in the biological classification of extant taxon, living and fossil organisms as well as Virus classification#ICTV classification, viruses. In the hierarchy of biological classification, genus com ...
in some higher
taxon In biology, a taxon (back-formation from ''taxonomy''; plural taxa) is a group of one or more populations of an organism or organisms seen by taxonomists to form a unit. Although neither is required, a taxon is usually known by a particular nam ...
of biotic organisms. New genera ("urns") are added to a taxon whenever a newly appearing species is considered sufficiently different from its predecessors that it does not belong in any of the current genera. New species ("balls") are added as old ones
speciate Speciation is the evolutionary process by which populations evolve to become distinct species. The biologist Orator F. Cook coined the term in 1906 for cladogenesis, the splitting of lineages, as opposed to anagenesis, phyletic evolution within ...
(i.e., split in two) and, assuming that new species belong to the same genus as their parent (except for those that start new genera), the probability that a new species is added to a genus will be proportional to the number of species the genus already has. This process, first studied by
Yule Yule, actually Yuletide ("Yule time") is a festival observed by the historical Germanic peoples, later undergoing Christianised reformulation resulting in the now better-known Christmastide. The earliest references to Yule are by way of indig ...
, is a ''
linear Linearity is the property of a mathematical relationship (''function'') that can be graphically represented as a straight line. Linearity is closely related to '' proportionality''. Examples in physics include rectilinear motion, the linear r ...
'' preferential attachment process, since the rate at which genera accrue new species is linear in the number they already have. Linear preferential attachment processes in which the number of urns increases are known to produce a distribution of balls over the urns following the so-called
Yule distribution Yule, actually Yuletide ("Yule time") is a festival observed by the historical Germanic peoples, later undergoing Christianised reformulation resulting in the now better-known Christmastide. The earliest references to Yule are by way of indig ...
. In the most general form of the process, balls are added to the system at an overall rate of ''m'' new balls for each new urn. Each newly created urn starts out with ''k''0 balls and further balls are added to urns at a rate proportional to the number ''k'' that they already have plus a constant ''a'' > −''k''0. With these definitions, the fraction ''P''(''k'') of urns having ''k'' balls in the limit of long time is given by P(k)=, for ''k'' ≥ ''k''0 (and zero otherwise), where B(''x'', ''y'') is the Euler
beta function In mathematics, the beta function, also called the Euler integral of the first kind, is a special function that is closely related to the gamma function and to binomial coefficients. It is defined by the integral : \Beta(z_1,z_2) = \int_0^1 t^( ...
: \mathrm(x,y)=, with Γ(''x'') being the standard
gamma function In mathematics, the gamma function (represented by , the capital letter gamma from the Greek alphabet) is one commonly used extension of the factorial function to complex numbers. The gamma function is defined for all complex numbers except ...
, and \gamma=2 + . The beta function behaves asymptotically as B(''x'', ''y'') ~ ''x''−''y'' for large ''x'' and fixed ''y'', which implies that for large values of ''k'' we have P(k) \propto k^. In other words, the preferential attachment process generates a " long-tailed" distribution following a Pareto distribution or
power law In statistics, a power law is a Function (mathematics), functional relationship between two quantities, where a Relative change and difference, relative change in one quantity results in a proportional relative change in the other quantity, inde ...
in its tail. This is the primary reason for the historical interest in preferential attachment: the species distribution and many other phenomena are observed empirically to follow power laws and the preferential attachment process is a leading candidate mechanism to explain this behavior. Preferential attachment is considered a possible candidate for, among other things, the distribution of the sizes of cities, the wealth of extremely wealthy individuals, the number of citations received by learned publications, and the number of links to pages on the World Wide Web. The general model described here includes many other specific models as special cases. In the species/genus example above, for instance, each genus starts out with a single species (''k''0 = 1) and gains new species in direct proportion to the number it already has (''a'' = 0), and hence ''P''(''k'') = B(''k'', ''γ'')/B(''k''0, ''γ'' − 1) with ''γ''=2 + 1/''m''. Similarly the Price model for scientific citations corresponds to the case ''k''0 = 0, ''a'' = 1 and the widely studied Barabási-Albert model corresponds to ''k''0 = ''m'', ''a'' = 0. Preferential attachment is sometimes referred to as the
Matthew effect The Matthew effect of accumulated advantage, Matthew principle, or Matthew effect, is the tendency of individuals to accrue social or economic success in proportion to their initial level of popularity, friends, wealth, etc. It is sometimes summar ...
, but the two are not precisely equivalent. The Matthew effect, first discussed by
Robert K. Merton Robert King Merton (born Meyer Robert Schkolnick; July 4, 1910 – February 23, 2003) was an American sociologist who is considered a founding father of modern sociology, and a major contributor to the subfield of criminology. He served as th ...
, is named for a passage in the
biblical The Bible (from Koine Greek , , 'the books') is a collection of religious texts or scriptures that are held to be sacred in Christianity, Judaism, Samaritanism, and many other religions. The Bible is an anthologya compilation of texts of a ...
Gospel of Matthew The Gospel of Matthew), or simply Matthew. It is most commonly abbreviated as "Matt." is the first book of the New Testament of the Bible and one of the three synoptic Gospels. It tells how Israel's Messiah, Jesus, comes to his people and for ...
: "For everyone who has will be given more, and he will have an abundance. Whoever does not have, even what he has will be taken from him." (
Matthew Matthew may refer to: * Matthew (given name) * Matthew (surname) * ''Matthew'' (ship), the replica of the ship sailed by John Cabot in 1497 * ''Matthew'' (album), a 2000 album by rapper Kool Keith * Matthew (elm cultivar), a cultivar of the Ch ...
25:29,
New International Version The New International Version (NIV) is an English translation of the Bible first published in 1978 by Biblica (formerly the International Bible Society). The ''NIV'' was created as a modern translation, by Bible scholars using the earliest an ...
.) The preferential attachment process does not incorporate the taking away part. This point may be moot, however, since the scientific insight behind the Matthew effect is in any case entirely different. Qualitatively it is intended to describe not a mechanical multiplicative effect like preferential attachment but a specific human behavior in which people are more likely to give credit to the famous than to the little known. The classic example of the Matthew effect is a scientific discovery made simultaneously by two different people, one well known and the other little known. It is claimed that under these circumstances people tend more often to credit the discovery to the well-known scientist. Thus the real-world phenomenon the Matthew effect is intended to describe is quite distinct from (though certainly related to) preferential attachment.


History

The first rigorous consideration of preferential attachment seems to be that of
Udny Yule George Udny Yule FRS (18 February 1871 – 26 June 1951), usually known as Udny Yule, was a British statistician, particularly known for the Yule distribution. Personal life Yule was born at Beech Hill, a house in Morham near Haddingto ...
in 1925, who used it to explain the power-law distribution of the number of species per genus of flowering plants. The process is sometimes called a "Yule process" in his honor. Yule was able to show that the process gave rise to a distribution with a power-law tail, but the details of his proof are, by today's standards, contorted and difficult, since the modern tools of stochastic process theory did not yet exist and he was forced to use more cumbersome methods of proof. Most modern treatments of preferential attachment make use of the
master equation In physics, chemistry and related fields, master equations are used to describe the time evolution of a system that can be modelled as being in a probabilistic combination of states at any given time and the switching between states is determine ...
method, whose use in this context was pioneered by
Simon Simon may refer to: People * Simon (given name), including a list of people and fictional characters with the given name Simon * Simon (surname), including a list of people with the surname Simon * Eugène Simon, French naturalist and the genus ...
in 1955, in work on the distribution of sizes of cities and other phenomena. The first application of preferential attachment to learned citations was given by
Price A price is the (usually not negative) quantity of payment or compensation given by one party to another in return for goods or services. In some situations, the price of production has a different name. If the product is a "good" in the c ...
in 1976. (He referred to the process as a "cumulative advantage" process.) His was also the first application of the process to the growth of a network, producing what would now be called a
scale-free network A scale-free network is a network whose degree distribution follows a power law, at least asymptotically. That is, the fraction ''P''(''k'') of nodes in the network having ''k'' connections to other nodes goes for large values of ''k'' as : P(k) ...
. It is in the context of network growth that the process is most frequently studied today. Price also promoted preferential attachment as a possible explanation for power laws in many other phenomena, including
Lotka's law Lotka's law, named after Alfred J. Lotka, is one of a variety of special applications of Zipf's law. It describes the frequency of publication by authors in any given field. It states that the number of authors making x contributions in a give ...
of scientific productivity and
Bradford's law Bradford's law is a pattern first described by Samuel C. Bradford in 1934 that estimates the exponentially diminishing returns of searching for references in science journals. One formulation is that if journals in a field are sorted by number of ...
of journal use. The application of preferential attachment to the growth of the World Wide Web was proposed by Barabási and Albert in 1999. Barabási and Albert also coined the name "preferential attachment" by which the process is best known today and suggested that the process might apply to the growth of other networks as well. For growing networks, the precise functional form of preferential attachment can be estimated by
maximum likelihood estimation In statistics, maximum likelihood estimation (MLE) is a method of estimating the parameters of an assumed probability distribution, given some observed data. This is achieved by maximizing a likelihood function so that, under the assumed statis ...
.


See also

*
Assortative mixing In the study of complex networks, assortative mixing, or assortativity, is a bias in favor of connections between network nodes with similar characteristics. In the specific case of social networks, assortative mixing is also known as homophily. ...
* Bose–Einstein condensation: a network theory approach *
Capital accumulation Capital accumulation is the dynamic that motivates the pursuit of profit, involving the investment of money or any financial asset with the goal of increasing the initial monetary value of said asset as a financial return whether in the form o ...
*
Chinese restaurant process In probability theory, the Chinese restaurant process is a discrete-time stochastic process, analogous to seating customers at tables in a restaurant. Imagine a restaurant with an infinite number of circular tables, each with infinite capacity. C ...
*
Complex network In the context of network theory, a complex network is a graph (network) with non-trivial topological features—features that do not occur in simple networks such as lattices or random graphs but often occur in networks representing real s ...
*
Double jeopardy (marketing) Double jeopardy is an empirical law in marketing where, with few exceptions, the lower-market-share brands in a market have both far fewer buyers in a time period and also lower brand loyalty. The term was originally coined by social scientist Wi ...
*
Lindy effect The Lindy effect (also known as Lindy's Law) is a theorized phenomenon by which the future life expectancy of some non-perishable things, like a technology or an idea, is proportional to their current age. Thus, the Lindy effect proposes the longe ...
* Link-centric preferential attachment *
Pitman–Yor process In probability theory, a Pitman–Yor process denoted PY(''d'', ''θ'', ''G''0), is a stochastic process whose sample path is a probability distribution. A random sample from this process is an infinite discrete probability distribution ...
*
Price's model Price's model (named after the physicist Derek J. de Solla Price) is a mathematical model for the growth of citation networks. It was the first model which generalized the Simon model to be used for networks, especially for growing networks. Price's ...
*
Proof of stake Proof-of-stake (PoS) protocols are a class of consensus mechanisms for blockchains that work by selecting validators in proportion to their quantity of holdings in the associated cryptocurrency. This is done to avoid the computational cost of ...
*
Simon model In applied probability theory, the Simon model is a class of stochastic models that results in a power-law distribution function. It was proposed by Herbert A. Simon to account for the wide range of empirical Frequency distribution, distributions f ...
* Success to the successful * Wealth condensation *
Yule–Simon distribution In probability and statistics, the Yule–Simon distribution is a discrete probability distribution named after Udny Yule and Herbert A. Simon. Simon originally called it the ''Yule distribution''. The probability mass function (pmf) of the Yu ...
*
Bibliogram A bibliogram is a graphical representation of the frequency of certain target words, usually noun phrases, in a given text. The term was introduced in 2005 by Howard D. White to name the linguistic object studied, but not previously named, in inf ...


References

{{DEFAULTSORT:Preferential Attachment