HOME

TheInfoList



OR:

Mixing patterns refer to systematic tendencies of one type of nodes in a network to connect to another type. For instance, nodes might tend to link to others that are very similar or very different. This feature is common in many
social network A social network is a social structure made up of a set of social actors (such as individuals or organizations), sets of dyadic ties, and other social interactions between actors. The social network perspective provides a set of methods for an ...
s, although it also appears sometimes in non-social networks. Mixing patterns are closely related to
assortativity Assortativity, or assortative mixing is a preference for a network's nodes to attach to others that are similar in some way. Though the specific measure of similarity may vary, network theorists often examine assortativity in terms of a node's deg ...
; however, for the purposes of this article, the term is used to refer to assortative or disassortative mixing based on real-world factors, either topological or sociological.


Types of Mixing Patterns

Mixing patterns are a characteristic of an entire network, referring to the extent for nodes to connect to other similar or different nodes. Mixing, therefore, can be classified broadly as assortative or disassortative. ''
Assortative mixing In the study of complex networks, assortative mixing, or assortativity, is a bias in favor of connections between network nodes with similar characteristics. In the specific case of social networks, assortative mixing is also known as homophily. ...
'' is the tendency for nodes to connect to like nodes, while ''disassortative mixing'' captures the opposite case in which very different nodes are connected. Obviously, the particular node characteristics involved in the process of creating a link between a pair will shape a network's mixing patterns. For instance, in a sexual relationship network, one is likely to find a preponderance of male-female links, while in a friendship network male-male and female-female networks might prevail. Examining different sets of node characteristics thus may reveal interesting communities or other structural properties of the network. In principle there are two kinds of methods used to exploit these properties. One is based on analytical calculations by using
generating function In mathematics, a generating function is a way of encoding an infinite sequence of numbers () by treating them as the coefficients of a formal power series. This series is called the generating function of the sequence. Unlike an ordinary seri ...
techniques. The other is numerical, and is based on
Monte Carlo Monte Carlo (; ; french: Monte-Carlo , or colloquially ''Monte-Carl'' ; lij, Munte Carlu ; ) is officially an administrative area of the Principality of Monaco, specifically the ward of Monte Carlo/Spélugues, where the Monte Carlo Casino is ...
simulations for the graph generation. In a study on mixing patterns in networks, M.E.J. Newman starts by classifying the node characteristics into two categories. While the number of real-world node characteristics is virtually unlimited, they tend to fall under two headings: discrete and scalar/topological. The following sections define the differences between the categories and provide examples of each. For each category, the models of assortatively mixed networks introduced by Newman are discussed in brief.


Mixing Based on Discrete Characteristics

Discrete characteristics of a node are categorical, nominal, or enumerative, and often qualitative. For instance, race, gender, and sexual orientation are commonly examined discrete characteristics. To measure the mixing of a network on discrete characteristics, Newman defines a quantity e_ to be the fraction of edges in a network that connect nodes of type ''i'' to type ''j'' (see Fig. 1). On an undirected network this quantity is symmetric in its indices e_ = e_, while on directed ones it may be asymmetric. It satisfies the sum rules \sum_,\quad\sum_,\quad\sum_, where a_ and b_ are the fractions of each type of an edge's end that is attached to nodes of type i. On undirected graphs, where there is no physical distinction between the ends of a link, i.e. the ends of edges are all of the same type, a_ = b_. Then, an '' assortativity coefficient'', a measure of the similarity's or dissimilarity's strength between two nodes on a set of discrete characteristics may be defined as: r = \frac with r_ = -\frac This formula yields r = 0 when there's no assortative mixing, since e_ = a_b_ in that case, and r = 1 when the network is perfectly assortative. If the network is perfectly disassortative, i.e. every link connects two nodes of different types, then r = r_, which lies in general in the range -1\leq r < 0. This range for r_ implies that a perfectly disassortative network is normally closer to a randomly mixed network than a perfectly assortative one is. When there are several different types of nodes, then random mixing will most often pair unlike nodes, so that the network appears to be mostly disassortative. Therefore, it is appropriate that the value r = 0 for a random network should be closer to that for the perfectly disassortative network than for the perfectly assortative one. The method of generating functions is based on the idea of figuring out the proper generating function for the distributions of our interest every time, and extract data related to the networks structure by differentiating them. Assuming that the degree distribution p_^ for nodes of type i and the value of the matrix e_ (and hence, the values of a_ and b_) are known, then we may consider the ensemble of all graphs with the specified p_^ and e_ to yield collective (macroscopic) network characteristics. In principle, the generating function for p_^ and its first moment are given by G_^(x_,...,x_) = \sum_ p_^x^, and G_^ = \frac\frac\Bigg, _, where x_ = \frac the node of type i (r_ in the number) and z_ the mean degree for nodes of this type. Now we focus on the distributions that we're interested for. The distribution of the total number of nodes reachable by following an edge that arrives at a node of type i has a generating function H_^(x) = xG_^ _^(x),...,H_^(x)/math>. Similarly, the distribution of the number of nodes reachable from a ''randomly'' chosen node of type i is generated by H_^(x) = xG_^ _^(x),...,H_^(x)/math>. Now we are in position to yield some of the network's properties. The mean number s_ of nodes reachable from a node of type i is s_ = \frac\Bigg, _ = 1 + G_^(1)\frac Furthermore, if u_ is the probability for a node of type i (reached by following a randomly chosen link in the graph) not to belong to the giant cluster, then the overall fraction S of nodes that compose this cluster is given by S = 1 - \sum_\fracG_^(u_,...,u_) The numerical simulations based on Monte Carlo techniques seem to agree with the analytical results yielded by the formulas described above.


Mixing by Scalar or Topological Characteristics

Scalar characteristics of a node are those that are quantitative. They may be continuous or discrete ordinal variables like counts. Age is perhaps the simplest example, though intelligence and raw income are other obvious possibilities. Some topological features of the network may also be used for examining mixing by scalar properties. Specifically, the degree of a node is often a highly important feature in the mixing patterns of networks. Topological scalar features are very useful, because unlike other measures, they are always available. They are sometimes used as a proxy for real-world "sociability". For measuring the assortativity of scalar variables, similar to the discrete case (see above) an assortativity coefficient can be defined. One can measure it using the standard
Pearson Correlation In statistics, the Pearson correlation coefficient (PCC, pronounced ) ― also known as Pearson's ''r'', the Pearson product-moment correlation coefficient (PPMCC), the bivariate correlation, or colloquially simply as the correlation coefficient ...
, as Newman demonstrates. In Fig. 2, for instance, a calculation of the Pearson Correlation Coefficient yields r = 0.574. This indicates a fairly strong association between the age of husbands and wives at the time of marriage. An alternative coefficient can be computed for measuring the mixing by the degree of the nodes. Newman derives the expression, which is found to be r = \frac for an undirected network. In this formula, if p_k refers to the graph's
degree distribution In the study of graphs and networks, the degree of a node in a network is the number of connections it has to other nodes and the degree distribution is the probability distribution of these degrees over the whole network. Definition The degree o ...
(i.e., the probability that a node has degree ''k'') then q_k = \frac. This refers to the ''excess degree'' of a node, or the number of other edges aside from the currently examined one. The ''z'' refers to the average degree in the network, and \sigma_q is the
standard deviation In statistics, the standard deviation is a measure of the amount of variation or dispersion of a set of values. A low standard deviation indicates that the values tend to be close to the mean (also called the expected value) of the set, while ...
of the distribution q_k. For a directed network the equivalent expression is r = \frac. This correlation is positive when nodes are assortative by degree, and negative when the network is disassortative. Thus, the measure captures an overall sense of the mixing patterns of a network. For a more in-depth analysis of this topic, see the article on
assortativity Assortativity, or assortative mixing is a preference for a network's nodes to attach to others that are similar in some way. Though the specific measure of similarity may vary, network theorists often examine assortativity in terms of a node's deg ...
. The method of generating functions is still applicable for this case too, but the functions to be calculated are rarely calculable in closed form. Thus, numerical simulations seem to be the only way to yield results of some interest. The technique used is once again the Monte Carlo one. For the case of networks with a
power-law In statistics, a power law is a functional relationship between two quantities, where a relative change in one quantity results in a proportional relative change in the other quantity, independent of the initial size of those quantities: one qua ...
degree-distribution p_\sim k^, q_ has a divergent mean, unless \tau > 3, which rarely happens so. Instead, the exponentially truncated power-law distribution p_ = \frac\ \mathrm\ k\geq 1 yields a distribution for the excess degree of the type q_\sim (k+1)^\mathrm^. The results for this case are summarized below. 1) The position of the phase transition at which a giant cluster appears moves to higher values of \kappa as the value of r decreases. That is, the more assortative a network is, the lower the edge density threshold for the giant cluster's appearance will be. 2) The size of the giant cluster in the limit of large \kappa is smaller for the assortatively mixed graph, than for the neutral and disassortative ones. 3) Assortative mixing in the network affects the network's robustness under node removal. For assortative networks, it is required to remove about ten times more than usual (usual means a neutral network) high-degree nodes to destroy the giant cluster, while the opposite is true for disassortative networks, i.e. they are more susceptible than neutral ones under removal of the high-degree nodes. The fascinating result on the dependence of the network's robustness to its node mixing may be explained as follows. According to their definition, high-degree nodes in assortative networks tend to form a core group among them. Such a core group provides robustness to the network by concentrating all the obvious target nodes together in one portion of the graph. Removing these high-degree nodes is still one of the most effective ways to destroy network connectivity, but it is less effective (compared to neutral networks) because by removing them all from the same portion of the graph we fail to attack other portions. If these other portions are themselves percolating, then a giant cluster will persist even as the highest degree nodes vanish. On the other hand, the disassortatively mixed network is particularly susceptible to removal of high-degree nodes because these nodes are strewn far apart across the network, so that attacking them is like attacking all parts of the network at once.


Examples and Applications

A common application of mixing patterns is the study of disease transmission. For instance, many studies have used mixing to study the spread of HIV/AIDS and other contagious diseases. These articles find a strong connection between Mixing patterns and the rate of disease spread. The findings can also be used to model real-world network growth, as in,{{cite journal , last=Catanzaro , first=Michele , last2=Caldarelli , first2=Guido , last3=Pietronero , first3=Luciano , title=Social network growth with assortative mixing , journal=Physica A: Statistical Mechanics and Its Applications , publisher=Elsevier BV , volume=338 , issue=1–2 , year=2004 , issn=0378-4371 , doi=10.1016/j.physa.2004.02.033 , pages=119–124, bibcode=2004PhyA..338..119C or find communities within networks.


References

Networks Systems theory