In
graph theory
In mathematics, graph theory is the study of '' graphs'', which are mathematical structures used to model pairwise relations between objects. A graph in this context is made up of '' vertices'' (also called ''nodes'' or ''points'') which are conn ...
and
computer science
Computer science is the study of computation, automation, and information. Computer science spans theoretical disciplines (such as algorithms, theory of computation, information theory, and automation) to practical disciplines (includin ...
, a dense subgraph is a
subgraph with many edges per vertex. This is formalized as follows: let be an
undirected graph
In discrete mathematics, and more specifically in graph theory, a graph is a structure amounting to a set of objects in which some pairs of the objects are in some sense "related". The objects correspond to mathematical abstractions called '' v ...
and let be a subgraph of . Then the ''density'' of is defined to be:
:
The densest subgraph problem is that of finding a subgraph of maximum density. The density of the maximally dense subgraph of a graph is sometimes referred to as its subgraph density. In 1984,
Andrew V. Goldberg
Andrew Vladislav Goldberg (born 1960) is an American computer scientist working primarily on design, analysis, and experimental evaluation of algorithms. He also worked on mechanism design, computer systems, and complexity theory. Currently he is ...
developed a
polynomial time
In computer science, the time complexity is the computational complexity that describes the amount of computer time it takes to run an algorithm. Time complexity is commonly estimated by counting the number of elementary operations performed by ...
algorithm to find the maximum density subgraph using a
max flow technique. This has been improved by Gallo, Grigoriadis and Tarjan in 1989 to run in time. A simple LP for finding the optimal solution was given by Charikar in 2000.
Subgraph density is
asymptotic
In analytic geometry, an asymptote () of a curve is a line such that the distance between the curve and the line approaches zero as one or both of the ''x'' or ''y'' coordinates tends to infinity. In projective geometry and related contexts, ...
to the related notion of
arboricity
The arboricity of an undirected graph is the minimum number of forests into which its edges can be partitioned. Equivalently it is the minimum number of spanning forests needed to cover all the edges of the graph. The Nash-Williams theorem pro ...
and to
graph degeneracy
In graph theory, a ''k''-degenerate graph is an undirected graph in which every subgraph has a vertex of degree (graph theory), degree at most ''k'': that is, some vertex in the subgraph touches ''k'' or fewer of the subgraph's edges. The degener ...
.
Densest subgraph
There are many variations on the densest subgraph problem. One of them is the densest subgraph problem, where the objective is to find the maximum density subgraph on exactly vertices. This problem generalizes the
clique problem
In computer science, the clique problem is the computational problem of finding cliques (subsets of vertices, all adjacent to each other, also called complete subgraphs) in a graph. It has several different formulations depending on which cliq ...
and is thus
NP-hard
In computational complexity theory, NP-hardness ( non-deterministic polynomial-time hardness) is the defining property of a class of problems that are informally "at least as hard as the hardest problems in NP". A simple example of an NP-hard pr ...
in general graphs. There exists a polynomial algorithm approximating the densest subgraph within a ratio of
for every
, while it does not admit an
-approximation in polynomial time unless the
exponential time hypothesis
In computational complexity theory, the exponential time hypothesis is an unproven computational hardness assumption that was formulated by . It states that satisfiability of 3-CNF Boolean formulas cannot be solved more quickly than exponential ...
is false. Under a weaker assumption that
, no
PTAS exists for the problem.
The problem remains NP-hard in
bipartite graph
In the mathematical field of graph theory, a bipartite graph (or bigraph) is a graph whose vertices can be divided into two disjoint and independent sets U and V, that is every edge connects a vertex in U to one in V. Vertex sets U and V ar ...
s and
chordal graph
In the mathematical area of graph theory, a chordal graph is one in which all cycles of four or more vertices have a ''chord'', which is an edge that is not part of the cycle but connects two vertices of the cycle. Equivalently, every induced c ...
s but is polynomial for
trees
In botany, a tree is a perennial plant with an elongated stem, or trunk, usually supporting branches and leaves. In some usages, the definition of a tree may be narrower, including only woody plants with secondary growth, plants that are ...
and
split graph
In graph theory, a branch of mathematics, a split graph is a graph in which the vertices can be partitioned into a clique and an independent set. Split graphs were first studied by , and independently introduced by .
A split graph may have m ...
s. It is open whether the problem is NP-hard or polynomial in
(proper) interval graphs and
planar graph
In graph theory, a planar graph is a graph that can be embedded in the plane, i.e., it can be drawn on the plane in such a way that its edges intersect only at their endpoints. In other words, it can be drawn in such a way that no edges cro ...
s; however, a variation of the problem in which the subgraph is required to be connected is NP-hard in planar graphs.
Densest at most subgraph
The objective of the densest at most
problem is to find the maximum density subgraph on at most
vertices. Anderson and Chellapilla showed that if there exists an
-approximation for this problem then that will lead to an
-approximation for the densest
subgraph problem.
Densest at least subgraph
The densest at least
problem is defined similarly to the densest at most
subgraph problem. The problem is NP-complete, but admits 2-approximation in polynomial time. Moreover, there is some evidence that this approximation algorithm is essentially the best possible: assuming the Small Set Expansion Hypothesis (a computational complexity assumption closely related to the
Unique Games Conjecture
In computational complexity theory, the unique games conjecture (often referred to as UGC) is a conjecture made by Subhash Khot in 2002.
The conjecture postulates that the problem of determining the approximate ''value'' of a certain type of ga ...
), then it is NP-hard to approximate the problem to within
factor for every constant
.
-clique densest subgraph
Charalampos Tsourakakis
Saint Charalampos ( grc, Ἅγιος Χαράλαμπος) (also variously Charalampas, Charalampus, Charalambos, Haralampus, Haralampos, Haralabos or Haralambos) was an early Christianity, early Christian priest in Magnesia on the Maeander, a c ...
introduced the
-clique densest subgraph problem. This variation of the densest subgraph problem aims to maximize the average number of induced
cliques
, where
is the set of
-cliques induced by
. Notice that the densest subgraph problem is obtained as a special case for
. This generalization provides an empirically successful poly-time approach for extracting large near-cliques from large-scale real-world networks.
Locally top- densest subgraph
Qin et al. introduced the problem of top-''k'' locally densest subgraphs discovery in a graph, each of which achieves the highest density in its local region in the graph: it is neither contained in any supergraph with the same or larger density, nor it contains subgraphs with density being loosely connected with the rest of the local densest subgraph. Note that the densest subgraph problem is obtained as a special case for
. The set of locally densest subgraphs in
a graph can be computed in polynomial time.
References
Further reading
*.
*.
*.
*.
*.
{{refend
Graph theory