Counting Graph Homomorphisms
Counting Graph Homomorphisms
net/publication/226755686
CITATIONS READS
190 481
5 authors, including:
Katalin - Vesztergombi
Eötvös Loránd University
58 PUBLICATIONS 1,911 CITATIONS
SEE PROFILE
All content following this page was uploaded by Katalin - Vesztergombi on 24 March 2014.
Abstract
Counting homomorphisms between graphs (often with weights) comes up in a wide
variety of areas, including extremal graph theory, properties of graph products, partition
functions in statistical physics and property testing of large graphs.
In this paper we survey recent developments in the study of homomorphism numbers, in-
cluding the characterization of the homomorphism numbers in terms of the semidefiniteness
of “connection matrices”, and some applications of this fact in extremal graph theory.
We define a distance of two graphs in terms of similarity of their global structure, which
also reflects the closeness of (appropriately scaled) homomorphism numbers into the two
graphs. We use homomorphism numbers to define convergence of a sequence of graphs, and
show that a graph sequence is convergent if and only if it is Cauchy in this distance. Every
convergent graph sequence has a limit in the form of a symmetric measurable function in
two variables. We use these notions of distance and graph limits to give a general theory
for parameter testing.
The convergence can also be characterized in terms of mappings of the graphs into fixed
small graphs, which is strongly connected to important parameters like ground state energy
in statistical physics, and to weighted maximum cut problems in computer science.
1 Introduction
For two finite graphs G and H, let hom(G, H) denote the number of homomorphisms (adjacency-
preserving mappings) from G to H. Counting homomorphisms between graphs has many inter-
esting aspects.
(a) A large part of extremal graph theory can be expressed as inequalities between various
homomorphism numbers. For example, Turán’s Theorem for triangles follows from the inequality
(due to Goodman [30]):
[email protected]; Research supported in part by OTKA grants No. TO32236, TO39210, TO42750.
¶ Department of Computer Science, Eötvös Loránd University, Pázmány Péter sétány 1/C, H-1117 Budapest,
Hungary; [email protected]
1
G0 are isomorphic (one actually needs one additional condition, the condition that both graphs
are twin-free; see Section 2.1 for the definition of this notion). In other words, let us order all
finite graphs in a sequence (F1 , F2 . . . ), and assign to each graph G its profile, the (infinite)
sequence (hom(F1 , G), hom(F2 , G) . . . ); then this sequence characterizes G. (This fact can be
used to prove, for example, the cancellation property of strong multiplication of graphs).
It is often worthwhile to normalize the homomorphism numbers, and consider the homomor-
phism densities
hom(F, G)
t(F, G) = . (2)
|V (G)||V (F )|
(Thus t(F, G) is the probability that a random map of V (F ) into V (G) is a homomor-
phism.) Instead of the profile (hom(F1 , G), hom(F2 , G), . . . ), we can consider the scaled profile
(t(F1 , G), t(F2 , G), . . . ) of the graph G. Such a normalization was first introduced in [23].
(c) Partition functions of many models in statistical mechanics can be expressed as graph
homomorphism functions. For example, let G be an n × n grid, and suppose that every node of
G (every “site”) can be in one of two states, “UP” or “DOWN”. The properties of the system
are such that no two adjacent sites can be “UP”. A “configuration” is a valid assignment of
states to each node. The number of configurations is the number of independent sets of nodes
in G, which in turn can be expressed as the number of homomorphisms of G into the graph H
consisting of two nodes, ”UP” and ”DOWN”, connected by an edge, and with an additional loop
at ”DOWN”. To define the thermodynamic functions in physical models, one needs to extend
the notion of graph homomorphism to the case when the nodes and edges of H have weights
(see Section 2.1).
(d) Suppose that G is a huge graph, and we know the numbers hom(F, G) (exactly or ap-
proximately) for small graphs F . What kind of information can be derived about the global
structure of G? The long-standing Reconstruction Conjecture is equivalent to the assertion that
it is enough to know all numbers hom(F, G) with |V (F )| < |V (G)| in order to recover the iso-
morphism type of G. The fact that this is unsolved shows the difficulty of this kind of question,
but our interest here is in the case when much less is given: hom(F, G) is only known for very
small graphs F .
(e) This is closely related to an important area of computer science called Property Testing.
In this model, we have a huge graph about which we can obtain information only by taking a
small sample of the nodes and examine the subgraph induced by them. This is equivalent to
knowing the homomorphism densities (2) for graphs F of small size. What makes the theory
of Property Testing interesting is the fact that from such meager local information nontrivial
properties and parameters of the graph can be inferred.
(f) Increasing sequences of (sparse) graphs generated by some specific random rule of growth
have recently been used to model the Internet; see [8] and references therein. What are the
limiting properties of such graph sequences? To what extent can these properties be derived from
local observation (observing a neighborhood of bounded radius of a few randomly chosen nodes?
This question can be rephrased as follows: To what extent are these properties characterized by
the homomorphism numbers of smaller graphs into the modeling sequence?
The main setup of our studies is the following. If we are given a (large, usually simple) graph
G, we may try to study its local structure by counting the homomorphisms of various “small”
graphs F into G; and we can study its global structure by counting its homomorphisms into
various small graphs H (called “softcore” weighted graphs; see Section 2.1 for a definition of
softcore). So the scheme to keep in mind is
F −→ G −→ H. (3)
According to the above discussion, in the scheme (3), the study of the local structure of G
by “probing from the left with F ” is related to property testing, while the study of the global
2
structure of G by “probing from the right with H” is related to statistical physics. As in statistical
physics, the best choice of graphs H to “probe G from the right” is not simple unweighted graphs
but weighted graphs. Furthermore, besides counting (weighted) homomorphisms into H, it is
also useful to consider maximizing the weight of such homomorphisms, which is again related to
well-studied questions both in statistical physics and graph property testing.
The number hom(F, G), as a function of F with G fixed, is a graph parameter (a function of
graphs F invariant under isomorphism). Graph parameters arising this way were characterized in
[28]. The necessary and sufficient condition involves certain matrices, called connection matrices,
associated with the graph parameter (see Section 3 for the definition): These matrices must be
positive definite and must satisfy a rank condition. The semidefiniteness condition is in fact
familiar from statistical physics, where it is called reflection positivity.
The scaled profile does not determine the graph: if we “blow up” every node into the same
number of “twins”, then we get a graph with exactly the same scaled profile. It turns out that
this is all: any two graphs with the same scaled profile are obtained from one and the same
graph by blowing up its nodes in two different ways.
Now we come to our main question: What can be said about two graphs whose scaled profiles
are approximately the same? Which properties of a graph G are determined if we only know a
few of the numbers hom(F, G), and even these are only known approximately? This question
turns out to be very interesting and it leads to a number of results, and even more open problems,
leading to quasirandom and generalized quasirandom graphs, and connecting to the ”Property
Testing” research in computer science and to statistical physics.
There is a way to measure the “distance” of two graphs so that they are close in this distance
if and only if they have approximately the same scaled profile [16]. This distance has many nice
properties. On the one hand, important parameters like the triangle density or the fraction of
edges in the maximum cut are continuous (often even Lipschitz) functions in this metric. On
the other hand, a sufficiently large random subgraph of an arbitrarily large graph will be close
to the whole graph with large probability. This fact explains several results in the theory of
Property Testing. Szemerédi’s Regularity Lemma (at least in its weaker but more effective form
due to Frieze and Kannan [29]) can be rephrased as follows: for every ε > 0, all graphs with at
2
most 22/ε nodes form an ε-net in the metric space of all graphs. In our context, an ε-net is
defined as a set of weighted graphs such that, for every graph G, there exists a graph H in the
set which has at most distance ε from G.
Once we make the set of all graphs into a metric space, we can make it complete. Is there
any combinatorial meaning of the new points in the completion? Surprisingly, the answer is in
the affirmative, and in fact in more than one way [44, 45]. These limit points can be described
as symmetric measurable functions W : [0, 1]2 → [0, 1] modulo measure-preserving transforma-
tions, as reflection positive graph parameters, as random graph models satisfying some natural
compatibility conditions, or as probability distributions of countable graphs with natural sym-
metries (see Chapter 4).
It is an important property of this completion that it is compact, so that every infinite
sequence of graphs has a convergent subsequence. Several arguments in extremal graph theory
and elsewhere can be simplified by going to the limit and thereby getting rid of remainder terms.
We conclude this introduction by mentioning two related bodies of work. There are many in-
teresting question that concern the existence of graph homomorphisms rather than their number
(an example is 4-colorability of a graph), and such questions have been studied quite exten-
sively, especially by the Czech school. These results are described in the recent book by Hell
and Nešetřil [34]. The set of all homomorphisms between two graphs can be endowed with a
topological structure, which turns out to be an important tool in the study of chromatic number.
See the book of Matoušek [51], and also the recent papers of Babson and Kozlov [5, 6].
This paper is organized as follows. In Section 2 we define homomorphism numbers, including
homomorphisms into a measurable function, and describe the basic examples. In Section 3
3
we define connection matrices, study their rank, and use their semidefiniteness to characterize
homomorphism functions. We also describe an analogous (but much more difficult) edge-coloring
version. In Section 4 we define convergence of a sequence of dense graphs, and show that they
have interesting limit objects, which can be described as measurable functions, reflection positive
graph parameters, or very natural models of finite or countable random graphs. In Section 5, we
introduce a metric on graphs that corresponds to the above notion of convergence, and show its
connection with Szemerédi’s Regularity Lemma and graph property testing. Section 6 studies
homomorphisms from large graphs, rather than into large graphs, which leads to quantities of
both combinatorial and physical interest; we show that they too can be used to characterize
convergent graph sequences. Section 7 contains some applications to extremal graph theory. In
Section 8 we conclude with describing some analogous (but less complete) results for sequences
of graphs with bounded degree.
2 Homomorphism numbers
2.1 Unweighted and weighted graphs
A graph is simple if it has no loops or parallel edges. A graph parameter is a function defined on
finite graphs, invariant under isomorphisms. We’ll talk of a simple graph parameter if it is only
defined on simple graphs. Sometimes it is convenient to think of a simple graph parameter as a
function defined on all graphs with multiple edges (but no loops) that is invariant under adding
parallel edges. A graph parameter t is called multiplicative if t(G) = t(G1 )t(G2 ) whenever G is
the disjoint union of G1 and G2 . We say that a graph parameter is normalized if its value on
K1 , the graph with one node and no edge, is 1. (Note that if a graph parameter is multiplicative
and not identically 0, then its value on K0 , the graph with no nodes and edges, is 1. The graph
parameter t(·, G) introduced in the introduction is multiplicative and normalized for every graph
G.
Recall that for two (finite) simple graphs F and G, hom(F, G) denotes the number of homo-
morphisms (adjacency preserving maps) from F to G.
A weighted graph G is a graph with a weight αi (G) associated with each node i and a weight
βij (G) associated with each edge ij. We’ll assume (unless otherwise stated) that the weights
αi (G) are positive. The weights βij (G) will be real, and most often nonnegative. If the graph G
is understood from the context, we will also use the notation αi and βij .
An edge with weight 0 will play the same role as no edge between those nodes, so we could
assume that we only consider weighted complete graphs with loops at all nodes (but this is not
always convenient). A weighted graph is called softcore if it is a complete graph with loops at
each node, and every edgeweight is strictly positive. An unweighted graph is a weighted graph
where all the nodeweights and edgeweights are 1.
For a weighted graph G, we denote by α(G) the sum of its nodeweights. Often it will be useful
to divide all nodeweights by α(G), to get a weighted graph G b in which the sum of nodeweights
is 1.
Let F and G be two weighted graphs. To every map φ : V (F ) → V (G), we assign the weight
Y £ ¤βuv (F )
homφ (F, G) = βφ(u)φ(v) (G) (4)
uv∈E(F )
4
where Y £ ¤αu (F )
αφ = αφ(u) (G) . (6)
u∈V (F )
£ ¤βuv (F )
(A little care is necessary, since the exponential βφ(u)φ(v) (G) may not be well defined;
but it well defined e.g. if the edge weights are positive. This problem will not arise in the cases
we consider.)
We’ll use this definition most often in the case when F is a simple unweighted graph, so that
Y
αφ = αφ(u) (G)
u∈V (F )
and Y
homφ (F, G) = βφ(u)φ(v) (G).
uv∈E(F )
hom(F, G) b
t(F, G) = = hom(F, G).
α(G)|V (F )|
Let F = {F1 , F2 , . . . } denote the set of (isomorphism types of) all simple finite graphs. To
every weighted graph G, we assign its hom-profile (or briefly profile), the (infinite) vector
Recall that we define the scaled profile of G as the (infinite) vector tG = (t(F1 , G), t(F2 , G), . . . ) ∈
RF . The scaled profile does not determine the graph. For example, if G is an unweighted graph
and G0 is obtained from G by replacing every node by N independent nodes, then
tG0 = tG .
More generally, let G be a weighted graph and let u, v ∈ V (G) be twins, i.e., βuw (G) = βvw (G)
for every w ∈ V (G) (note that αu (G) may be different from αv (G)). Merging twins in a
weighted graph does not change its scaled profile. Furthermore, if we multiply all nodeweights
of a weighted graph by the same positive constant, then its scaled profile does not change. If we
merge twins as long as we can, we say that we have performed twin-reduction.
Proposition 2.1 [39] If two weighted graphs have the same scaled profile, then after twin-
reduction one can be obtained from the other by multiplying all nodeweights by the same positive
scalar.
One of our main concerns will be: if we only know a bounded number of entries of the scaled
profile of a graph G, and even this only approximately, to what degree is the graph determined?
From a graph-theoretic perspective, the following variations of homomorphism functions are
perhaps more important: Let inj(F, G) denote the number of homomorphisms that are injective
on the nodes, and ind(F, G), the number of embeddings as an induced subgraph. Finally, let
surj(F, G) denote the number of homomorphisms that are surjective on the nodes.
For weighted graphs, inj(F, G) and surj(F, G) are easily defined by restricting the sum in (5)
to sums over injective and surjective maps, respectively, but the definition of ind(F, G) requires
5
some care. Here we only consider the case where F is simple, and G is a weighted graph without
loops. We then define X
ind(F, G) = αφ indφ (F, G), (7)
φ: V (F )→V (G)
where the sum goes over all injective maps from V (F ) to V (G) and
Y Y
indφ (F, G) = βφ(u)φ(v) (G) (1 − βφ(u)φ(v) (G)), (8)
uv∈E(F ) uv∈E(F )
Here we restrict (Gn ) to be a sequence of simple graphs. The reason for the subscript on T is
that, in our paper [16], we will consider a more general class of graph parameters, which will be
denoted by T . Note also that in [44], the authors used T rather than T0 to denote the smaller
class considered here.
What about hom(F1 × F2 , G)? There is no identity for this number in terms of the notions
introduced so far, but there is one if one also introduces the operation of exponentiation (which
we do not discuss here; see [37]).
For a fixed graph G, identity (9) gives an algebraic relation between the entries of its hom-
profile. It was proved by Whitney [62] that there are no other algebraic relations between these
entries valid for all graphs G. A slightly stronger result was proved in [23]: the projection of T0
to the coordinates corresponding to any finite set of connected graphs is full-dimensional. This
excludes any other kind of equations (e.g. exponential) between these numbers.
6
There are simple identities relating homomorphism numbers with the injective and induced
versions. In order to spare the reader from separate provisos for each relation, we restrict
ourselves to the case where both F and G are simple graphs. If Θ is any equivalence relation on
V (F ), then we denote by F/Θ the graph obtained by identifying nodes that belong to the same
class of Θ. Note that this may create loops and parallel edges.
We have some easy relations:
X
hom(F, G) = inj(F/Θ, G) (12)
Θ
and X
inj(F, G) = ind(F 0 , G), (13)
F 0 ⊃F
where the sum runs over graphs F 0 ⊃ F with the same node set.
From these, we can get reverse relations by Möbius inversion (or inclusion-exclusion):
X 0
ind(F, G) = (−1)|E(F )\E(F )| inj(F 0 , G), (14)
F 0 ⊃F
and X
inj(F, G) = µ(Θ)hom(F/Θ, G), (15)
Θ
where the last sum runs over equivalence relations and
k ³
Y ´
µ(Θ) = (−1)(|A|−1) (|A| − 1)! ,
A∈Θ
where d1 , . . . , dn are the degrees of G. Hence hom(Sk , G)1/(k−1) tends to the maximum degree
of G as k → ∞.
Example 2.3 (Cycles and eigenvalues) Let Ck denote the cycle on k nodes, and again let
G be any graph on n nodes. Then
n
X
hom(Ck , G) = λki , (20)
i=1
where λ1 , . . . , λn are the eigenvalues of the adjacency matrix of G. Hence hom(C2k , G)1/(2k)
tends to the largest eigenvalue of G as k → ∞.
7
Example 2.4 (Independent sets) Let H be the graph on two nodes, with an edge connecting
the two nodes and a loop at one of the nodes. Then for every simple graph G, hom(G, H) is the
number of independent sets of nodes in G.
Example 2.5 (Colorings) It is easy to see that hom(G, Kq ) is the number of colorings of the
graph G with q colors. It is well known that for a fixed G, this number is a polynomial in q,
called the chromatic polynomial. The chromatic polynomial defines a graph invariant for every
complex number q, but this cannot be expressed as the number of homomorphisms into any
graph unless q is a nonnegative integer [27, 28] (cf. Example 3.4).
It is often useful to consider homomorphisms into a fixed graph H as generalized colorings,
where the colors are the nodes of H, and every edge of H imposes a constraint on the coloring
that these two colors cannot be used at adjacent nodes.
Example 2.6 (Maximum cut) Let H denote the looped complete graph on two nodes,
weighted as follows: the non-loop edge has weight 2; all other edges and nodes have weight
1. Then for every simple graph G with n nodes,
where MaxCut(G) denotes the size of the maximum cut in G. So unless G is very sparse,
log2 hom(G, H) is a good approximation of the maximum cut in G.
Example 2.7 (Random graphs) Let G = G(n, p) be a random graph with n nodes and edge-
density p. Then for every simple graph F with k nodes,
Example 2.8 (Partition functions of the Ising model) Let G be any simple graph, and
let T > 0, h ≥ 0, and J be three real parameters. Let H be the looped complete graph on
two nodes, denoted by + and −, weighted as follows: α+ = eh/T , α− = e−h/T , β++ = β−− ,
β+− = β−+ , and β++ /β+− = e2J/T . Then hom(G, H) is the partition function of the Ising
model on the graph G at temperature T with coupling J in external magnetic field h.
(We can think of the interval [0, 1] as the set of nodes, and of the value W (x, y) as the weight
of the edge xy.) While this definition is meaningful for all graphs F , we will mostly use it for
simple graphs.
It is easy to see that for every weighted graph G, the graph parameter t(·, G) is a special case.
We may assume that V (G) = {1, . . . , n} and α(G) = 1. Define a function WG : [0, 1]2 → [0, 1]
as follows. For (x, y) ∈ [0, 1]2 , let a and b be determined by
8
and let
WG (x, y) = βab (G).
(Informally, WG is obtained by replacing the (i, j) entry in the weighted adjacency matrix of G
by a rectangle of size αi × αj , and define the function value on this square as βij .) Then
t(F, G) = t(F, WG )
Example 2.9 For an undirected simple graph F , let eul(F ) denote the number of eulerian
orientations of F (i.e., orientations in which every node has the same outdegree as indegree). By
Euler’s theorem, eul(F ) = 0 if and only if F has a node with odd degree.
It can be shown [44] that this graph parameter can be represented in the form t(·, W ), where
On the other hand, it follows e.g. from Theorem 3.6 below that eul is not of the form hom(·, G)
with any finite weighted graph G.
3 Connection matrices
3.1 The connection matrix of a graph parameter
A k-labeled graph (k ≥ 0) is a finite graph in which k nodes are labeled by 1, 2, . . . k. Two
k-labeled graphs are isomorphic, if there is a label-preserving isomorphism between them. We
denote by Kk the k-labeled complete graph on k-nodes, and by Ok , the k-labeled graph on k
nodes with no edges.
Let G1 and G2 be two k-labeled graphs. Their product G1 G2 is defined as follows: we take
their disjoint union, and then identify nodes with the same label. Clearly this multiplication is
associative and commutative. For 0-labeled graphs, this notation is in line with our notation for
disjoint union.
The following construction is central to the theory of homomorphisms functions. Let f be
any graph parameter. For every integer k ≥ 0, we define the following (infinite) matrix M (f, k).
The rows and columns are indexed by isomorphism types of k-labeled graphs. The entry in the
intersection of the row corresponding to G1 and the column corresponding to G2 is f (G1 G2 ).
We call the matrices M (f, k) the connection matrices of the graph parameter f (see Figure 1).
For a simple graph parameter, the above construction causes trouble if we get multiple edges
when gluing the two graphs. In this case, we suppress the edge multiplicities in G1 G2 when
defining the entry corresponding to the pair (G1 , G2 ).
Example 3.1 (Edges) Let e(G) = |E(G)| denote the number of edges in G. Then e(G1 G2 ) =
e(G1 ) + e(G2 ), and so M (e, k) is the sum of two matrices of rank 1. Thus M (e, k) has rank 2,
so rk(e, k) = 2 for all k.
9
k=2: ...
...
Figure 1: A small part of the connection matrix for k = 2. The matrix entries are obtained by
evaluating the parameter on the graph shown.
If we restrict e(G) to simple graphs G to get a simple graph parameter e0 , then the situation
is more complicated: we have
e0 (G1 G2 ) = e0 (G1 ) + e0 (G2 ) − e0 (G1 ∩ G2 ).
¡ ¢
Rewriting e0 (G1 ∩ G2 ) as x(G1 )T x(G2 ) where x(G) is the k2 -dimensional vector with entries
xij (G) = 1 if G contains an edge joining the labeled vertices i and j and xij (G) = 0 otherwise,
¡ ¢
we see that the matrix whose (G1 , G2 ) entry is e0 (G1 ∩G2 ) has rank k2 , implying that rk(e0 , k) ≤
¡k¢
2 + 2. One can check that this is the exact value.
Example 3.2 (Subgraphs) Let subg(G) denote the number of spanning subgraphs of G, i.e.,
subg(G) = 2e(G) . Then subg(G1 G2 ) = subg(G1 )subg(G2 ), and so M (subg, k) has rank 1. Thus
rk(subg, k) = 1 for all k.
Again, the version when we only consider simple graphs is more complicated: Let subg0 (G)
denote this simple graph parameter. Then
subg0 (G1 )subg0 (G2 )
subg0 (G1 G2 ) = .
subg0 (G1 ∩ G2 )
The first two factors do not change the rank, and the rows of the matrix given by the second
factor are determined by the edges induced by the labeled nodes, so it has only 2(2) different
k
rows, implying that rk(subg0 , k) ≤ 2(2) . Again one can check that this is the exact value.
k
Example 3.3 (Matchings) Let pmatch(G) denote the number of perfect matchings in the
graph G. It is trivial that pmatch(G) is multiplicative. We claim that its node-rank-connectivity
is exponentially bounded:
rpmatch (k)≤2k .
Let G be a k-labeled graph, let X ⊆ [k] = {1, . . . , k}, and let pmatch(G, X) denote the
number of matchings in G that match all the unlabeled nodes and the nodes with label in X,
but not any of the other labeled nodes. Then we have for any two k-labeled graphs G1 , G2
X
pmatch(G1 G2 ) = pmatch(G1 , X1 )pmatch(G2 , X2 ).
X1 ∩X2 =∅, X1 ∪X2 =[k]
10
This can be read as follows: The matrix M (pmatch, k) can be written as a product N T W N ,
where N has infinitely many rows indexed by k-labeled graphs, but only 2k columns, indexed
by subsets of [k],
NG,X = pmatch(G, X),
and W is a symmetric 2k × 2k matrix, where
(
1 if X1 = [k] \ X2 ,
WX1 ,X2 =
0 otherwise.
Hence the rank of M (pmatch, k) is at most 2k (it is not hard to see that in fact equality holds).
Example 3.4 (Chromatic polynomial) We have seen that the number of q-colorings is a
special case of homomorphism functions. This number is the evaluation of the chromatic poly-
nomial chr(G; x) at nonnegative integers q. What about evaluations at other values? It turns
out that these evaluations violate both conditions in Theorem 3.6 below [27]. For every fixed x,
this is a multiplicative graph parameter. To describe its rank-connectivity, we need the following
notation. For k, q ∈ Z+ , let Bk,q denote the number of partitions of a k-element set into at most
q parts. So Bk = Bk,k is the k-th Bell number. With this notation,
(
Bk,x if x is a positive integer,
rk(chr, k) =
Bk otherwise.
Note that this is always finite: if x is a positive integer, then it is bounded by xk , but otherwise
it grows faster than ck for every c. Similar results can be derived for the Tutte polynomial,
where the exceptional values are the hyperbolas in the Tutte plane for which (x − 1)(y − 1) is a
positive integer.
Let f be a graph parameter that is not identically 0. Then f is multiplicative if and only if
f (K0 ) = 1 (K0 is the empty graph) and rk(f, 0) = 1. Every multiplicative graph parameter f
satisfies the inequality
rk(f, k + l) ≥ rk(f, k) · rk(f, l). (21)
In the most important special case (to be discussed below) when f is a homomorphism function,
a stronger version of property (21) holds: the sequence rk(f, k) is logconvex. We do not know
if this property holds for more general graph parameters.
Finiteness of the rank connectivity function has interesting algorithmic consequences:
Theorem 3.5 [27] If r(f, k) is finite for some k, then f can be computed in polynomial time
for graphs with treewidth at most k.
Theorem 3.6 [28] The graph parameter f , defined on graphs with multiple edges but no loops,
is equal to hom(·, H) for some weighted graph H on q nodes if and only if
(a) M (f, k) is positive semidefinite and
(b) rk(f, k) ≤ q k for all k.
11
In terms of statistical physics, this theorem can be viewed as a characterization of partition
functions of models whose degrees of freedom sit on vertices (as opposed to the edge color-
ing models considered below). The property that M (f, k) is positive semidefinite is related to
the “reflection positivity” property in statistical physics, and we will call a graph parameter
reflection positive if M (f, k) is positive semidefinite for every k.
The proof of the necessity of the conditions in Theorem 3.6 is easy and it is instructive to
present it here. (The sufficiency is more involved, and the proof is based on algebraic consider-
ations.)
We need the following notation: For any k-labeled graph G and mapping φ : [k] → V (H),
let X αψ
homφ (G, H) = homψ (G, H), (22)
ψ: V (G)→V (H)
αφ
ψ extends φ
so that X
hom(G, H) = αφ homφ (G, H). (23)
φ: [k]→V (H)
The decomposition (23) writes the matrix M (f, k) as the sum of |V (H)|k matrices, one for each
mapping φ : [k] → V (H); (24) shows that these matrices are positive semidefinite and have
rank 1.
In the presence of condition (a), condition (b) in Theorem 3.6 can be replaced by the following
quite different type of condition. To formulate it, we need the notion of a quantum graph, defined
as a formal linear combination of graphs with real coefficients; but the condition concerns only
the existence of a single 2-labeled quantum graph [45]:
(c) There is a 2-labeled quantum graph g0 with the following property: if G is a 2-labeled
graph having no edge between the labeled nodes, and G0 denotes the graph obtained from G by
identifying the two labeled nodes, then f (g0 G) = f (G0 ).
In other words, attaching g0 at two nodes is effectively the same as identifying the two nodes
(this is why it is called a “contractor” in [45]).
Let us conclude with a discussion of the independence of the two conditions in Theorem 3.6.
In Example 3.3 we saw that the number of perfect matchings in a graph provides an example
for a graph parameter for which the rank of connection matrices grows simply exponentially.
This parameter is also multiplicative, so for k = 0 the connection matrix is positive semidefinite.
But it is easy to see that for k = 1, the submatrix indexed by K1 and K2 is
µ ¶
0 1
1 0
which is not positive semidefinite. Thus the number of perfect matchings cannot be represented
as a homomorphism function.
Recalling Examples 3.1 and 3.2, let us define the multigraph parameter f by f (G) =
0
1/subg0 (G) = 2−e(G ) , where G0 is obtained from G by removing duplicate edges. As in Ex-
ample 3.2, the rank of the connection matrix M (f, k) grows as 2(2) . It is further not hard
k
to check that M (f, k) is positive semidefinite. The graph parameter f is, in fact, the limit of
parameters of the form hom(·, H): take homomorphisms into a random graph H = G(n, 1/2),
with all nodeweights 1/n and all edge-weights 1. But the rank of its connection matrices is finite
but superexponential, so the parameter is not of the form hom(·, H).
This example also illustrates the importance of the condition that f is defined on graphs with
multiple edges for the validity of 3.6. Indeed, for a simple graph G (i.e., if G has no multiple
12
0
edges), f (G) = 2−e (G) can be represented as the number of homomorphisms into the graph
K1 (1/2), consisting of a single node with a loop, where the node has weight 1 and the loop has
weight 1/2.
The chromatic polynomial (Example 3.4) was another example whose connection matrices
had superexponential rank growth if the variable x was not a nonnegative integer. Here the
reflection positivity condition gives the same condition on x: the k-th connection matrix is
positive semidefinite if and only if either x is a positive integer or k ≤ x + 1. Thus M (chr, k) is
semidefinite for all k if and only if x is a positive integer.
Theorem 3.7 [39] Assume that the target graph H is twin-free. Then for every k,
rk(hom(·, H), k) is the number of orbits of the automorphism group of H on ordered k-tuples
of its nodes.
It is worthwhile to formulate two corollaries.
Corollary 3.8 Let H be a weighted graph that has no twins and no automorphisms. Then
rk(hom(·, H), k) = |V (H)|k for every k.
Swapping twins i and j is “almost” an automorphism: the only additional condition needed is
that αi = αj . In particular, for unweighted graphs the condition that there are no automorphisms
implies that there are no twins.
Our second corollary is in fact an equivalent reformulation of Theorem 3.7 in the framework
of quantum graphs. To state it, we need the following notation: given a weighted graph H,
a k-labeled quantum graph x and nodes i1 , . . . , ik ∈ V (H), we define homi1 ...ik (x, H) to be the
number of homomorphisms from x to H such that the labeled nodes are mapped into i1 . . . ik .
Corollary 3.9 Let H be a weighted graph that has no twins, and let h : V (H)k → R. Then there
exists a k-labeled quantum graph x such that homi1 ...ik (x, H) = h(i1 , . . . , ik ) for all i1 , . . . , ik ∈
V (H) if and only if h is invariant under the automorphisms of H.
loop(G) = 2−#loops
13
with loops, each loop e in the target graph H must have two weights: one which is used when a
non-loop edge of G is mapped onto e, and the other, when a loop of G is mapped onto e. With
this modification, the theorem remains valid.
The constructions and results above are in fact more general; they extend to directed graphs
and hypergraphs. One new element in the case of directed graphs is the following. For a directed
graph, homomorphism functions can be defined by the same formulas (4) and (5), except that
weights are now assigned to the directed edges of G and H. But there are (at least) two
substantially different ways to define connection matrices.
(1) The easier way is to define k-labeled digraphs similarly, and glue them together just like
we did in the undirected case. The related theorem and proof are precisely the same as above.
(2) The alternative generalization of Theorem 3.6 to directed graphs is a bit more interesting.
(Example 3.13 below is a case when this second theorem applies.) For this, we consider weighted
directed graphs in which the edgeweights can be complex. Such a graph H is called Hermitian if
for every arc uv with (complex) weight βuv , the arc vu is also present and has weight βvu = β uv .
For any directed graph D, let D∗ be the digraph obtained from D by reversing all arcs. For
two k-labeled digraphs D and D0 , let DD0 denote, as before, their union with the labeled nodes
identified. For any complex-valued digraph parameter f defined on loopless directed graphs and
for each natural number k, we define the matrix M̃ (f, k) as follows: its rows and columns are
indexed by k-labeled directed graphs, and the entry in position D1 , D2 is f (D1∗ D2 ).
Theorem 3.10 [40] Let f be a complex valued digraph parameter. Then f = hom(·, H) for
some Hermitian weighted digraph H if and only if f (K0 ) = 1 and there exists a d ≥ 0 such that,
for each k ≥ 0, M̃k is positive semidefinite and has rank at most dk .
Note that M̃k is a complex valued matrix. The condition that it is positive semidefinite
includes the condition that it is Hermitian.
There is a common formulation of these results, using semigroups; see [40] for details.
(It will be also useful to allow a single edge with no endpoints; we call this graph the circle, and
denote it by °. By definition, col(°, h) = |C|.)
We can define edge-connection matrices that are analogous to the connection matrices defined
before: Instead of gluing graphs together along nodes, we glue them together along edges. To be
1 For this reason, these models are usually called vertex models in the physics literature.
14
precise, we define a k-broken graph as a k-labeled graph in which the labeled nodes have degree
one. (It is best to think of the labeled nodes not as nodes of the graph, but rather as points
where the k edges sticking out of the rest of the graph are broken off.) We allow that both ends
of an edge be broken off.
For two k-broken graphs G1 and G2 , we define G∗1 G2 by gluing together the corresponding
broken ends of G1 and G2 . These ends are not nodes of the resulting graph any more, so G∗1 G2
is different from the graph G1 G2 we would obtain by gluing together G1 and G2 as k-labeled
graphs. One very important difference is that while G1 G2 is k-labeled, G∗1 G2 has no broken
edges any more, and so it is not k-broken. This fact leads to considerable difficulties in the
treatment of edge models.
For every graph parameter f and integer k ≥ 0, we define the edge-connection matrix M 0 (f, k)
as follows. The rows and columns are indexed by isomorphism types of k-broken graphs. The
entry in the intersection of the row corresponding to G1 and the column corresponding to G2 is
f (G∗1 G2 ). Note that for k = 0, we have M (f, 0) = M 0 (f, 0), but for other values of k, connection
and edge-connection matrices are different. We say that f is edge reflection positive, if M 0 (f, k)
is positive semidefinite for every k ≥ 0.
It is easy to see (similarly as in the case of homomorphism functions) that if h : ZC + → R+
and f = col(·, h) is an edge-coloring function, then
rk(M 0 (f, k)) ≤ |C|k ,
and M 0 (f, k) is positive semidefinite. Unlike in the case of node-connection matrices, these two
properties are not independent any more:
Proposition 3.11 [58] If f is a multiplicative graph parameter such that M 0 (f, k) is positive
semidefinite for every k ≥ 0, then f (°) is a nonnegative integer and rk(M 0 (f, k)) ≤ f (°)k .
The analogue of Theorem 3.6 is even simpler to state (but much more difficult to prove:
Theorem 3.12 [58] A graph parameter f can be represented as f (·) = col(·, h) for some edge
coloring model h if and only if it is multiplicative and edge-reflection positive.
Just as for homomorphism functions, it is natural to ask what determines the rank of con-
nection matrices of edge models. This question seems to lead to difficult algebraic questions in
group representations, and is unanswered at this time.
otherwise, the node evaluates to 0. It is easy to see that an edge-coloring has nonzero weight
only if it comes from a homomorphism, and in that case, the weight of the edge-coloring is the
same as the weight of the corresponding homomorphism.
15
It is not obvious, but it is true, that undirected edge coloring functions generalize undirected
homomorphism functions [58], at least if complex values are allowed for h. It is not clear which
(real) homomorphism functions can be obtained as edge coloring functions with a real valued h.
In the opposite direction, edge coloring models cannot be translated into node coloring mod-
els (homomorphisms) in general; but there are some nontrivial examples of important graph
parameters that are defined as edge coloring functions, but that can also be represented as ho-
momorphism functions in a nontrivial way. A general understanding of these examples would
be very interesting.
Example 3.13 (Nowhere-zero flows) Let eul(G) = 1 if G is eulerian (i.e., all nodes have even
degree), and eul(G) = 0 otherwise. To represent this function as a homomorphism function, let
µ ¶ µ ¶
1/2 1 −1
a= , B=
1/2 −1 1
It was noted by de la Harpe and Jones [33] that for the weighted graph H = (a, B) we have
hom(G, H) = eul(G).
This example can be generalized quite a bit. Let Γ be a finite abelian group and let S ⊆ Γ
be such that S is closed under inversion. For any graph G, fix an orientation of the edges. An
S-flow is an assignment of an element of S to each edge such that for each node v, the product
of elements assigned to edges entering v is the same as the product of elements assigned to
the edges leaving v. Let sflo(G) be the number of S-flows. This number is independent of the
orientation.
The choice Γ = Z2 and S = Z2 \ {0} gives the special case above (incidence function of
eulerian graphs). If Γ = S = Z2 , then sflo(G) is the number of eulerian subgraphs of G. Perhaps
the most interesting special case is when |Γ| = t and S = Γ \ {0}, which gives the number of
nowhere zero t-flows.
Surprisingly, this parameter (which is an edge coloring model) can be described as a homo-
morphism function. Let Γ∗ be the character group of Γ. Let H be the complete directed graph
(with all loops) on Γ∗ . Let αχ := 1/|Γ| for each χ ∈ Γ∗ , and let
X
βχχ0 := χ−1 (s)χ0 (s),
s∈S
Example 3.14 (Matchings revisited) We have seen that the number pmatch(G) of perfect
matchings has exponential rank connectivity but is not reflection positive, and hence it is not a
homomorphism function. However, consider the following weighted graph Hx : We take a looped
complete graph on two nodes u and v, and define
1 1
α(u) = , α(v) = − ,
x x
and
β(uu) = x + 1, β(uv) = β(vv) = 1.
16
Then the following surprising fact holds:
(this is the asymptotic number of labeled copies of F in a random graph with edge probability
p). The definition is usually formulated in terms of the number of injections (labeled copies) of
F into G, but the two differ only in lower order terms, which are swallowed by the o(1) in the
definition.
It turns out that (25) implies many other properties that are familiar from the theory of
random graphs; for example, almost all degrees are about pn, almost all codegrees are about
p2 n etc. Many of these properties characterize quasirandom graphs, and so these provide many
equivalent ways to define a quasirandom sequence [19, 59]. Quasirandomness is closely related
to Szemerédi’s lemma [56, 57]. One of the most surprising facts proved in [19] is that it is enough
to require the condition about the number of copies of F for just two graphs, namely K2 (which
just defines the edge density p) and the 4-cycle C4 . This fact can be stated and proved in a
simpler way using the “limit set” T0 defined in Section 2.1:
Theorem 4.1 If t ∈ T0 satisfies t(C4 ) = t(K2 )4 , then for every simple graph H,
In other words, t(H) is the expected profile of a random graph G(n, p) with p = t(K2 ); it is
also the profile of the weighted graph consisting of a single node and a loop with weight p (the
weight of the node does not matter).
To illustrate the power of reflection positivity, we give a proof of this theorem (the proof goes
along the lines of the original, just the details are simpler).
Proof. Let p = t(K2 ). We first prove the conclusion for stars K1,j :
t(K1,j ) = pj . (26)
Starting with t(K1,2 ), let us first consider the connection matrix M (t, 1) and its 2 × 2 submatrix
formed by the rows and columns corresponding to the graph K1 and K2 . Positivep semidefiniteness
of this matrix gives t(K1,2 ) ≥ t(K2 )2 = p2 . On the other hand t(K1,2 ) ≤ t(C4 ) = p2 by
positive semidefiniteness of the 2 × 2 submatrix of M (t, 2) indexed by K1,2 (with its endpoints
labeled) and K 2 , the empty graph on two nodes. The above two inequalities give t(K1,2 ) = p2 ,
which proves (26) for j = 2. To prove the identity for j > 2, we again consider the connection
matrix M (t, 1). By the identity we just established, its 2 × 2 submatrix formed by the rows and
columns corresponding to the graph K1 and K2 has 0 determinant. By positive semidefiniteness,
17
the corresponding two rows of the whole connection matrix M (t, 1) are proportional. But this
means that
t(K1,j+1 ) = pt(K1,j )
for every j, from which (26) follows by induction.
Next we show that for all complete bipartite graphs K2,j :
Since t(K2,2 ) = t(C4 ) = p4 by assumption, this is true for j = 2. For general j, it follows just
like (26) from the positive semidefiniteness of the matrix M (t, 2).
Now we prove the general case by a similar induction. Let us view the graph H as glued
together from a star K1,d and a graph F on one fewer nodes, along the set T of the leaves of the
star, and suppose that we know the assertion for F . Consider the matrix M (t, d) and its 2 × 2
submatrix formed by the rows and columns indexed by K d (the graph on d labeled nodes with
no edge) and K1,d (with the leaves labeled). By (26) and (27), this submatrix is singular, and
hence these two rows of the whole matrix are proportional, the second row is pd times the first.
But the graphs F and H define two elements of these rows above each other, so
Example 4.2 Let G(n, p) be a random graph on n nodes with edge-density p; the sequence
(G(n, p), n = 1, 2, . . . ) is convergent with probability 1. The limiting simple graph parameter is
given by t(F ) = p|E(F )| .
By definition, every quasirandom graph sequence with density p is also convergent, and the
homomorphism densities into it tend to the same value.
18
A generalized quasirandom graph sequence (Gn ) with model graph H (or briefly H-
quasirandom sequence) is defined by the property that for every fixed finite graph F ,
t(F, Gn ) −→ t(F, H) (n → ∞).
In other words, the number of homomorphisms of F into Gn is approximately the same as
the expected number of homomorphisms of F into a generalized random graph G(N, H) on
N = |V (Gn )| nodes.
This definition suggests that we should consider the graph H as the “limit” of the H-
quasirandom sequence. The definition of a quasirandom sequence of graphs (with edge-density
p) is equivalent to saying that the sequence converges to K1 (p). (Warning: not every convergent
sequence will have a limit of this form!)
In view of the theory of quasirandom graphs, we can ask the following two basic questions
concerning generalized quasirandom graphs:
(a) Is it enough to require the condition concerning the number of copies of F for a finite set
of graphs Fi (depending on α and β)?
(b) Is the structure of a generalized quasirandom graph Gn similar to a generalized random
graph?
To be more precise, we want that the nodes of Gn can be partitioned into q classes U1 , . . . , Uq
of sizes α1 n,. . . ,αq n so that the graph spanned by Ui is quasirandom with density βii , and the
bipartite graph formed by the edges between Ui and Uj is quasirandom with density βij .
The answer to the first two questions is in the affirmative. More precisely, the following
theorems hold.
Theorem 4.3 [43] Let H be a weighted graph with V (H) = [q], nodeweights (αi : i = 1, . . . , q)
and edgeweights (βij : i, j = 1, . . . , q). Let (Gn , n = 1, 2, . . . ) be an H-quasirandom sequence of
unweighted simple graphs. Then for every n there exists a partition V (Gn ) = {U1 , . . . , Uq ) such
that
|Ui |
(a) → αi (i = 1, . . . , q),
|V (Gn )|
(b) the subgraph of Gn induced by Ui is a quasirandom graph sequence with edge density βii ,
and
(c) the bipartite subgraph between Ui and Uj is a quasirandom bipartite graph sequence with
density βij .
Theorem 4.4 [43] Let H be a weighted graph with V (H) = [q]. A sequence (Gn , n = 1, 2, . . . )
is H-quasirandom if and only if
t(F, Gn ) −→ t(F, H) (t → ∞)
for every graph F with at most (10q)q nodes.
19
Theorem 4.5 [44] A simple graph parameter f is in T0 if and only f is normalized, multiplica-
tive and reflection positive.
W0 = {W ∈ W : 0 ≤ W ≤ 1}.
Theorem 4.6 [44] A simple graph parameter f is in T0 if and only if there is a function W ∈ W0
such that f = t(·, W ).
This function W is not unique: for example, W (1 − x, 1 − y) will define the same graph
parameter. More generally, if φ : [0, 1] → [0, 1] is a measure preserving map (not necessarily
bijective), then
W φ (x, y) = W (φ(x), φ(y))
defines the same parameter. The following theorem says that this is all: Let us call functions
W1 , W2 ∈ W equal up to measure preserving transformation if there is a third function W ∈ W
and measure preserving maps φ1 , φ2 : [0, 1] → [0, 1] such that Wi = W φi .
Theorem 4.7 [14] Two functions W1 , W2 ∈ W define the same simple graph parameter if and
only if they are equal up to measure preserving transformation.
Theorem 4.8 [44] With probability 1, the graph sequence G(n, W ) is convergent, and its limit
is the function W .
Let us define a random graph model as a distribution Gn on simple graphs on [n] , for every
n ∈ Z+ . The random graph model G(n, W ) defined above has the following three obvious
properties:
(i) The distribution of Gn is invariant under relabeling nodes;
(ii) If we delete node n from Gn , the distribution of the resulting graph is the same as the
distribution of Gn−1 ;
(iii) for every 1 < k < n, the subgraphs of G induced by [k] and {k+1, . . . , n} are independent
(as random variables).
It turns out that these three properties characterize the model G(n, W ):
Theorem 4.9 [44] A random graph model is of the form G(n, W ) for some function W ∈ W0
if and only if it satisfies conditions (i), (ii) and (iii). Furthermore, two functions W1 , W2 ∈ W0
define the same random graph model if and only if they are equal up to measure preserving
transformation.
20
We have seen that the limit of ordinary random graphs G(n, 1/2) is the function W ≡ 1/2.
It is, however, quite natural to think that the limit of ordinary random graphs should be the
Rado graph (the countable random graph). It turns out that this is also true in the following
sense: For every W ∈ W0 , we can define a countable random graph G(ω, W ) on Z+ , by choosing
an infinite sequence (X0 , X1 , . . . ) of independent uniform samples from [0, 1], and connecting i
and j with probability W (Xi , Xj ). A countable random graph model is a probability distribution
on graphs on Z+ (with the σ-algebra generated by cylinders consisting of all graph containing a
given edge).
Theorem 4.10 [48] A countable graph model is of the form G(ω, W ) if and only if it satisfies (i)
and (iii) above. Furthermore, the countable graph model G(ω, W ) determines W up to measure
preserving transformation.
Thus it is justified to say that with probability 1, G(n, 1/2) converges to the Rado graph
G(ω, 1/2). A word of caution is warranted here: as an unlabeled graph, G(ω, 1/2) is isomorphic
to G(n, 1/3) with probability 1. So viewing the Rado graph as an unlabeled graph would
not contain enough information to characterize the limit; we have to view it as a probability
distribution over graphs on a fixed countable set of nodes.
4.4.4 Examples
Example 4.11 Consider the half-graphs Hn,n : they are bipartite graphs on 2n nodes
{1, . . . , n, 10 , . . . , n0 }, where i is connected to j 0 if and only if i ≤ j 0 . It is easy to see that
this sequence is convergent. Indeed, let F be a simple graph with k nodes; we show that the
limit of t(F, Hn,n ) exists. We may assume that F is connected. If F is non-bipartite, then
t(F, Hn,n ) = 0 for all n, so suppose that F is bipartite; let V (F ) = V1 ∪ V2 be its (unique)
bipartition. Then every homomorphism of F into H preserves the 2-coloring, and so the homo-
morphisms split into two classes: those that map V1 into {1, . . . , n} and those that map it into
{10 , . . . , n0 }. By the symmetry of the half-graphs, these two classes have the same cardinality.
Now F defines a partial order P on V (F ), where u ≤ v if and only if u = v or u ∈ V1 ,
v ∈ V2 , and uv ∈ E. With respect to this partial order, 12 hom(F, Hn,n ) is just the number of
order-preserving maps from V (F ) to the chain {1, . . . , n}, and so
1
hom(F, Hn,n ) 2 hom(F, Hn,n )
2k−1 t(F, Hn,n ) = 2k−1 · =
(2n)k nk
21
graph with node set [0, 1] and edge-set {xy : x ≤ 1/2, y > 1/2, x ≤ y − 1/2}. More precisely,
the limit is given by the function
(
1, if x ≥ y + 12 or y ≥ x + 12 ,
W (x, y) =
0, otherwise.
22
This norm is closely related to `∞ → `1 norm, which can be defined by
kAk∞→1 = max |uT Av| = max uT Av; (29)
u,v∈[−1,1]n u,v∈{−1,1}n
in fact,
1
kAk∞→1 ≤ kAk¤ ≤ kAk∞→1 . (30)
4
For symmetric matrices A, the norm
kAk0¤ = max |uT Au| (31)
u∈{0,1}n
Note that we are dividing by n2 and not by |S| × |T |, so the contribution of a pair S, T is at
most |T | × |S|/n2 . Thus small sets of size o(n) play no role when measuring the distance. In
terms of the adjacency matrices A and A0 of G and G0 , respectively, this can be expressed as
1° °
°A − A0 ° .
d¤ (G, G0 ) = 2 ¤
n
Note that the definition can be extended to the case when G and G0 have edgeweights. Further-
more, by (30) and (32), we could replace the k.k¤ norm in the definition by one of the other
matrix norms defined above without distorting the distance by more than a constant factor.
We need to extend this notion to weighted graphs on the same set of nodes. Let G and G0
be weighted graphs with V (G) = V (G0 ). We assume that G and G0 both have total nodeweight
1, but the weights of individual nodes in G and G0 may be different. Then we define
X
d¤ (G, G0 ) = |αi (G) − αi (G0 )|
i
¯X¡ ¢¯¯ (33)
¯
+ max ¯ αi (G)αj (G)βij (G) − αi (G0 )αj (G0 )βij (G0 ) ¯.
S,T ⊆V (G)
i∈S
j∈T
23
If the two graphs do not both have total nodeweights of one, then we simply define the distance
d¤ in terms of the corresponding “normalized” graphs, i.e., we replace αi (G) with αi (G)/α(G),
and similarly for G0 .
where G̃ and G̃0 range over all labelings of G and G0 by 1, . . . , n, respectively (of course, we
could fix the labeling of one of the graphs).
Consider any labeling that attains the minimum in the definition of δb¤ , and identify the nodes
of G and G0 with the same label. In this case, we say that G and G0 are optimally overlaid.
and
n
X
Xiu = αu (G0 ).
i=1
We think of Xiu as the portion of node i that is mapped onto node u. We call such a matrix
X a fractional overlay of G and G0 . Let X (G, G0 ) denote the set of all fractional overlays. Note
that for every X ∈ X (G, G0 ),
0 0
n X
X n n
X n
X
Xiu = αi (G) = αu (G0 ) = 1.
i=1 u=1 i=1 u=1
(If we view αG and αG0 as probability distributions, then every X ∈ X (G, G0 ) is a coupling of
these distributions.)
For each fractional overlay, we construct the following two weighted graphs. The nodes of
G[X] are all pairs (i, u) where 1 ≤ i ≤ n and 1 ≤ u ≤ n0 . The weight of the node (i, u) is Xiu ,
and the weight of the edge ((i, u), (j, v)) is βij . The other graph G0 [X T ] is defined similarly,
except that the roles of i and u are interchanged. Now the node sets of G[X] and G0 [X T ] are
labeled by the same set of pairs (i, u), so their distances are well defined.
Thus we can define the distance of two weighted unlabeled graphs G and G0 (with total
nodeweight 1):
δ¤ (G, G0 ) = min 0 d¤ (G[X], G0 [X T ]).
X∈X (G,G )
24
We can express this distance in terms of the original graphs G and G0 by the following
formula: ¯ X ¡ ¢¯¯
¯
δ¤ (G, G0 ) = min 0 max 0 ¯ Xiu Xjv βij (G) − βuv (G0 ) ¯. (35)
X∈X (G,G ) S,T ⊆V ×V
(i,u)∈S
(j,v)∈T
Of course, this definition also applies if G and G0 have the same number of nodes; however,
it may give a different value than (34). It is proved in [16] that there is a constant c > 0 such
that
δ¤ (G, G0 ) ≤ δb¤ (G, G0 ) ≤ cδ¤ (G, G0 )1/4 (36)
(the lower bound is trivial; we do not have an example showing that the exponent 1/4 is needed
in the upper bound). While the definition of δb¤ is more straightforward, the distance δ¤ will
be easier to work with, and we will use mostly the latter distance.
A very special weighted graph is K1 (p): a single node with a loop with weight p. For the
random graph G = G(n, p), we have a.s.
Let (Gn ) be a sequence of simple graphs. It follows by standard results on quasirandom graphs
that
Proposition 5.1 [19] A sequence (Gn ) of graphs is quasirandom if and only if δ¤ (Gn , K1 (p)) →
0 as n → ∞.
The following result connects this distance to homomorphism functions.
where d is the density d = eG (U, W )/(|U | · |W |). Let twr(ε) denote the d1/ε2 e times iterated
exponential function (the “tower”). With this notation, we can state one version of the Regularity
Lemma:
Lemma 5.3 (Szemerédi Regularity Lemma) For every ε > 0 and every graph G = (V, E)
there is a partition P of V into k ≤ twr(ε) classes V1 , . . . , Vk such that
X
irregG (Vi , Vj ) ≤ ε|V |2 .
1≤i<j≤k
(While this form is perhaps easiest to state and prove, there are equivalent forms that are more
suited for applications. We refer to [35] for a survey, to [55] for connections with quasirandom
graphs, and to [47] for analytic aspects of the Regularity Lemma.)
The appearance of the tower function in the Lemma forbids practical applications (and
unfortunately this bound on the number of parts is not far from best possible, as it was shown
by Gowers [32]). A more reasonable threshold was proved by Frieze and Kannan [29], at the cost
of using a weaker measure of irregularity.
25
Given a graph G = (V, E) and a partition P = {V1 , . . . , Vk } of V , we define a weighted graph
GP on V by setting αu (GP ) = 1 and βuv (GP ) = di(u)i(v) , where i(u) is the index of the set Vi
containing u and dij = eG (Vi , Vj )/(|Vi ||Vj |). The edgeweight matrix of GP is thus obtained from
the adjacency matrix of G by replacing each entry in the block Vi × Vj by the average over the
block. In this notation, the result of Frieze and Kannan [29] can be formulated as follows:
Lemma 5.4 (Weak Regularity Lemma) For every ε > 0 and every graph G = (V, E), there
2
exists a partition P of V into k ≤ 22/ε classes such that d¤ (G, GP ) ≤ ε.
The bound on the number of partition classes is still rather large (exponential), but at least
not a tower. Frieze and Kannan show that the partition can be obtained as an “overlay” of only
1/ε2 sets, so it has a description that is polynomial in 1/ε, which in some applications leads to
polynomial time algorithms (see e.g. [2]).
The Weak Regularity Lemma immediately implies the following slight variant:
Lemma 5.5 [16] For every ε > 0 and every graph G = (V, E), there is a weighted graph H with
2
at most d21/ε e nodes such that δ¤ (G, H) ≤ ε.
We note that other versions strengthen the conclusion (of course, at the cost of replacing
the tower function by an even more huge value). Such a “super-strong” Regularity Lemma was
proved and used by Alon and Shapira [4]. It would be interesting to fit the original Regularity
Lemma or one of its applications into this framework.
Theorem 5.6 Let G1 and G2 be two graphs on the same set of nodes V , let ε = d¤ (G1 , G2 ),
δ > 0, and let S be a random k-subset of V .
(a) If k ≥ 300 2
ε2 log( δ ), then with probability at least 1 − δ,
26
8
Theorem 5.8 [16] Let G, G0 be simple graphs, and let ε > 0. Set k = d2c2 /ε e, and assume
2
that for every simple graph F on at most k nodes, we have |t(F, G) − t(F, G0 )| < 2−2k . Then
δ¤ (G, G0 ) ≤ ε.
These results allow us to characterize convergent graph sequences:
Theorem 5.9 [16] A graph sequence is convergent if and only if it is Cauchy in the δ¤ metric.
Let F denote the metric space of all finite, simple graphs with the δ¤ metric. It follows
from the above that the completion X of F can be described as follows. Consider the space of
all functions in W0 , with the distance
¯Z ¯
¯ ¯
d¤ (U, W ) = sup ¯ W (x, y)dx dy ¯.
S,T ⊆[0,1] S×T
27
Proposition 5.10 [16] A simple graph parameter is testable if and only if any of the following
equivalent conditions holds.
(a) For every convergent graph sequence (Gn ), the limit of f (Gn ) exists as n → ∞ (continuity
at infinity).
(b) For every ε > 0 there is an integer k0 such that for every k > k0 and every graph G on
at least k nodes, a random set X of k nodes of G satisfies
|f (G) − E(f (G[X]))| < ε.
(c) f is “essentially” uniformly continuous with respect to the δ¤ distance in the following
sense: For every ε > 0 there is an ε0 > 0 and a positive integer n0 so that if G1 and G2 are two
graphs with |V (Gi )| ≥ n0 and δ¤ (G1 , G2 ) < ε0 , then |f (G1 ) − f (G2 )| < ε.
(d) There exists a functional fb(W ) on W0 that is continuous in the rectangle norm, and
extends f in the sense that |fb(WG ) − f (G)| → 0 if |V (G)| → ∞.
If we want to use (c) to prove that a certain invariant is testable, then the complicated
definition of the δ¤ distance may cause a difficulty. So it is useful to show that (c) can be
replaced by a weaker condition, which consists of three special cases of (c):
Supplement 5.11 [16] The following three conditions together are also equivalent to testability:
(c.1) For every ε > 0 there is an ε0 > 0 such that if G and G0 are two simple graphs on the
same node set and d¤ (G, G0 ) ≤ ε0 then |f (G) − f (G0 )| < ε.
(c.2) For every simple graph G, f (G(m)) has a limit as m → ∞, where G(m) denotes the
graph obtained from G by replacing each node by m twins.
(c.3) f (G(m)) − f (G) → 0 if |V (G)| → ∞.
Some of the implications between conditions (a)-(d) in the Theorem are easy, some others
follow from the general theory sketched above. To illustrate the use of this theorem, let us
consider the density of the maximum cut:
eG (S, V (G) \ S)
f (G) = max ¡n¢ .
S⊆V (G)
2
This parameter is testable: this fact is nontrivial, and its first proof by Goldreich, Goldwasser
and Ron [31] was one of the first important results in Property Testing. Of the conditions above,
(a) and (b) are more or less a reformulation of testability.
Condition (c), on the other hand, is easy to verify in this case. Let ε > 0, and let G1 and
G2 be two graphs for which δ¤ (G1 , G2 ) < ε. Let us blow up the points of each graph so that
the new graphs G01 and G02 have the same number N of points and they can be overlaid so that
d¤ (G01 , G02 ) < ε. For any subset S ⊆ V (G01 ) = V (G02 ), we have
|eG01 (S, V (G) \ S) − eG02 (S, V (G) \ S)| < εN 2 ,
and hence
|f (G01 ) − f (G02 )| < ε.
To complete the proof, one must argue that |f (G0i ) − f (Gi )| is small, which is not hard (and is
not given here).
Condition (d) can also be directly verified: we can extend the definition of a maximum cut
to functions W ∈ W0 in a natural way:
X Z Z
fb(W ) = W (x, y) dx dy.
S⊆[0,1] S [0,1]\S
Then it is easy to check that this functional is continuous in the norm k.k¤ , and extends f .
28
5.4.2 Property testing
Instead of estimating a numerical parameter, we may want to determine some property of G:
Is G 3-colorable? Is it connected? Does it have a triangle? The answer will of course have
some uncertainty. A precise definition was given by Goldreich, Goldwasser and Ron [31], who
also proved several fundamental results about this problem. There are in fact several ways to
formalize this question. For this exposition, we take the following.
As for parameter testing, we specify an ε > 0 and want to find a positive integer k (depending
on ε) with the following property. We select k independent uniform random nodes of G, and
from the subgraph induced by them we compute a guess X ∈ {Y ES, N O}. Ideally, we want
that if the graph does have the property, our guess should be YES with large probability, and
if the graph does not have the property, then we should guess NO with large probability. But
this is too much to ask. Suppose that we have two graphs that can be obtained from each other
by changing a very tiny fraction of the edges, but one has the property, and the other does not.
Then a sample induced subgraph from one graph will have almost the same distribution as a
sample (of the same size) from the other, and so our guess for the two graphs will be almost the
same.
A graph property P is testable, if our guess satisfies the following: if a graph has the property
in a robust way so that changing at most εn2 edges in any way it still has the property, we must
guess YES with probability at least 1 − ε; similarly, if changing at most εn2 edges in any way
the obtained graph does not have the property, then we must guess NO with probability at least
1 − ε; in the grey area inbetween, we can guess arbitrarily. In other words, whatever we guess,
we should be able to change at most εn2 edges to make out guess right.
Remark 5.12 While changing a small number of edges is the most natural way to formalize
that there is a “nearby” graph with the property, we have seen that the rectangular distance is
often better behaved. One is tempted to define that a property is weakly testable, if for every
ε > 0 there is a k such that for every graph G on at least k nodes we can make a guess based on
a sample induced subgraph of size k such that with probability at least 1 − ε, there is a graph G0
such that d¤ (G, G0 ) < ε and our guess is right for G0 . But this notion is not very interesting due
to the fact that every graph property is weakly testable. This is an easy application of Theorem
5.7 above.
Among the many results on graph property testing, let us quote a surprisingly general recent
result of Alon and Shapira [4]. A graph property is called hereditary, if it is inherited by induced
subgraphs.
Theorem 5.14 [4] The distance from a hereditary graph property is testable.
This theorem has a reasonably short proof using graph limits, see [47].
29
where one wants to study homomorphisms from Gn into a small graph H. This naturally raises
the question whether suitably normalized homomorphism numbers hom(Gn , H) converge if Gn
is convergent from the left.
Consider a graph G on n nodes, and a softcore graph H on q nodes. (Recall that H is called
softcore if all edge weights are strictly positive.) Then log hom(G, H) typically grows like the
number of edges in G. For dense graphs, it therefore seems natural to consider the quantity
1
log hom(G, H). (37)
n2
This quantity is closely related to a weighted maximum cut problem on G. Indeed, let B =
(Bij )1≤i,j≤q be a symmetric matrix with real entries. We then define the ground state energy of
the “model” B on the graph G as
1 X
E(G, B) = 2 max Bφ(u)φ(v) . (38)
n φ: V (G)→[q]
uv∈E(G)
For large n, the quantity defined in (37) is well approximated by the ground state energy.
Lemma 6.1 Let G be an unweighted graph with n nodes, and let H be a softcore graph with
α(H) = 1. Let α = mini αH (i). Let B be the matrix of logarithms of the edgeweights of H. Then
log(1/α) log hom(G, H)
E(G, B) − ≤ ≤ E(G, B).
n n2
(Note that the upper bound on log hom(G, H)/n2 does not depend on the nodeweights of H,
and in the lower bound, only the error term does.)
Proof. Note that 2
max homφ (G, H) = en E(G,B)
.
φ: V (G)→V (H)
Thus we have
X X 2 2
hom(G, H) = αφ homφ (G, H) ≤ αφ en E(G,B)
= en E(G,B)
,
φ φ
and 2
hom(G, H) ≥ max αφ homφ (G, H) ≥ αn en E(G,B)
.
φ
From these bounds the Lemma follows. ¤
Example 6.2 As a special case, consider the graph H consisting of two nodes of weight 1/2,
with loops of weight 1 at each, and connected by an edge of weight e (the base of the natu-
ral logarithm). Then n2 E(G, H 0 ) is the size of the maximum cut in G, which we denote by
MAXCUT(G). By Lemma 6.1,
log hom(G, H) − n ≤ MAXCUT(G) ≤ log hom(G, H).
Since MAXCUT(G) ≥ |E(G)|/2, this gives a very good approximation of the maximum cut.
More generally, for a fixed H, computing E(G, H) is a weighted multiway cut problem.
Our next theorem states that convergence from the left implies convergence of the ground
state energies. Its proof uses the notion of fractional partitions, a notion we will need at
several places in this section: a fractional partition of a set V into q classes (briefly, a fractional
q-partition) is a q-tuple ρ = (ρ1 , . . . , ρq ) of functions from V to [0, 1] such that for all x ∈ V , we
have ρ1 (x) + · · · + ρq (x) = 1. (Later in this section, we will apply this definition also to the case
when V = [0, 1], when we tacitly assume that the functions ρi are measurable.) We will use the
notation Pdq for the set P of probability distributions on [q], i.e., the set of vectors a = (a1 , . . . , aq )
such that ai ≥ 0 and i ai = 1, and the notation Symq for the set of q × q symmetric matrices.
30
Theorem 6.3 [16] Let q be a positive integer, let B ∈ Symq , and let Gn be a convergent sequence
of simple graphs. Then E(Gn , B) is a convergent sequence.
As a simple illustration of the usefulness of Proposition 5.10 and its Supplement 5.11, we
sketch the proof. Indeed, let us consider the quantity
1 X
Eφ (G, B) = Bφ(u)φ(v)
n2
uv∈E(G)
where φ is a map from V (G) to [q]. Identifying these maps with partitions P = (V1 , . . . , Vq ) of
V (G) and using the definition of the d¤ metric, we immediately see that
whenever G and G0 are simple graphs on the same set of nodes, which verifies the condition (c.1)
of Supplement 5.11.
The condition (c.2) is not hard to verify either: Define the energy of a fractional q-partition
ρ of V (G) as
1 X X
Eρ (G, B) = 2 ρi (u)ρj (v)Bij . (39)
n
uv∈E(G) i,j∈[q]
Then E(G(k), B) = maxρ Eρ (G, B) where the maximum runs over all fractional partitions such
that all ρi (u)’s are multiples of 1/k. We claim that the maximum is attained for fractional
partitions which are {0, 1} valued, so that E(G(k), B) = E(G, B) for all k. It is clear from the
above description that E(G(k), B) ≥ E(G, B), so the only thing we need to show is a matching
upper bound on E(G(k), B). Consider a fractional partition ρ maximizing Eρ (G, B), and a fixed
node u ∈ V (G). Then Eρ (G, B) is a linear function of the vector (ρ1 (u), . . . , ρq (u)), implying
that the maximum over all these vectors is obtained at a vertex of the simplex Pdq . Applying
this procedure to all nodes u ∈ V (G), this gives the desired inequality E(G(k), B) ≤ E(G, B),
and hence the equality of E(G(k), B) and E(G, B) for all k. The condition (c.2) therefore holds
trivially. The verification of condition (c.3) is even easier.
Let us finally note that the limiting ground state energy of a convergent sequence can be
expressed explicitly. Indeed, for W ∈ W0 and a symmetric q × q matrix B, let
q Z
1 X
E(W, B) = max Bij ρi (x)ρj (y)W (x, y) dx dy, (40)
ρ 2 i,j=1 [0,1]2
where the maximum runs over fractional q-partitions of [0, 1]. Then we have the following
theorem.
Theorem 6.4 [16] Let (Gn ) be a convergent sequence, and let W ∈ W0 be its limit. Let H be a
softcore graph, and let B be the matrix of logarithms of the edgeweights of H. Then
log hom(Gn , H)
lim = lim E(Gn , B) = E(W, B).
n→∞ |V (Gn )|2 n→∞
31
where n is the number of nodes in G and n1 G is obtained from G by multiplying the edge weights
of G by n1 .
To discuss the convergence of the free energy, we need the notion of the entropy of fractional
partitions. Let ρ = (ρ1 , . . . , ρq ) be a fractional partition of a finite set V . Then the entropy of ρ
is defined as
q
1 XX
H(ρ) = − ρi (u) log ρi (u).
|V | i=1
u∈V
Theorem 6.5 [16] Let q be a positive integer, let H be a softcore weighted graph, and let (Gn )
b n , H) is a convergent sequence.
be a convergent sequence of simple graphs. Then P(G
If Gn converges to a function W ∈ W0 , the limiting free energy can again be expressed as
an explicit function of W . To this end, let us define the entropy of a fractional q-partition ρ of
[0, 1] as
Z 1X q
H(ρ) = − ρi (x) log ρi (x) dx,
0 i=1
where the maximum goes over all fractional q-partitions of [0, 1], hi = log αi (H) and Bij =
log βij (H).
Theorem 6.6 [16] Let (Gn ) be a convergent sequence of simple graphs, and let W ∈ W0 be its
limit. Let H be a weighted softcore graph. Then
1 1
lim log hom( Gn , H) = P(W, H).
n→∞ n n
This theorem can be proved using Proposition 5.10 and its Supplement 5.11, in a way similar
to (but more involved than) the proof of Theorem 6.3.
32
Here eG (Vi , Vj ) denotes the number of edges uv ∈ E(G) with u ∈ Vi and v ∈ Vj ; note that we
allow i = j, in this case e(Vi , Vi ) is twice the number of edges spanned by Vi .
We denote by Sbq (G) the set of factors of G with q nodes. Note that by our definition the
factors are labeled graphs, but since permuting the nodes of a factor also gives a factor, we would
not loose information by forgetting the labeling. We can consider Sbq (G) as a subset of Rq×(q+1) .
We extend these definitions to functions U ∈ W0 . Let P be a q-partition of [0, 1]; we then
define a weighted graph H(U, P) on V (H(U, P)) = [q], where node i has weight
αi (H(U, P)) = λ(Vi ),
and edge ij has weight
Z
1
βij (H(U, P)) = U (x, y) dx dy.
λ(Vi )λ(Vj ) Vi ×Vj
(If λ(Vi )λ(Vj ) = 0, then we define βij (H(U, P)) = 0.) We call the graph H(U, P) a factor of U ,
and use the symbol Sbq (U ) to denote the set of all factors of U with q nodes.
Note that the knowledge of the factors of G is enough to recover the ground state energies of
G. Indeed, in terms of the factors of G, the ground state energy defined in (38) can be expressed
as
Xq
E(G, B) = max ai aj Bij Xij , (43)
(a,X)∈Sbq (G) i,j=1
33
Lemma 6.7 [16] For every U ∈ W0 , the set Sq (U ) is the closure of Sbq (U ).
(The two sets are not equal in general.)
For every weighted graph G, Sq (G) is again a closed connected set. Obviously, Sq (G) contains
b
Sq (G), but it is not its closure in general (since the latter is a finite set). It is not hard to see
that for every weighted graph G,
Clearly Sbq (G) is a finite subset of these infinite sets. But it can be shown that it is not much
smaller:
Lemma 6.8 [16] If c is the largest nodeweight in G, then for every H ∈ Sq (G) there is an
√
H 0 ∈ Sbq (G) such that δ¤ (H, H 0 ) ≤ 4q 2 c.
Most of the time, we will work with the fractional versions, which are much easier to handle.
for every i ∈ [q]. (In other words, we prescribe the proportions of elements of S mapped onto
each i ∈ [q], as closely as possible.) For a simple graph G and a softcore graph H with node
weights one, we then introduce microcanonical homomorphism numbers
X
homa (G, H) = homφ (G, H).
φ∈aV (G)
The microcanonical ground state energies and free energies of the model B on a graph G with n
nodes are then defined as
1 X
Eba (G, B) = 2 max Bφ(i),φ(j) , (46)
n φ∈a V (G)
ij∈E(G)
and
¡ ¢
ba (G, B) = 1 log homa 1 G, H
P (47)
n n
respectively.
The ground state energy Êa contains a number of important graph parameters as special
cases. If µ ¶ µ ¶
α 1 0
q = 2, a = and B = (48)
1−α 0 0
then Ea (G, B) corresponds to the densest subgraph on α|V (G)| nodes. If
µ ¶ µ ¶
1/2 0 −1
q = 2, a = and B = (49)
1/2 −1 0
34
then −Ea (G, B) is the minimal bisection; if we replace here B by −B, then Ea (G, B) is the
maximal bisection. For q > 2, we get multiway cut problems in a similar way.
The free energy P̂a is a finer measure. For example, in the case (48), Pa (G, B) will pick
out the density of an induced subgraph that is not necessarily the maximum, but for which the
number of induced subgraphs with this density is large, at the cost of some loss in density.
The use of (45) is cumbersome and in some cases it leads to unpleasant discontinuities; it
will be much more convenient to work with a fractional version. We formulate our definition for
a weighted graph G. Then for all a ∈ Pdq and B ∈ Symq we define
1 X X
Ea (G, B) = max αu (G)αv (G)βuv (G) ρi (u)ρj (v)Bij , (50)
ρ 2
u,v∈V (G) i,j∈[q]
We also extend the notion of microcanonical ground state energy to functions. For every W ∈
W0 , a ∈ Pdq , and B ∈ Symq , we define
q Z
1 X
Ea (W, B) = max Bij ρi (x)ρj (y)W (x, y) dx dy, (51)
ρ:αρ =a 2 [0,1]2
i,j=1
where ρ ranges over fractional q-partitions of [0, 1] with αi (ρ) = ai for all i. It is easy to see that
Using a result from [2], it follows easily that for every simple graph G on n nodes and every
matrix B,
6q 3
|Eba (G, B) − Ea (G, B)| ≤ kBk∞ . (52)
n
The notion of the microcanonical free energy for functions is defined analogously: for every
W ∈ W0 , a ∈ Pdq , and B ∈ Symq , we define
³ Z ´
1X
Pa (W, B) = max H(ρ) + Bij W (x, y)ρi (x)ρj (y) dx dy , (53)
ρ:α(ρ)=a 2 i,j [0,1]2
where ρ ranges over fractional q-partitions of [0, 1] with αi (ρ) = ai for all i.
dHf (A, B) = max(sup inf d(x, y), sup inf d(x, y)).
x∈A y∈B x∈B y∈A
It turns out that the following three types of information about two graphs G, G0 are equiv-
alent: (1) G and G0 are close in the δ¤ distance; (2) Sq (G) and Sq (G0 ) are close in the dHf
distance for every q up to a certain bound, and (3) |Ea (G, B) − Ea (G0 , B)| is small for all a ∈ Pdq
and B ∈ Symq for every q up to a certain bound.
The exact statement is the following:
35
Theorem 6.9 [16] (a) For two simple graphs G, G0 and a ∈ Pdq ,
dHf 0 0
¤ (Sq (G), Sq (G )) ≤ δ¤ (G, G )
Theorem 6.10 [16] Let (Gn ) be a sequence of simple graphs with |V (Gn )| → ∞. Then the
following are equivalent:
(a) For every simple graph F , t(F, Gn ) is convergent.
(b) The sequence (Gn ) is Cauchy in the δ¤ metric.
(c) For every q ≥ 1, the sequence Sq (Gn ) is Cauchy with respect to the Hausdorff metric dHf
1 .
(d) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , the sequence Ea (Gn , B) is a Cauchy sequence.
(e) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , the sequence P ba (Gn , B) is a Cauchy sequence.
Supplement 6.11 [16] The following two conditions are also equivalent to conditions (a)–(e)
in Theorem 6.10:
(f) For every k ≥ 1 there is an nk ≥ 1 such √ that if n, m > nk , then Gn and Gm have weak
0
Szemerédi
√ k-partitions P and P with error 2/ log k such that d¤ (H(Gn , P), H(Gm , P 0 )) <
2/ log k.
(g) For every k ≥ 1 there is an nk ≥ 1 such that if n, m > nk , then Gn and Gm have strong
Szemerédi k-partitions P and P 0 with error 1/ log∗ k such that d¤ (H(Gn , P), H(Gm , P 0 )) <
1/ log∗ k.
A convergent sequence has a limit W ∈ W0 by Theorem 4.6. The conditions in Theorem
6.10 can be rephrased to characterize the convergence to this limit:
Theorem 6.12 [16] For a sequence (Gn ) of simple graphs with |V (Gn )| → ∞, and for any
W ∈ W0 , the following are equivalent:
(a) For every simple graph F , t(F, Gn ) → t(F, W ).
(b) δ¤ (Gn , W ) → 0.
(c) For every q ≥ 1, Sq (Gn ) → Sq (W ) in the Hausdorff metric dHf
1 .
(d) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , Ea (Gn , B) → Ea (W, B).
(e) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , P ba (Gn , H) → Pa (W, B).
One can define Szemerédi partitions for the limit objects W ∈ W0 (see [45]), and then
formulate analogues of (f) and (g) in Supplement 6.11 describing the convergence to the limit.
36
7 Homomorphisms and extremal graph theory
7.1 Inequalities between homomorphism numbers
We have mentioned in the introduction that many results in extremal graph theory can be
expressed as algebraic inequalities between homomorphism densities. Every algebraic inequality
that holds for all finite graphs also holds for simple graph parameters in the closure T0 (and
of course vice versa). So for example, Goodman’s Theorem (1) is equivalent to saying that for
every simple graph parameter t ∈ T0 ,
counts (by inclusion-exclusion) the number of homomorphisms of F1 into G that map the edges
in E(F2 ) \ E(F1 ) onto non-adjacent pairs. This number is nonnegative, which implies that
X
(−1)|E(F2 )\E(F )| t(F, G) ≥ 0. (57)
F1 ⊆F ⊆F2
¡ ¢
If we fix V (F ) = V with |V | = k, then t(F ) can be considered as a setfunction on k2 elements.
Then (57) can be used to show that this setfunction is supermodular, i.e., it satisfies
t(GG) ≥ t(G)2 .
37
As special cases, we mention that for the path P3 on 3 nodes,
and
t(C4 ) ≥ t(P3 )2 . (60)
Using that Nr = t(Kr , G)nr /r!, this inequality can be expressed in the following simpler form:
For every t ∈ T0 ,
t(Kr ) t(Kr+1 )
r ≤ (r − 1) + 1. (62)
t(Kr−1 ) t(Kr )
This shows that (54) is a special case, and the derivation of (54) above can be extended.
The (simplest, asymptotic) case of the Kruskal-Katona theorem,
also follows. One uses multiplicativity to write it as t(K3 )2 ≤ t(K2 )t(K2 K2 ); by monotonicity, it
suffices to prove the stronger inequality t(K3 )2 ≤ t(K2 )t(C4 ); which then follows by considering
the following submatrix of M2 : µ ¶
t(K2 ) t(K3 )
.
t(K3 ) t(C4 )
38
Kruskal-Katona
LL-Simonovits
Goodman
Bollobás
Fisher
Figure 2: The region of possible edge densities and triangle densities. (The concave arcs are
distorted to make the qualitative properties more visible.)
graph with k − 1 equal color classes. The size of the two special color classes must be optimized
to make sure that the density of triangles is minimized, subject to the given edge density. For
the interval 1/2 ≤ t(K2 ) ≤ 2/3, the optimization yields the cubic curve
9y 2 − 18xy + 8x − 3x2 + 6x3 = 0.
This case was proved by Fisher [24]; a recent proof by Razborov [54] uses methods quite closely
related to those described in this paper. The conjecture was proved in [42] if t(K2 ) was in a
small neighborhood of any one of the special values 1 − 1/k. The general case is open.
One should add that this conjecture is all that is needed to describe T00 : it is easy to see that
between two points of T00 the plane on a vertical line, the whole interval is contained in T00 .
39
8.1 Convergence for graphs with bounded degree
Let (G1 , G2 , . . . ) be a sequence of graphs whose degrees are uniformly bounded by D. We say
that this sequence is locally convergent, or just convergent, if τ (F, Gn ) tends to a limit for every
connected graph F .
There is another way to define this. Given a graph G with all degrees bounded by D, and
a positive integer r, let SG (v, r) denote the neighborhood of node v with radius r. We consider
SG (v, r) as a rooted graph (where v is its root). For fixed D and r, there is a finite number of
possible neighborhoods, and we can make a statistic of these: we denote by νG (N, r) the fraction
of nodes v ∈ V (G) for which SG (v, r) ∼ = N . Then νG is a probability distribution on all possible
r-neighborhoods.
For bounded degrees, it is easy to see that the probability distributions νGn (·, r) tend to
a limit distribution for all r > 0 if and only if Gn converges in the sense defined above. The
convergence of the probability distributions νGn (·, r) can therefore be used as an alternative
characterization of convergent sequences (G1 , G2 , . . . ) of graphs with degrees bounded by D.
This notion of convergence was introduced implicitly by Aldous [1], and explicitly by Ben-
jamini and Schramm [17]. It was extended to the case of bounded average degree by Lyons
[49]. The limit object has several descriptions, the strongest is due to Elek [21]: A continuous
graphing is an infinite graph on [0, 1] that has the following structure: we take a finite number
of continuous measure preserving involutions φ1 , . . . , φN : [0, 1] → [0, 1], and connect every
x ∈ [0, 1] to every φi (x) (i = 1, . . . , N ) such that x 6= φi (x) by an edge.
Every graphing defines a probability distribution on countable graphs with bounded degree
with a specified root: we pick a uniform random point x in [0, 1], and consider the connected
component of the graphing containing x, with root x. This yields the description of the limit by
Benjamini and Schramm.
Example 8.1 Consider the sequence of cycles Cn . It is trivial that this is convergent, but the
numbers hom(Cn , K2 ) alternate between 0 and 2.
But in a sense parity is all that goes wrong. Let us consider the subsequence C2n of even
cycles. Then for any graph H,
Xk
hom(C2n , H) = λ2n
i ,
i=1
hom(C2n , H)1/(2n) → λ1 .
1
It is perhaps more usual to take the logarithm here, to get the sequence 2n log hom(C2n , H) (in
statistical physics, this parameter is called the free energy or pressure of the H-colorings of C2n ).
This sequence is convergent for every H, and the limiting parameter is log λ1 (H).
The sequence of odd cycles behaves similarly, but the limit parameter a(H) =
1
lim 2n+1 log hom(C2n+1 , H) is a bit more complicated to describe: a(H) is the logarithm of
largest eigenvalue of its non-bipartite components (a(H) = −∞ if H is bipartite).
40
Based on this example, let us consider a convergent sequence (Gn ) of graphs with bounded
degree, and ask for which graphs H the numbers
b n , H) = 1
P(G log hom(Gn , H)
|V (Gn )|
converge.
Before discussing this further, let us rewrite the right hand side using Möbius inversion. Let
X
ψ(G, H) = (−1)|V (G)|−|V | log hom(G[V ], H).
V ⊂V (G)
Then
X X ind(F, G)
log hom(G, H) = ψ(G[V ], H) = ψ(F, H), (65)
|V (F )|!
V ⊂V (G) F
where the sum goes over all finite simple graphs F and ind(F, G) is the number of embedding of
F into G as an induced subgraph. We thus have shown that
X τind (F, G)
b
P(G, H) = ψ(F, H) (66)
|V (F )|!
F
where
1
τind (F, G) = ind(F, G). (67)
|V (G)|
Recall that the numbers ind(F, G) can be obtained from the homomorphism numbers hom(F, G)
by Möbius inversion (see (14) and (15)). If Gn is convergent in the sense that the normalized
homomorphism numbers τ (F, Gn ) are convergent for all F , then the numbers τind (F, Gn ) are
convergent as well; let τind (F, Gn ) → τ (F ). One might therefore hope that convergence of the
sequence Gn implies convergence of the free energies P(G b n , H), with the limit given by the
infinite sum
X τ (F )
P(τ, H) = ψ(F, H). (68)
|V (F )|!
F
It is clear, however, that this cannot be true in general; an easy counterexample is the example
Gn = Cn discussed above. But it turns out that we can prove the convergence under suitable
conditions on H and the maximal degree D of the graphs Gn .
Theorem 8.2 [15] If (Gn ) is a sequence of graphs with all degrees bounded by a constant D,
b n , H) has a limit for every unweighted target graph H in which
and (Gn ) is convergent,¡ then P(G
1
¢
all degrees are at least 1 − 2D |V (H)|.
The conceptually most transparent proof of the theorem (with a slightly worse constant)
proceeds by proving uniform convergence of the expansion (68) using the method of cluster
expansions. As stated, the theorem can be proven using Dobrushin’s uniqueness theorem for
uniform H-colorings on Gn .
The cluster expansion proof also gives an analogue of Theorem 8.2 for weighted graphs (see
[15]) and allows to prove that convergence from the right for weighted graphs implies convergence
from the left. In fact, we only need convergence from the right for graphs H that are small
perturbations of completely looped complete graphs with edge weights 1 to conclude that Gn is
convergent from the left.
Theorem 8.3 [15] If (Gn ) is a sequence of graphs with all degrees bounded by a constant D,
and P(Gn , H) is convergent for every weighted graph H which is a looped complete graph with
all edgeweights arbitrarily close to 1, then (Gn ) is convergent.
41
References
[1] D.J. Aldous: Tree-valued Markov chains and Poisson-Galton-Watson distributions, in: Mi-
crosurveys in Discrete Probability (D. Aldous and J. Propp, editors), DIMACS Ser. Discrete
Math. Theoret. Comput. Sci. 41 (1998) Amer. Math. Soc., Providence, RI. (1998), 1–20.
[2] N. Alon, W. Fernandez de la Vega, R. Kannan and M. Karpinski: Random sampling and
approximation of MAX-CSPs, J. Comput. System Sci. 67 (2003), 212–243.
[3] N. Alon and A. Naor: Approximating the Cut-Norm via Grothendieck’s Inequality, preprint:
http://research.microsoft.com/research/theory/naor/homepage%20files/
cutnorm.pdf
[4] N. Alon and A. Shapira, Every monotone graph property is testable, Proc. of the 37th ACM
STOC, Baltimore, ACM Press (2005), to appear:
http://www.math.tau.ac.il/∼nogaa/PDFS/MonotoneSTOC.pdf
[5] E. Babson and D. Kozlov: Complexes of graph homomorphisms, arXive:
http://lanl.arxiv.org/abs/math.CO/0310056
[6] E. Babson and D. Kozlov: Proof of the Lovász Conjecture, arXive:
http://lanl.arxiv.org/abs/math.CO/0402395
[7] A. Barabasi and R. Albert: Emergence of scaling in random networks, Science 286 (1999)
509–512.
[8] B. Bollobás and O. Riordan: Mathematical results on scale-free random graphs, in: Hand-
book of graphs and networks, 1–34, Wiley-VCH, Weinheim, 2003.
[9] M. Biskup, C. Borgs, J.T. Chayes, L. Kleinwaks and R. Kotecký: Partition function zeros at
first-order phase transitions: A general analysis, Commun. Math. Phys. 251 (2004) 79–131.
[10] B. Bollobás, O.M. Riordan, J. Spencer, and G. Tusnády: The degree sequence of a scale-free
random graph process, Random Structures and Algorithms 18 (2001), 279–290.
[11] B. Bollobás: Relations between sets of complete subgraphs, in: Combinatorics, Proc. 5th
British Comb. Conf. (ed. C.St.J.A. Nash-Williams, J. Sheehan), Utilitas Math. (1975), 79–
84.
[12] B. Bollobás, C. Borgs, J. T. Chayes, and O. Riordan: Directed scale-free graphs, in: Pro-
ceedings of the 14th ACM-SIAM Symposium on Discrete Algorithms, 132–139, 2003.
[13] C. Borgs: Statistical Physics Expansion Methods for Combinatorics and Computer Science,
CBMS lecture notes (in preparation).
[14] C. Borgs and J. Chayes and L. Lovász: Unique limits of dense graph sequences (in prepa-
ration).
[15] C. Borgs, J. Chayes, J. Kahn, L. Lovász and V.T. Sós: Convergent sequences for sparse
graphs (in preparation).
[16] C. Borgs, J. Chayes, L. Lovász, V.T. Sós and K. Vesztergombi: Convergent sequences of
dense graphs (in preparation).
[17] I. Benjamini and O. Schramm: Recurrence of Distributional Limits of Finite Planar Graphs,
Electronic J. Probab. 6 (2001), no. 23, 1–13.
42
[18] M.-D. Choi: Tricks or Treats with the Hilbert Matrix, Amer. Math. Monthly 90 (1983),
301–312.
[19] F. Chung, R.L. Graham and R.M. Wilson: Quasi-random graphs, Combinatorica 9 (1989),
345–362.
[20] C. Cooper and A. Frieze: On a general model of web graphs, Rand. Struct. Alg. 22 (2003)
311–335.
[34] P. Hell and J. Nešetřil: Graphs and Homomorphisms, Oxford University Press, 2004.
[35] J. Komlós and M. Simonovits: Szemerédi’s Regularity Lemma and its applications in graph
theory, in: Combinatorics, Paul Erdos is Eighty (D. Miklos et. al, eds.), Bolyai Society
Mathematical Studies 2 (1996), pp. 295–352.
[36] T.D. Lee and C.N. Yang, Statistical theory of equations of state and phase transitions: II.
Lattice gas and Ising model, Phys. Rev. 87 (1952) 410–419.
[37] L. Lovász: Operations with structures, Acta Math. Hung. 18 (1967), 321-328.
43
[38] L. Lovász: Direct product in locally finite categories, Acta Sci. Math. Szeged 23 (1972),
319-322.
[39] L. Lovász: The rank of connection matrices and the dimension of graph algebras, Europ.
J. Combin., to appear:
http://research.microsoft.com/∼lovasz/homdim.pdf
[40] L. Lovász and A. Schrijver: Graph parameters and semigroup functions (manuscript).
[41] L. Lovász and M. Simonovits: On the number of complete subgraphs of a graph (M. Si-
monovits), in: Combinatorics, Proc. 5th British Comb. Conf. (ed. C.St.J.A.Nash-Williams,
J.Sheehan), Utilitas Math. (1976) , 439-441.
[42] L. Lovász and M. Simonovits: On the number of complete subgraphs of a graph II, in:
Studies in Pure Math., To the memory of P. Turán (ed. P. Erdös), Akadémiai Kiadó (1983),
459-495.
[43] L. Lovász and V.T. Sós: Generalized quasirandom graphs, preprint:
http://research.microsoft.com/∼lovasz/quasirandom4.pdf
[44] L. Lovász and B. Szegedy: Limits of dense graph sequences, Microsoft Research Technical
Report MSR-TR-2004-79:
ftp://ftp.research.microsoft.com/pub/tr/TR-2004-79.pdf
[45] L. Lovász and B. Szegedy: Szemerédi’s Lemma for the analyst, Microsoft Research Technical
Report MSR-TR-2005-90:
ftp://ftp.research.microsoft.com/pub/tr/TR-2005-90.pdf
[46] L. Lovász and B. Szegedy: Contractors and connectors of graph algebras Microsoft Research
Technical Report TR-2005-91:
ftp://ftp.research.microsoft.com/pub/tr/TR-2005-91.pdf
[47] L. Lovász and B. Szegedy: Graph limits and testing hereditary graph properties, Microsoft
Research Technical Report MSR-TR-2005-110:
ftp://ftp.research.microsoft.com/pub/tr/TR-2005-110.pdf
[48] L. Lovász and B. Szegedy: Moments of 2-variable functions (in preparation).
[49] R. Lyons: Asymptotic Enumeration of Spanning Trees, preprint:
http://mypage.iu.edu/∼rdlyons/pdf/est.pdf
[50] H. Mahmound, R. Smythe and J. Szymański: On the structure of plane-oriented trees and
their branches. Random Struct. Alg. 3 (1993) 255–266.
[51] J. Matoušek: Using the Borsuk-Ulam Theorem: Lectures on Topological Methods in Com-
binatorics and Geometry Springer, 2003.
[52] J.W. Moon, L. Moser: On a problem of Turn, Magyar Tud. Akad. Mat. Kutat Int. Kzl. 7
(1962), 283–286.
[53] D. J. de S. Price: A general theory of bibliometric and other cumulative advantage processes.
J. Amer. Soc. Inform. Sci., 27 (1976) 292–306.
[54] A. Razborov (unpublished).
44
View publication stats
[55] M. Simonovits and V.T. Sós: Szemerdi’s partition and quasirandomness, Random Structures
Algorithms 2 (1991), 1–10.
[56] M. Simonovits and V.T. Sós: Hereditary extended properties, quasi-random graphs and
induced subgraphs, Combinatorics, Probability and Computing 12 (2003), 319–344.
[57] M. Simonovits and V.T. Sós: Hereditarily extended properties, quasi-random graphs and
not necessarily induced subgraphs. Combinatorica 17 (1997), 577–596.
45