0% found this document useful (0 votes)

72 views46 pages

Counting Graph Homomorphisms

This document discusses the counting of graph homomorphisms and their applications across various fields such as extremal graph theory and statistical physics. It introduces concepts like homomorphism numbers, connection matrices, and the convergence of graph sequences, providing a framework for analyzing graph structures and properties. The paper also explores the implications of these concepts in property testing and the characterization of graphs based on their homomorphism profiles.

Uploaded by

Francisco Alan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views46 pages

Counting Graph Homomorphisms

Uploaded by

Francisco Alan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/226755686

Counting Graph Homomorphisms

Chapter · January 2006

DOI: 10.1007/3-540-33700-8_18

CITATIONS READS

190 481

5 authors, including:

Jennifer Chayes Lovász László

University of California, Berkeley Alfréd Rényi Institute of Mathematics
148 PUBLICATIONS 8,570 CITATIONS 507 PUBLICATIONS 48,868 CITATIONS

SEE PROFILE SEE PROFILE

Katalin - Vesztergombi
Eötvös Loránd University
58 PUBLICATIONS 1,911 CITATIONS

SEE PROFILE

All content following this page was uploaded by Katalin - Vesztergombi on 24 March 2014.

The user has requested enhancement of the downloaded file.

Counting Graph Homomorphisms
Christian Borgs∗, Jennifer Chayes†, László Lovász‡,
Vera T. Sós§, Katalin Vesztergombi¶
February 2006

Abstract
Counting homomorphisms between graphs (often with weights) comes up in a wide
variety of areas, including extremal graph theory, properties of graph products, partition
functions in statistical physics and property testing of large graphs.
In this paper we survey recent developments in the study of homomorphism numbers, in-
cluding the characterization of the homomorphism numbers in terms of the semidefiniteness
of “connection matrices”, and some applications of this fact in extremal graph theory.
We define a distance of two graphs in terms of similarity of their global structure, which
also reflects the closeness of (appropriately scaled) homomorphism numbers into the two
graphs. We use homomorphism numbers to define convergence of a sequence of graphs, and
show that a graph sequence is convergent if and only if it is Cauchy in this distance. Every
convergent graph sequence has a limit in the form of a symmetric measurable function in
two variables. We use these notions of distance and graph limits to give a general theory
for parameter testing.
The convergence can also be characterized in terms of mappings of the graphs into fixed
small graphs, which is strongly connected to important parameters like ground state energy
in statistical physics, and to weighted maximum cut problems in computer science.

1 Introduction
For two finite graphs G and H, let hom(G, H) denote the number of homomorphisms (adjacency-
preserving mappings) from G to H. Counting homomorphisms between graphs has many inter-
esting aspects.
(a) A large part of extremal graph theory can be expressed as inequalities between various
homomorphism numbers. For example, Turán’s Theorem for triangles follows from the inequality
(due to Goodman [30]):

hom(K1 , H)hom(K3 , H) ≥ hom(K2 , H)(2hom(K2 , H) − hom(K1 , H)2 ). (1)

One may wish to obtain a characterization of such inequalities.

(b) Homomorphism numbers characterize graphs: it was proved in [37] that if two graphs G
and G0 have the property that hom(F, G) = hom(F, G0 ) for every finite graph F , then G and
∗ Microsoft Research, One Microsoft Way, Redmond, WA 98052; [email protected]
† Microsoft Research, One Microsoft Way, Redmond, WA 98052; [email protected]
‡ Microsoft Research, One Microsoft Way, Redmond, WA 98052; [email protected]
§ Alfréd Rényi Institute of Mathematics, Budapest, Reáltanoda U. 13-15., H-1053 Budapest, Hungary;

[email protected]; Research supported in part by OTKA grants No. TO32236, TO39210, TO42750.
¶ Department of Computer Science, Eötvös Loránd University, Pázmány Péter sétány 1/C, H-1117 Budapest,

Hungary; [email protected]

1
G0 are isomorphic (one actually needs one additional condition, the condition that both graphs
are twin-free; see Section 2.1 for the definition of this notion). In other words, let us order all
finite graphs in a sequence (F1 , F2 . . . ), and assign to each graph G its profile, the (infinite)
sequence (hom(F1 , G), hom(F2 , G) . . . ); then this sequence characterizes G. (This fact can be
used to prove, for example, the cancellation property of strong multiplication of graphs).
It is often worthwhile to normalize the homomorphism numbers, and consider the homomor-
phism densities
hom(F, G)
t(F, G) = . (2)
|V (G)||V (F )|
(Thus t(F, G) is the probability that a random map of V (F ) into V (G) is a homomor-
phism.) Instead of the profile (hom(F1 , G), hom(F2 , G), . . . ), we can consider the scaled profile
(t(F1 , G), t(F2 , G), . . . ) of the graph G. Such a normalization was first introduced in [23].
(c) Partition functions of many models in statistical mechanics can be expressed as graph
homomorphism functions. For example, let G be an n × n grid, and suppose that every node of
G (every “site”) can be in one of two states, “UP” or “DOWN”. The properties of the system
are such that no two adjacent sites can be “UP”. A “configuration” is a valid assignment of
states to each node. The number of configurations is the number of independent sets of nodes
in G, which in turn can be expressed as the number of homomorphisms of G into the graph H
consisting of two nodes, ”UP” and ”DOWN”, connected by an edge, and with an additional loop
at ”DOWN”. To define the thermodynamic functions in physical models, one needs to extend
the notion of graph homomorphism to the case when the nodes and edges of H have weights
(see Section 2.1).
(d) Suppose that G is a huge graph, and we know the numbers hom(F, G) (exactly or ap-
proximately) for small graphs F . What kind of information can be derived about the global
structure of G? The long-standing Reconstruction Conjecture is equivalent to the assertion that
it is enough to know all numbers hom(F, G) with |V (F )| < |V (G)| in order to recover the iso-
morphism type of G. The fact that this is unsolved shows the difficulty of this kind of question,
but our interest here is in the case when much less is given: hom(F, G) is only known for very
small graphs F .
(e) This is closely related to an important area of computer science called Property Testing.
In this model, we have a huge graph about which we can obtain information only by taking a
small sample of the nodes and examine the subgraph induced by them. This is equivalent to
knowing the homomorphism densities (2) for graphs F of small size. What makes the theory
of Property Testing interesting is the fact that from such meager local information nontrivial
properties and parameters of the graph can be inferred.
(f) Increasing sequences of (sparse) graphs generated by some specific random rule of growth
have recently been used to model the Internet; see [8] and references therein. What are the
limiting properties of such graph sequences? To what extent can these properties be derived from
local observation (observing a neighborhood of bounded radius of a few randomly chosen nodes?
This question can be rephrased as follows: To what extent are these properties characterized by
the homomorphism numbers of smaller graphs into the modeling sequence?
The main setup of our studies is the following. If we are given a (large, usually simple) graph
G, we may try to study its local structure by counting the homomorphisms of various “small”
graphs F into G; and we can study its global structure by counting its homomorphisms into
various small graphs H (called “softcore” weighted graphs; see Section 2.1 for a definition of
softcore). So the scheme to keep in mind is

F −→ G −→ H. (3)

According to the above discussion, in the scheme (3), the study of the local structure of G
by “probing from the left with F ” is related to property testing, while the study of the global

2
structure of G by “probing from the right with H” is related to statistical physics. As in statistical
physics, the best choice of graphs H to “probe G from the right” is not simple unweighted graphs
but weighted graphs. Furthermore, besides counting (weighted) homomorphisms into H, it is
also useful to consider maximizing the weight of such homomorphisms, which is again related to
well-studied questions both in statistical physics and graph property testing.
The number hom(F, G), as a function of F with G fixed, is a graph parameter (a function of
graphs F invariant under isomorphism). Graph parameters arising this way were characterized in
[28]. The necessary and sufficient condition involves certain matrices, called connection matrices,
associated with the graph parameter (see Section 3 for the definition): These matrices must be
positive definite and must satisfy a rank condition. The semidefiniteness condition is in fact
familiar from statistical physics, where it is called reflection positivity.
The scaled profile does not determine the graph: if we “blow up” every node into the same
number of “twins”, then we get a graph with exactly the same scaled profile. It turns out that
this is all: any two graphs with the same scaled profile are obtained from one and the same
graph by blowing up its nodes in two different ways.
Now we come to our main question: What can be said about two graphs whose scaled profiles
are approximately the same? Which properties of a graph G are determined if we only know a
few of the numbers hom(F, G), and even these are only known approximately? This question
turns out to be very interesting and it leads to a number of results, and even more open problems,
leading to quasirandom and generalized quasirandom graphs, and connecting to the ”Property
Testing” research in computer science and to statistical physics.
There is a way to measure the “distance” of two graphs so that they are close in this distance
if and only if they have approximately the same scaled profile [16]. This distance has many nice
properties. On the one hand, important parameters like the triangle density or the fraction of
edges in the maximum cut are continuous (often even Lipschitz) functions in this metric. On
the other hand, a sufficiently large random subgraph of an arbitrarily large graph will be close
to the whole graph with large probability. This fact explains several results in the theory of
Property Testing. Szemerédi’s Regularity Lemma (at least in its weaker but more effective form
due to Frieze and Kannan [29]) can be rephrased as follows: for every ε > 0, all graphs with at
2
most 22/ε nodes form an ε-net in the metric space of all graphs. In our context, an ε-net is
defined as a set of weighted graphs such that, for every graph G, there exists a graph H in the
set which has at most distance ε from G.
Once we make the set of all graphs into a metric space, we can make it complete. Is there
any combinatorial meaning of the new points in the completion? Surprisingly, the answer is in
the affirmative, and in fact in more than one way [44, 45]. These limit points can be described
as symmetric measurable functions W : [0, 1]2 → [0, 1] modulo measure-preserving transforma-
tions, as reflection positive graph parameters, as random graph models satisfying some natural
compatibility conditions, or as probability distributions of countable graphs with natural sym-
metries (see Chapter 4).
It is an important property of this completion that it is compact, so that every infinite
sequence of graphs has a convergent subsequence. Several arguments in extremal graph theory
and elsewhere can be simplified by going to the limit and thereby getting rid of remainder terms.
We conclude this introduction by mentioning two related bodies of work. There are many in-
teresting question that concern the existence of graph homomorphisms rather than their number
(an example is 4-colorability of a graph), and such questions have been studied quite exten-
sively, especially by the Czech school. These results are described in the recent book by Hell
and Nešetřil [34]. The set of all homomorphisms between two graphs can be endowed with a
topological structure, which turns out to be an important tool in the study of chromatic number.
See the book of Matoušek [51], and also the recent papers of Babson and Kozlov [5, 6].
This paper is organized as follows. In Section 2 we define homomorphism numbers, including
homomorphisms into a measurable function, and describe the basic examples. In Section 3

3
we define connection matrices, study their rank, and use their semidefiniteness to characterize
homomorphism functions. We also describe an analogous (but much more difficult) edge-coloring
version. In Section 4 we define convergence of a sequence of dense graphs, and show that they
have interesting limit objects, which can be described as measurable functions, reflection positive
graph parameters, or very natural models of finite or countable random graphs. In Section 5, we
introduce a metric on graphs that corresponds to the above notion of convergence, and show its
connection with Szemerédi’s Regularity Lemma and graph property testing. Section 6 studies
homomorphisms from large graphs, rather than into large graphs, which leads to quantities of
both combinatorial and physical interest; we show that they too can be used to characterize
convergent graph sequences. Section 7 contains some applications to extremal graph theory. In
Section 8 we conclude with describing some analogous (but less complete) results for sequences
of graphs with bounded degree.

2 Homomorphism numbers
2.1 Unweighted and weighted graphs
A graph is simple if it has no loops or parallel edges. A graph parameter is a function defined on
finite graphs, invariant under isomorphisms. We’ll talk of a simple graph parameter if it is only
defined on simple graphs. Sometimes it is convenient to think of a simple graph parameter as a
function defined on all graphs with multiple edges (but no loops) that is invariant under adding
parallel edges. A graph parameter t is called multiplicative if t(G) = t(G1 )t(G2 ) whenever G is
the disjoint union of G1 and G2 . We say that a graph parameter is normalized if its value on
K1 , the graph with one node and no edge, is 1. (Note that if a graph parameter is multiplicative
and not identically 0, then its value on K0 , the graph with no nodes and edges, is 1. The graph
parameter t(·, G) introduced in the introduction is multiplicative and normalized for every graph
G.
Recall that for two (finite) simple graphs F and G, hom(F, G) denotes the number of homo-
morphisms (adjacency preserving maps) from F to G.
A weighted graph G is a graph with a weight αi (G) associated with each node i and a weight
βij (G) associated with each edge ij. We’ll assume (unless otherwise stated) that the weights
αi (G) are positive. The weights βij (G) will be real, and most often nonnegative. If the graph G
is understood from the context, we will also use the notation αi and βij .
An edge with weight 0 will play the same role as no edge between those nodes, so we could
assume that we only consider weighted complete graphs with loops at all nodes (but this is not
always convenient). A weighted graph is called softcore if it is a complete graph with loops at
each node, and every edgeweight is strictly positive. An unweighted graph is a weighted graph
where all the nodeweights and edgeweights are 1.
For a weighted graph G, we denote by α(G) the sum of its nodeweights. Often it will be useful
to divide all nodeweights by α(G), to get a weighted graph G b in which the sum of nodeweights
is 1.
Let F and G be two weighted graphs. To every map φ : V (F ) → V (G), we assign the weight
Y £ ¤βuv (F )
homφ (F, G) = βφ(u)φ(v) (G) (4)
uv∈E(F )

(here 00 = 1). We then define

X
hom(F, G) = αφ homφ (F, G), (5)
φ: V (F )→V (G)

4
where Y £ ¤αu (F )
αφ = αφ(u) (G) . (6)
u∈V (F )
£ ¤βuv (F )
(A little care is necessary, since the exponential βφ(u)φ(v) (G) may not be well defined;
but it well defined e.g. if the edge weights are positive. This problem will not arise in the cases
we consider.)
We’ll use this definition most often in the case when F is a simple unweighted graph, so that
Y
αφ = αφ(u) (G)
u∈V (F )

and Y
homφ (F, G) = βφ(u)φ(v) (G).
uv∈E(F )

An interesting case is when, in addition, α(G) = 1. Then the nodeweights in G define a

probability distribution on V (G), and the coefficient αφ is the probability of the random map φ
(if the images of the nodes of F are chosen independently). So hom(F, G) is the expectation of
homφ (F, G).
We can extend homomorphism densities (2) to the case when G is a weighted graph with
nodeweights αi and edgeweights βij by replacing |V (G)| by α(G):

hom(F, G) b
t(F, G) = = hom(F, G).
α(G)|V (F )|

Let F = {F1 , F2 , . . . } denote the set of (isomorphism types of) all simple finite graphs. To
every weighted graph G, we assign its hom-profile (or briefly profile), the (infinite) vector

hG = (hom(F1 , G), hom(F2 , G), . . . ) ∈ RF .

Recall that we define the scaled profile of G as the (infinite) vector tG = (t(F1 , G), t(F2 , G), . . . ) ∈
RF . The scaled profile does not determine the graph. For example, if G is an unweighted graph
and G0 is obtained from G by replacing every node by N independent nodes, then

tG0 = tG .

More generally, let G be a weighted graph and let u, v ∈ V (G) be twins, i.e., βuw (G) = βvw (G)
for every w ∈ V (G) (note that αu (G) may be different from αv (G)). Merging twins in a
weighted graph does not change its scaled profile. Furthermore, if we multiply all nodeweights
of a weighted graph by the same positive constant, then its scaled profile does not change. If we
merge twins as long as we can, we say that we have performed twin-reduction.

Proposition 2.1 [39] If two weighted graphs have the same scaled profile, then after twin-
reduction one can be obtained from the other by multiplying all nodeweights by the same positive
scalar.
One of our main concerns will be: if we only know a bounded number of entries of the scaled
profile of a graph G, and even this only approximately, to what degree is the graph determined?
From a graph-theoretic perspective, the following variations of homomorphism functions are
perhaps more important: Let inj(F, G) denote the number of homomorphisms that are injective
on the nodes, and ind(F, G), the number of embeddings as an induced subgraph. Finally, let
surj(F, G) denote the number of homomorphisms that are surjective on the nodes.
For weighted graphs, inj(F, G) and surj(F, G) are easily defined by restricting the sum in (5)
to sums over injective and surjective maps, respectively, but the definition of ind(F, G) requires

5
some care. Here we only consider the case where F is simple, and G is a weighted graph without
loops. We then define X
ind(F, G) = αφ indφ (F, G), (7)
φ: V (F )→V (G)

where the sum goes over all injective maps from V (F ) to V (G) and
Y Y
indφ (F, G) = βφ(u)φ(v) (G) (1 − βφ(u)φ(v) (G)), (8)
uv∈E(F ) uv∈E(F )

with F denoting the complement of F (again defined to be a simple graph).

Analogously to the hom-profile, we can define the inj-profile and ind-profile of a graph G. It
follows from the identities to be discussed below that any of the profiles determines the others. It
is obvious that the ind-profile determines the graph (look at the largest non-zero entry). It follows
that the inj-profile and hom-profile of a graph also determine the graph (up to isomorphism) [37].
For future reference, we also define the set of simple graph parameters which is the pointwise
closure of the set of all hom-profiles:

T0 = {t(·) : ∃ (Gn ) s.t. t(F ) = lim t(F, Gn ) ∀ finite F }.

n→∞

Here we restrict (Gn ) to be a sequence of simple graphs. The reason for the subscript on T is
that, in our paper [16], we will consider a more general class of graph parameters, which will be
denoted by T . Note also that in [44], the authors used T rather than T0 to denote the smaller
class considered here.

2.2 Simple properties

There are some simple identities that hold for homomorphism numbers. If F is the disjoint union
of two graphs F1 and F2 , then

hom(F, G) = hom(F1 , G)hom(F2 , G). (9)

If F is connected, and G is the disjoint union of two graphs G1 and G2 ,then

hom(F, G) = hom(F, G1 ) + hom(F, G2 ). (10)

So in a sense it is enough to study homomorphisms between connected graphs.

For two simple graphs G1 , G2 , we define their categorial product G1 × G2 as the graph with
node set V (G1 ) × V (G2 ), in which (i1 , i2 ) is connected to (j1 , j2 ) (i1 , j1 ∈ V (G1 ), i2 , j2 ∈ V (G2 ))
if and only if i1 j1 ∈ E(G1 ) and i2 j2 ∈ E(G2 ). The definition can be extended to weighted graphs
(possibly with loops) by defining the weight of the node (i1 , i2 ) as the product of the weights of
the nodes i1 and i2 , and the weight of the edge (i1 , i2 )(j1 , j2 ) as the product of the weights of
the edges i1 j1 and i2 j2 . For this product, we have the identity

hom(F, G1 × G2 ) = hom(F, G1 ) · hom(F, G2 ). (11)

What about hom(F1 × F2 , G)? There is no identity for this number in terms of the notions
introduced so far, but there is one if one also introduces the operation of exponentiation (which
we do not discuss here; see [37]).
For a fixed graph G, identity (9) gives an algebraic relation between the entries of its hom-
profile. It was proved by Whitney [62] that there are no other algebraic relations between these
entries valid for all graphs G. A slightly stronger result was proved in [23]: the projection of T0
to the coordinates corresponding to any finite set of connected graphs is full-dimensional. This
excludes any other kind of equations (e.g. exponential) between these numbers.

6
There are simple identities relating homomorphism numbers with the injective and induced
versions. In order to spare the reader from separate provisos for each relation, we restrict
ourselves to the case where both F and G are simple graphs. If Θ is any equivalence relation on
V (F ), then we denote by F/Θ the graph obtained by identifying nodes that belong to the same
class of Θ. Note that this may create loops and parallel edges.
We have some easy relations:
X
hom(F, G) = inj(F/Θ, G) (12)
Θ

and X
inj(F, G) = ind(F 0 , G), (13)
F 0 ⊃F
where the sum runs over graphs F 0 ⊃ F with the same node set.
From these, we can get reverse relations by Möbius inversion (or inclusion-exclusion):
X 0
ind(F, G) = (−1)|E(F )\E(F )| inj(F 0 , G), (14)
F 0 ⊃F

and X
inj(F, G) = µ(Θ)hom(F/Θ, G), (15)
Θ
where the last sum runs over equivalence relations and
k ³
Y ´
µ(Θ) = (−1)(|A|−1) (|A| − 1)! ,
A∈Θ

with the product running over all classes A ∈ Θ.

We have the following relations describing complementation (as an operation from simple
graphs to simple graphs):
ind(F, G) = ind(F , G), (16)
X
|E(F 0 )| 0
inj(F, G) = (−1) inj(F , G) (17)
F 0 ⊂F
and X 0
hom(F, G) = (−1)|E(F )| hom(F 0 , G). (18)
F 0 ⊂F

2.3 Examples of homomorphism functions

Example 2.2 (Stars and degrees) Let Sk denote the star with k nodes. Then for any graph
G on n nodes,
n
X
hom(Sk , G) = dk−1
i , (19)
i=1

where d1 , . . . , dn are the degrees of G. Hence hom(Sk , G)1/(k−1) tends to the maximum degree
of G as k → ∞.

Example 2.3 (Cycles and eigenvalues) Let Ck denote the cycle on k nodes, and again let
G be any graph on n nodes. Then
n
X
hom(Ck , G) = λki , (20)
i=1

where λ1 , . . . , λn are the eigenvalues of the adjacency matrix of G. Hence hom(C2k , G)1/(2k)
tends to the largest eigenvalue of G as k → ∞.

7
Example 2.4 (Independent sets) Let H be the graph on two nodes, with an edge connecting
the two nodes and a loop at one of the nodes. Then for every simple graph G, hom(G, H) is the
number of independent sets of nodes in G.

Example 2.5 (Colorings) It is easy to see that hom(G, Kq ) is the number of colorings of the
graph G with q colors. It is well known that for a fixed G, this number is a polynomial in q,
called the chromatic polynomial. The chromatic polynomial defines a graph invariant for every
complex number q, but this cannot be expressed as the number of homomorphisms into any
graph unless q is a nonnegative integer [27, 28] (cf. Example 3.4).
It is often useful to consider homomorphisms into a fixed graph H as generalized colorings,
where the colors are the nodes of H, and every edge of H imposes a constraint on the coloring
that these two colors cannot be used at adjacent nodes.

Example 2.6 (Maximum cut) Let H denote the looped complete graph on two nodes,
weighted as follows: the non-loop edge has weight 2; all other edges and nodes have weight
1. Then for every simple graph G with n nodes,

log2 hom(G, H) − n ≤ MaxCut(G) ≤ log2 hom(G, H).

where MaxCut(G) denotes the size of the maximum cut in G. So unless G is very sparse,
log2 hom(G, H) is a good approximation of the maximum cut in G.

Example 2.7 (Random graphs) Let G = G(n, p) be a random graph with n nodes and edge-
density p. Then for every simple graph F with k nodes,

E(hom(F, G)) = (1 + o(1))nk p|E(F )| (n → ∞).

By a straightforward application of high concentration results, it follows that hom(F, G) is very

close to its expectation with large probability.

Example 2.8 (Partition functions of the Ising model) Let G be any simple graph, and
let T > 0, h ≥ 0, and J be three real parameters. Let H be the looped complete graph on
two nodes, denoted by + and −, weighted as follows: α+ = eh/T , α− = e−h/T , β++ = β−− ,
β+− = β−+ , and β++ /β+− = e2J/T . Then hom(G, H) is the partition function of the Ising
model on the graph G at temperature T with coupling J in external magnetic field h.

2.4 Homomorphisms into measurable functions

The following definition from [28] (which will play an important role later on) generalizes the
homomorphism function. Every bounded function W : [0, 1]2 → R defines a graph parameter
as follows: For a finite graph F on k nodes, let
Z Y
t(F, W ) = W (xi , xj ) dx1 . . . dxk .
[0,1]k ij∈E(F )

(We can think of the interval [0, 1] as the set of nodes, and of the value W (x, y) as the weight
of the edge xy.) While this definition is meaningful for all graphs F , we will mostly use it for
simple graphs.
It is easy to see that for every weighted graph G, the graph parameter t(·, G) is a special case.
We may assume that V (G) = {1, . . . , n} and α(G) = 1. Define a function WG : [0, 1]2 → [0, 1]
as follows. For (x, y) ∈ [0, 1]2 , let a and b be determined by

α1 (G) + · · · + αa−1 (G) ≤ x < α1 (G) + · · · + αa (G),

α1 (G) + · · · + αb−1 (G) ≤ y < α1 (G) + · · · + αb (G),

8
and let
WG (x, y) = βab (G).
(Informally, WG is obtained by replacing the (i, j) entry in the weighted adjacency matrix of G
by a rectangle of size αi × αj , and define the function value on this square as βij .) Then

t(F, G) = t(F, WG )

for every finite simple graph G.

Example 2.9 For an undirected simple graph F , let eul(F ) denote the number of eulerian
orientations of F (i.e., orientations in which every node has the same outdegree as indegree). By
Euler’s theorem, eul(F ) = 0 if and only if F has a node with odd degree.
It can be shown [44] that this graph parameter can be represented in the form t(·, W ), where

W (x, y) = 2 cos(2π(x − y)).

On the other hand, it follows e.g. from Theorem 3.6 below that eul is not of the form hom(·, G)
with any finite weighted graph G.

3 Connection matrices
3.1 The connection matrix of a graph parameter
A k-labeled graph (k ≥ 0) is a finite graph in which k nodes are labeled by 1, 2, . . . k. Two
k-labeled graphs are isomorphic, if there is a label-preserving isomorphism between them. We
denote by Kk the k-labeled complete graph on k-nodes, and by Ok , the k-labeled graph on k
nodes with no edges.
Let G1 and G2 be two k-labeled graphs. Their product G1 G2 is defined as follows: we take
their disjoint union, and then identify nodes with the same label. Clearly this multiplication is
associative and commutative. For 0-labeled graphs, this notation is in line with our notation for
disjoint union.
The following construction is central to the theory of homomorphisms functions. Let f be
any graph parameter. For every integer k ≥ 0, we define the following (infinite) matrix M (f, k).
The rows and columns are indexed by isomorphism types of k-labeled graphs. The entry in the
intersection of the row corresponding to G1 and the column corresponding to G2 is f (G1 G2 ).
We call the matrices M (f, k) the connection matrices of the graph parameter f (see Figure 1).
For a simple graph parameter, the above construction causes trouble if we get multiple edges
when gluing the two graphs. In this case, we suppress the edge multiplicities in G1 G2 when
defining the entry corresponding to the pair (G1 , G2 ).

3.2 The rank of connection matrices

Connection matrices of a graph parameter are infinite matrices and their rank may be infinite.
However, the rank is quite often finite, and if so, this fact has interesting consequences. Let us
denote by rk(f, k) the rank of the k-th connection matrix of the graph parameter f .
We start with several examples of graph parameters for which the rank of connection matrices
is finite. The most important case for us will be when the graph parameter is defined as hom(·, H)
for some fixed weighted graph H. This will be discussed in detail in the next section.

Example 3.1 (Edges) Let e(G) = |E(G)| denote the number of edges in G. Then e(G1 G2 ) =
e(G1 ) + e(G2 ), and so M (e, k) is the sum of two matrices of rank 1. Thus M (e, k) has rank 2,
so rk(e, k) = 2 for all k.

9
k=2: ...

...

Figure 1: A small part of the connection matrix for k = 2. The matrix entries are obtained by
evaluating the parameter on the graph shown.

If we restrict e(G) to simple graphs G to get a simple graph parameter e0 , then the situation
is more complicated: we have
e0 (G1 G2 ) = e0 (G1 ) + e0 (G2 ) − e0 (G1 ∩ G2 ).
¡ ¢
Rewriting e0 (G1 ∩ G2 ) as x(G1 )T x(G2 ) where x(G) is the k2 -dimensional vector with entries
xij (G) = 1 if G contains an edge joining the labeled vertices i and j and xij (G) = 0 otherwise,
¡ ¢
we see that the matrix whose (G1 , G2 ) entry is e0 (G1 ∩G2 ) has rank k2 , implying that rk(e0 , k) ≤
¡k¢
2 + 2. One can check that this is the exact value.

Example 3.2 (Subgraphs) Let subg(G) denote the number of spanning subgraphs of G, i.e.,
subg(G) = 2e(G) . Then subg(G1 G2 ) = subg(G1 )subg(G2 ), and so M (subg, k) has rank 1. Thus
rk(subg, k) = 1 for all k.
Again, the version when we only consider simple graphs is more complicated: Let subg0 (G)
denote this simple graph parameter. Then
subg0 (G1 )subg0 (G2 )
subg0 (G1 G2 ) = .
subg0 (G1 ∩ G2 )
The first two factors do not change the rank, and the rows of the matrix given by the second
factor are determined by the edges induced by the labeled nodes, so it has only 2(2) different
k

rows, implying that rk(subg0 , k) ≤ 2(2) . Again one can check that this is the exact value.
k

Example 3.3 (Matchings) Let pmatch(G) denote the number of perfect matchings in the
graph G. It is trivial that pmatch(G) is multiplicative. We claim that its node-rank-connectivity
is exponentially bounded:
rpmatch (k)≤2k .
Let G be a k-labeled graph, let X ⊆ [k] = {1, . . . , k}, and let pmatch(G, X) denote the
number of matchings in G that match all the unlabeled nodes and the nodes with label in X,
but not any of the other labeled nodes. Then we have for any two k-labeled graphs G1 , G2
X
pmatch(G1 G2 ) = pmatch(G1 , X1 )pmatch(G2 , X2 ).
X1 ∩X2 =∅, X1 ∪X2 =[k]

10
This can be read as follows: The matrix M (pmatch, k) can be written as a product N T W N ,
where N has infinitely many rows indexed by k-labeled graphs, but only 2k columns, indexed
by subsets of [k],
NG,X = pmatch(G, X),
and W is a symmetric 2k × 2k matrix, where
(
1 if X1 = [k] \ X2 ,
WX1 ,X2 =
0 otherwise.

Hence the rank of M (pmatch, k) is at most 2k (it is not hard to see that in fact equality holds).

Example 3.4 (Chromatic polynomial) We have seen that the number of q-colorings is a
special case of homomorphism functions. This number is the evaluation of the chromatic poly-
nomial chr(G; x) at nonnegative integers q. What about evaluations at other values? It turns
out that these evaluations violate both conditions in Theorem 3.6 below [27]. For every fixed x,
this is a multiplicative graph parameter. To describe its rank-connectivity, we need the following
notation. For k, q ∈ Z+ , let Bk,q denote the number of partitions of a k-element set into at most
q parts. So Bk = Bk,k is the k-th Bell number. With this notation,
(
Bk,x if x is a positive integer,
rk(chr, k) =
Bk otherwise.

Note that this is always finite: if x is a positive integer, then it is bounded by xk , but otherwise
it grows faster than ck for every c. Similar results can be derived for the Tutte polynomial,
where the exceptional values are the hyperbolas in the Tutte plane for which (x − 1)(y − 1) is a
positive integer.
Let f be a graph parameter that is not identically 0. Then f is multiplicative if and only if
f (K0 ) = 1 (K0 is the empty graph) and rk(f, 0) = 1. Every multiplicative graph parameter f
satisfies the inequality
rk(f, k + l) ≥ rk(f, k) · rk(f, l). (21)
In the most important special case (to be discussed below) when f is a homomorphism function,
a stronger version of property (21) holds: the sequence rk(f, k) is logconvex. We do not know
if this property holds for more general graph parameters.
Finiteness of the rank connectivity function has interesting algorithmic consequences:

Theorem 3.5 [27] If r(f, k) is finite for some k, then f can be computed in polynomial time
for graphs with treewidth at most k.

3.3 Connection matrices of homomorphisms

Homomorphism functions, which are our main concern, provide the most important class of
graph parameters for which connection matrices have finite rank. In fact, connection matrices
can be used to characterize these parameters, as the following theorem of Freedman, Lovász and
Schrijver shows.

Theorem 3.6 [28] The graph parameter f , defined on graphs with multiple edges but no loops,
is equal to hom(·, H) for some weighted graph H on q nodes if and only if
(a) M (f, k) is positive semidefinite and
(b) rk(f, k) ≤ q k for all k.

11
In terms of statistical physics, this theorem can be viewed as a characterization of partition
functions of models whose degrees of freedom sit on vertices (as opposed to the edge color-
ing models considered below). The property that M (f, k) is positive semidefinite is related to
the “reflection positivity” property in statistical physics, and we will call a graph parameter
reflection positive if M (f, k) is positive semidefinite for every k.
The proof of the necessity of the conditions in Theorem 3.6 is easy and it is instructive to
present it here. (The sufficiency is more involved, and the proof is based on algebraic consider-
ations.)
We need the following notation: For any k-labeled graph G and mapping φ : [k] → V (H),
let X αψ
homφ (G, H) = homψ (G, H), (22)
ψ: V (G)→V (H)
αφ
ψ extends φ

so that X
hom(G, H) = αφ homφ (G, H). (23)
φ: [k]→V (H)

For any two k-labeled graph G1 and G2 ,

homφ (G1 G2 , H) = homφ (G1 , H)homφ (G2 , H). (24)

The decomposition (23) writes the matrix M (f, k) as the sum of |V (H)|k matrices, one for each
mapping φ : [k] → V (H); (24) shows that these matrices are positive semidefinite and have
rank 1.
In the presence of condition (a), condition (b) in Theorem 3.6 can be replaced by the following
quite different type of condition. To formulate it, we need the notion of a quantum graph, defined
as a formal linear combination of graphs with real coefficients; but the condition concerns only
the existence of a single 2-labeled quantum graph [45]:
(c) There is a 2-labeled quantum graph g0 with the following property: if G is a 2-labeled
graph having no edge between the labeled nodes, and G0 denotes the graph obtained from G by
identifying the two labeled nodes, then f (g0 G) = f (G0 ).
In other words, attaching g0 at two nodes is effectively the same as identifying the two nodes
(this is why it is called a “contractor” in [45]).
Let us conclude with a discussion of the independence of the two conditions in Theorem 3.6.
In Example 3.3 we saw that the number of perfect matchings in a graph provides an example
for a graph parameter for which the rank of connection matrices grows simply exponentially.
This parameter is also multiplicative, so for k = 0 the connection matrix is positive semidefinite.
But it is easy to see that for k = 1, the submatrix indexed by K1 and K2 is
µ ¶
0 1
1 0

which is not positive semidefinite. Thus the number of perfect matchings cannot be represented
as a homomorphism function.
Recalling Examples 3.1 and 3.2, let us define the multigraph parameter f by f (G) =
0
1/subg0 (G) = 2−e(G ) , where G0 is obtained from G by removing duplicate edges. As in Ex-
ample 3.2, the rank of the connection matrix M (f, k) grows as 2(2) . It is further not hard
k

to check that M (f, k) is positive semidefinite. The graph parameter f is, in fact, the limit of
parameters of the form hom(·, H): take homomorphisms into a random graph H = G(n, 1/2),
with all nodeweights 1/n and all edge-weights 1. But the rank of its connection matrices is finite
but superexponential, so the parameter is not of the form hom(·, H).
This example also illustrates the importance of the condition that f is defined on graphs with
multiple edges for the validity of 3.6. Indeed, for a simple graph G (i.e., if G has no multiple

12
0
edges), f (G) = 2−e (G) can be represented as the number of homomorphisms into the graph
K1 (1/2), consisting of a single node with a loop, where the node has weight 1 and the loop has
weight 1/2.
The chromatic polynomial (Example 3.4) was another example whose connection matrices
had superexponential rank growth if the variable x was not a nonnegative integer. Here the
reflection positivity condition gives the same condition on x: the k-th connection matrix is
positive semidefinite if and only if either x is a positive integer or k ≤ x + 1. Thus M (chr, k) is
semidefinite for all k if and only if x is a positive integer.

3.4 The exact rank of connection matrices for homomorphisms

How good is the upper bound on the rank given in Theorem 3.6? It can be proved [39] that
equality holds in the “generic” case.
One reason why the upper bound is not always reached are twins. As remarked earlier,
twin-reduction in H does not change the numbers hom(G, H), and so it does not change the
connection matrices (but of course it decreases the upper bounds |V (H)|k ). So we may assume
that H is twin-free.
The second reason for rank loss in connection matrices is that if H has a proper automorphism
(a permutation of the nodes that preserves both the nodeweights and edgeweights), then in
formula (22), any two terms defined by a mapping φ : [k] → V (H) and φσ (σ ∈ Aut(H)) are
equal, so the sum of all such terms is still rank 1. So the rank of M (hom(·, H), k) is at most the
number of orbits of the automorphism group of H on ordered k-tuples of its nodes.

Theorem 3.7 [39] Assume that the target graph H is twin-free. Then for every k,
rk(hom(·, H), k) is the number of orbits of the automorphism group of H on ordered k-tuples
of its nodes.
It is worthwhile to formulate two corollaries.

Corollary 3.8 Let H be a weighted graph that has no twins and no automorphisms. Then
rk(hom(·, H), k) = |V (H)|k for every k.
Swapping twins i and j is “almost” an automorphism: the only additional condition needed is
that αi = αj . In particular, for unweighted graphs the condition that there are no automorphisms
implies that there are no twins.
Our second corollary is in fact an equivalent reformulation of Theorem 3.7 in the framework
of quantum graphs. To state it, we need the following notation: given a weighted graph H,
a k-labeled quantum graph x and nodes i1 , . . . , ik ∈ V (H), we define homi1 ...ik (x, H) to be the
number of homomorphisms from x to H such that the labeled nodes are mapped into i1 . . . ik .

Corollary 3.9 Let H be a weighted graph that has no twins, and let h : V (H)k → R. Then there
exists a k-labeled quantum graph x such that homi1 ...ik (x, H) = h(i1 , . . . , ik ) for all i1 , . . . , ik ∈
V (H) if and only if h is invariant under the automorphisms of H.

3.5 Extensions: directed graphs, hypergraphs, semigroups

In Theorem 3.6 we allowed parallel edges in the graphs G, but no loops. Indeed, the represen-
tation theorem is false if G can have loops: it is not hard to check that the graph parameter

loop(G) = 2−#loops

cannot be represented as a homomorphism function, even though its connection matrix

M (loop, k) is positive semidefinite and has rank 1. To get a representation theorem for graphs

13
with loops, each loop e in the target graph H must have two weights: one which is used when a
non-loop edge of G is mapped onto e, and the other, when a loop of G is mapped onto e. With
this modification, the theorem remains valid.
The constructions and results above are in fact more general; they extend to directed graphs
and hypergraphs. One new element in the case of directed graphs is the following. For a directed
graph, homomorphism functions can be defined by the same formulas (4) and (5), except that
weights are now assigned to the directed edges of G and H. But there are (at least) two
substantially different ways to define connection matrices.
(1) The easier way is to define k-labeled digraphs similarly, and glue them together just like
we did in the undirected case. The related theorem and proof are precisely the same as above.
(2) The alternative generalization of Theorem 3.6 to directed graphs is a bit more interesting.
(Example 3.13 below is a case when this second theorem applies.) For this, we consider weighted
directed graphs in which the edgeweights can be complex. Such a graph H is called Hermitian if
for every arc uv with (complex) weight βuv , the arc vu is also present and has weight βvu = β uv .
For any directed graph D, let D∗ be the digraph obtained from D by reversing all arcs. For
two k-labeled digraphs D and D0 , let DD0 denote, as before, their union with the labeled nodes
identified. For any complex-valued digraph parameter f defined on loopless directed graphs and
for each natural number k, we define the matrix M̃ (f, k) as follows: its rows and columns are
indexed by k-labeled directed graphs, and the entry in position D1 , D2 is f (D1∗ D2 ).

Theorem 3.10 [40] Let f be a complex valued digraph parameter. Then f = hom(·, H) for
some Hermitian weighted digraph H if and only if f (K0 ) = 1 and there exists a d ≥ 0 such that,
for each k ≥ 0, M̃k is positive semidefinite and has rank at most dk .
Note that M̃k is a complex valued matrix. The condition that it is positive semidefinite
includes the condition that it is Hermitian.
There is a common formulation of these results, using semigroups; see [40] for details.

3.6 Edge coloring models

Let G be a finite graph. An edge coloring model or edge model is determined by a finite set C
and a mapping h : ZC + → R+ , which we call the node evaluation function. Here C is the set
of possible edge colors; for any coloring of the edges, we think of h(a) as the value of a node
incident with a(c) edges with the color c (c ∈ C). In terms of statistical physics, an edge coloring
is a state of the system, and log h(a) is the contribution of a node (incident with a(c) edges with
the color c) to the energy of the state. 1
To be more precise, for an edge-coloring φ : E(G) → C and node v, let aφ,v (c) denote the
number of edges e incident with v with φ(e) = c. So aφ,v ∈ ZC + is the “local view” of node v.
The weight of the assignment φ is defined by
Y
w(φ) = h(aφ,v ),
v∈V (G)

and the edge coloring parameter, by

X
col(G, h) = w(φ).
φ: E(G)→C

(It will be also useful to allow a single edge with no endpoints; we call this graph the circle, and
denote it by °. By definition, col(°, h) = |C|.)
We can define edge-connection matrices that are analogous to the connection matrices defined
before: Instead of gluing graphs together along nodes, we glue them together along edges. To be
1 For this reason, these models are usually called vertex models in the physics literature.

14
precise, we define a k-broken graph as a k-labeled graph in which the labeled nodes have degree
one. (It is best to think of the labeled nodes not as nodes of the graph, but rather as points
where the k edges sticking out of the rest of the graph are broken off.) We allow that both ends
of an edge be broken off.
For two k-broken graphs G1 and G2 , we define G∗1 G2 by gluing together the corresponding
broken ends of G1 and G2 . These ends are not nodes of the resulting graph any more, so G∗1 G2
is different from the graph G1 G2 we would obtain by gluing together G1 and G2 as k-labeled
graphs. One very important difference is that while G1 G2 is k-labeled, G∗1 G2 has no broken
edges any more, and so it is not k-broken. This fact leads to considerable difficulties in the
treatment of edge models.
For every graph parameter f and integer k ≥ 0, we define the edge-connection matrix M 0 (f, k)
as follows. The rows and columns are indexed by isomorphism types of k-broken graphs. The
entry in the intersection of the row corresponding to G1 and the column corresponding to G2 is
f (G∗1 G2 ). Note that for k = 0, we have M (f, 0) = M 0 (f, 0), but for other values of k, connection
and edge-connection matrices are different. We say that f is edge reflection positive, if M 0 (f, k)
is positive semidefinite for every k ≥ 0.
It is easy to see (similarly as in the case of homomorphism functions) that if h : ZC + → R+
and f = col(·, h) is an edge-coloring function, then
rk(M 0 (f, k)) ≤ |C|k ,
and M 0 (f, k) is positive semidefinite. Unlike in the case of node-connection matrices, these two
properties are not independent any more:

Proposition 3.11 [58] If f is a multiplicative graph parameter such that M 0 (f, k) is positive
semidefinite for every k ≥ 0, then f (°) is a nonnegative integer and rk(M 0 (f, k)) ≤ f (°)k .
The analogue of Theorem 3.6 is even simpler to state (but much more difficult to prove:

Theorem 3.12 [58] A graph parameter f can be represented as f (·) = col(·, h) for some edge
coloring model h if and only if it is multiplicative and edge-reflection positive.
Just as for homomorphism functions, it is natural to ask what determines the rank of con-
nection matrices of edge models. This question seems to lead to difficult algebraic questions in
group representations, and is unanswered at this time.

3.7 Edge colorings and homomorphisms

The connection between homomorphism functions and edge coloring functions seems to go farther
than analogy, but it is not well understood.
In one direction, edge coloring functions are more general than homomorphism functions.
This connection is easy in the directed case. We can generalize the edge coloring model to
directed graphs in the obvious way. It is easy to see that the directed edge model is more general
than the directed homomorphism model: If we are given a pair H = (a, B) (a ∈ Rq+ , B ∈ Cq×q ,
with entries αc and βuv , resp.), then to every homomorphism φ : V (G) → [q] we can assign an
edge-coloring in which the edge ij is colored with the pair ψ(ij) = (φ(i), φ(j)). The evaluation
function at a node v is given as follows: if there is a color c such that all the outgoing edges have
the same first color c and all the incoming edges have the same second color c, then the value is
Y
αc βψ(uv)1 ,ψ(uv)2 ;
u: uv∈E(G)

otherwise, the node evaluates to 0. It is easy to see that an edge-coloring has nonzero weight
only if it comes from a homomorphism, and in that case, the weight of the edge-coloring is the
same as the weight of the corresponding homomorphism.

15
It is not obvious, but it is true, that undirected edge coloring functions generalize undirected
homomorphism functions [58], at least if complex values are allowed for h. It is not clear which
(real) homomorphism functions can be obtained as edge coloring functions with a real valued h.
In the opposite direction, edge coloring models cannot be translated into node coloring mod-
els (homomorphisms) in general; but there are some nontrivial examples of important graph
parameters that are defined as edge coloring functions, but that can also be represented as ho-
momorphism functions in a nontrivial way. A general understanding of these examples would
be very interesting.

Example 3.13 (Nowhere-zero flows) Let eul(G) = 1 if G is eulerian (i.e., all nodes have even
degree), and eul(G) = 0 otherwise. To represent this function as a homomorphism function, let
µ ¶ µ ¶
1/2 1 −1
a= , B=
1/2 −1 1

It was noted by de la Harpe and Jones [33] that for the weighted graph H = (a, B) we have
hom(G, H) = eul(G).
This example can be generalized quite a bit. Let Γ be a finite abelian group and let S ⊆ Γ
be such that S is closed under inversion. For any graph G, fix an orientation of the edges. An
S-flow is an assignment of an element of S to each edge such that for each node v, the product
of elements assigned to edges entering v is the same as the product of elements assigned to
the edges leaving v. Let sflo(G) be the number of S-flows. This number is independent of the
orientation.
The choice Γ = Z2 and S = Z2 \ {0} gives the special case above (incidence function of
eulerian graphs). If Γ = S = Z2 , then sflo(G) is the number of eulerian subgraphs of G. Perhaps
the most interesting special case is when |Γ| = t and S = Γ \ {0}, which gives the number of
nowhere zero t-flows.
Surprisingly, this parameter (which is an edge coloring model) can be described as a homo-
morphism function. Let Γ∗ be the character group of Γ. Let H be the complete directed graph
(with all loops) on Γ∗ . Let αχ := 1/|Γ| for each χ ∈ Γ∗ , and let
X
βχχ0 := χ−1 (s)χ0 (s),
s∈S

for χ, χ0 ∈ Γ∗ . Using arguments related to duality transformations of models in statistical physics

(see e.g. [22] and references therein) one can show [28] that this weighted graph H represents
sflo in the sense that sflo(·) = hom(·, H). The condition on S that it is closed under inversion
can be dropped if we use homomorphisms of directed graphs (Section 3.5).
In statistical physics, negative—and more generally, complex—nodeweights correspond to
complex magnetic fields, which arise in the well-known Lee-Yang theory of phase transitions
[9, 36, 60]. The next example [58] shows that it is also interesting in our context to extend the
definition of weighted graphs by allowing negative nodeweights (a direction we will not pursue
here except for this example).

Example 3.14 (Matchings revisited) We have seen that the number pmatch(G) of perfect
matchings has exponential rank connectivity but is not reflection positive, and hence it is not a
homomorphism function. However, consider the following weighted graph Hx : We take a looped
complete graph on two nodes u and v, and define
1 1
α(u) = , α(v) = − ,
x x
and
β(uu) = x + 1, β(uv) = β(vv) = 1.

16
Then the following surprising fact holds:

lim hom(G, Hx ) = pmatch(G)

x→0

for every graph G.

4 Convergence and limit

4.1 Quasi-random graphs
Quasirandom (also called pseudorandom) graphs were introduced by Thomason [59] and Chung,
Graham and Wilson [19]. These graphs have many properties that true random graphs have.
A sequence (Gn : n = 1, 2, . . . ) of graphs is called quasirandom with density p (where 0 <
p < 1), if for every simple finite graph F ,

t(F, Gn ) = (1 + o(1))p|E(F )| . (25)

(this is the asymptotic number of labeled copies of F in a random graph with edge probability
p). The definition is usually formulated in terms of the number of injections (labeled copies) of
F into G, but the two differ only in lower order terms, which are swallowed by the o(1) in the
definition.
It turns out that (25) implies many other properties that are familiar from the theory of
random graphs; for example, almost all degrees are about pn, almost all codegrees are about
p2 n etc. Many of these properties characterize quasirandom graphs, and so these provide many
equivalent ways to define a quasirandom sequence [19, 59]. Quasirandomness is closely related
to Szemerédi’s lemma [56, 57]. One of the most surprising facts proved in [19] is that it is enough
to require the condition about the number of copies of F for just two graphs, namely K2 (which
just defines the edge density p) and the 4-cycle C4 . This fact can be stated and proved in a
simpler way using the “limit set” T0 defined in Section 2.1:

Theorem 4.1 If t ∈ T0 satisfies t(C4 ) = t(K2 )4 , then for every simple graph H,

t(H) = t(K2 )|E(H)| .

In other words, t(H) is the expected profile of a random graph G(n, p) with p = t(K2 ); it is
also the profile of the weighted graph consisting of a single node and a loop with weight p (the
weight of the node does not matter).
To illustrate the power of reflection positivity, we give a proof of this theorem (the proof goes
along the lines of the original, just the details are simpler).
Proof. Let p = t(K2 ). We first prove the conclusion for stars K1,j :

t(K1,j ) = pj . (26)

Starting with t(K1,2 ), let us first consider the connection matrix M (t, 1) and its 2 × 2 submatrix
formed by the rows and columns corresponding to the graph K1 and K2 . Positivep semidefiniteness
of this matrix gives t(K1,2 ) ≥ t(K2 )2 = p2 . On the other hand t(K1,2 ) ≤ t(C4 ) = p2 by
positive semidefiniteness of the 2 × 2 submatrix of M (t, 2) indexed by K1,2 (with its endpoints
labeled) and K 2 , the empty graph on two nodes. The above two inequalities give t(K1,2 ) = p2 ,
which proves (26) for j = 2. To prove the identity for j > 2, we again consider the connection
matrix M (t, 1). By the identity we just established, its 2 × 2 submatrix formed by the rows and
columns corresponding to the graph K1 and K2 has 0 determinant. By positive semidefiniteness,

17
the corresponding two rows of the whole connection matrix M (t, 1) are proportional. But this
means that
t(K1,j+1 ) = pt(K1,j )
for every j, from which (26) follows by induction.
Next we show that for all complete bipartite graphs K2,j :

t(K2,j ) = p2j . (27)

Since t(K2,2 ) = t(C4 ) = p4 by assumption, this is true for j = 2. For general j, it follows just
like (26) from the positive semidefiniteness of the matrix M (t, 2).
Now we prove the general case by a similar induction. Let us view the graph H as glued
together from a star K1,d and a graph F on one fewer nodes, along the set T of the leaves of the
star, and suppose that we know the assertion for F . Consider the matrix M (t, d) and its 2 × 2
submatrix formed by the rows and columns indexed by K d (the graph on d labeled nodes with
no edge) and K1,d (with the leaves labeled). By (26) and (27), this submatrix is singular, and
hence these two rows of the whole matrix are proportional, the second row is pd times the first.
But the graphs F and H define two elements of these rows above each other, so

t(H) = pd t(F ) = pd p|E(F )| = p|E(H)| ,

which proves the theorem. ¤

4.2 Convergent sequences

Let (Gn ) be a sequence of unweighted simple graphs, and assume again that |V (Gn )| = n. We
say that this sequence is convergent, if the sequence t(F, Gn ) has a limit for every simple graph
F . Note that it would be enough to assume this for connected graphs F .
In the definition, we could replace the homomorphism function by the number of embeddings
(injective homomorphisms), with appropriate normalization. Indeed, the difference between the
number of homomorphisms and embeddings is the number of non-injective homomorphisms,
which is of lower order, so it tends to 0 when divided by n|V (F )| .
We could also replace the homomorphism function by the number of embeddings as induced
subgraphs. Indeed, the number of embeddings can be obtained by summing the numbers of
induced embeddings over all supergraphs (on the same set of nodes). Conversely, the number of
induced embeddings can be expressed in terms of the numbers of embeddings of supergraphs by
inclusion-exclusion.

Example 4.2 Let G(n, p) be a random graph on n nodes with edge-density p; the sequence
(G(n, p), n = 1, 2, . . . ) is convergent with probability 1. The limiting simple graph parameter is
given by t(F ) = p|E(F )| .
By definition, every quasirandom graph sequence with density p is also convergent, and the
homomorphism densities into it tend to the same value.

4.3 Finite limits, a.k.a. generalized quasirandom graphs

A generalized random graph G(n; H) is defined by the number n of its nodes and by a weighted
“model” graph H. We assume that V (H) = [q] and set αi = αi (H) (i = 1, . . . , q) and βij =
βij (H) (i, j = 1, . . . , q). We partition [n] into q classes V1 , . . . , Vq , by putting each u ∈ [n] into
Vi with probability αi and connect each pair u ∈ Vi and v ∈ Vj with probability βij (all these
decision are made independently).

18
A generalized quasirandom graph sequence (Gn ) with model graph H (or briefly H-
quasirandom sequence) is defined by the property that for every fixed finite graph F ,
t(F, Gn ) −→ t(F, H) (n → ∞).
In other words, the number of homomorphisms of F into Gn is approximately the same as
the expected number of homomorphisms of F into a generalized random graph G(N, H) on
N = |V (Gn )| nodes.
This definition suggests that we should consider the graph H as the “limit” of the H-
quasirandom sequence. The definition of a quasirandom sequence of graphs (with edge-density
p) is equivalent to saying that the sequence converges to K1 (p). (Warning: not every convergent
sequence will have a limit of this form!)
In view of the theory of quasirandom graphs, we can ask the following two basic questions
concerning generalized quasirandom graphs:
(a) Is it enough to require the condition concerning the number of copies of F for a finite set
of graphs Fi (depending on α and β)?
(b) Is the structure of a generalized quasirandom graph Gn similar to a generalized random
graph?
To be more precise, we want that the nodes of Gn can be partitioned into q classes U1 , . . . , Uq
of sizes α1 n,. . . ,αq n so that the graph spanned by Ui is quasirandom with density βii , and the
bipartite graph formed by the edges between Ui and Uj is quasirandom with density βij .
The answer to the first two questions is in the affirmative. More precisely, the following
theorems hold.

Theorem 4.3 [43] Let H be a weighted graph with V (H) = [q], nodeweights (αi : i = 1, . . . , q)
and edgeweights (βij : i, j = 1, . . . , q). Let (Gn , n = 1, 2, . . . ) be an H-quasirandom sequence of
unweighted simple graphs. Then for every n there exists a partition V (Gn ) = {U1 , . . . , Uq ) such
that
|Ui |
(a) → αi (i = 1, . . . , q),
|V (Gn )|
(b) the subgraph of Gn induced by Ui is a quasirandom graph sequence with edge density βii ,
and
(c) the bipartite subgraph between Ui and Uj is a quasirandom bipartite graph sequence with
density βij .

Theorem 4.4 [43] Let H be a weighted graph with V (H) = [q]. A sequence (Gn , n = 1, 2, . . . )
is H-quasirandom if and only if
t(F, Gn ) −→ t(F, H) (t → ∞)
for every graph F with at most (10q)q nodes.

4.4 The general limit object

4.4.1 Limits as reflection positive parameters
We now turn to describing limits of general convergent graph sequences.
Let Te0 be the set of homomorphism density functions t(·, G) defined on simple (unweighted)
graphs, where G is any simple unweighted target graph. Let T0 be the set of all graph parameters
that are pointwise limits of graph parameters in Te0 (i.e., its closure in the product topology on
RF ). It is not hard to see that T0 would not change if we allowed weighted target graphs with
edge weights between 0 and 1.
The characterization of homomorphism functions (Theorem 3.6) extends to the limit, at least
for simple graphs:

19
Theorem 4.5 [44] A simple graph parameter f is in T0 if and only f is normalized, multiplica-
tive and reflection positive.

4.4.2 Limits as measurable functions

Graph parameters in the set T0 can be represented as homomorphism functions into measurable
functions [44]. Let W denote the set of all bounded measurable functions W : [0, 1]2 → R such
that W (x, y) = W (y, x) for all x, y ∈ [0, 1]. We also introduce the set

W0 = {W ∈ W : 0 ≤ W ≤ 1}.

Theorem 4.6 [44] A simple graph parameter f is in T0 if and only if there is a function W ∈ W0
such that f = t(·, W ).
This function W is not unique: for example, W (1 − x, 1 − y) will define the same graph
parameter. More generally, if φ : [0, 1] → [0, 1] is a measure preserving map (not necessarily
bijective), then
W φ (x, y) = W (φ(x), φ(y))
defines the same parameter. The following theorem says that this is all: Let us call functions
W1 , W2 ∈ W equal up to measure preserving transformation if there is a third function W ∈ W
and measure preserving maps φ1 , φ2 : [0, 1] → [0, 1] such that Wi = W φi .

Theorem 4.7 [14] Two functions W1 , W2 ∈ W define the same simple graph parameter if and
only if they are equal up to measure preserving transformation.

4.4.3 Limits as distributions over finite and countable graphs

Given any function W ∈ W0 and an integer n > 0, we can generate a random graph G(n, W ),
called a W -random graph, on node set [n] as follows. We generate n independent numbers
X1 , . . . , Xn from the uniform distribution on [0, 1], and then connect nodes i and j with proba-
bility W (Xi , Xj ).
As a special case, if W is the identically p function, we get “ordinary” random graphs G(n, p).
More generally, if W = WH for a (finite) weighted graph H, then G(n, WH ) is a quasirandom
graph with model H.

Theorem 4.8 [44] With probability 1, the graph sequence G(n, W ) is convergent, and its limit
is the function W .
Let us define a random graph model as a distribution Gn on simple graphs on [n] , for every
n ∈ Z+ . The random graph model G(n, W ) defined above has the following three obvious
properties:
(i) The distribution of Gn is invariant under relabeling nodes;
(ii) If we delete node n from Gn , the distribution of the resulting graph is the same as the
distribution of Gn−1 ;
(iii) for every 1 < k < n, the subgraphs of G induced by [k] and {k+1, . . . , n} are independent
(as random variables).
It turns out that these three properties characterize the model G(n, W ):

Theorem 4.9 [44] A random graph model is of the form G(n, W ) for some function W ∈ W0
if and only if it satisfies conditions (i), (ii) and (iii). Furthermore, two functions W1 , W2 ∈ W0
define the same random graph model if and only if they are equal up to measure preserving
transformation.

20
We have seen that the limit of ordinary random graphs G(n, 1/2) is the function W ≡ 1/2.
It is, however, quite natural to think that the limit of ordinary random graphs should be the
Rado graph (the countable random graph). It turns out that this is also true in the following
sense: For every W ∈ W0 , we can define a countable random graph G(ω, W ) on Z+ , by choosing
an infinite sequence (X0 , X1 , . . . ) of independent uniform samples from [0, 1], and connecting i
and j with probability W (Xi , Xj ). A countable random graph model is a probability distribution
on graphs on Z+ (with the σ-algebra generated by cylinders consisting of all graph containing a
given edge).

Theorem 4.10 [48] A countable graph model is of the form G(ω, W ) if and only if it satisfies (i)
and (iii) above. Furthermore, the countable graph model G(ω, W ) determines W up to measure
preserving transformation.
Thus it is justified to say that with probability 1, G(n, 1/2) converges to the Rado graph
G(ω, 1/2). A word of caution is warranted here: as an unlabeled graph, G(ω, 1/2) is isomorphic
to G(n, 1/3) with probability 1. So viewing the Rado graph as an unlabeled graph would
not contain enough information to characterize the limit; we have to view it as a probability
distribution over graphs on a fixed countable set of nodes.

4.4.4 Examples
Example 4.11 Consider the half-graphs Hn,n : they are bipartite graphs on 2n nodes
{1, . . . , n, 10 , . . . , n0 }, where i is connected to j 0 if and only if i ≤ j 0 . It is easy to see that
this sequence is convergent. Indeed, let F be a simple graph with k nodes; we show that the
limit of t(F, Hn,n ) exists. We may assume that F is connected. If F is non-bipartite, then
t(F, Hn,n ) = 0 for all n, so suppose that F is bipartite; let V (F ) = V1 ∪ V2 be its (unique)
bipartition. Then every homomorphism of F into H preserves the 2-coloring, and so the homo-
morphisms split into two classes: those that map V1 into {1, . . . , n} and those that map it into
{10 , . . . , n0 }. By the symmetry of the half-graphs, these two classes have the same cardinality.
Now F defines a partial order P on V (F ), where u ≤ v if and only if u = v or u ∈ V1 ,
v ∈ V2 , and uv ∈ E. With respect to this partial order, 12 hom(F, Hn,n ) is just the number of
order-preserving maps from V (F ) to the chain {1, . . . , n}, and so
1
hom(F, Hn,n ) 2 hom(F, Hn,n )
2k−1 t(F, Hn,n ) = 2k−1 · =
(2n)k nk

is the probability that a random map of V (F ) into {1, . . . , n} is order-preserving. As n → ∞, the

fraction of non-injective maps tends to 0, and hence it is easy to see that 2k−1 t(F, Hn,n ) tends
to a number 2k−1 t(F ), which is the probability that a random ordering of V (F ) is compatible
with P . In other words, k!2k−1 t(F ) is the number of linear extensions of P .
However, the half-graphs do not converge to any finite weighted graph. To see this, let Sk
denote the star on k nodes, and consider the (infinite) matrix M defined Mk,l = t(Sk+l−1 ). If
t(F ) = t(F, G0 ) for some finite weighted graph G0 , then it follows from the characterization of
homomorphism functions in [28] that this matrix has rank at most |V (G0 )|; on the other hand,
it is easy to compute that
1
Mk,l = k+l−2 ,
2 (k + l − 1)
and this matrix (up to row and column scaling, the Hilbert matrix) has infinite rank (see e.g
[18]).
One can say, however, that in the limit, we are considering order-preserving maps of the
poset P into the interval [0, 1]; equivalently, adjacency-preserving maps of F into the infinite

21
graph with node set [0, 1] and edge-set {xy : x ≤ 1/2, y > 1/2, x ≤ y − 1/2}. More precisely,
the limit is given by the function
(
1, if x ≥ y + 12 or y ≥ x + 12 ,
W (x, y) =
0, otherwise.

Example 4.12 (Preferential attachment graphs) Preferential attachment as a paradigm

of generating power law distributions goes back to Yule [61] in 1923. In the context of networks
and graph theory, the idea is usually credited to Barabasi and Albert [7], even though it can
be already found in [53]. For the simplest case of preferential attachment trees, the rigorous
mathematical analysis of these models was first carried out in [50], while the rigorous analysis
of the more general model of Barabasi and Albert was first carried out in [10]. For more general
models of undirected preferential attachment graphs see [20], and for models with directed edges,
see [12]. All these model are sparse models with bounded average degree.
Here we define a preferential attachment graph PAG(n, m) as the random graph with n nodes
and m edges obtained by the following procedure. Fix a set of n nodes, and let v1 . . . vn be any
ordering of the nodes. We extend this sequence one by one by picking an element of the current
sequence randomly and uniformly, and append a copy of it at the end. We repeat this until 2m
further elements have been added. So we get a sequence v1 . . . vn vn+1 . . . vn+2m .
Now we construct G by connecting nodes vn+2k−1 and vn+2k for k = 1, 2, . . . , m, to get
G(n, m). (Note that G may have multiple edges and loops, which we have to live with for the
time being).
Another way of describing this construction is to view it as adding edges one by one, where
the probability of adding an edge connecting u and v is proportional to the product of the
“degrees”. To be more precise, the probability that the (k + 1)-st edge connects u and v is


 2(d(u) + 1)(d(v) + 1)
 if u 6= v,
(n + 2k)(n + 2k + 1)

 (d(u) + 1)(d(u) + 2)
 if u = v,
(n + 2k)(n + 2k + 1)
where d(u) is the current degree of the node (adding 1 to the degree is needed to start the
procedure at all; adding 2 to the second factor in the case when u = v makes everything come
out nicer).
It can be shown that the limit of preferential attachment graphs PAG(n, cn2 ), with probability
1, is the function Wc (x, y) = c(log x)(log y). It is interesting to note that the graphs G(n, Wc )
form another (different) sequence of random graphs tending to the same limit Wc with probability
1.

5 The metric space of graphs

5.1 Distances of graphs
In this section, we assume that the graph G (which we probe by F from the left and by H from
the right) is dense, i.e., the number of edges of G is Ω(n2 ), where n is the number of nodes (the
results are valid but mostly vacuous for sparse graphs). We’ll discuss analogous questions for
sparse graphs (specifically, for graphs with bounded degree) in Section 8.

5.1.1 Matrix norms

For an n × n matrix A, the rectangle norm (also called the cut norm) is defined as
kAk¤ = max |uT Av|. (28)
u,v∈{0,1}n

22
This norm is closely related to `∞ → `1 norm, which can be defined by
kAk∞→1 = max |uT Av| = max uT Av; (29)
u,v∈[−1,1]n u,v∈{−1,1}n

in fact,
1
kAk∞→1 ≤ kAk¤ ≤ kAk∞→1 . (30)
4
For symmetric matrices A, the norm
kAk0¤ = max |uT Au| (31)
u∈{0,1}n

is simpler to define and is also equivalent to the rectangle norm:

kAk0¤ ≤ kAk¤ ≤ 2kAk0¤ . (32)
A constant factor approximation of the rectangle norm can be computed in polynomial time,
using semidefinite optimization and Grothendieck’s inequality in functional analysis (see [3]).

5.1.2 Labeled graphs on the same set of nodes

Let G and G0 be two graphs on the same set of n nodes. We want to define a notion of distance
between them that reflects structural similarity. A first attempt is to define
1
d1 (G, G0 ) = |E(G)4E(G0 )|.
n2
(Here the division by n2 is just a convenience, so that the distance of two graphs is always
between 0 and 1.) However, this notion is too restrictive: For example, the distance of two
random graphs with the same density is of constant order (with large probability), even though
two random graphs are structurally very similar.
For our purposes, the following distance function will be more useful. For a graph G and
sets S, T ⊆ V (G), let eG (S, T ) denote the number of edges in G with one endnode in S and the
other in T (the endnodes may also belong to S ∩ T ; so eG (S, S) is twice the number of edges
spanned by S). We define:
1
d¤ (G, G0 ) = max |eG (S, T ) − eG0 (S, T )|.
n2 S,T ⊆V (G)

Note that we are dividing by n2 and not by |S| × |T |, so the contribution of a pair S, T is at
most |T | × |S|/n2 . Thus small sets of size o(n) play no role when measuring the distance. In
terms of the adjacency matrices A and A0 of G and G0 , respectively, this can be expressed as
1° °
°A − A0 ° .
d¤ (G, G0 ) = 2 ¤
n
Note that the definition can be extended to the case when G and G0 have edgeweights. Further-
more, by (30) and (32), we could replace the k.k¤ norm in the definition by one of the other
matrix norms defined above without distorting the distance by more than a constant factor.
We need to extend this notion to weighted graphs on the same set of nodes. Let G and G0
be weighted graphs with V (G) = V (G0 ). We assume that G and G0 both have total nodeweight
1, but the weights of individual nodes in G and G0 may be different. Then we define
X
d¤ (G, G0 ) = |αi (G) − αi (G0 )|
i
¯X¡ ¢¯¯ (33)
¯
+ max ¯ αi (G)αj (G)βij (G) − αi (G0 )αj (G0 )βij (G0 ) ¯.
S,T ⊆V (G)
i∈S
j∈T

23
If the two graphs do not both have total nodeweights of one, then we simply define the distance
d¤ in terms of the corresponding “normalized” graphs, i.e., we replace αi (G) with αi (G)/α(G),
and similarly for G0 .

5.1.3 Unlabeled graphs with the same number of nodes

Now assume that G and G0 are unlabeled unweighted graphs on n nodes. It is natural to define

δb¤ (G, G0 ) = min d¤ (G̃, G̃0 ), (34)

G̃,G̃0

where G̃ and G̃0 range over all labelings of G and G0 by 1, . . . , n, respectively (of course, we
could fix the labeling of one of the graphs).
Consider any labeling that attains the minimum in the definition of δb¤ , and identify the nodes
of G and G0 with the same label. In this case, we say that G and G0 are optimally overlaid.

5.1.4 Unlabeled graphs with different number of nodes

To define the distance (in any of the above senses) of two unlabeled graphs with different number
of nodes, say G with n nodes and G0 with n0 nodes, a first idea is to blow up each node of G
into n0 nodes, and each node of G0 into n nodes, so that both graphs now will have nn0 nodes.
An improved version of this idea is to match up the nodes “fractionally”. This also allows us to
extend the notion of distance to weighted graphs.
Let G and G0 be weighted graphs with (say) V (G) = [n], V (G0 ) = [n0 ], and assume that the
sum of nodeweights is 1 (just scale the nodeweights of each graph). Let X be a nonnegative
n × n0 matrix such that
Xn0
Xiu = αi (G)
u=1

and
n
X
Xiu = αu (G0 ).
i=1

We think of Xiu as the portion of node i that is mapped onto node u. We call such a matrix
X a fractional overlay of G and G0 . Let X (G, G0 ) denote the set of all fractional overlays. Note
that for every X ∈ X (G, G0 ),
0 0
n X
X n n
X n
X
Xiu = αi (G) = αu (G0 ) = 1.
i=1 u=1 i=1 u=1

(If we view αG and αG0 as probability distributions, then every X ∈ X (G, G0 ) is a coupling of
these distributions.)
For each fractional overlay, we construct the following two weighted graphs. The nodes of
G[X] are all pairs (i, u) where 1 ≤ i ≤ n and 1 ≤ u ≤ n0 . The weight of the node (i, u) is Xiu ,
and the weight of the edge ((i, u), (j, v)) is βij . The other graph G0 [X T ] is defined similarly,
except that the roles of i and u are interchanged. Now the node sets of G[X] and G0 [X T ] are
labeled by the same set of pairs (i, u), so their distances are well defined.
Thus we can define the distance of two weighted unlabeled graphs G and G0 (with total
nodeweight 1):
δ¤ (G, G0 ) = min 0 d¤ (G[X], G0 [X T ]).
X∈X (G,G )

24
We can express this distance in terms of the original graphs G and G0 by the following
formula: ¯ X ¡ ¢¯¯
¯
δ¤ (G, G0 ) = min 0 max 0 ¯ Xiu Xjv βij (G) − βuv (G0 ) ¯. (35)
X∈X (G,G ) S,T ⊆V ×V
(i,u)∈S
(j,v)∈T

Of course, this definition also applies if G and G0 have the same number of nodes; however,
it may give a different value than (34). It is proved in [16] that there is a constant c > 0 such
that
δ¤ (G, G0 ) ≤ δb¤ (G, G0 ) ≤ cδ¤ (G, G0 )1/4 (36)
(the lower bound is trivial; we do not have an example showing that the exponent 1/4 is needed
in the upper bound). While the definition of δb¤ is more straightforward, the distance δ¤ will
be easier to work with, and we will use mostly the latter distance.
A very special weighted graph is K1 (p): a single node with a loop with weight p. For the
random graph G = G(n, p), we have a.s.

δ¤ (G, K1 (p)) −→ 0 (n → ∞).

Let (Gn ) be a sequence of simple graphs. It follows by standard results on quasirandom graphs
that

Proposition 5.1 [19] A sequence (Gn ) of graphs is quasirandom if and only if δ¤ (Gn , K1 (p)) →
0 as n → ∞.
The following result connects this distance to homomorphism functions.

Lemma 5.2 [44] For any three simple graphs F , G and G0

|t(F, G) − t(F, G0 )| ≤ |E(F )| · δ¤ (G, G0 ).

5.2 Szemerédi’s Lemma

For a graph G = (V, E) and two subsets U, W ⊂ V we define the “irregularity” of the pair U, W
as the quantity ¯ ¯
irregG (U, V ) = max ¯eG (X, Y ) − d|X| · |Y |¯,
X⊆U,Y ⊆V

where d is the density d = eG (U, W )/(|U | · |W |). Let twr(ε) denote the d1/ε2 e times iterated
exponential function (the “tower”). With this notation, we can state one version of the Regularity
Lemma:

Lemma 5.3 (Szemerédi Regularity Lemma) For every ε > 0 and every graph G = (V, E)
there is a partition P of V into k ≤ twr(ε) classes V1 , . . . , Vk such that
X
irregG (Vi , Vj ) ≤ ε|V |2 .
1≤i<j≤k

(While this form is perhaps easiest to state and prove, there are equivalent forms that are more
suited for applications. We refer to [35] for a survey, to [55] for connections with quasirandom
graphs, and to [47] for analytic aspects of the Regularity Lemma.)
The appearance of the tower function in the Lemma forbids practical applications (and
unfortunately this bound on the number of parts is not far from best possible, as it was shown
by Gowers [32]). A more reasonable threshold was proved by Frieze and Kannan [29], at the cost
of using a weaker measure of irregularity.

25
Given a graph G = (V, E) and a partition P = {V1 , . . . , Vk } of V , we define a weighted graph
GP on V by setting αu (GP ) = 1 and βuv (GP ) = di(u)i(v) , where i(u) is the index of the set Vi
containing u and dij = eG (Vi , Vj )/(|Vi ||Vj |). The edgeweight matrix of GP is thus obtained from
the adjacency matrix of G by replacing each entry in the block Vi × Vj by the average over the
block. In this notation, the result of Frieze and Kannan [29] can be formulated as follows:

Lemma 5.4 (Weak Regularity Lemma) For every ε > 0 and every graph G = (V, E), there
2
exists a partition P of V into k ≤ 22/ε classes such that d¤ (G, GP ) ≤ ε.
The bound on the number of partition classes is still rather large (exponential), but at least
not a tower. Frieze and Kannan show that the partition can be obtained as an “overlay” of only
1/ε2 sets, so it has a description that is polynomial in 1/ε, which in some applications leads to
polynomial time algorithms (see e.g. [2]).
The Weak Regularity Lemma immediately implies the following slight variant:

Lemma 5.5 [16] For every ε > 0 and every graph G = (V, E), there is a weighted graph H with
2
at most d21/ε e nodes such that δ¤ (G, H) ≤ ε.

We note that other versions strengthen the conclusion (of course, at the cost of replacing
the tower function by an even more huge value). Such a “super-strong” Regularity Lemma was
proved and used by Alon and Shapira [4]. It would be interesting to fit the original Regularity
Lemma or one of its applications into this framework.

5.3 Sampling from a graph

The following important fact connecting sampling and graph distance follows from a result in
[2]; see also [16] for a simple proof of (a):

Theorem 5.6 Let G1 and G2 be two graphs on the same set of nodes V , let ε = d¤ (G1 , G2 ),
δ > 0, and let S be a random k-subset of V .
(a) If k ≥ 300 2
ε2 log( δ ), then with probability at least 1 − δ,

2−10 ε ≤ d¤ (G1 [S], G2 [S]) ≤ 4ε1/4 .

(b) If k ≥ 1010 log(2/ε)/(ε4 δ 5 ), then with probability at least 1 − δ,

ε
2−10 ε ≤ d¤ (G1 [S], G2 [S]) ≤ 107 √ .
δ
Using this bound and the (weak) Regularity Lemma, it is not hard to prove the following
theorem, which is the key to several further results.
8
Theorem 5.7 [16] Let G be a (possibly weighted) graph, ε > 0 and k ≥ 2c1 /ε . Let S be a
random subset of V (G) of size k. Then with probability at least 1 − ε, we have δ¤ (G, G[S]) < ε.
Informally, if we take a sample of k points, and blow up each node of this subgraph into
|V (G)|/k twins, then the resulting graph can be overlayed with G so that the d¤ -distance will
be small.
This theorem can be thought of as a strengthening of the (weak) Regularity Lemma in
two directions. First, it says that the approximating weighted graph can be required to be
unweighted; second, that it can be obtained just by drawing a random sample.
From this theorem, it is easy to deduce the following converse of Lemma 5.2:

26
8
Theorem 5.8 [16] Let G, G0 be simple graphs, and let ε > 0. Set k = d2c2 /ε e, and assume
2
that for every simple graph F on at most k nodes, we have |t(F, G) − t(F, G0 )| < 2−2k . Then
δ¤ (G, G0 ) ≤ ε.
These results allow us to characterize convergent graph sequences:

Theorem 5.9 [16] A graph sequence is convergent if and only if it is Cauchy in the δ¤ metric.
Let F denote the metric space of all finite, simple graphs with the δ¤ metric. It follows
from the above that the completion X of F can be described as follows. Consider the space of
all functions in W0 , with the distance
¯Z ¯
¯ ¯
d¤ (U, W ) = sup ¯ W (x, y)dx dy ¯.
S,T ⊆[0,1] S×T

Define a new metric by

δ¤ (U, W ) = inf d¤ (U φ , W ψ ),
where φ and ψ range over all measure preserving maps φ, ψ : [0, 1] → [0, 1]. Then the elements
of X can be obtained by identifying functions that are at distance 0, and the δ¤ metric between
these classes extends the δ¤ metric on graphs.
The above results give various other descriptions of this completion: for example, X is
isomorphic to T0 with the metric
1
δ(t1 , t2 ) = sup |t1 (F ) − t2 (F )| (t1 , t2 ∈ T0 ).
F |E(F )|

Szemerédi’s Lemma can be used to show that X is compact.

5.4 Testing huge graphs

Imagine that we have a huge graph G; this graph is so large that we cannot describe it completely
in any way. All we can do is sample a bounded number of nodes of G and look at the subgraph
that is induced by them. What can we learn about G?
There are two related, but slightly different ways of asking this question.

5.4.1 Parameter testing

Parameter testing is easier to state. We may want to determine some parameter of G; say
what is the edge density? How large is the density of the maximum cut? Of course, we will
not be able to determine the exact value of this parameter; the best we can hope for is that if
we take a sufficiently large sample, we can find the approximate value of the parameter with
large probability. To be precise, a graph parameter f is testable, if for every ε > 0 there is
a positive integer k such that if G is a graph with at least k nodes and we select a set X of
k independent uniform random nodes of G, then from the subgraph induced by them we can
compute an estimate fe(G[X]) of f such that

P(|f (G) − fe(G[X])| > ε) < ε.

It is an easy observation that we can always use fe(G[X]) = f (G[X]).

Using the notions of graph distance and convergence introduced above, we can give a number
of characterizations of testable parameters.

27
Proposition 5.10 [16] A simple graph parameter is testable if and only if any of the following
equivalent conditions holds.
(a) For every convergent graph sequence (Gn ), the limit of f (Gn ) exists as n → ∞ (continuity
at infinity).
(b) For every ε > 0 there is an integer k0 such that for every k > k0 and every graph G on
at least k nodes, a random set X of k nodes of G satisfies
|f (G) − E(f (G[X]))| < ε.

(c) f is “essentially” uniformly continuous with respect to the δ¤ distance in the following
sense: For every ε > 0 there is an ε0 > 0 and a positive integer n0 so that if G1 and G2 are two
graphs with |V (Gi )| ≥ n0 and δ¤ (G1 , G2 ) < ε0 , then |f (G1 ) − f (G2 )| < ε.
(d) There exists a functional fb(W ) on W0 that is continuous in the rectangle norm, and
extends f in the sense that |fb(WG ) − f (G)| → 0 if |V (G)| → ∞.
If we want to use (c) to prove that a certain invariant is testable, then the complicated
definition of the δ¤ distance may cause a difficulty. So it is useful to show that (c) can be
replaced by a weaker condition, which consists of three special cases of (c):

Supplement 5.11 [16] The following three conditions together are also equivalent to testability:
(c.1) For every ε > 0 there is an ε0 > 0 such that if G and G0 are two simple graphs on the
same node set and d¤ (G, G0 ) ≤ ε0 then |f (G) − f (G0 )| < ε.
(c.2) For every simple graph G, f (G(m)) has a limit as m → ∞, where G(m) denotes the
graph obtained from G by replacing each node by m twins.
(c.3) f (G(m)) − f (G) → 0 if |V (G)| → ∞.
Some of the implications between conditions (a)-(d) in the Theorem are easy, some others
follow from the general theory sketched above. To illustrate the use of this theorem, let us
consider the density of the maximum cut:
eG (S, V (G) \ S)
f (G) = max ¡n¢ .
S⊆V (G)
2

This parameter is testable: this fact is nontrivial, and its first proof by Goldreich, Goldwasser
and Ron [31] was one of the first important results in Property Testing. Of the conditions above,
(a) and (b) are more or less a reformulation of testability.
Condition (c), on the other hand, is easy to verify in this case. Let ε > 0, and let G1 and
G2 be two graphs for which δ¤ (G1 , G2 ) < ε. Let us blow up the points of each graph so that
the new graphs G01 and G02 have the same number N of points and they can be overlaid so that
d¤ (G01 , G02 ) < ε. For any subset S ⊆ V (G01 ) = V (G02 ), we have
|eG01 (S, V (G) \ S) − eG02 (S, V (G) \ S)| < εN 2 ,
and hence
|f (G01 ) − f (G02 )| < ε.
To complete the proof, one must argue that |f (G0i ) − f (Gi )| is small, which is not hard (and is
not given here).
Condition (d) can also be directly verified: we can extend the definition of a maximum cut
to functions W ∈ W0 in a natural way:
X Z Z
fb(W ) = W (x, y) dx dy.
S⊆[0,1] S [0,1]\S

Then it is easy to check that this functional is continuous in the norm k.k¤ , and extends f .

28
5.4.2 Property testing
Instead of estimating a numerical parameter, we may want to determine some property of G:
Is G 3-colorable? Is it connected? Does it have a triangle? The answer will of course have
some uncertainty. A precise definition was given by Goldreich, Goldwasser and Ron [31], who
also proved several fundamental results about this problem. There are in fact several ways to
formalize this question. For this exposition, we take the following.
As for parameter testing, we specify an ε > 0 and want to find a positive integer k (depending
on ε) with the following property. We select k independent uniform random nodes of G, and
from the subgraph induced by them we compute a guess X ∈ {Y ES, N O}. Ideally, we want
that if the graph does have the property, our guess should be YES with large probability, and
if the graph does not have the property, then we should guess NO with large probability. But
this is too much to ask. Suppose that we have two graphs that can be obtained from each other
by changing a very tiny fraction of the edges, but one has the property, and the other does not.
Then a sample induced subgraph from one graph will have almost the same distribution as a
sample (of the same size) from the other, and so our guess for the two graphs will be almost the
same.
A graph property P is testable, if our guess satisfies the following: if a graph has the property
in a robust way so that changing at most εn2 edges in any way it still has the property, we must
guess YES with probability at least 1 − ε; similarly, if changing at most εn2 edges in any way
the obtained graph does not have the property, then we must guess NO with probability at least
1 − ε; in the grey area inbetween, we can guess arbitrarily. In other words, whatever we guess,
we should be able to change at most εn2 edges to make out guess right.

Remark 5.12 While changing a small number of edges is the most natural way to formalize
that there is a “nearby” graph with the property, we have seen that the rectangular distance is
often better behaved. One is tempted to define that a property is weakly testable, if for every
ε > 0 there is a k such that for every graph G on at least k nodes we can make a guess based on
a sample induced subgraph of size k such that with probability at least 1 − ε, there is a graph G0
such that d¤ (G, G0 ) < ε and our guess is right for G0 . But this notion is not very interesting due
to the fact that every graph property is weakly testable. This is an easy application of Theorem
5.7 above.
Among the many results on graph property testing, let us quote a surprisingly general recent
result of Alon and Shapira [4]. A graph property is called hereditary, if it is inherited by induced
subgraphs.

Theorem 5.13 [4] Every hereditary graph property is testable.

They in fact obtain a stronger result, which can be cast in the framework of parameter
testing. For a hereditary graph property P, define the distance from the property as d(G, P) =
D(G, P)/|V (G)|2 , where D(G, P) is the minimum number of edges we need to change in G to
obtain a graph with property P. Alon and Shapira proved:

Theorem 5.14 [4] The distance from a hereditary graph property is testable.
This theorem has a reasonably short proof using graph limits, see [47].

6 Partitions and homomorphisms into small graphs

6.1 Ground state energy
In Section 4 we defined convergence of a graph sequence Gn in terms of the homomorphism num-
bers from small graphs F to Gn (“convergence from the left”). But there are many applications

29
where one wants to study homomorphisms from Gn into a small graph H. This naturally raises
the question whether suitably normalized homomorphism numbers hom(Gn , H) converge if Gn
is convergent from the left.
Consider a graph G on n nodes, and a softcore graph H on q nodes. (Recall that H is called
softcore if all edge weights are strictly positive.) Then log hom(G, H) typically grows like the
number of edges in G. For dense graphs, it therefore seems natural to consider the quantity
1
log hom(G, H). (37)
n2
This quantity is closely related to a weighted maximum cut problem on G. Indeed, let B =
(Bij )1≤i,j≤q be a symmetric matrix with real entries. We then define the ground state energy of
the “model” B on the graph G as
1 X
E(G, B) = 2 max Bφ(u)φ(v) . (38)
n φ: V (G)→[q]
uv∈E(G)

For large n, the quantity defined in (37) is well approximated by the ground state energy.

Lemma 6.1 Let G be an unweighted graph with n nodes, and let H be a softcore graph with
α(H) = 1. Let α = mini αH (i). Let B be the matrix of logarithms of the edgeweights of H. Then
log(1/α) log hom(G, H)
E(G, B) − ≤ ≤ E(G, B).
n n2
(Note that the upper bound on log hom(G, H)/n2 does not depend on the nodeweights of H,
and in the lower bound, only the error term does.)
Proof. Note that 2
max homφ (G, H) = en E(G,B)
.
φ: V (G)→V (H)

Thus we have
X X 2 2
hom(G, H) = αφ homφ (G, H) ≤ αφ en E(G,B)
= en E(G,B)
,
φ φ

and 2
hom(G, H) ≥ max αφ homφ (G, H) ≥ αn en E(G,B)
.
φ
From these bounds the Lemma follows. ¤

Example 6.2 As a special case, consider the graph H consisting of two nodes of weight 1/2,
with loops of weight 1 at each, and connected by an edge of weight e (the base of the natu-
ral logarithm). Then n2 E(G, H 0 ) is the size of the maximum cut in G, which we denote by
MAXCUT(G). By Lemma 6.1,
log hom(G, H) − n ≤ MAXCUT(G) ≤ log hom(G, H).
Since MAXCUT(G) ≥ |E(G)|/2, this gives a very good approximation of the maximum cut.
More generally, for a fixed H, computing E(G, H) is a weighted multiway cut problem.
Our next theorem states that convergence from the left implies convergence of the ground
state energies. Its proof uses the notion of fractional partitions, a notion we will need at
several places in this section: a fractional partition of a set V into q classes (briefly, a fractional
q-partition) is a q-tuple ρ = (ρ1 , . . . , ρq ) of functions from V to [0, 1] such that for all x ∈ V , we
have ρ1 (x) + · · · + ρq (x) = 1. (Later in this section, we will apply this definition also to the case
when V = [0, 1], when we tacitly assume that the functions ρi are measurable.) We will use the
notation Pdq for the set P of probability distributions on [q], i.e., the set of vectors a = (a1 , . . . , aq )
such that ai ≥ 0 and i ai = 1, and the notation Symq for the set of q × q symmetric matrices.

30
Theorem 6.3 [16] Let q be a positive integer, let B ∈ Symq , and let Gn be a convergent sequence
of simple graphs. Then E(Gn , B) is a convergent sequence.
As a simple illustration of the usefulness of Proposition 5.10 and its Supplement 5.11, we
sketch the proof. Indeed, let us consider the quantity
1 X
Eφ (G, B) = Bφ(u)φ(v)
n2
uv∈E(G)

where φ is a map from V (G) to [q]. Identifying these maps with partitions P = (V1 , . . . , Vq ) of
V (G) and using the definition of the d¤ metric, we immediately see that

|Eφ (G, B) − Eφ (G0 , B)| ≤ q 2 kBk∞ d¤ (G, G0 )

whenever G and G0 are simple graphs on the same set of nodes, which verifies the condition (c.1)
of Supplement 5.11.
The condition (c.2) is not hard to verify either: Define the energy of a fractional q-partition
ρ of V (G) as
1 X X
Eρ (G, B) = 2 ρi (u)ρj (v)Bij . (39)
n
uv∈E(G) i,j∈[q]

Then E(G(k), B) = maxρ Eρ (G, B) where the maximum runs over all fractional partitions such
that all ρi (u)’s are multiples of 1/k. We claim that the maximum is attained for fractional
partitions which are {0, 1} valued, so that E(G(k), B) = E(G, B) for all k. It is clear from the
above description that E(G(k), B) ≥ E(G, B), so the only thing we need to show is a matching
upper bound on E(G(k), B). Consider a fractional partition ρ maximizing Eρ (G, B), and a fixed
node u ∈ V (G). Then Eρ (G, B) is a linear function of the vector (ρ1 (u), . . . , ρq (u)), implying
that the maximum over all these vectors is obtained at a vertex of the simplex Pdq . Applying
this procedure to all nodes u ∈ V (G), this gives the desired inequality E(G(k), B) ≤ E(G, B),
and hence the equality of E(G(k), B) and E(G, B) for all k. The condition (c.2) therefore holds
trivially. The verification of condition (c.3) is even easier.
Let us finally note that the limiting ground state energy of a convergent sequence can be
expressed explicitly. Indeed, for W ∈ W0 and a symmetric q × q matrix B, let
q Z
1 X
E(W, B) = max Bij ρi (x)ρj (y)W (x, y) dx dy, (40)
ρ 2 i,j=1 [0,1]2

where the maximum runs over fractional q-partitions of [0, 1]. Then we have the following
theorem.

Theorem 6.4 [16] Let (Gn ) be a convergent sequence, and let W ∈ W0 be its limit. Let H be a
softcore graph, and let B be the matrix of logarithms of the edgeweights of H. Then

log hom(Gn , H)
lim = lim E(Gn , B) = E(W, B).
n→∞ |V (Gn )|2 n→∞

6.2 Entropy and free energy

In addition to the ground state energy and the closely related quantity (37), we will consider the
so-called “pressure” or “free energy” of the model H on G, defined as
1 ¡1 ¢
b
P(G, H) = log hom G, H (41)
n n

31
where n is the number of nodes in G and n1 G is obtained from G by multiplying the edge weights
of G by n1 .
To discuss the convergence of the free energy, we need the notion of the entropy of fractional
partitions. Let ρ = (ρ1 , . . . , ρq ) be a fractional partition of a finite set V . Then the entropy of ρ
is defined as
q
1 XX
H(ρ) = − ρi (u) log ρi (u).
|V | i=1
u∈V

Theorem 6.5 [16] Let q be a positive integer, let H be a softcore weighted graph, and let (Gn )
b n , H) is a convergent sequence.
be a convergent sequence of simple graphs. Then P(G
If Gn converges to a function W ∈ W0 , the limiting free energy can again be expressed as
an explicit function of W . To this end, let us define the entropy of a fractional q-partition ρ of
[0, 1] as
Z 1X q
H(ρ) = − ρi (x) log ρi (x) dx,
0 i=1

and the entropy of the H colorings of the limit graph W as

³ X Z 1X
Z ´
P(W, H) = max H(ρ) + hi ρi (x) dx + Bij W (x, y)ρi (x)ρj (y) dx dy , (42)
ρ
i [0,1] 2 i,j [0,1]2

where the maximum goes over all fractional q-partitions of [0, 1], hi = log αi (H) and Bij =
log βij (H).

Theorem 6.6 [16] Let (Gn ) be a convergent sequence of simple graphs, and let W ∈ W0 be its
limit. Let H be a weighted softcore graph. Then
1 1
lim log hom( Gn , H) = P(W, H).
n→∞ n n
This theorem can be proved using Proposition 5.10 and its Supplement 5.11, in a way similar
to (but more involved than) the proof of Theorem 6.3.

6.3 Factor graphs

Let G be a weighted graph and let P = (V1 , . . . , Vq ) be a partition of V (G). The factor graph
(or briefly factor) H(G, P) is the weighted graph on [q] with nodeweights
P
α(G[Vi ]) αu (G)
αi (H(G, P)) = = u∈Vi ,
α(G) α(G))
and edges weights P
u∈Vi ,v∈Vj αu (G)αv (G)βuv (G)
βij (H(G, P)) = .
α(G[Vi ])α(G[Vj ])
Note that H(G, P) is invariant under scaling the nodeweights of G.
In the special case when G is unweighted, this definition specializes to
|Vi |
αi (H(G, P)) = ,
|V (G)|
and
eG (Vi , Vj )
βij (H(G, P)) = .
|Vi | · |Vj |

32
Here eG (Vi , Vj ) denotes the number of edges uv ∈ E(G) with u ∈ Vi and v ∈ Vj ; note that we
allow i = j, in this case e(Vi , Vi ) is twice the number of edges spanned by Vi .
We denote by Sbq (G) the set of factors of G with q nodes. Note that by our definition the
factors are labeled graphs, but since permuting the nodes of a factor also gives a factor, we would
not loose information by forgetting the labeling. We can consider Sbq (G) as a subset of Rq×(q+1) .
We extend these definitions to functions U ∈ W0 . Let P be a q-partition of [0, 1]; we then
define a weighted graph H(U, P) on V (H(U, P)) = [q], where node i has weight
αi (H(U, P)) = λ(Vi ),
and edge ij has weight
Z
1
βij (H(U, P)) = U (x, y) dx dy.
λ(Vi )λ(Vj ) Vi ×Vj

(If λ(Vi )λ(Vj ) = 0, then we define βij (H(U, P)) = 0.) We call the graph H(U, P) a factor of U ,
and use the symbol Sbq (U ) to denote the set of all factors of U with q nodes.
Note that the knowledge of the factors of G is enough to recover the ground state energies of
G. Indeed, in terms of the factors of G, the ground state energy defined in (38) can be expressed
as
Xq
E(G, B) = max ai aj Bij Xij , (43)
(a,X)∈Sbq (G) i,j=1

where a = (a1 , . . . , aq ) and X = (Xij )1≤i,j≤q .

6.4 Fractional factor graphs

The set Sbq (G) is typically a very large finite set, which makes it difficult to work with. It will
be convenient to introduce a fractional version of factors.
Let G be a weighted graph. For every fractional partition ρ = (ρ1 , . . . , ρq ) of V (G), we define
the fractional factor Gρ as the graph with nodeweights
P
u∈V (G) ρi (u)αu (G)
αi (Gρ ) = ,
α(G)
and edgeweights P
u,v∈V ρi (u)ρj (v)αu (G)αv (G)βuv (G)
βij (Gρ ) = .
αi (Gρ )αj (Gρ )
Let Sq (G) denote the set of all fractional factors of G with q nodes. Then Sq (G) is a closed set;
it is not convex in general.
We also extend these notions to functions. Let U ∈ W0 and let ρ = (ρ1 , . . . , ρq ) be a fractional
R1
partition of [0, 1]. Set αi (ρ) = 0 ρi (x) dx. Then we define
αi (Uρ ) = αi (ρ),
and Z
1
βij (Uρ ) = ρi (x)ρj (y)U (x, y) dx dy.
αi (ρ)αj (ρ) [0,1]2
Let Sq (U ) denote the set of all fractional factors of U with q nodes. For a weighted graph G
and a vector a ∈ Pdq , let Bba (G) denote the set of all weighted adjacency matrices of all factors
of G with nodeweights a1 , . . . , aq . So Sq (G) is the set of all pairs (a, B) with B ∈ Ba (G). The
sets Ba (G), Bba (U ) and Ba (U ) are defined analogously.
We clearly have that 0 ≤ βij (Uρ ) ≤ 1 whenever U ∈ W0 . Furthermore, it is not hard to see
that for U ∈ W0 , the set Sq (U ) is closed in the obvious topology of weighted labeled graphs on
q nodes.

33
Lemma 6.7 [16] For every U ∈ W0 , the set Sq (U ) is the closure of Sbq (U ).
(The two sets are not equal in general.)
For every weighted graph G, Sq (G) is again a closed connected set. Obviously, Sq (G) contains
b
Sq (G), but it is not its closure in general (since the latter is a finite set). It is not hard to see
that for every weighted graph G,

Sq (G) = Sq (WG ) = Sbq (WG ). (44)

Clearly Sbq (G) is a finite subset of these infinite sets. But it can be shown that it is not much
smaller:

Lemma 6.8 [16] If c is the largest nodeweight in G, then for every H ∈ Sq (G) there is an
√
H 0 ∈ Sbq (G) such that δ¤ (H, H 0 ) ≤ 4q 2 c.
Most of the time, we will work with the fractional versions, which are much easier to handle.

6.5 Microcanonical ground state energy, a.k.a. multiway cut problems

We have seen that convergence of the sequence (Gn ) implies convergence of the free energies
and ground state energies. But the converse does not hold. To get convergence of (Gn ) from
the convergence of the ground state energies, we will need a finer measure than ground state
energy, where the sizes of the partition classes are also taken into account (in physics terms, this
could be called the “microcanonical version” of the ground state energy). We will see that this
quantity also contains a number of frequently studied graph parameters.
For a finite set S, q ≥ 1 and a ∈ Pdq , let aS denote the set of all maps φ : S → [q] such that

|φ−1 (i) − ai |S|| ≤ 1 (45)

for every i ∈ [q]. (In other words, we prescribe the proportions of elements of S mapped onto
each i ∈ [q], as closely as possible.) For a simple graph G and a softcore graph H with node
weights one, we then introduce microcanonical homomorphism numbers
X
homa (G, H) = homφ (G, H).
φ∈aV (G)

The microcanonical ground state energies and free energies of the model B on a graph G with n
nodes are then defined as
1 X
Eba (G, B) = 2 max Bφ(i),φ(j) , (46)
n φ∈a V (G)
ij∈E(G)

and
¡ ¢
ba (G, B) = 1 log homa 1 G, H
P (47)
n n
respectively.
The ground state energy Êa contains a number of important graph parameters as special
cases. If µ ¶ µ ¶
α 1 0
q = 2, a = and B = (48)
1−α 0 0
then Ea (G, B) corresponds to the densest subgraph on α|V (G)| nodes. If
µ ¶ µ ¶
1/2 0 −1
q = 2, a = and B = (49)
1/2 −1 0

34
then −Ea (G, B) is the minimal bisection; if we replace here B by −B, then Ea (G, B) is the
maximal bisection. For q > 2, we get multiway cut problems in a similar way.
The free energy P̂a is a finer measure. For example, in the case (48), Pa (G, B) will pick
out the density of an induced subgraph that is not necessarily the maximum, but for which the
number of induced subgraphs with this density is large, at the cost of some loss in density.
The use of (45) is cumbersome and in some cases it leads to unpleasant discontinuities; it
will be much more convenient to work with a fractional version. We formulate our definition for
a weighted graph G. Then for all a ∈ Pdq and B ∈ Symq we define

1 X X
Ea (G, B) = max αu (G)αv (G)βuv (G) ρi (u)ρj (v)Bij , (50)
ρ 2
u,v∈V (G) i,j∈[q]

where ρ ranges over all fractional q-partitions of V (G) such that

X
αv (G)ρi (v) = ai .
u∈V (G)

We also extend the notion of microcanonical ground state energy to functions. For every W ∈
W0 , a ∈ Pdq , and B ∈ Symq , we define
q Z
1 X
Ea (W, B) = max Bij ρi (x)ρj (y)W (x, y) dx dy, (51)
ρ:αρ =a 2 [0,1]2
i,j=1

where ρ ranges over fractional q-partitions of [0, 1] with αi (ρ) = ai for all i. It is easy to see that

Ea (G, B) = Ea (WG , B).

Using a result from [2], it follows easily that for every simple graph G on n nodes and every
matrix B,
6q 3
|Eba (G, B) − Ea (G, B)| ≤ kBk∞ . (52)
n
The notion of the microcanonical free energy for functions is defined analogously: for every
W ∈ W0 , a ∈ Pdq , and B ∈ Symq , we define
³ Z ´
1X
Pa (W, B) = max H(ρ) + Bij W (x, y)ρi (x)ρj (y) dx dy , (53)
ρ:α(ρ)=a 2 i,j [0,1]2

where ρ ranges over fractional q-partitions of [0, 1] with αi (ρ) = ai for all i.

6.6 Convergence from the right

Our goal is to give a characterization of convergent graph sequences in terms of partition infor-
mation, specifically the sets Sq (G) and the functions Ea (G, B). Recall that for two sets A, B in
a metric space (X, d), the Hausdorff distance is defined as

dHf (A, B) = max(sup inf d(x, y), sup inf d(x, y)).
x∈A y∈B x∈B y∈A

It turns out that the following three types of information about two graphs G, G0 are equiv-
alent: (1) G and G0 are close in the δ¤ distance; (2) Sq (G) and Sq (G0 ) are close in the dHf
distance for every q up to a certain bound, and (3) |Ea (G, B) − Ea (G0 , B)| is small for all a ∈ Pdq
and B ∈ Symq for every q up to a certain bound.
The exact statement is the following:

35
Theorem 6.9 [16] (a) For two simple graphs G, G0 and a ∈ Pdq ,
dHf 0 0
¤ (Sq (G), Sq (G )) ≤ δ¤ (G, G )

(b) For two simple graphs G, G0 , a ∈ Pdq and B ∈ Symq ,

|Ea (G, B) − Ea (G0 , B)| ≤ q 2 δ¤

Hf
(Sq (G), Sq (G0 )).
(c) Let G, G0 be two simple graphs, and suppose that
ε2
|Ea (G, B) − Ea (G0 , B)| ≤ c3
q2
2
for all q ≤ 4 · 2c4 /ε , a ∈ Pdq and B ∈ Symq . Then δ¤ (G, G0 ) < ε.
Similar statements hold for the microcanonical free energy, see [16].

6.7 Summary: convergence criteria

The following theorem summarizes several equivalent conditions for a sequence of (dense) graphs
to be convergent:

Theorem 6.10 [16] Let (Gn ) be a sequence of simple graphs with |V (Gn )| → ∞. Then the
following are equivalent:
(a) For every simple graph F , t(F, Gn ) is convergent.
(b) The sequence (Gn ) is Cauchy in the δ¤ metric.
(c) For every q ≥ 1, the sequence Sq (Gn ) is Cauchy with respect to the Hausdorff metric dHf
1 .
(d) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , the sequence Ea (Gn , B) is a Cauchy sequence.
(e) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , the sequence P ba (Gn , B) is a Cauchy sequence.

We can also characterize convergent graph sequences in terms of Szemerédi partitions.

Supplement 6.11 [16] The following two conditions are also equivalent to conditions (a)–(e)
in Theorem 6.10:
(f) For every k ≥ 1 there is an nk ≥ 1 such √ that if n, m > nk , then Gn and Gm have weak
0
Szemerédi
√ k-partitions P and P with error 2/ log k such that d¤ (H(Gn , P), H(Gm , P 0 )) <
2/ log k.
(g) For every k ≥ 1 there is an nk ≥ 1 such that if n, m > nk , then Gn and Gm have strong
Szemerédi k-partitions P and P 0 with error 1/ log∗ k such that d¤ (H(Gn , P), H(Gm , P 0 )) <
1/ log∗ k.
A convergent sequence has a limit W ∈ W0 by Theorem 4.6. The conditions in Theorem
6.10 can be rephrased to characterize the convergence to this limit:

Theorem 6.12 [16] For a sequence (Gn ) of simple graphs with |V (Gn )| → ∞, and for any
W ∈ W0 , the following are equivalent:
(a) For every simple graph F , t(F, Gn ) → t(F, W ).
(b) δ¤ (Gn , W ) → 0.
(c) For every q ≥ 1, Sq (Gn ) → Sq (W ) in the Hausdorff metric dHf
1 .
(d) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , Ea (Gn , B) → Ea (W, B).
(e) For every q ≥ 1, a ∈ Pdq and B ∈ Symq , P ba (Gn , H) → Pa (W, B).

One can define Szemerédi partitions for the limit objects W ∈ W0 (see [45]), and then
formulate analogues of (f) and (g) in Supplement 6.11 describing the convergence to the limit.

36
7 Homomorphisms and extremal graph theory
7.1 Inequalities between homomorphism numbers
We have mentioned in the introduction that many results in extremal graph theory can be
expressed as algebraic inequalities between homomorphism densities. Every algebraic inequality
that holds for all finite graphs also holds for simple graph parameters in the closure T0 (and
of course vice versa). So for example, Goodman’s Theorem (1) is equivalent to saying that for
every simple graph parameter t ∈ T0 ,

t(K3 ) ≥ t(K2 )(2t(K2 ) − 1). (54)

By multiplicativity, every algebraic inequality is equivalent to a linear inequality. For example,

(1) is equivalent to
t(K3 ) ≥ 2t(K2 K2 ) − t(K2 ).
The positive semidefiniteness of the connection matrix (reflection positivity) implies linear and
nonlinear inequalities between the values of a simple graph parameters t ∈ T0 for “small” graphs.
In fact, Theorem 4.5 says that every (say, algebraic) inequality between the values of simple graph
parameters t ∈ T0 is a consequence of multiplicativity and reflection positivity.
Let us describe some of these derivations. Fix a simple graph parameter t ∈ T0 . If v = (vG )
is any real vector with finite support, indexed by k-labeled graphs, then

v T M (t, k)v ≥ 0 (55)

gives a linear inequality between the values t(F ).

Probably all linear inequalities can be obtained by taking nonnegative linear combinations
of inequalities (55). However, the (infinitely many) inequalities (55) and their consequences are
not so easy to understand, and we formulate some special inequalities.
Trivial linear inequalities are that if F1 is a subgraph of F2 (not necessarily on the same set
of nodes), then
t(F1 ) ≥ t(F2 ). (56)
For any two graphs F1 ⊆ F2 on the same set of nodes, the expression
X
(−1)|E(F2 )\E(F )| hom(F, G)
F1 ⊆F ⊆F2

counts (by inclusion-exclusion) the number of homomorphisms of F1 into G that map the edges
in E(F2 ) \ E(F1 ) onto non-adjacent pairs. This number is nonnegative, which implies that
X
(−1)|E(F2 )\E(F )| t(F, G) ≥ 0. (57)
F1 ⊆F ⊆F2

¡ ¢
If we fix V (F ) = V with |V | = k, then t(F ) can be considered as a setfunction on k2 elements.
Then (57) can be used to show that this setfunction is supermodular, i.e., it satisfies

t(F1 ∪ F2 ) + t(F1 ∩ F2 ) ≥ t(F1 ) ∩ t(F2 ). (58)

We leave it to the reader as an exercise to derive these inequalities from (55).

Semidefiniteness of the connection matrix can also be formulated in terms of nonlinear in-
equalities (nonnegativity of certain determinants). One of these is worth mentioning. Let G be
a k-labeled graph, then for t ∈ T0 , we have

t(GG) ≥ t(G)2 .

37
As special cases, we mention that for the path P3 on 3 nodes,

t(P3 ) ≥ t(K2 )2 (59)

and
t(C4 ) ≥ t(P3 )2 . (60)

7.2 Re-proving some results in extremal graph theory

Let us start with deriving (54). By (57),

t(K3 ) − 2t(P3 ) + t(K2 K1 ) ≥ 0,

Using multiplicativity, we have t(K2 + K1 ) = t(K2 )t(K1 ) = t(K2 ). Furthermore, by (59) we

have t(P3 ) ≥ t(K2 )2 . This implies (54).
We can derive the following theorem of Moon and Moser [52] in a similar way: Let G be a
graph with n nodes, and let Nr denote the number of complete subgraphs with r nodes. Then
µ ¶
Nr+1 1 Nr
≥ 2 r2 −n . (61)
Nr r −1 Nr−1

Using that Nr = t(Kr , G)nr /r!, this inequality can be expressed in the following simpler form:
For every t ∈ T0 ,
t(Kr ) t(Kr+1 )
r ≤ (r − 1) + 1. (62)
t(Kr−1 ) t(Kr )
This shows that (54) is a special case, and the derivation of (54) above can be extended.
The (simplest, asymptotic) case of the Kruskal-Katona theorem,

t(K3 )2 ≤ t(K2 )3 , (63)

also follows. One uses multiplicativity to write it as t(K3 )2 ≤ t(K2 )t(K2 K2 ); by monotonicity, it
suffices to prove the stronger inequality t(K3 )2 ≤ t(K2 )t(C4 ); which then follows by considering
the following submatrix of M2 : µ ¶
t(K2 ) t(K3 )
.
t(K3 ) t(C4 )

7.3 A 2-dimensional projection

Perhaps the most basic question about inequalities between homomorphism densities concerns
edges and triangles. What are the possible pairs (t(K2 , G), t(K3 , G))? If we disregard number
theoretic conditions like these numbers must be rational, we can ask: What are the possible
pairs (t(K2 ), t(K3 )), where t ∈ T0 ? In other words, what is the projection T00 of T0 onto the
2-dimensional plane determined by the density of edges and density of triangles?
The answer to this questions turns out quite complicated and is only partially solved. Figure
2 describes the set T00 . The upper boundary curve is given the edge-density is given by the
Kruskal-Katona Theorem (63):
x3 = y 2
(and this is tight for all densities). The lower bound is more complicated. Inequality (54) gives a
parabola that is a lower bound. However, this is only tight for special values of the edge-density:
t(K2 ) = 1 − 1/k, k = 1, 2, . . . . For these values, a Turán graph (complete k-partite graph with
equal color classes) gives equality. Bollobás [11] proved that in the intervals between these values,
the lower boundary is above the chord. A conjecture for the curve was formulated in [42]: An
extremal graph in the interval 1 − 1/k ≤ t(K2 ) ≤ 1 − 1/(k + 1) is a complete (k + 1)-partite

38
Kruskal-Katona

LL-Simonovits

Goodman

0 1/2 2/3 3/4 1

Bollobás
Fisher

Figure 2: The region of possible edge densities and triangle densities. (The concave arcs are
distorted to make the qualitative properties more visible.)

graph with k − 1 equal color classes. The size of the two special color classes must be optimized
to make sure that the density of triangles is minimized, subject to the given edge density. For
the interval 1/2 ≤ t(K2 ) ≤ 2/3, the optimization yields the cubic curve
9y 2 − 18xy + 8x − 3x2 + 6x3 = 0.
This case was proved by Fisher [24]; a recent proof by Razborov [54] uses methods quite closely
related to those described in this paper. The conjecture was proved in [42] if t(K2 ) was in a
small neighborhood of any one of the special values 1 − 1/k. The general case is open.
One should add that this conjecture is all that is needed to describe T00 : it is easy to see that
between two points of T00 the plane on a vertical line, the whole interval is contained in T00 .

8 Graphs with bounded degree

Fix a positive integer D, and consider (as the middle graph G in (3) only graphs in which all
degrees are at most D. The normalization t(F, G) we used before does not make sense any more,
since (if G is large) the probability that a random mapping from a given F is a homomorphism
is very small.
How to normalize hom(F, G) in this case? Let n = |V (G)| and k = |V (F )|. If F is not con-
nected, then hom(F, G) is just the product of the numbers hom(F 0 , G), where F 0 is a connected
component of F , so we may restrict our attention to the case when F is connected; then there
is the following rather obvious upper bound:
hom(F, G) ≤ n · Dk−1 .
So it makes sense to consider
hom(F, G)
τ (F, G) = (64)
n
(Since we consider D and F fixed, it does not seem to help to divide by Dk−1 .)

39
8.1 Convergence for graphs with bounded degree
Let (G1 , G2 , . . . ) be a sequence of graphs whose degrees are uniformly bounded by D. We say
that this sequence is locally convergent, or just convergent, if τ (F, Gn ) tends to a limit for every
connected graph F .
There is another way to define this. Given a graph G with all degrees bounded by D, and
a positive integer r, let SG (v, r) denote the neighborhood of node v with radius r. We consider
SG (v, r) as a rooted graph (where v is its root). For fixed D and r, there is a finite number of
possible neighborhoods, and we can make a statistic of these: we denote by νG (N, r) the fraction
of nodes v ∈ V (G) for which SG (v, r) ∼ = N . Then νG is a probability distribution on all possible
r-neighborhoods.
For bounded degrees, it is easy to see that the probability distributions νGn (·, r) tend to
a limit distribution for all r > 0 if and only if Gn converges in the sense defined above. The
convergence of the probability distributions νGn (·, r) can therefore be used as an alternative
characterization of convergent sequences (G1 , G2 , . . . ) of graphs with degrees bounded by D.
This notion of convergence was introduced implicitly by Aldous [1], and explicitly by Ben-
jamini and Schramm [17]. It was extended to the case of bounded average degree by Lyons
[49]. The limit object has several descriptions, the strongest is due to Elek [21]: A continuous
graphing is an infinite graph on [0, 1] that has the following structure: we take a finite number
of continuous measure preserving involutions φ1 , . . . , φN : [0, 1] → [0, 1], and connect every
x ∈ [0, 1] to every φi (x) (i = 1, . . . , N ) such that x 6= φi (x) by an edge.
Every graphing defines a probability distribution on countable graphs with bounded degree
with a specified root: we pick a uniform random point x in [0, 1], and consider the connected
component of the graphing containing x, with root x. This yields the description of the limit by
Benjamini and Schramm.

8.2 Left and right convergence

The theory of convergence of graphs with bounded degree is much less satisfactory than the
analogous theory of dense graphs described above. In particular, the “right” notion of distance
and a powerful analogue of Szemerédi’s Lemma are missing. But there are some nontrivial facts,
relating homomorphisms into a large graph to homomorphisms from it, which can be thought of
as analogues of some results in Section 6.
Let us start with an example.

Example 8.1 Consider the sequence of cycles Cn . It is trivial that this is convergent, but the
numbers hom(Cn , K2 ) alternate between 0 and 2.
But in a sense parity is all that goes wrong. Let us consider the subsequence C2n of even
cycles. Then for any graph H,
Xk
hom(C2n , H) = λ2n
i ,
i=1

where λ1 ≥ · · · ≥ λk are the eigenvalues of H. Hence

hom(C2n , H)1/(2n) → λ1 .
1
It is perhaps more usual to take the logarithm here, to get the sequence 2n log hom(C2n , H) (in
statistical physics, this parameter is called the free energy or pressure of the H-colorings of C2n ).
This sequence is convergent for every H, and the limiting parameter is log λ1 (H).
The sequence of odd cycles behaves similarly, but the limit parameter a(H) =
1
lim 2n+1 log hom(C2n+1 , H) is a bit more complicated to describe: a(H) is the logarithm of
largest eigenvalue of its non-bipartite components (a(H) = −∞ if H is bipartite).

40
Based on this example, let us consider a convergent sequence (Gn ) of graphs with bounded
degree, and ask for which graphs H the numbers

b n , H) = 1
P(G log hom(Gn , H)
|V (Gn )|
converge.
Before discussing this further, let us rewrite the right hand side using Möbius inversion. Let
X
ψ(G, H) = (−1)|V (G)|−|V | log hom(G[V ], H).
V ⊂V (G)

Then
X X ind(F, G)
log hom(G, H) = ψ(G[V ], H) = ψ(F, H), (65)
|V (F )|!
V ⊂V (G) F

where the sum goes over all finite simple graphs F and ind(F, G) is the number of embedding of
F into G as an induced subgraph. We thus have shown that
X τind (F, G)
b
P(G, H) = ψ(F, H) (66)
|V (F )|!
F

where
1
τind (F, G) = ind(F, G). (67)
|V (G)|
Recall that the numbers ind(F, G) can be obtained from the homomorphism numbers hom(F, G)
by Möbius inversion (see (14) and (15)). If Gn is convergent in the sense that the normalized
homomorphism numbers τ (F, Gn ) are convergent for all F , then the numbers τind (F, Gn ) are
convergent as well; let τind (F, Gn ) → τ (F ). One might therefore hope that convergence of the
sequence Gn implies convergence of the free energies P(G b n , H), with the limit given by the
infinite sum
X τ (F )
P(τ, H) = ψ(F, H). (68)
|V (F )|!
F
It is clear, however, that this cannot be true in general; an easy counterexample is the example
Gn = Cn discussed above. But it turns out that we can prove the convergence under suitable
conditions on H and the maximal degree D of the graphs Gn .

Theorem 8.2 [15] If (Gn ) is a sequence of graphs with all degrees bounded by a constant D,
b n , H) has a limit for every unweighted target graph H in which
and (Gn ) is convergent,¡ then P(G
1
¢
all degrees are at least 1 − 2D |V (H)|.
The conceptually most transparent proof of the theorem (with a slightly worse constant)
proceeds by proving uniform convergence of the expansion (68) using the method of cluster
expansions. As stated, the theorem can be proven using Dobrushin’s uniqueness theorem for
uniform H-colorings on Gn .
The cluster expansion proof also gives an analogue of Theorem 8.2 for weighted graphs (see
[15]) and allows to prove that convergence from the right for weighted graphs implies convergence
from the left. In fact, we only need convergence from the right for graphs H that are small
perturbations of completely looped complete graphs with edge weights 1 to conclude that Gn is
convergent from the left.

Theorem 8.3 [15] If (Gn ) is a sequence of graphs with all degrees bounded by a constant D,
and P(Gn , H) is convergent for every weighted graph H which is a looped complete graph with
all edgeweights arbitrarily close to 1, then (Gn ) is convergent.

41
References
[1] D.J. Aldous: Tree-valued Markov chains and Poisson-Galton-Watson distributions, in: Mi-
crosurveys in Discrete Probability (D. Aldous and J. Propp, editors), DIMACS Ser. Discrete
Math. Theoret. Comput. Sci. 41 (1998) Amer. Math. Soc., Providence, RI. (1998), 1–20.

[2] N. Alon, W. Fernandez de la Vega, R. Kannan and M. Karpinski: Random sampling and
approximation of MAX-CSPs, J. Comput. System Sci. 67 (2003), 212–243.
[3] N. Alon and A. Naor: Approximating the Cut-Norm via Grothendieck’s Inequality, preprint:
http://research.microsoft.com/research/theory/naor/homepage%20files/
cutnorm.pdf

[4] N. Alon and A. Shapira, Every monotone graph property is testable, Proc. of the 37th ACM
STOC, Baltimore, ACM Press (2005), to appear:
http://www.math.tau.ac.il/∼nogaa/PDFS/MonotoneSTOC.pdf
[5] E. Babson and D. Kozlov: Complexes of graph homomorphisms, arXive:
http://lanl.arxiv.org/abs/math.CO/0310056
[6] E. Babson and D. Kozlov: Proof of the Lovász Conjecture, arXive:
http://lanl.arxiv.org/abs/math.CO/0402395
[7] A. Barabasi and R. Albert: Emergence of scaling in random networks, Science 286 (1999)
509–512.
[8] B. Bollobás and O. Riordan: Mathematical results on scale-free random graphs, in: Hand-
book of graphs and networks, 1–34, Wiley-VCH, Weinheim, 2003.

[9] M. Biskup, C. Borgs, J.T. Chayes, L. Kleinwaks and R. Kotecký: Partition function zeros at
first-order phase transitions: A general analysis, Commun. Math. Phys. 251 (2004) 79–131.

[10] B. Bollobás, O.M. Riordan, J. Spencer, and G. Tusnády: The degree sequence of a scale-free
random graph process, Random Structures and Algorithms 18 (2001), 279–290.

[11] B. Bollobás: Relations between sets of complete subgraphs, in: Combinatorics, Proc. 5th
British Comb. Conf. (ed. C.St.J.A. Nash-Williams, J. Sheehan), Utilitas Math. (1975), 79–
84.
[12] B. Bollobás, C. Borgs, J. T. Chayes, and O. Riordan: Directed scale-free graphs, in: Pro-
ceedings of the 14th ACM-SIAM Symposium on Discrete Algorithms, 132–139, 2003.
[13] C. Borgs: Statistical Physics Expansion Methods for Combinatorics and Computer Science,
CBMS lecture notes (in preparation).
[14] C. Borgs and J. Chayes and L. Lovász: Unique limits of dense graph sequences (in prepa-
ration).
[15] C. Borgs, J. Chayes, J. Kahn, L. Lovász and V.T. Sós: Convergent sequences for sparse
graphs (in preparation).
[16] C. Borgs, J. Chayes, L. Lovász, V.T. Sós and K. Vesztergombi: Convergent sequences of
dense graphs (in preparation).
[17] I. Benjamini and O. Schramm: Recurrence of Distributional Limits of Finite Planar Graphs,
Electronic J. Probab. 6 (2001), no. 23, 1–13.

42
[18] M.-D. Choi: Tricks or Treats with the Hilbert Matrix, Amer. Math. Monthly 90 (1983),
301–312.
[19] F. Chung, R.L. Graham and R.M. Wilson: Quasi-random graphs, Combinatorica 9 (1989),
345–362.

[20] C. Cooper and A. Frieze: On a general model of web graphs, Rand. Struct. Alg. 22 (2003)
311–335.

[21] G. Elek: Graphings and graph sequences (preprint).

[22] K. Drühl and H. Wagner: Algebraic formulation of duality transforms for abelian lattice
models, Anals of Physics 141 (1982), 225–253.
[23] P. Erdös and L. Lovász, J. Spencer: Strong independence of graphcopy functions, in: Graph
Theory and Related Topics, Academic Press (1979), 165-172.
[24] D.C. Fisher: Lower bounds on the number of triangles in a graph, J. Graph Theory 13
(1989), 505–512.
[25] D.C. Fisher and J. Ryan: Conjectures on the number of complete subgraphs, in: Proc. of
the 20-th Southeastern Conf. on Comb., Graph Theory, and Computing, Congr. Numer. 70
(1990), 217–219.
[26] D.C. Fisher and A. Solow: Dependence polynomials, Discrete Math. 82 (1990), 251–258.

[27] M. Freedman and L. Lovász, D. Welsh (unpublished).

[28] M. Freedman, L. Lovász and A. Schrijver: Reflection positivity, rank connectivity, and
homomorphism of graphs (MSR Tech Report # MSR-TR-2004-41).
ftp://ftp.research.microsoft.com/pub/tr/TR-2004-41.pdf
[29] A. Frieze and R. Kannan: Quick approximation to matrices and applications, Combinatorica
19 (1999), 175–220.
[30] A.W. Goodman: On sets of aquaintences and strangers at any party, Amer. Math. Monthly
66 (1959) 778–783.
[31] O. Goldreich, S. Goldwasser and D. Ron: Property testing and its connection to learning
and approximation, J. ACM 45 (1998), 653–750.
[32] W.T. Gowers: Lower bounds of tower type for Szemerédi’s Uniformity Lemma, Geom. Func.
Anal. 7 (1997), 322–337.
[33] P. de la Harpe and V.F.R. Jones: Graph Invariants Related to Statistical Mechanical Mod-
els: Examples and Problems, Journal of Combinatorial Theory B 57 (1993), 207–227.

[34] P. Hell and J. Nešetřil: Graphs and Homomorphisms, Oxford University Press, 2004.
[35] J. Komlós and M. Simonovits: Szemerédi’s Regularity Lemma and its applications in graph
theory, in: Combinatorics, Paul Erdos is Eighty (D. Miklos et. al, eds.), Bolyai Society
Mathematical Studies 2 (1996), pp. 295–352.
[36] T.D. Lee and C.N. Yang, Statistical theory of equations of state and phase transitions: II.
Lattice gas and Ising model, Phys. Rev. 87 (1952) 410–419.
[37] L. Lovász: Operations with structures, Acta Math. Hung. 18 (1967), 321-328.

43
[38] L. Lovász: Direct product in locally finite categories, Acta Sci. Math. Szeged 23 (1972),
319-322.
[39] L. Lovász: The rank of connection matrices and the dimension of graph algebras, Europ.
J. Combin., to appear:
http://research.microsoft.com/∼lovasz/homdim.pdf
[40] L. Lovász and A. Schrijver: Graph parameters and semigroup functions (manuscript).
[41] L. Lovász and M. Simonovits: On the number of complete subgraphs of a graph (M. Si-
monovits), in: Combinatorics, Proc. 5th British Comb. Conf. (ed. C.St.J.A.Nash-Williams,
J.Sheehan), Utilitas Math. (1976) , 439-441.
[42] L. Lovász and M. Simonovits: On the number of complete subgraphs of a graph II, in:
Studies in Pure Math., To the memory of P. Turán (ed. P. Erdös), Akadémiai Kiadó (1983),
459-495.
[43] L. Lovász and V.T. Sós: Generalized quasirandom graphs, preprint:
http://research.microsoft.com/∼lovasz/quasirandom4.pdf

[44] L. Lovász and B. Szegedy: Limits of dense graph sequences, Microsoft Research Technical
Report MSR-TR-2004-79:
ftp://ftp.research.microsoft.com/pub/tr/TR-2004-79.pdf

[45] L. Lovász and B. Szegedy: Szemerédi’s Lemma for the analyst, Microsoft Research Technical
Report MSR-TR-2005-90:
ftp://ftp.research.microsoft.com/pub/tr/TR-2005-90.pdf
[46] L. Lovász and B. Szegedy: Contractors and connectors of graph algebras Microsoft Research
Technical Report TR-2005-91:
ftp://ftp.research.microsoft.com/pub/tr/TR-2005-91.pdf
[47] L. Lovász and B. Szegedy: Graph limits and testing hereditary graph properties, Microsoft
Research Technical Report MSR-TR-2005-110:
ftp://ftp.research.microsoft.com/pub/tr/TR-2005-110.pdf
[48] L. Lovász and B. Szegedy: Moments of 2-variable functions (in preparation).
[49] R. Lyons: Asymptotic Enumeration of Spanning Trees, preprint:
http://mypage.iu.edu/∼rdlyons/pdf/est.pdf
[50] H. Mahmound, R. Smythe and J. Szymański: On the structure of plane-oriented trees and
their branches. Random Struct. Alg. 3 (1993) 255–266.

[51] J. Matoušek: Using the Borsuk-Ulam Theorem: Lectures on Topological Methods in Com-
binatorics and Geometry Springer, 2003.
[52] J.W. Moon, L. Moser: On a problem of Turn, Magyar Tud. Akad. Mat. Kutat Int. Kzl. 7
(1962), 283–286.
[53] D. J. de S. Price: A general theory of bibliometric and other cumulative advantage processes.
J. Amer. Soc. Inform. Sci., 27 (1976) 292–306.
[54] A. Razborov (unpublished).

44
View publication stats

[55] M. Simonovits and V.T. Sós: Szemerdi’s partition and quasirandomness, Random Structures
Algorithms 2 (1991), 1–10.
[56] M. Simonovits and V.T. Sós: Hereditary extended properties, quasi-random graphs and
induced subgraphs, Combinatorics, Probability and Computing 12 (2003), 319–344.

[57] M. Simonovits and V.T. Sós: Hereditarily extended properties, quasi-random graphs and
not necessarily induced subgraphs. Combinatorica 17 (1997), 577–596.

[58] B. Szegedy: Edge models and reflection positivity, preprint:

http://arxiv.org/abs/math.CO/0505035
[59] A. Thomason: Pseudorandom graphs, in: Random graphs ’85 North-Holland Math. Stud.
144, North-Holland, Amsterdam, 1987, 307–331.
[60] C.N. Yang and T.D. Lee: Statistical theory of equations of state and phase transitions:
I. Theory of condensation, Phys. Rev. 87 (1952), 404–409.
[61] G. U. Yule: A mathematical theory of evolution, based on the conclusions of Dr. J. C.
Willis. Philos. Trans. Roy. Soc. London, Ser. B 213 (1924) 21–87.
[62] H. Whitney: The coloring of graphs, Ann. of Math. 33 (1932), 688–718.

AMS Subject Classification: Primary 05C99, Secondary 05C35, 05C80, 82B99

Keywords: Graph homomorphism, partition function, extremal graphs, Szemerédi Regularity
Lemma, connection matrix, convergent graph sequence, limits of graphs.

Energymetabolism Chinchilla
100% (4)
Energymetabolism Chinchilla
7 pages
2015 Turbine Day-Final
100% (4)
2015 Turbine Day-Final
217 pages
Licensure Examination For Teachers Reviewer (Part 1)
100% (1)
Licensure Examination For Teachers Reviewer (Part 1)
11 pages
RTU Specification for SCADA Systems
100% (1)
RTU Specification for SCADA Systems
18 pages
Earn Money Typing Online
100% (3)
Earn Money Typing Online
37 pages
Design of Radial Gate Using Rectangular 2
100% (1)
Design of Radial Gate Using Rectangular 2
55 pages
Enderlein and Pleomorphism
No ratings yet
Enderlein and Pleomorphism
1 page
List of Is Codes: SL No.1 To SL No.7 Are Design of Gates
No ratings yet
List of Is Codes: SL No.1 To SL No.7 Are Design of Gates
3 pages
Reddy J.N. - Solution Manual For Introduction To Finite Element Analysis-MGH (2005)
No ratings yet
Reddy J.N. - Solution Manual For Introduction To Finite Element Analysis-MGH (2005)
13 pages
MLGS Ii
No ratings yet
MLGS Ii
505 pages
FIITJEE Admission Test Broucher
No ratings yet
FIITJEE Admission Test Broucher
76 pages
Chapter 1.
No ratings yet
Chapter 1.
6 pages
On Not Teaching Greek
100% (1)
On Not Teaching Greek
7 pages
Domination in SVNIG
No ratings yet
Domination in SVNIG
12 pages
SOW102-Doing Social Research, 2nd Edition-Therese Baker-1994 - (Learnclax - Com) - Pages-200-235
No ratings yet
SOW102-Doing Social Research, 2nd Edition-Therese Baker-1994 - (Learnclax - Com) - Pages-200-235
36 pages
All RE
100% (3)
All RE
98 pages
Cadenas, Bandas y Piñones
No ratings yet
Cadenas, Bandas y Piñones
0 pages
2016 KDD AsymmetricTransitivityPreservingGraphEmbedding
No ratings yet
2016 KDD AsymmetricTransitivityPreservingGraphEmbedding
11 pages
On Distinguishing Graphs and Cost Number Using Automorphism Representations
No ratings yet
On Distinguishing Graphs and Cost Number Using Automorphism Representations
23 pages
Inventory Management and Control System
No ratings yet
Inventory Management and Control System
88 pages
Graph Theory and Decomposition
No ratings yet
Graph Theory and Decomposition
49 pages
Split System Air Conditioners Manual
No ratings yet
Split System Air Conditioners Manual
20 pages
Frogs of Borneo
No ratings yet
Frogs of Borneo
2 pages
Large Networks and Graph Limits
100% (2)
Large Networks and Graph Limits
487 pages
DASC16 Windfieldestimationthroughautonomousquadcopteravionics XIANG
No ratings yet
DASC16 Windfieldestimationthroughautonomousquadcopteravionics XIANG
7 pages
Manufacturing's Cyber-Physical Shift
No ratings yet
Manufacturing's Cyber-Physical Shift
29 pages
2010-Facilitating Preservice Teachers Development of T
No ratings yet
2010-Facilitating Preservice Teachers Development of T
246 pages
Amatrixofcommutinggraphsofnon Abeliangroups
No ratings yet
Amatrixofcommutinggraphsofnon Abeliangroups
18 pages
Graph Cover Algorithms for Experts
No ratings yet
Graph Cover Algorithms for Experts
50 pages
7717 Full
No ratings yet
7717 Full
6 pages
Implications of Future Price Trends and Interannual Resource Uncertainty On Firm Solar Power Delivery With Photovoltaic Overbuilding and Battery Storage-3
No ratings yet
Implications of Future Price Trends and Interannual Resource Uncertainty On Firm Solar Power Delivery With Photovoltaic Overbuilding and Battery Storage-3
14 pages
Galois Module Structure of Some Elementary P-Abeli
No ratings yet
Galois Module Structure of Some Elementary P-Abeli
15 pages
Flip T FINAL-Revised-V15
No ratings yet
Flip T FINAL-Revised-V15
21 pages
Distributed-Lag Models: Dynamic Effects of Temporary and Permanent Changes
No ratings yet
Distributed-Lag Models: Dynamic Effects of Temporary and Permanent Changes
20 pages
195 2017 Jocaaa
No ratings yet
195 2017 Jocaaa
10 pages
CHE 1000-E LEARNING - BALANCING REDOX REACTIONS
No ratings yet
CHE 1000-E LEARNING - BALANCING REDOX REACTIONS
17 pages
BishopIndependence 20
No ratings yet
BishopIndependence 20
16 pages
Dimensionality Reduction by Learning An Invariant Mapping
No ratings yet
Dimensionality Reduction by Learning An Invariant Mapping
9 pages
Meteorological Instruments: MODEL 85000
No ratings yet
Meteorological Instruments: MODEL 85000
16 pages
Create Gantt Chart and Cash Flow Using Excel With A File
No ratings yet
Create Gantt Chart and Cash Flow Using Excel With A File
6 pages
Auer
No ratings yet
Auer
13 pages
Lozano Fernandez Etal 2019 GBE
No ratings yet
Lozano Fernandez Etal 2019 GBE
17 pages
Particle-In-Cell Simulation Codes in High Performa
No ratings yet
Particle-In-Cell Simulation Codes in High Performa
24 pages
ST62T00CM6 TR
No ratings yet
ST62T00CM6 TR
100 pages
242 248V9N2CT
No ratings yet
242 248V9N2CT
8 pages
Fox 2020
No ratings yet
Fox 2020
15 pages
Quantum Algorithm
No ratings yet
Quantum Algorithm
13 pages
Quenching TMS - 2007 LK
No ratings yet
Quenching TMS - 2007 LK
7 pages
Recognizing Building Groups For Generalization A Comparative Study
No ratings yet
Recognizing Building Groups For Generalization A Comparative Study
19 pages
NL Comparisonofpermeabilitytestingmethods399-402
No ratings yet
NL Comparisonofpermeabilitytestingmethods399-402
5 pages
2013 K.D. Dearn
No ratings yet
2013 K.D. Dearn
12 pages
Harrington 1 Ton Hand Chain Hoist OM Manual
No ratings yet
Harrington 1 Ton Hand Chain Hoist OM Manual
55 pages
Chen Et Al 2009
No ratings yet
Chen Et Al 2009
7 pages
A New Model For Learning in Graph Domains
No ratings yet
A New Model For Learning in Graph Domains
7 pages
5013 Authentication and Authorization Orchestrator For Microservice Based Software Architectures v2
No ratings yet
5013 Authentication and Authorization Orchestrator For Microservice Based Software Architectures v2
6 pages
Gibbs Helmholtz ChemTex
No ratings yet
Gibbs Helmholtz ChemTex
4 pages
Database Programming With PL/SQL 2-3: Practice Activities: Recognizing Data Types
No ratings yet
Database Programming With PL/SQL 2-3: Practice Activities: Recognizing Data Types
3 pages
Wind and Sheet Erosion Investigations in Sample Areas in The Nyírség
No ratings yet
Wind and Sheet Erosion Investigations in Sample Areas in The Nyírség
8 pages
ChemCatChem Earlyview
No ratings yet
ChemCatChem Earlyview
9 pages
2016JOM LVAE Neural Model 10.1007 s11837-016-2043-6
No ratings yet
2016JOM LVAE Neural Model 10.1007 s11837-016-2043-6
14 pages
Mechanical Propertiesof AISI1045 Steel Subjectedto Combined Loadsof Tensionand Torsion
No ratings yet
Mechanical Propertiesof AISI1045 Steel Subjectedto Combined Loadsof Tensionand Torsion
15 pages
Mitrovic AIMS 2024b
No ratings yet
Mitrovic AIMS 2024b
19 pages
Applied Surface Science 255
No ratings yet
Applied Surface Science 255
9 pages
Georganti Single Channel Distance Estimation
No ratings yet
Georganti Single Channel Distance Estimation
12 pages
Urban Planning with Cellular Data
No ratings yet
Urban Planning with Cellular Data
9 pages
Domination Integrity and Efficient Fuzzy Graphs: Neural Computing and Applications July 2020
No ratings yet
Domination Integrity and Efficient Fuzzy Graphs: Neural Computing and Applications July 2020
12 pages
Scientific Papers Series B. Horticulture
No ratings yet
Scientific Papers Series B. Horticulture
10 pages
Barnoskyetal 2011
No ratings yet
Barnoskyetal 2011
8 pages
Enter Sandbox Android Sandbox Comparison
No ratings yet
Enter Sandbox Android Sandbox Comparison
11 pages
Mengjiao Guo Thesis PDF
No ratings yet
Mengjiao Guo Thesis PDF
152 pages
Capillary Pressure Experiments With Single Drops and Bubbles
No ratings yet
Capillary Pressure Experiments With Single Drops and Bubbles
43 pages
ML Reconstructs Coronary Artery Stress
No ratings yet
ML Reconstructs Coronary Artery Stress
20 pages
High-Order Method for Curved Domains
No ratings yet
High-Order Method for Curved Domains
48 pages
Shimshekgenesisi CRE2002
No ratings yet
Shimshekgenesisi CRE2002
9 pages
Designing A Connectivist Flipped Classroom Platform Using Unified Modeling Language
No ratings yet
Designing A Connectivist Flipped Classroom Platform Using Unified Modeling Language
19 pages
Multi-Class Generative Adversarial Networks With T
No ratings yet
Multi-Class Generative Adversarial Networks With T
16 pages
Cross-Cultural Variations in Consumer Behavior
No ratings yet
Cross-Cultural Variations in Consumer Behavior
24 pages
Approximate Nearest Neighbour Search With The Fukunaga and Narendra Algorithm and Its Application To Chromosome Classification
No ratings yet
Approximate Nearest Neighbour Search With The Fukunaga and Narendra Algorithm and Its Application To Chromosome Classification
8 pages
OB - Product Design - Eng
No ratings yet
OB - Product Design - Eng
29 pages
National Cultural Policy
No ratings yet
National Cultural Policy
58 pages
CHAMELEON A Hierarchical Clustering Algorithm Usin
No ratings yet
CHAMELEON A Hierarchical Clustering Algorithm Usin
23 pages
Antimicrobial Nano-Silver Non-Woven Polyethylene Terephthalate Fabric Via An Atmospheric Pressure Plasma Deposition
No ratings yet
Antimicrobial Nano-Silver Non-Woven Polyethylene Terephthalate Fabric Via An Atmospheric Pressure Plasma Deposition
11 pages
Pythagorean Neutrosophic Planar Graphs With An Application in Decision-Making
No ratings yet
Pythagorean Neutrosophic Planar Graphs With An Application in Decision-Making
20 pages
Experiment Explanation - Grade 7
No ratings yet
Experiment Explanation - Grade 7
5 pages
FPA-21 PG 70 ABV
No ratings yet
FPA-21 PG 70 ABV
1 page
Laplace Transform Homotopy Perturbation Method For The Approximation of Variational Problems PDF
No ratings yet
Laplace Transform Homotopy Perturbation Method For The Approximation of Variational Problems PDF
34 pages
Python Datatypes
No ratings yet
Python Datatypes
6 pages
Module 11
No ratings yet
Module 11
5 pages
Melan PDF
No ratings yet
Melan PDF
31 pages