arXiv:1704.03913v2 [cs.SI] 5 Jan 2018 n oeigcmlxpyia,sca,informational, [ social, systems physical, biological and complex modeling and eaoi ewrsaeognzdb esl connected e.g., densely by system, [ organized modules larger are a networks within metabolic connected operating highly [ units from future arises functional the clustering in local collaborate cases, to other likely collab- [ more mutual other same are a each orator the with cite on scientists to networks, be likely co-authorship to more likely hence more and in topic are appearing publication references same two the networks, citation to as in likely known works: more process indi- are a two closure friend themselves, where common friends triangles a become of share formation ap- who the clusters viduals networks, evolutionary to social local due in by pear example, explained For ap- [ to be processes. cliques edges can of or clustering tendency clusters such the small is through- in domains networks pear these of of trait all recurring out a sparse, typically l [ gle offiin esrstepoaiiyo rai lsr in closure triadic of probability the measures coefficient h lseigceceti enda h rcinof fraction the triangles. as of defined or terms paths, is in length-2 coefficient cluster network clustering a The of edges which ∗ ‡ † [email protected] [email protected] [email protected] ewrsaeafnaetlto o understanding for tool fundamental a are Networks The 3 , 8 [ lseigcoefficient clustering (Fig. ] nttt o opttoa n ahmtclEngineering Mathematical and Computational for Institute 2 , 7 ]. 4 .Smlrtidccoue cu nohrnet- other in occur closures triadic Similar ]. h tutr fra-ol ewrsfo eea domains. several from networks clusteri real-world higher-order of use structure we Finally, the models. c clustering graph comp higher-order random o of about generalization edges properties natural several the a derive how are proba of coefficients view closure clustering comprehensive order the more measure a well net provide not that the and is coefficients in structures clustering network netw triangle order higher-order complex a such understanding induces to to c i.e., respect crucial clustering are closed, the triangles is by beyond path quantified typically length-2 is a clustering the of udmna rpryo ope ewrsi h ednyf tendency the is networks complex of property fundamental A 1 row , .INTRODUCTION I. wedges optrSineDprmn,Safr nvriy Stanfo University, Stanford Department, Science Computer eateto optrSine onl nvriy Ithac University, Cornell Science, Computer of Department C 2 1 .I te od,teclustering the words, other In ). .Atog uhntok are networks such Although ]. htaecoe ihatrian- a with closed are that , unie h xetto extent the quantifies ihrodrcutrn nnetworks in clustering Higher-order 2 , 3 .I aycases, many In ]. Dtd aur ,2018) 8, January (Dated: 5 utnR Benson R. Austin n in and ] triadic ueLeskovec Jure 6 .In ]. a Yin Hao fsc ihrodrsrcue a o enwl under- well been quantified. not nor has stood structures higher-order such of lseigi ewrsb esrn h omlzdfre- normalized the measuring by networks in clustering [ models cer- to null respect tain with networks real-world many in triangles nteajcn deeitt oma ( an form to exist edge adjacent the in nwr soito n rti-rti neato net- interaction protein-protein [ works and association word in efficient daeteg,ie,an i.e., edge, adjacent ℓ otesrcueadfnto fcmlxntok [ networks complex of function and crucial 11 are structure cliques the larger Moreover, to as triangle. such probabil- structure—the structures simple in- closure higher-order one the is just measures coefficient of it ity clustering as the restrictive However, herently network. the as coefficients The clustering higher-order probabilities. of expansion Overview clique 1. FIG. C C C − ∗ 2 4 3 tnodUiest,Safr,C,935 USA 94305, CA, Stanford, University, Stanford , ee epoieafaeokt uniyhigher-order quantify to framework a provide we Here, .Freape -lqe eelcmuiystructure community reveal 4-cliques example, For ]. osbeegsbtenthe between edges possible 1 ecet n nlz hmudrcommon under them analyze and oefficients .Satwt .Fn najcn de3 hc o an for Check 3. edge adjacent an Find 2. with Start 1. ‡ h rdtoa lseigcecet We coefficient. clustering traditional the f an † ecet hc stepoaiiythat probability the is which oefficient, 12 gcecet ogi e nihsinto insights new gain to coefficients ng nesod eew nrdc higher- introduce we Here understood. C rs n h lseigbhvo with behavior clustering the and orks, ℓ iiyo ihrodrntokcliques network higher-order of bility ok oee,hge-re cliques higher-order However, work. ciu ofr an form to -clique n lqe fszs57aemr rqetthan frequent more are 5–7 sizes of cliques and ] ℓ esrstepoaiiyta an that probability the measures e ewrscutr u higher- Our cluster. networks lex regst lse.Teextent The cluster. to edges or d A 40,USA 94305, CA, rd, ,N,180 USA 14850, NY, a, 13 .Hwvr h xeto clustering of extent the However, ]. ℓ wde scoe,maigta the that meaning closed, is -wedge, ℓ ciu n h usd node outside the and -clique ℓ wde( -wedge ℓ hodrcutrn co- clustering th-order ℓ 1)-clique. + ℓ ciu n an and -clique ℓ 1)-clique + 9 – 2 quency at which higher-order cliques are closed, which umn 3) [8, 16]. The novelty of our interpretation of the we call higher-order clustering coefficients. We derive our clustering coefficient is considering it as a form of clique higher-order clustering coefficients by extending a novel expansion, rather than as the closure of a length-2 path, interpretation of the classical clustering coefficient as a which is key to our generalizations in the next section. form of clique expansion (Fig. 1). We then derive sev- Formally, the classical global clustering coefficient is eral properties about higher-order clustering coefficients 6 K3 and analyze them under the Gn,p and small-world null C = | |, (1) models. W | | Using our theoretical analysis as a guide, we analyze where K is the set of 3-cliques (triangles), W is the the higher-order clustering behavior of real-world net- 3 set of wedges, and the coefficient 6 comes from the fact works from a variety of domains. We find that each that each 3-clique closes 6 wedges—the 6 ordered pairs domain of networks has its own higher-order clustering of edges in the triangle. pattern, which the traditional clustering coefficient does We can also reinterpret the local clustering coeffi- not show on its own. Conventional wisdom in network cient [3] in this way. In this case, each wedge again science posits that practically all real-world networks ex- consists of a 2-clique and adjacent edge (Fig. 1, row C , hibit clustering; however, we find that not all networks 2 column 2), and we call the unique node in the intersection exhibit higher-order clustering. More specifically, once of the 2-clique and adjacent edge the center of the wedge. we control for the clustering as measured by the classical The local clustering clustering coefficient of a node u is clustering coefficient, some networks do not show signifi- the fraction of wedges centered at u that are closed: cant clustering in terms of higher-order cliques. In addi- tion to the theoretical properties and empirical findings 2 K (u) C(u)= | 3 |, (2) exhibited in this paper, our related work also demon- W (u) strates a connection between higher-order clustering and | | community detection [14]. where K3(u) is the set of 3-cliques containing u and W (u) is the set of wedges with center u (if W (u) = 0, we say that C(u) is undefined). The average| clustering| coeffi- II. DERIVATION OF HIGHER-ORDER cient C¯ is the mean of the local clustering coefficients, CLUSTERING COEFFICIENTS
¯ 1 In this section, we derive our higher-order clustering C = C(u), (3) V e coefficients and some of their basic properties. We first | | uX∈V present an alternative interpretation of the classical clus- e tering coefficient and then show how this novel interpre- where V is the set of nodes in the network where the local clustering coefficient is defined. tation seamlessly generalizes to arrive at our definition e of higher-order clustering coefficients. We then provide some probabilistic interpretations of higher-order clus- tering coefficients that will be useful for our subsequent B. Generalizing to higher-order clustering analysis. coefficients
Our alternative interpretation of the clustering coef- A. Alternative interpretation of the classical ficient, described above as a form of clique expansion, clustering coefficient leads to a natural generalization to higher-order cliques. Instead of expanding 2-cliques to 3-cliques, we expand Here we give an alternative interpretation of the clus- ℓ-cliques to (ℓ + 1)-cliques (Fig. 1, rows C3 and C4). For- mally, we define an ℓ-wedge to consist of an ℓ-clique and tering coefficient that will later allow us to generalize it and quantify clustering of higher-order network struc- an adjacent edge for ℓ 2. Then we define the global ℓth- order clustering coefficient≥ C as the fraction of ℓ-wedges tures (this interpretation is summarized in Fig. 1). Our ℓ interpretation is based on a notion of clique expansion. that are closed, meaning that they induce an (ℓ+1)-clique in the network. We can write this as First, we consider a 2-clique K in a graph G (that is, 2 a single edge K; see Fig. 1, row C2, column 1). Next, (ℓ + ℓ) Kℓ+1 we expand the clique K by considering any edge e adja- Cℓ = | |, (4) Wℓ cent to K, i.e., e and K share exactly one node (Fig. 1, | | row C2, column 2). This expanded subgraph forms a where Kℓ+1 is the set of (ℓ + 1)-cliques, and Wℓ is the set wedge, i.e., a length-2 path. The classical global cluster- of ℓ-wedges. The coefficient ℓ2 + ℓ comes from the fact ing coefficient C of G (sometimes called the transitivity that each (ℓ + 1)-clique closes that many wedges: each of G [15]) is then defined as the fraction of wedges that (ℓ + 1)-clique contains ℓ +1 ℓ-cliques, and each ℓ-clique are closed, meaning that the 2-clique and adjacent edge contains ℓ nodes which may serve as the center of an ℓ- induce a (2 + 1)-clique, or a triangle (Fig. 1, row C2, col- wedge. Note that the classical definition of the global 3 clustering coefficient given in Eq. 1 is equivalent to the C. Probabilistic interpretations of higher-order definition in Eq. 4 when ℓ = 2. clustering coefficients We also define higher-order local clustering coefficients: To facilitate understanding of higher-order clustering ℓ Kℓ+1(u) coefficients and to aid our analysis in Section III, we Cℓ(u)= | |, (5) Wℓ(u) present a few probabilistic interpretations of the quanti- | | ties. First, we can interpret Cℓ(u) as the probability that where Kℓ+1(u) is the set of (ℓ + 1)-cliques containing a wedge w chosen uniformly at random from all wedges node u, Wℓ(u) is the set of ℓ-wedges with center u (where centered at u is closed: the center is the unique node in the intersection of the ℓ-clique and adjacent edge comprising the wedge; see Cℓ(u)= P [w Kℓ+1(u)] . (10) Fig. 1), and the coefficient ℓ comes from the fact that ∈ each (ℓ+1)-clique containing u closes that many ℓ-wedges The variant of this interpretation for the classical clus- in W (u). The ℓth-order clustering coefficient of a node ℓ tering case of ℓ = 2 has been useful for graph algorithm is defined for any node that is the center of at least one development [19]. ℓ-wedge, and the average ℓth-order clustering coefficient is the mean of the local clustering coefficients: For the next probabilistic interpretation, it is useful to analyze the structure of the 1-hop neighborhood graph N (u) of a given node u (not containing node u). The ¯ 1 1 Cℓ = Cℓ(u), (6) vertex set of N1(u) is the set of all nodes adjacent to u, Vℓ Xe | | u∈Vℓ and the edge set consists of all edges between neighbors e of u, i.e., (v, w) (u, v), (u, w), (v, w) E , where E is { | ∈ } where Vℓ is the set of nodes that are the centers of at the edge set of the graph. least one ℓ-wedge. Any ℓ-clique in G containing node u corresponds to a e To understand how to compute higher-order clustering unique (ℓ 1)-clique in N (u), and specifically for ℓ = 2, − 1 coefficients, we substitute the following useful identity any edge (u, v) corresponds to a node v in N1(u). There- fore, each ℓ-wedge centered at u corresponds to an (ℓ 1)- − Wℓ(u) = Kℓ(u) (du ℓ + 1), (7) clique K and one of the du ℓ + 1 nodes outside K (i.e., | | | | · − in N (u) K). Thus, Eq. 8 can− be re-written as 1 \ where du is the degree of node u, into Eq. 5 to get ℓ Kℓ(N1(u)) ℓ K (u) · | | , (11) C (u)= · | ℓ+1 | . (8) (du ℓ + 1) Kℓ−1(N1(u)) ℓ (d ℓ + 1) K (u) − · | | u − · | ℓ | where Kk(N1(u)) denotes the number of k-cliques in From Eq. 8, it is easy to see that we can compute all N1(u). local ℓth-order clustering coefficients by enumerating all If we uniformly at random select an (ℓ 1)-clique K (ℓ + 1)-cliques and ℓ-cliques in the graph. The compu- from N (u) and then also uniformly at random− select a tational complexity of the algorithm is thus bounded by 1 node v from N (u) outside of this clique, then C (u) is the time to enumerate (ℓ+1)-cliques and ℓ-cliques. Using 1 ℓ the probability that these ℓ nodes form an ℓ-clique: the Chiba and Nishizeki algorithm [17], the complexity is O(ℓaℓ−2m), where a is the arboricity of the graph, and C (u)= P [K v K (N (u))] . (12) m is the number of edges. The arboricity a may be as ℓ ∪{ } ∈ ℓ 1 large as √m, so this algorithm is only guaranteed to take polynomial time if ℓ is a constant. In general, determin- Moreover, if we condition on observing an ℓ-clique from ing if there exists a single clique with at least ℓ nodes is this sampling procedure, then the ℓ-clique itself is se- NP-complete [18]. lected uniformly at random from all ℓ-cliques in N1(u). For the global clustering coefficient, note that Therefore, Cℓ−1(u) Cℓ(u) is the probability that an (ℓ 1)-clique and two· nodes selected uniformly at ran- − W = W (u) . (9) dom from N1(u) form an (ℓ + 1)-clique. Applying this | ℓ| | ℓ | uX∈V recursively gives
ℓ Thus, it suffices to enumerate ℓ-cliques (to compute Wℓ K (N (u)) | | C (u)= | ℓ 1 | . (13) using Eq. 7) and to count the total number of ℓ-cliques. j du In practice, we use the Chiba and Nishizeki to enumerate jY=2 ℓ cliques and simultaneously compute Cℓ and Cℓ(u) for all nodes u. This suffices for our clustering analysis with In other words, the product of the higher-order local clus- ℓ =2, 3, 4 on networks with over a hundred million edges tering coefficients of node u up to order ℓ is the ℓ-clique in Section IV. density amongst u’s neighbors. 4
Proof. Clearly, 0 C (u) if the local clustering coefficient ≤ ℓ u u u is well defined. This bound is tight when N1(u) is (ℓ 1)- partite, as in the middle column of Fig. 2. In the (ℓ −1)- ℓ−2 − partite case, C2(u)= ℓ−1 . By removing edges from this extremal case in a sufficiently large graph, we can make d 1 2 1 d− ℓ−2 C2(u) 1 ≈ 4 4 ≈ 2(d − 1) 2 d− 4 C2(u) arbitrarily close to any value in [0, ℓ−1 ]. To derive the upper bound, consider the 1-hop neigh- d − 4 1 borhood N1(u), and let C3(u) 1 0 ≈ 2d − 4 2 Kℓ(N1(u)) d − 6 1 δℓ(N1(u)) = | d | (15) C4(u) 1 0 ≈ u 2d − 6 2 ℓ FIG. 2. Example 1-hop neighborhoods of a node u with degree denote the ℓ-clique density of N1(u). The Kruskal- d with different higher-order clustering. Left: For cliques, Katona theorem [20, 21] implies that Cℓ(u) = 1 for any ℓ. Middle: If u’s neighbors form a complete ℓ/(ℓ−1) bipartite graph, C2(u) is constant while Cℓ(u) = 0, ℓ ≥ 3. δℓ(N1(u)) [δℓ−1(N1(u))] Right: If half of u’s neighbors form a star and half form a ≤ δ (N (u)) [δ (N (u))](ℓ−1)/2. clique with u, then Cℓ(u) ≈ pC2(u), which is the upper ℓ−1 1 ≤ 2 1 bound in Proposition 1. Combining this with Eq. 8 gives
1 C (u) [δ (N (u))] ℓ−1 δ (N (u)) = C (u), III. THEORETICAL ANALYSIS AND ℓ ≤ ℓ−1 1 ≤ 2 1 2 HIGHER-ORDER CLUSTERING IN RANDOM p p GRAPH MODELS where the last equality uses the fact that C2(u) is the edge density of N1(u). The upper bound becomes tight when N1(u) consists We now provide some theoretical analysis of our of a clique and isolated nodes (Fig. 2, right) and the higher-order clustering coefficients. We first give some neighborhood is sufficiently large. Specifically, let N1(u) extremal bounds on the values that higher-order clus- consist of a clique of size c and b isolated nodes. When tering coefficients can take given the value of the tradi- ℓ = 2, tional (second-order) clustering coefficient. After, we an- alyze the values of higher-order clustering coefficients in c (c 1)c c 2 two common random graph models—the G and small- C (u)= 2 = − n,p ℓ c+b (c + b 1)(c + b) → c + b world models. The theory from this section will be a 2 − useful guide for interpreting the clustering behavior of and by Eq. 11, when 3 ℓ c, real-world networks in Section IV. ≤ ≤ c ℓ ℓ c ℓ +1 c Cℓ(u)= · = − . (c + b ℓ + 1) c c + b ℓ +1 → c + b A. Extremal bounds − · ℓ−1 − By adjusting the ratio c/(b + c) in N1(u), we can con- We first analyze the relationships between local higher- struct a family of graphs such that C2(u) takes any value order clustering coefficients of different orders. Our tech- in the interval [0, 1] as du and Cℓ(u) C2(u) as nical result is Proposition 1, which provides essentially → ∞ → du . p tight lower and upper bounds for higher-order local clus- → ∞ tering coefficients in terms of the traditional local cluster- The second part of the result requires the neighbor- ing coefficient. The main ideas of the proof are illustrated hoods to be sufficiently large in order to reach the upper in Fig. 2. bound. However, we will see later that in some real-world data, there are nodes u for which C3(u) is close to the Proposition 1. For any fixed ℓ 3, ≥ upper bound C2(u) for several values of C2(u). Next, we analyzep higher-order clustering coefficients 0 Cℓ(u) C2(u). (14) in two common random graph models: the Erd˝os-R´enyi ≤ ≤ p model with edge probability p (i.e., the Gn,p model [22]) Moreover, and the small-world model [3]. 1. There exists a finite graph G with a node u such that the lower bound is tight and C2(u) is within ǫ ℓ−2 B. Analysis for the model of any prescribed value in [0, ℓ−1 ]. Gn,p 2. There exists a finite graph G with a node u such that Cℓ(u) is within ǫ of the upper bound for any Now, we analyze higher-order clustering coefficients prescribed value of C (u) [0, 1]. 2 ∈ in classical Erd˝os-R´enyi random graph model, where 5 each edge exists independently with probability p (i.e., Proof. Similar to the proof of Proposition 3, we look at the Gn,p model [22]). We implicitly assume that ℓ is the conditional expectation over Wℓ(u) > 0: small in the following analysis so that there should be E [C (u) C (u), W (u) > 0] at least one ℓ-wedge in the graph (with high probabil- G ℓ | 2 ℓ ity and n large, there is no clique of size greater than = E E [C (u) C (u), W (u)] G Wℓ(u)>0 ℓ | 2 ℓ (2 + ǫ) log n/ log(1/p) for any ǫ> 0 [23]). Therefore, the = E E 1 P [w closed C (u)] . global and local clustering coefficients are well-defined. G Wℓ(u)>0 |W (u)| w∈Wℓ(u) 2 h h ℓ | ii In the Gn,p model, we first observe that any ℓ-wedge P du is closed if and only if the ℓ 1 possible edges between Now, note that N1(u) has m = C2(u) edges. Know- − · 2 the ℓ-clique and the outside node in the adjacent edge ing that w W (u) accounts for ℓ−1 of these edges. By ∈ ℓ 2 exist to form an (ℓ + 1)-clique. Each of the ℓ 1 edges symmetry, the other q = m ℓ−1 edges appear in any of exist independently with probability p in the G − model, − 2 n,p du ℓ− 1 the remaining r = 2 2 pairs of nodes uniformly which means that the higher-order clustering coefficients −r should scale as pℓ−1. We formalize this in the following at random. There are q ways to place these edges, of r−ℓ+1 proposition. which q−ℓ+1 would close the wedge w. Thus, Proposition 2. Let G be a random graph drawn from P [w is closed C2(u)] the Gn,p model. For constant ℓ, r−ℓ+1 | ℓ−1 (q−ℓ+1) (r−ℓ+1)!q! (q−ℓ+2)(q−ℓ+3)···q 1. EG [Cℓ]= p = r = (q−ℓ+1)!r! = (r−ℓ+2)(r−ℓ+3)···r . ℓ−1 (q) 2. EG [Cℓ(u) Wℓ(u) > 0] = p for any node u E ¯ | ℓ−1 3. G Cℓ = p Now, for any small nonnegative integer k, Proof. We prove the first part by conditioning on the set du ℓ−1 q k C2(u)·( 2 )−( 2 )−k of ℓ-wedges, Wℓ: − = du ℓ−1 r k ( 2 )−( 2 )−k − E E E ℓ−1 [Cℓ]= G [ Wℓ [Cℓ Wℓ]] ( 2 )+k = C (u) (1 C (u)) −1 | 2 2 du − ℓ −k 1 − − ( 2 ) ( 2 ) = EG EW P [w is closed] ℓ |Wℓ| w∈Wℓ 2 h h ii = C2(u) (1 C2(u)) O(1/d ). P − − · u = E E 1 pℓ−1 G Wℓ |W | w∈Wℓ h h ℓ ii (Recall that ℓ is constant by assumption, so the big-O no- ℓ−1 P = EG p tation is appropriate). The above expression approaches (C (u))ℓ−1 when C (u) 1 as well as when d . = pℓ−1. 2 2 → u → ∞ Proposition 3 says that even if the second-order local As noted above, the second equality is well defined (with high probability) for small ℓ. The third equality comes clustering coefficient is large, the ℓth-order clustering co- efficient will still decay exponentially in ℓ, at least in the from the fact that any ℓ-wedge is closed if and only if the ℓ 1 possible edges between the ℓ-clique and the outside limit as du grows large. By examining higher-order clique closures, this allows us to distinguish between nodes u node− in the adjacent edge exist to form an (ℓ + 1)-clique. whose neighborhoods are “dense but random” (C (u) is The proof of the second part is essentially the same, 2 large but C (u) (C (u))ℓ−1) or “dense and structured” except we condition over the set of possible cases where ℓ 2 (C (u) is large and≈ C (u) > (C (u))ℓ−1). Only the latter W (u) > 0. 2 ℓ 2 ℓ case exhibits higher-order clustering. We use this in our Recall that V is the set of nodes at the center of at analysis of real-world networks in Section IV. least one ℓ-wedge. To prove the third part, we take the e conditional expectation over V and use our result from the second part. C. Analysis for the small-world model e The above results say that the global, local, and av- erage ℓth order clustering coefficients decrease exponen- We also study higher-order clustering in the small- tially in ℓ. It turns out that if we also condition on world random graph model [3]. The model begins with a the second-order clustering coefficient having some fixed ring network where each node connects to its 2k nearest value, then the higher-order clustering coefficients still neighbors. Then, for each node u and each of the k edges decay exponentially in ℓ for the Gn,p model. This will be (u, v) with v following u clockwise in the ring, the edge useful for interpreting the distribution of local clustering is rewired to (u, w) with probability p, where w is chosen coefficients on real-world networks. uniformly at random. With no rewiring (p = 0) and k n, it is known that Proposition 3. ≪ Let G be a random graph drawn from C¯2 3/4 [3]. As p increases, the average clustering coef- the G model. Then for constant ℓ, ≈ n,p ficient C¯2 slightly decreases until a phase transition near ¯ E [C (u) C (u), W (u) > 0] p =0.1, where C2 decays to 0 [3] (also see Fig. 3). Here, G ℓ | 2 ℓ we generalize these results for higher-order clustering co- ℓ−1 = C (u) (1 C (u)) O(1/d2 ) (C (u))ℓ−1. efficients. 2 − − 2 · u ≈ 2 6
0.7 If we ignore lower-order terms k and note that t = O(k), 0.6 we get
ℓ−3 0.5 K (u) = k (2k−t)t + O(kℓ−3) | ℓ | t=1 (ℓ−3)! P h i 0.4 = 1 k (2ktℓ−3 tℓ−2)+ O(kℓ−2). (ℓ−3)! t=1 − − − 0.3 1 P kℓ 2 kℓ 1 ℓ−2 = (ℓ−3)! 2k ℓ−2 ℓ−1 + O(k ), h · − i 0.2 -C2 ℓ ℓ−1 ℓ−2 Avg. clust. coeff. = (ℓ−1)! k + O(k ). -C3 · 0.1 -C4 0.0 -3 -2 -1 0 10 10 10 10 Proposition 4 shows that, when p = 0, C¯ℓ decreases Rewiring probability (p) as ℓ increases. Furthermore, via simulation, we observe the same behavior as for C¯2 when adjusting the rewiring probability p (Fig. 3). Regardless of ℓ, the phase tran- FIG. 3. Average higher-order clustering coefficient C¯ℓ as a sition happens near p = 0.1. Essentially, once there is function of rewiring probability p in small-world networks for enough rewiring, all local clique structure is lost, and ℓ = 2, 3, 4 (n = 20, 000, k = 5). Proposition 4 shows that the clustering at all orders is lost. This is partly a conse- ℓth-order clustering coefficient when p = 0 predicts that the quence of Proposition 1, which says that Cℓ(u) 0 as clustering should decrease modestly as ℓ increases. C (u) 0 for any ℓ. → 2 → Proposition 4. In the small-world model without rewiring (p =0), IV. EXPERIMENTAL RESULTS ON REAL-WORLD NETWORKS C¯ (ℓ + 1)/(2ℓ) ℓ → for any constant ℓ 2 as k and n while We now analyze the higher-order clustering of real- 2k