Graph Embedding and Classification Via Simplicial Complexes

Total Page:16

File Type:pdf, Size:1020Kb

Graph Embedding and Classification Via Simplicial Complexes algorithms Article (Hyper)Graph Embedding and Classification via Simplicial Complexes Alessio Martino 1,* , Alessandro Giuliani 2 and Antonello Rizzi 1 1 Department of Information Engineering, Electronics and Telecommunications, University of Rome “La Sapienza”, Via Eudossiana 18, 00184 Rome, Italy; [email protected] 2 Department of Environment and Health, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy; [email protected] * Correspondence: [email protected]; Tel.: +39-06-4458-5745 Received: 26 September 2019; Accepted: 24 October 2019; Published: 25 October 2019 Abstract: This paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures (i.e., information granules) on the top of which an embedding space can be built by means of symbolic histograms. In the embedding space, any Euclidean pattern recognition system can be used, possibly equipped with feature selection capabilities in order to select the most informative symbols. The selected symbols can be analysed by field-experts in order to extract further knowledge about the process to be modelled by the learning system, hence the proposed modelling strategy can be considered as a grey-box. The proposed embedding has been tested on thirty benchmark datasets for graph classification and, further, we propose two real-world applications, namely predicting proteins’ enzymatic function and solubility propensity starting from their 3D structure in order to give an example of the knowledge discovery phase which can be carried out starting from the proposed embedding strategy. Keywords: granular computing; embedding spaces; graph embedding; topological data analysis; simplicial complexes; computational biology; protein contact networks; complex networks; complex systems 1. Introduction Graphs are powerful data structures that can capture topological and semantic information from data. This is one of the main reasons they are commonly used for modelling several real-world systems in fields such as biology and chemistry [1–8], social networks [9], telecommunication networks [10,11] and natural language processing [12–14]. However, solving pattern recognition problems in structured domains such as graphs pose additional challenges. Indeed, many structured domains are also non-metric in nature [15–17] and patterns lack any geometrical interpretation. In brief, an input space is said to be non-metric if pairwise dissimilarities between patterns lying in such space do not satisfy the properties of a metric (non-negativity, identity, symmetry and triangle inequality) [17,18]. In the literature, several strategies can be found in order to perform pattern recognition tasks in structured domains [17], namely: • feature generation and feature engineering, where numerical features are ad-hoc extracted from the input patterns Algorithms 2019, 12, 223; doi:10.3390/a12110223 www.mdpi.com/journal/algorithms Algorithms 2019, 12, 223 2 of 21 • ad-hoc dissimilarities in the input space, where custom dissimilarity measures (e.g., edit distances [19–22]) are designed in order to directly process patterns in the input space (without moving towards Euclidean spaces) • dissimilarity representations [18,23], where each pattern is described by the pairwise distances with other patterns or with respect to a properly chosen subset of pivotal training patterns [23–26] • kernel methods, where the mapping between the original input space and the Euclidean space exploits positive-definite kernel functions [27–32] • embedding via information granulation. As the latter is concerned, embedding techniques are gaining more and more attention especially since the breakthrough of Granular Computing [33,34]. In short, Granular Computing is a human-inspired information processing paradigm which aims at the extraction of meaningful entities (information granules) arising from both the problem at hand and the data representation. The challenge with Granular Computing-based pattern recognition systems is that there are different levels of granularity according to which a given system can be observed [35–37]; nonetheless, one shall choose a suitable level of granularity for the problem at hand. These information granules are usually extracted in a data-driven manner and describe data aggregates, namely data which are similar according to structural and/or functional similarity [15–17]. Data clustering, for example, is a promising tool for extracting information granules [38], especially when clustering algorithms can be equipped with ad-hoc dissimilarity measures in order to deal with structured data [17,39–42]. Indeed, several works focused on extracting information granules via motifs clustering (see e.g., Refs. [43–47]), where a proper granulation module is in charge of extracting and clustering sub-structures (i.e., sub-graphs). The resulting clusters can be considered as information granules and the clusters’ representatives form an alphabet on the top of which the embedding procedure is performed thanks to the symbolic histograms approach [46]: let M be the size of the alphabet, each input pattern is transformed into an M-length integer-valued feature vector whose ith element contains the number of occurrences of the ith alphabet member within the pattern itself. Thanks to the embedding, the problem is moved towards a metric (Euclidean) space and plain pattern recognition algorithms can be used without alterations. The symbols extraction and alphabet synthesis is crucial in granulation-based classifiers: the resulting embedding space must preserve (the vast majority of) the original input space properties (e.g., the more different two objects drawn from the input space are, the more distant they must appear in the embedding space.) [17,18]. Also, for the sake of modelling complexity, the size of the alphabet must be as small as possible or, specifically, the set of resulting alphabet symbols should be small, yet informative. This aspect is crucial since Granular Computing-based pattern recognition systems aim to be human interpretable: the resulting set of symbols forming the alphabet, hence pivotal for the embedding space, should allow field experts to gather further insights for the problem at hand [17]. The aim of this paper is to investigate a novel procedure for extracting meaningful information granules thanks to simplicial complexes. Conversely to network motifs and graphlets, simplicial complexes are able to capture the multi-scale/higher-order organisation in complex networks [48,49], overcoming the main limitation offered by ‘plain graphs’; that is, they only considers pairwise relations, whereas simplicial complexes (and hypergraphs, in general) also consider multi-way relations. On the top of simplicial complexes, an embedding space is built for pattern recognition purposes. In order to show the effectiveness of the proposed embedding procedure, a set of thirty open-access datasets for graph classification has been considered. Furthermore, the proposed technique has been benchmarked against two suitable competitors and a null-model for statistical assessment. In order to stress the knowledge discovery phase offered by Granular Computing-based classifiers, additional experiments are proposed. Specifically, starting from real-world proteomic data, two problems will be addressed regarding the possibility to predict the enzymatic function and the solution/folding propensity starting from proteins’ folded 3D-structure. This paper is organised as follows: in Section2 the approach at the basis of work is presented by giving a brief overview of simplicial complexes (Section 2.1) before diving into the proper embedding Algorithms 2019, 12, 223 3 of 21 procedure (Section 2.2); in Section3 the results over benchmark datasets (Section 3.1) and real-world problems (Section 3.2) are shown. Section4 remarks the interpretability of the proposed model and, finally, Section5 concludes the paper, remarking future directions. 2. Information Granulation and Classification Systems 2.1. An Introduction to Simplicial Complexes Let P be a set of points in a multi-dimensional space equipped with a notion of distance d(·, ·) and let X be the topological space enclosing P. The topological space X can be described by means of its simplices, that are multidimensional objects of different order (dimension) drawn from P. Formally, a k-simplex (simplex of order k) is a set of k + 1 points drawn from P, for example, 0-dimensional simplices correspond to points, 1-dimensional simplices correspond to lines, 2-dimensional simplices correspond to triangles, 3-dimensional simplices correspond to tetrahedrons and so on for higher-dimensional simplices. Every non-empty subset of the (k + 1) vertices of a k-simplex is a face of the simplex: a face is itself a simplex. Simplicial complexes [50,51] are properly constructed finite collections of simplices that are closed with respect to inclusions of the faces: if a given simplex s belongs to a given simplicial complex S, then all faces of s also belong to S. The order (dimension) of the simplicial complex is the maximum order of any of its simplices. A graph G = (V, E), where E is the set of edges and V is the set of vertices, is also commonly-known as “1-skeleton” or “simplicial complex of order 1” since the only entities
Recommended publications
  • Spectral Gap Bounds for the Simplicial Laplacian and an Application To
    Spectral gap bounds for the simplicial Laplacian and an application to random complexes Samir Shukla,∗ D. Yogeshwaran† Abstract In this article, we derive two spectral gap bounds for the reduced Laplacian of a general simplicial complex. Our two bounds are proven by comparing a simplicial complex in two different ways with a larger complex and with the corresponding clique complex respectively. Both of these bounds generalize the result of Aharoni et al. (2005) [1] which is valid only for clique complexes. As an application, we investigate the thresholds for vanishing of cohomology of the neighborhood complex of the Erdös- Rényi random graph. We improve the upper bound derived in Kahle (2007) [15] by a logarithmic factor using our spectral gap bounds and we also improve the lower bound via finer probabilistic estimates than those in Kahle (2007) [15]. Keywords: spectral bounds, Laplacian, cohomology, neighborhood complex, Erdös-Rényi random graphs. AMS MSC 2010: 05C80. 05C50. 05E45. 55U10. 1 Introduction Let G be a graph with vertex set V (G) (often to be abbreviated as V ) and let L(G) denote the (unnormalized) Laplacian of G. Let 0 = λ1(G) ≤ λ2(G) ≤ . ≤ λ|V |(G) denote the eigenvalues of L(G) in ascending order. Here, the second smallest eigenvalue λ2(G) is called the spectral gap. The clique complex of a graph G is the simplicial complex whose simplices are all subsets σ ⊂ V which spans a complete subgraph of G. We shall denote the kth k arXiv:1810.10934v2 [math.CO] 9 Oct 2019 reduced cohomology of a simplicial complex X by H (X).
    [Show full text]
  • Message Passing Simplicial Networks
    Weisfeiler and Lehman Go Topological: Message Passing Simplicial Networks Cristian Bodnar * 1 Fabrizio Frasca * 2 3 Yu Guang Wang * 4 5 6 Nina Otter 7 Guido Montufar´ * 4 7 Pietro Lio` 1 Michael M. Bronstein 2 3 v0 Abstract v6 v10 v9 The pairwise interaction paradigm of graph ma- v chine learning has predominantly governed the 8 v v modelling of relational systems. However, graphs 1 5 v3 alone cannot capture the multi-level interactions v v present in many complex systems and the expres- 4 7 v2 sive power of such schemes was proven to be lim- ited. To overcome these limitations, we propose Figure 1. Message Passing with upper and boundary adjacencies Message Passing Simplicial Networks (MPSNs), illustrated for vertex v2 and edge (v5; v7) in the complex. a class of models that perform message passing on simplicial complexes (SCs). To theoretically anal- 1. Introduction yse the expressivity of our model we introduce a Simplicial Weisfeiler-Lehman (SWL) colouring Graphs are among the most common abstractions for com- procedure for distinguishing non-isomorphic SCs. plex systems of relations and interactions, arising in a broad We relate the power of SWL to the problem of range of fields from social science to high energy physics. distinguishing non-isomorphic graphs and show Graph neural networks (GNNs), which trace their origins that SWL and MPSNs are strictly more power- to the 1990s (Sperduti, 1994; Goller & Kuchler, 1996; Gori ful than the WL test and not less powerful than et al., 2005; Scarselli et al., 2009; Bruna et al., 2014; Li et al., the 3-WL test.
    [Show full text]
  • Some Properties of Glued Graphs at Complete Clone
    Sci.Int.(Lahore),27(1),39-47,2014 ISSN 1013-5316; CODEN: SINTE 8 39 SOME PROPERTIES OF GLUED GRAPHS AT COMPLETE CLONE IN THE VIEW OF ALGEBRAIC COMBINATORICS Seyyede Masoome Seyyedi , Farhad Rahmati Faculty of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran. Corresponding Author: Email: [email protected] ABSTRACT: A glued graph at complete clone is obtained from combining two graphs by identifying edges of of each original graph. We investigate how to change some properties such as height, big height, Krull dimension, Betti numbers by gluing of two graphs at complete clone. We give a sufficient and necessary condition so that the glued graph of two Cohen-Macaulay chordal graphs at complete clone is a Cohen-Macaulay graph. Moreover, we present the conditions that the edge ideal of gluing of two graphs at complete clone has linear resolution whenever the edge ideals of original graphs have linear resolution. We show when gluing of two independence complexes, line graphs, complement graphs can be expressed as independence complex, line graph and complement of gluing of two graphs. Key Words: Glued graph, Height, Big height, Krull dimension, Projective dimension, Linear resolution, Betti number, Cohen-Macaulay. INTRODUCTION The concept edge ideal was first introduced by Villarreal in Our paper is organized as follows. In section 2, we give an [23], that is, let be a simple (no loops or multiple edges) explicit formula for computing the height of glued graph at graph on the vertex set and the edge set complete clone. Also, we present a lower bound for the big .
    [Show full text]
  • Topological Graph Persistence
    CAIM ISSN 2038-0909 Commun. Appl. Ind. Math. 11 (1), 2020, 72–87 DOI: 10.2478/caim-2020-0005 Research Article Topological graph persistence Mattia G. Bergomi1*, Massimo Ferri2*, Lorenzo Zuffi2 1Veos Digital, Milan, Italy 2ARCES and Dept. of Mathematics, Univ. of Bologna, Italy *Email address for correspondence: [email protected] Communicated by Alessandro Marani Received on 04 28, 2020. Accepted on 11 09, 2020. Abstract Graphs are a basic tool in modern data representation. The richness of the topological information contained in a graph goes far beyond its mere interpretation as a one-dimensional simplicial complex. We show how topological constructions can be used to gain information otherwise concealed by the low-dimensional nature of graphs. We do this by extending previous work in homological persistence, and proposing novel graph-theoretical constructions. Beyond cliques, we use independent sets, neighborhoods, enclaveless sets and a Ramsey-inspired extended persistence. Keywords: Clique, independent set, neighborhood, enclaveless set, Ramsey AMS subject classification: 68R10, 05C10 1. Introduction Currently data are produced massively and rapidly. A large part of these data is either naturally organized, or can be represented, as graphs or networks. Recently, topological persistence has proved to be an invaluable tool for the exploration and understanding of these data types. Albeit graphs can be considered as topological objects per se, given their low dimensionality, only limited information can be obtained by studying their topology. It is possible to grasp more information by superposing higher-dimensional topological structures to a given graph. Many research lines, for ex- ample, focused on building n-dimensional complexes from graphs by considering their (n+1)-cliques (see, e.g., section 1.1).
    [Show full text]
  • Persistent Homology of Unweighted Complex Networks Via Discrete
    www.nature.com/scientificreports OPEN Persistent homology of unweighted complex networks via discrete Morse theory Received: 17 January 2019 Harish Kannan1, Emil Saucan2,3, Indrava Roy1 & Areejit Samal 1,4 Accepted: 6 September 2019 Topological data analysis can reveal higher-order structure beyond pairwise connections between Published: xx xx xxxx vertices in complex networks. We present a new method based on discrete Morse theory to study topological properties of unweighted and undirected networks using persistent homology. Leveraging on the features of discrete Morse theory, our method not only captures the topology of the clique complex of such graphs via the concept of critical simplices, but also achieves close to the theoretical minimum number of critical simplices in several analyzed model and real networks. This leads to a reduced fltration scheme based on the subsequence of the corresponding critical weights, thereby leading to a signifcant increase in computational efciency. We have employed our fltration scheme to explore the persistent homology of several model and real-world networks. In particular, we show that our method can detect diferences in the higher-order structure of networks, and the corresponding persistence diagrams can be used to distinguish between diferent model networks. In summary, our method based on discrete Morse theory further increases the applicability of persistent homology to investigate the global topology of complex networks. In recent years, the feld of topological data analysis (TDA) has rapidly grown to provide a set of powerful tools to analyze various important features of data1. In this context, persistent homology has played a key role in bring- ing TDA to the fore of modern data analysis.
    [Show full text]
  • Arxiv:2006.02870V1 [Cs.SI] 4 Jun 2020
    The why, how, and when of representations for complex systems Leo Torres Ann S. Blevins [email protected] [email protected] Network Science Institute, Department of Bioengineering, Northeastern University University of Pennsylvania Danielle S. Bassett Tina Eliassi-Rad [email protected] [email protected] Department of Bioengineering, Network Science Institute and University of Pennsylvania Khoury College of Computer Sciences, Northeastern University June 5, 2020 arXiv:2006.02870v1 [cs.SI] 4 Jun 2020 1 Contents 1 Introduction 4 1.1 Definitions . .5 2 Dependencies by the system, for the system 6 2.1 Subset dependencies . .7 2.2 Temporal dependencies . .8 2.3 Spatial dependencies . 10 2.4 External sources of dependencies . 11 3 Formal representations of complex systems 12 3.1 Graphs . 13 3.2 Simplicial Complexes . 13 3.3 Hypergraphs . 15 3.4 Variations . 15 3.5 Encoding system dependencies . 18 4 Mathematical relationships between formalisms 21 5 Methods suitable for each representation 24 5.1 Methods for graphs . 24 5.2 Methods for simplicial complexes . 25 5.3 Methods for hypergraphs . 27 5.4 Methods and dependencies . 28 6 Examples 29 6.1 Coauthorship . 29 6.2 Email communications . 32 7 Applications 35 8 Discussion and Conclusion 36 9 Acknowledgments 38 10 Citation diversity statement 38 2 Abstract Complex systems thinking is applied to a wide variety of domains, from neuroscience to computer science and economics. The wide variety of implementations has resulted in two key challenges: the progenation of many domain-specific strategies that are seldom revisited or questioned, and the siloing of ideas within a domain due to inconsistency of complex systems language.
    [Show full text]
  • FACE VECTORS of FLAG COMPLEXES 1. Introduction We
    FACE VECTORS OF FLAG COMPLEXES ANDY FROHMADER Abstract. A conjecture of Kalai and Eckhoff that the face vector of an arbi- trary flag complex is also the face vector of some particular balanced complex is verified. 1. Introduction We begin by introducing the main result. Precise definitions and statements of some related theorems are deferred to later sections. The main object of our study is the class of flag complexes. A simplicial complex is a flag complex if all of its minimal non-faces are two element sets. Equivalently, if all of the edges of a potential face of a flag complex are in the complex, then that face must also be in the complex. Flag complexes are closely related to graphs. Given a graph G, define its clique complex C = C(G) as the simplicial complex whose vertex set is the vertex set of G, and whose faces are the cliques of G. The clique complex of any graph is itself a flag complex, as for a subset of vertices of a graph to not form a clique, two of them must not form an edge. Conversely, any flag complex is the clique complex of its 1-skeleton. The Kruskal-Katona theorem [6, 5] characterizes the face vectors of simplicial complexes as being precisely the integer vectors whose coordinates satisfy some par- ticular bounds. The graphs of the \rev-lex" complexes which attain these bounds invariably have a clique on all but one of the vertices of the complex, and sometimes even on all of the vertices. Since the bounds of the Kruskal-Katona theorem hold for all simplicial com- plexes, they must in particular hold for flag complexes.
    [Show full text]
  • Percolation and Stochastic Topology NSF/CBMS Conference
    Lecture 9: Percolation and stochastic topology NSF/CBMS Conference Sayan Mukherjee Departments of Statistical Science, Computer Science, Mathematics Duke University www.stat.duke.edu/ sayan ⇠ May 31, 2016 Stochastic topology “I predict a new subject of statistical topology. Rather than count the number of holes, Betti numbers, etc., one will be more interested in the distribution of such objects on noncompact manifolds as one goes out to infinity,” Isadore Singer. Random graphs Erd˝os-R´enyi G G(n, p) is a draw of a graph G from the graph distribution. ⇠ Consider p(n)andn and ask questions about thresholds of !1 graph properties. Erd˝os-R´enyi random graph model G(n, p) is the probability space of graphs on vertex set [n]= 1,...n with the probability of each edge p independently. { } Consider p(n)andn and ask questions about thresholds of !1 graph properties. Erd˝os-R´enyi random graph model G(n, p) is the probability space of graphs on vertex set [n]= 1,...n with the probability of each edge p independently. { } G G(n, p) is a draw of a graph G from the graph distribution. ⇠ Erd˝os-R´enyi random graph model G(n, p) is the probability space of graphs on vertex set [n]= 1,...n with the probability of each edge p independently. { } G G(n, p) is a draw of a graph G from the graph distribution. ⇠ Consider p(n)andn and ask questions about thresholds of !1 graph properties. Matthew Kahle (Ohio State University) Random topology and geometry AMS Short Course Matthew Kahle (Ohio State University) Random topology and geometry AMS Short Course Matthew
    [Show full text]
  • Matroid Representation of Clique Complexes∗
    Matroid representation of clique complexes¤ x Kenji Kashiwabaray Yoshio Okamotoz Takeaki Uno{ Abstract In this paper, we approach the quality of a greedy algorithm for the maximum weighted clique problem from the viewpoint of matroid theory. More precisely, we consider the clique complex of a graph (the collection of all cliques of the graph) which is also called a flag complex, and investigate the minimum number k such that the clique complex of a given graph can be represented as the intersection of k matroids. This number k can be regarded as a measure of “how complex a graph is with respect to the maximum weighted clique problem” since a greedy algorithm is a k-approximation algorithm for this problem. For any k > 0, we characterize graphs whose clique complexes can be represented as the intersection of k matroids. As a consequence, we can see that the class of clique complexes is the same as the class of the intersections of partition matroids. Moreover, we determine how many matroids are necessary and sufficient for the representation of all graphs with n vertices. This number turns out to be n - 1. Other related investigations are also given. Keywords: Abstract simplicial complex, Clique complex, Flag complex, Independence system, Matroid intersec- tion, Partition matroid 1 Introduction An independence system is a family of subsets of a nonempty finite set such that all subsets of a member of the family are also members of the family. A lot of combinatorial optimization problems can be seen as optimization problems on the corresponding independence systems. For example, in the minimum cost spanning tree problem, we want to find a maximal set with minimum total weight in the collection of all forests of a given graph, which is an independence system.
    [Show full text]
  • The Intersection of a Matroid and a Simplicial Complex 1
    TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 358, Number 11, November 2006, Pages 4895–4917 S 0002-9947(06)03833-5 Article electronically published on June 19, 2006 THE INTERSECTION OF A MATROID AND A SIMPLICIAL COMPLEX RON AHARONI AND ELI BERGER Abstract. A classical theorem of Edmonds provides a min-max formula re- lating the maximal size of a set in the intersection of two matroids to a “cov- ering” parameter. We generalize this theorem, replacing one of the matroids by a general simplicial complex. One application is a solution of the case r = 3 of a matroidal version of Ryser’s conjecture. Another is an upper bound on the minimal number of sets belonging to the intersection of two matroids, needed to cover their common ground set. This, in turn, is used to derive a weakened version of a conjecture of Rota. Bounds are also found on the dual parameter—the maximal number of disjoint sets, all spanning in each of two given matroids. We study in detail the case in which the complex is the complex of independent sets of a graph, and prove generalizations of known results on “independent systems of representatives” (which are the special case in which the matroid is a partition matroid). In particular, we define a no- tion of k-matroidal colorability of a graph, and prove a fractional version of a conjecture, that every graph G is 2∆(G)-matroidally colorable. The methods used are mostly topological. 1. Introduction The point of departure of this paper is a notion which has been recently developed and studied, that of an independent system of representatives (ISR), which is a generalization of the notion of a system of distinct representatives (SDR).
    [Show full text]
  • Structural Parameters in Combinatorial Objects
    DISS. ETH No. 15901, 2005 Structural Parameters in Combinatorial Objects A dissertation submitted to the Swiss Federal Institute of Technology Zurich¨ for the degree of Doctor of Technical Sciences presented by Yoshio Okamoto Master of Systems Science, the University of Tokyo, Japan born July 22, 1976 in Hekinan, Japan citizen of Japan submitted to Prof. Emo Welzl, ETH Zurich,¨ examiner Prof. Komei Fukuda, ETH Zurich,¨ co-examiner ii iii Acknowledgements First of all, I would like to express my sincere gratitude to my supervi- sor Emo Welzl. It was a great opportunity for me to work at his teach- ing/research group in Zurich. His persistent encouragement and sup- port were really helpful, and his way of looking at problems, way of presenting materials, and everything were very exciting to me. He was also a speaker of the Berlin/Zurich Joint Graduate Program “Combina- torics, Geometry, and Combinatorics” (CGC) financed by ETH Zurich and German Science Foundation. Without this program, I would not have imagined how I completed the study. My sincere gratitude also goes to Komei Fukuda who kindly ad- mitted to be a co-referee. Indeed, he was the first person who I made a contact when I tried to move to Zurich. At that time we did not know each other; Nevertheless he was so kind that he gave a lot of informa- tion and encouraged me to come to Zurich. Some of the chapters in the thesis are based on joint papers with the following people: Vladimir De˘ıneko, Michael Hoffmann, Kenji Kashi- wabara, Masataka Nakamura, Takeaki Uno and Gerhard Woeginger.
    [Show full text]
  • TOPOLOGY of RANDOM D-CLIQUE COMPLEXES 3
    TOPOLOGY OF RANDOM d-CLIQUE COMPLEXES DEMET TAYLAN Abstract. For a simplicial complex X, the d-clique complex ∆d(X) is the simplicial complex having all subsets of vertices whose (d + 1)-subsets are contained by X as its α −1 d+1 −1 faces. We prove that if p = n , with α< max{ k−d+1 , − k } or α> 2k+2 , then the k-th (d) ( d ) reduced homology group of the random d-clique complex ∆d(Gd(n,p)) is asymptotically −1 −1 (d+1)(k+1) −1 almost surely vanishing, and if t <α< t+1 where t = ( (d+1)(k+1) − k ) , then the ( d+1 ) ( +1) (kd + d − 1)-st reduced homology group of ∆d(Gd(n,p)) is asymptotically almost surely nonvanishing. This provides a partial answer to a question posed by Eric Babson. 1. Introduction logn One of the famous results in random graph theory establishes that p = n is a threshold for the connectivity of Erd˝os and R´enyi random graphs [5]. A 2-dimensional analogue of Erd˝os and R´enyi’s result was obtained by Linial-Meshulam in [7]. Meshulam-Wallach, in [10], presented a d-dimensional analogue for d ≥ 3. In particular, Linial-Meshulam- dlogn Wallach theorem yields that p = n is the threshold for the vanishing of the (d − 1)-st homology of the random simplicial complex Gd(n, p) with coefficients in a finite abelian group. Here, the random simplicial complex Gd(n, p) is a d-dimensional simplicial complex on [n] with a full (d − 1)-dimensional skeleton and with d-dimensional faces are chosen independently each with probability p.
    [Show full text]