Measuring Network Centrality Using Hypergraphs

Total Page:16

File Type:pdf, Size:1020Kb

Measuring Network Centrality Using Hypergraphs Measuring Network Centrality Using Hypergraphs Sanjukta Roy Balaraman Ravindran Chennai Mathematical Institute Indian Institute of Technology Madras [email protected] [email protected] ABSTRACT real- world systems. For example, if we model a collabora- Networks abstracted as graph lose some information related tion network as a simple graph, we would have to reduce the to the super-dyadic relation among the nodes. We find nat- interaction into a set of two way interactions, and if there ural occurrence of hyperedges in co-authorship, co-citation, are multiple actors collaborating on the same project, the social networks, e-mail networks, weblog networks etc. Treat- representation will be a clique on all those actors. As we ing these networks as hypergraph preserves the super-dyadic will show, we lose information about the set of actors col- relations. But the primal graph or Gaifmann graph associ- laborating on the same project in the clique construction. A ated with hypergraphs converts each hyperedge to a clique natural way to model these interaction is through a hyper- losing again the n-ary relationship among nodes. We aim to graph. In a simple graph, an edge is represented by a pair measure Shapley Value based centrality on these networks of vertices, whereas an hyperedge is a non-empty subset of without losing the super-dyadic information. For this pur- vertices. pose, we use co-operative games on single graph representa- Multi-way interactions, known as super-dyadic relations, tion of a hypergraph such that Shapley value can be com- characterize many real world applications and there has been puted efficiently[1]. We propose several methods to generate much interest lately in modeling such relations, especially simpler graphs from hypergraphs and study the efficacy of using hypergraphs [3, 4, 5, 6]. While there have been several the centrality scores computed on these constructions. approaches proposed for using various graph properties, such as connectivity, centrality, modularity, etc., for modeling in- teractions between actors, the extensions of these notions to Categories and Subject Descriptors hypergraphs is still an active area of research. The typical G.2.2 [Hypergraphs]: Network Centrality approach is to construct a simple graph from the hypergraph and compute the properties on the regular graph. The ap- Keywords proaches differ in the construction of the simple graph. A hypergraph of tag co-occurrences has been used for de- Hypergraph, Shapley value, Centrality tecting communities using betweenness centrality [3]. Puzis et al. [4] used betweenness as a centrality measure on hy- 1. INTRODUCTION pergraphs and gave an efficient approach using Brandes al- The study of networks represents an important area of gorithm. Zhou et al. [7] pointed out the information loss multidisciplinary research involving Physics, Mathematics, occurs when hypergraphs are treated as simple graphs and Chemistry, Biology, Social sciences, and Information sci- utilised hypergraphs to device clustering technique. Kather- ences. These systems are commonly represented using sim- ine Faust [5] used non-dyadic affiliation relationships which ple graphs in which the nodes represent the objects under in- allowed to quantify the centrality of a subset of actors or a vestigation, e.g., people, proteins, molecules, computer sys- subset of events. She calculated an actor's centrality as a tems etc., and the edges represent interactions between the function of the centrality of hyperedges of events to which nodes. But, there are occasions in which interactions in- the actor belongs and calculated the centrality of events as a volve more than two actors. For example, deliberations that function of centrality of the actors. Hypergraphs have been take place in a committee are multi-way interactions [2]. In used to understand team assembly mechanisms to determine such circumstances, using simple graphs to represent com- collaboration [8, 9, 10]. Jain et al. [11] studied Indian Rail plex networks does not provide complete description of the network using hypergraph. Of particular interest to us in this work are centrality mea- sures. A centrality measure identifies nodes that are in the Permission to make digital or hard copies of all or part of this work for `center'of a network. In practice, identifying exactly what personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear we mean by `center'depends on the application of interest. this notice and the full citation on the first page. Copyrights for components In this work, we mainly look at this from the view point of of this work owned by others than ACM must be honored. Abstracting with information propagation. Independent Cascade and Linear credit is permitted. To copy otherwise, or republish, to post on servers or to Threshold are the two information diffusion models proposed redistribute to lists, requires prior specific permission and/or a fee. Request in [12]. To find a seed set of k nodes that maximizes influ- permissions from [email protected]. CODS ’15, March 18 - 21, 2015, Bangalore, India ence propagation is in general a hard problem and [12] gave Copyright 2015 ACM 978-1-4503-3436-5/15/03 $15.00. an (1 − 1=e − ) approximation under the assumption that http://dx.doi.org/10.1145/2732587.2732595 the activation functions are submodular. We use the diffu- 2 i j sion models of [12] to define measures of effectiveness of the different centrality scores that we investigate. 1 There have been some earlier attempts at computing cen- trality scores on hypergraphs. Faust [5] looked at a hy- 1 2 pergraph as a bipartite graph[6] between actors and events 3 when computing closeness centrality. Puzis et al.[4] essen- tially converted each hyperedge to a clique and computed be- tweenness centrality on the modified graph efficiently. Rep- l k resenting each hyperedge as a clique converts the hypergraph 1 to a general graph. We call this conversion as clique con- struction. Clique construction is the method that is widely Figure 1: wt-clique construction for H and H : used for computations on hypergraph. But in this work we 1 2 W (H ) and W (H ) are the same argue that the clique construction loses the non-dyadic in- 1 2 formation represented by the hyperedges. Another, slightly different way of looking at the hypergraph is using the edge multiplicity between two nodes as the weight on the edge sider a cooperative game with transferable utilities, (N; ν), between those nodes in the clique construction. This con- where N = 1; 2; : : : ; n is the set of players and ν : 2N !R is version we call as wt-clique construction. a characteristic function, that assigns a value to each coali- tion (subset of N) , with ν(φ) = 0. The Shapley value (SV) The following is a toy example where we look at a small of player i 2 N, is network so that we can easily measure influence of each node. 1 X This example shows that the traditional computations on SV (i) = : ν(S (π) [ fig) − ν(S (π)) (1) n! i i hypergraphs indeed give away some information. π2Ω Example 1. where Ω is the set of all permutations over N , and Si(π) is the set of players appearing before the ith player in permu- Let H = (V; E ) be the hypergraph with V = fi; j; k; lg 1 1 tation π. and We use the same game (Topk Game) that is described in E = ffi; jg; fi; kg; fj; kg; fi; k; lg; fi; j; kg; fj; lgg and H = 1 2 [13]. Then we extend the game for multi-hop on weighted (V; E ) be another hypergraph with 2 graphs as done in Topk Game with weights in [1]. We mea- E = ffi; j; kg; fi; j; k; lg; fi; kgg. 2 sure influence of a node using Shapley Value (SV) of the These two hypergraphs can be thought of as two collab- node as in [13, 1]. This computation is fast on graphs. As oration networks where i; j; k; l are authors and each edge shown in [1], it is linear in number of edges and vertices. We represents a document the authors have written. For exam- give an equally efficient algorithm for computing ranking on ple, in the first network (given by H ), the first edge i; j 1 hypergraphs. We compute these ranking using clique con- represents a document written by author i and j together. struction; wt-clique construction and three of our proposed Notice here, in both the networks each author has written a methods and compare the results using diffusion models. document with every one else but the number of times two The main contributions of this work are: (1) Different authors have written a document together is different. For methods of reducing hypergraphs to simpler forms for the example, in the second network (given by H ) i has written 2 purposes of centrality computations. These reductions could one document with l and three documents with k. Let us possibly yield more efficient mechanisms for computing other assume we want to measure the centrality of nodes in these network properties; (2) Computation of game-theoretic cen- two networks to calculate fluency of information propaga- trality measures on super-dyadic relations. These have been tion between two authors. Our philosophy is that it must shown to be efficient to compute on simple graphs. And, this be easy to propagate information from k to i than from l to extension enables us to compute centrality scores on large i. super-dyadic networks; and (3) Empirical study of how cen- If we convert the hyperedges of H and H to clique (clique 1 2 trality scores are distributed over the nodes of the network construction) for both H and H , we get the same graph, 1 2 under different graph constructions.
Recommended publications
  • Practical Parallel Hypergraph Algorithms
    Practical Parallel Hypergraph Algorithms Julian Shun [email protected] MIT CSAIL Abstract v While there has been signicant work on parallel graph pro- 0 cessing, there has been very surprisingly little work on high- e0 performance hypergraph processing. This paper presents v0 v1 v1 a collection of ecient parallel algorithms for hypergraph processing, including algorithms for betweenness central- e1 ity, maximal independent set, k-core decomposition, hyper- v2 trees, hyperpaths, connected components, PageRank, and v2 v3 e single-source shortest paths. For these problems, we either 2 provide new parallel algorithms or more ecient implemen- v3 tations than prior work. Furthermore, our algorithms are theoretically-ecient in terms of work and depth. To imple- (a) Hypergraph (b) Bipartite representation ment our algorithms, we extend the Ligra graph processing Figure 1. An example hypergraph representing the groups framework to support hypergraphs, and our implementa- , , , , , , and , , and its bipartite repre- { 0 1 2} { 1 2 3} { 0 3} tions benet from graph optimizations including switching sentation. between sparse and dense traversals based on the frontier size, edge-aware parallelization, using buckets to prioritize processing of vertices, and compression. Our experiments represented as hyperedges, can contain an arbitrary number on a 72-core machine and show that our algorithms obtain of vertices. Hyperedges correspond to group relationships excellent parallel speedups, and are signicantly faster than among vertices (e.g., a community in a social network). An algorithms in existing hypergraph processing frameworks. example of a hypergraph is shown in Figure 1a. CCS Concepts • Computing methodologies → Paral- Hypergraphs have been shown to enable richer analy- lel algorithms; Shared memory algorithms.
    [Show full text]
  • Networkx: Network Analysis with Python
    NetworkX: Network Analysis with Python Salvatore Scellato Full tutorial presented at the XXX SunBelt Conference “NetworkX introduction: Hacking social networks using the Python programming language” by Aric Hagberg & Drew Conway Outline 1. Introduction to NetworkX 2. Getting started with Python and NetworkX 3. Basic network analysis 4. Writing your own code 5. You are ready for your project! 1. Introduction to NetworkX. Introduction to NetworkX - network analysis Vast amounts of network data are being generated and collected • Sociology: web pages, mobile phones, social networks • Technology: Internet routers, vehicular flows, power grids How can we analyze this networks? Introduction to NetworkX - Python awesomeness Introduction to NetworkX “Python package for the creation, manipulation and study of the structure, dynamics and functions of complex networks.” • Data structures for representing many types of networks, or graphs • Nodes can be any (hashable) Python object, edges can contain arbitrary data • Flexibility ideal for representing networks found in many different fields • Easy to install on multiple platforms • Online up-to-date documentation • First public release in April 2005 Introduction to NetworkX - design requirements • Tool to study the structure and dynamics of social, biological, and infrastructure networks • Ease-of-use and rapid development in a collaborative, multidisciplinary environment • Easy to learn, easy to teach • Open-source tool base that can easily grow in a multidisciplinary environment with non-expert users
    [Show full text]
  • Measuring Homophily
    Measuring Homophily Matteo Cristani, Diana Fogoroasi, and Claudio Tomazzoli University of Verona fmatteo.cristani, diana.fogoroasi.studenti, [email protected] Abstract. Social Network Analysis is employed widely as a means to compute the probability that a given message flows through a social net- work. This approach is mainly grounded upon the correct usage of three basic graph- theoretic measures: degree centrality, closeness centrality and betweeness centrality. We show that, in general, those indices are not adapt to foresee the flow of a given message, that depends upon indices based on the sharing of interests and the trust about depth in knowledge of a topic. We provide new definitions for measures that over- come the drawbacks of general indices discussed above, using Semantic Social Network Analysis, and show experimental results that show that with these measures we have a different understanding of a social network compared to standard measures. 1 Introduction Social Networks are considered, on the current panorama of web applications, as the principal virtual space for online communication. Therefore, it is of strong relevance for practical applications to understand how strong a member of the network is with respect to the others. Traditionally, sociological investigations have dealt with problems of defining properties of the users that can value their relevance (sometimes their impor- tance, that can be considered different, the first denoting the ability to emerge, and the second the relevance perceived by the others). Scholars have developed several measures and studied how to compute them in different types of graphs, used as models for social networks. This field of research has been named Social Network Analysis.
    [Show full text]
  • Modeling Customer Preferences Using Multidimensional Network Analysis in Engineering Design
    Modeling customer preferences using multidimensional network analysis in engineering design Mingxian Wang1, Wei Chen1, Yun Huang2, Noshir S. Contractor2 and Yan Fu3 1 Department of Mechanical Engineering, Northwestern University, Evanston, IL 60208, USA 2 Science of Networks in Communities, Northwestern University, Evanston, IL 60208, USA 3 Global Data Insight and Analytics, Ford Motor Company, Dearborn, MI 48121, USA Abstract Motivated by overcoming the existing utility-based choice modeling approaches, we present a novel conceptual framework of multidimensional network analysis (MNA) for modeling customer preferences in supporting design decisions. In the proposed multidimensional customer–product network (MCPN), customer–product interactions are viewed as a socio-technical system where separate entities of `customers' and `products' are simultaneously modeled as two layers of a network, and multiple types of relations, such as consideration and purchase, product associations, and customer social interactions, are considered. We first introduce a unidimensional network where aggregated customer preferences and product similarities are analyzed to inform designers about the implied product competitions and market segments. We then extend the network to a multidimensional structure where customer social interactions are introduced for evaluating social influence on heterogeneous product preferences. Beyond the traditional descriptive analysis used in network analysis, we employ the exponential random graph model (ERGM) as a unified statistical
    [Show full text]
  • Multidimensional Network Analysis
    Universita` degli Studi di Pisa Dipartimento di Informatica Dottorato di Ricerca in Informatica Ph.D. Thesis Multidimensional Network Analysis Michele Coscia Supervisor Supervisor Fosca Giannotti Dino Pedreschi May 9, 2012 Abstract This thesis is focused on the study of multidimensional networks. A multidimensional network is a network in which among the nodes there may be multiple different qualitative and quantitative relations. Traditionally, complex network analysis has focused on networks with only one kind of relation. Even with this constraint, monodimensional networks posed many analytic challenges, being representations of ubiquitous complex systems in nature. However, it is a matter of common experience that the constraint of considering only one single relation at a time limits the set of real world phenomena that can be represented with complex networks. When multiple different relations act at the same time, traditional complex network analysis cannot provide suitable an- alytic tools. To provide the suitable tools for this scenario is exactly the aim of this thesis: the creation and study of a Multidimensional Network Analysis, to extend the toolbox of complex network analysis and grasp the complexity of real world phenomena. The urgency and need for a multidimensional network analysis is here presented, along with an empirical proof of the ubiquity of this multifaceted reality in different complex networks, and some related works that in the last two years were proposed in this novel setting, yet to be systematically defined. Then, we tackle the foundations of the multidimensional setting at different levels, both by looking at the basic exten- sions of the known model and by developing novel algorithms and frameworks for well-understood and useful problems, such as community discovery (our main case study), temporal analysis, link prediction and more.
    [Show full text]
  • Multilevel Hypergraph Partitioning with Vertex Weights Revisited
    Multilevel Hypergraph Partitioning with Vertex Weights Revisited Tobias Heuer ! Karlsruhe Institute of Technology, Karlsruhe, Germany Nikolai Maas ! Karlsruhe Institute of Technology, Karlsruhe, Germany Sebastian Schlag ! Karlsruhe Institute of Technology, Karlsruhe, Germany Abstract The balanced hypergraph partitioning problem (HGP) is to partition the vertex set of a hypergraph into k disjoint blocks of bounded weight, while minimizing an objective function defined on the hyperedges. Whereas real-world applications often use vertex and edge weights to accurately model the underlying problem, the HGP research community commonly works with unweighted instances. In this paper, we argue that, in the presence of vertex weights, current balance constraint definitions either yield infeasible partitioning problems or allow unnecessarily large imbalances and propose a new definition that overcomes these problems. We show that state-of-the-art hypergraph partitioners often struggle considerably with weighted instances and tight balance constraints (even with our new balance definition). Thus, we present a recursive-bipartitioning technique that isable to reliably compute balanced (and hence feasible) solutions. The proposed method balances the partition by pre-assigning a small subset of the heaviest vertices to the two blocks of each bipartition (using an algorithm originally developed for the job scheduling problem) and optimizes the actual partitioning objective on the remaining vertices. We integrate our algorithm into the multilevel hypergraph
    [Show full text]
  • On Decomposing a Hypergraph Into K Connected Sub-Hypergraphs
    Egervary´ Research Group on Combinatorial Optimization Technical reportS TR-2001-02. Published by the Egrerv´aryResearch Group, P´azm´any P. s´et´any 1/C, H{1117, Budapest, Hungary. Web site: www.cs.elte.hu/egres . ISSN 1587{4451. On decomposing a hypergraph into k connected sub-hypergraphs Andr´asFrank, Tam´asKir´aly,and Matthias Kriesell February 2001 Revised July 2001 EGRES Technical Report No. 2001-02 1 On decomposing a hypergraph into k connected sub-hypergraphs Andr´asFrank?, Tam´asKir´aly??, and Matthias Kriesell??? Abstract By applying the matroid partition theorem of J. Edmonds [1] to a hyper- graphic generalization of graphic matroids, due to M. Lorea [3], we obtain a gen- eralization of Tutte's disjoint trees theorem for hypergraphs. As a corollary, we prove for positive integers k and q that every (kq)-edge-connected hypergraph of rank q can be decomposed into k connected sub-hypergraphs, a well-known result for q = 2. Another by-product is a connectivity-type sufficient condition for the existence of k edge-disjoint Steiner trees in a bipartite graph. Keywords: Hypergraph; Matroid; Steiner tree 1 Introduction An undirected graph G = (V; E) is called connected if there is an edge connecting X and V X for every nonempty, proper subset X of V . Connectivity of a graph is equivalent− to the existence of a spanning tree. As a connected graph on t nodes contains at least t 1 edges, one has the following alternative characterization of connectivity. − Proposition 1.1. A graph G = (V; E) is connected if and only if the number of edges connecting distinct classes of is at least t 1 for every partition := V1;V2;:::;Vt of V into non-empty subsets.P − P f g ?Department of Operations Research, E¨otv¨osUniversity, Kecskem´etiu.
    [Show full text]
  • Hypergraph Packing and Sparse Bipartite Ramsey Numbers
    Hypergraph packing and sparse bipartite Ramsey numbers David Conlon ∗ Abstract We prove that there exists a constant c such that, for any integer ∆, the Ramsey number of a bipartite graph on n vertices with maximum degree ∆ is less than 2c∆n. A probabilistic argument due to Graham, R¨odland Ruci´nskiimplies that this result is essentially sharp, up to the constant c in the exponent. Our proof hinges upon a quantitative form of a hypergraph packing result of R¨odl,Ruci´nskiand Taraz. 1 Introduction For a graph G, the Ramsey number r(G) is defined to be the smallest natural number n such that, in any two-colouring of the edges of Kn, there exists a monochromatic copy of G. That these numbers exist was proven by Ramsey [19] and rediscovered independently by Erd}osand Szekeres [10]. Whereas the original focus was on finding the Ramsey numbers of complete graphs, in which case it is known that t t p2 r(Kt) 4 ; ≤ ≤ the field has broadened considerably over the years. One of the most famous results in the area to date is the theorem, due to Chvat´al,R¨odl,Szemer´ediand Trotter [7], that if a graph G, on n vertices, has maximum degree ∆, then r(G) c(∆)n; ≤ where c(∆) is just some appropriate constant depending only on ∆. Their proof makes use of the regularity lemma and because of this the bound it gives on the constant c(∆) is (and is necessarily [11]) very bad. The situation was improved somewhat by Eaton [8], who proved, by using a variant of the regularity lemma, that the function c(∆) may be taken to be of the form 22c∆ .
    [Show full text]
  • A Geometric Approach to Betweenness
    A GEOMETRIC APPROACH TO BETWEENNESS y z BENNY CHOR AND MADHU SUDAN Abstract An input to the betweenness problem contains m constraints over n real variables p oints Each constraint consists of three p oints where one of the p oint is sp ecied to lie inside the interval dened by the other two The order of the other two p oints ie which one is the largest and which one is the smallest is not sp ecied This problem comes up in questions related to physical mapping in molecular biology In Opatrny showed that the problem of deciding whether the n p oints can b e totally ordered while satisfying the m b etweenness constraints is NPcomplete Furthermore the problem is MAX SNP complete and for every nding a total order that satises at least of the m constraints is NPhard even if all the constraints are satisable It is easy to nd an ordering of the p oints that satises of the m constraints eg by cho osing the ordering at random This pap er presents a p olynomial time algorithm that either determines that there is no feasible solution or nds a total order that satises at least of the m constraints The algorithm translates the problem into a set of quadratic inequalities and solves a semidenite relaxation of n them in R The n solution p oints are then pro jected on a random line through the origin The claimed p erformance guarantee is shown using simple geometric prop erties of the SDP solution Key words Approximation algorithm Semidenite programming NPcompleteness Compu tational biology AMS sub ject classications Q Q Intro duction An input
    [Show full text]
  • Hypernetwork Science: from Multidimensional Networks to Computational Topology∗
    Hypernetwork Science: From Multidimensional Networks to Computational Topology∗ Cliff A. Joslyn,y Sinan Aksoy,z Tiffany J. Callahan,x Lawrence E. Hunter,x Brett Jefferson,z Brenda Praggastis,y Emilie A.H. Purvine,y Ignacio J. Tripodix March, 2020 Abstract cations, and physical infrastructure often afford a rep- resentation as such a set of entities with binary rela- As data structures and mathematical objects used tionships, and hence may be analyzed utilizing graph for complex systems modeling, hypergraphs sit nicely theoretical methods. poised between on the one hand the world of net- Graph models benefit from simplicity and a degree of work models, and on the other that of higher-order universality. But as abstract mathematical objects, mathematical abstractions from algebra, lattice the- graphs are limited to representing pairwise relation- ory, and topology. They are able to represent com- ships between entities, while real-world phenomena plex systems interactions more faithfully than graphs in these systems can be rich in multi-way relation- and networks, while also being some of the simplest ships involving interactions among more than two classes of systems representing topological structures entities, dependencies between more than two vari- as collections of multidimensional objects connected ables, or properties of collections of more than two in a particular pattern. In this paper we discuss objects. Representing group interactions is not possi- the role of (undirected) hypergraphs in the science ble in graphs natively, but rather requires either more of complex networks, and provide a mathematical complex mathematical objects, or coding schemes like overview of the core concepts needed for hypernet- “reification” or semantic labeling in bipartite graphs.
    [Show full text]
  • Approximating Cumulative Pebbling Cost Is Unique Games Hard
    Approximating Cumulative Pebbling Cost Is Unique Games Hard Jeremiah Blocki Department of Computer Science, Purdue University, West Lafayette, IN, USA https://www.cs.purdue.edu/homes/jblocki [email protected] Seunghoon Lee Department of Computer Science, Purdue University, West Lafayette, IN, USA https://www.cs.purdue.edu/homes/lee2856 [email protected] Samson Zhou School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA https://samsonzhou.github.io/ [email protected] Abstract P The cumulative pebbling complexity of a directed acyclic graph G is defined as cc(G) = minP i |Pi|, where the minimum is taken over all legal (parallel) black pebblings of G and |Pi| denotes the number of pebbles on the graph during round i. Intuitively, cc(G) captures the amortized Space-Time complexity of pebbling m copies of G in parallel. The cumulative pebbling complexity of a graph G is of particular interest in the field of cryptography as cc(G) is tightly related to the amortized Area-Time complexity of the Data-Independent Memory-Hard Function (iMHF) fG,H [7] defined using a constant indegree directed acyclic graph (DAG) G and a random oracle H(·). A secure iMHF should have amortized Space-Time complexity as high as possible, e.g., to deter brute-force password attacker who wants to find x such that fG,H (x) = h. Thus, to analyze the (in)security of a candidate iMHF fG,H , it is crucial to estimate the value cc(G) but currently, upper and lower bounds for leading iMHF candidates differ by several orders of magnitude. Blocki and Zhou recently showed that it is NP-Hard to compute cc(G), but their techniques do not even rule out an efficient (1 + ε)-approximation algorithm for any constant ε > 0.
    [Show full text]
  • A Framework for Transductive and Inductive Learning on Hypergraphs
    Deep Hyperedges: a Framework for Transductive and Inductive Learning on Hypergraphs Josh Payne IBM T. J. Watson Research Center Yorktown Heights, NY 10598 [email protected] Abstract From social networks to protein complexes to disease genomes to visual data, hypergraphs are everywhere. However, the scope of research studying deep learning on hypergraphs is still quite sparse and nascent, as there has not yet existed an effective, unified framework for using hyperedge and vertex embeddings jointly in the hypergraph context, despite a large body of prior work that has shown the utility of deep learning over graphs and sets. Building upon these recent advances, we propose Deep Hyperedges (DHE), a modular framework that jointly uses contextual and permutation-invariant vertex membership properties of hyperedges in hypergraphs to perform classification and regression in transductive and inductive learning settings. In our experiments, we use a novel random walk procedure and show that our model achieves and, in most cases, surpasses state-of-the-art performance on benchmark datasets. Additionally, we study our framework’s performance on a variety of diverse, non-standard hypergraph datasets and propose several avenues of future work to further enhance DHE. 1 Introduction and Background As data becomes more plentiful, we find ourselves with access to increasingly complex networks that can be used to express many different types of information. Hypergraphs have been used to recommend music [1], model cellular and protein-protein interaction networks [2, 3], classify images and perform 3D object recognition [4, 5], diagnose Alzheimer’s disease [6], analyze social relationships [7, 8], and classify gene expression [9]—the list goes on.
    [Show full text]