Closeness Centrality

Closeness Centrality

Social Network Analysis Day 1 Graph theory, Centrality, Clustering, Transitivity Graph Theoretic Concepts • In this section we will cover: – ReachaBility/Connectednes – Definitions s – Terminology • Connectivity, flows – Adjacency – Isolates, Pendants, Centers – Density concepts – Components, bi-components • E.g, – Walk Lengths, distance Completeness • Geodesic distance – Walks, trails, paths – Independent paths – Cycles, Trees – Cutpoints, bridges What is a Graph? • G = (V,E) – A graph is a set of vertices and edges • Vertices, sometimes called nodes, are the actors or entities between which relationships exist – People – Organizations • Edges, sometimes called relations or lines, are the behavior/interaction/relationship of interest – Communicates with – Trusts – Uses Kinds of Network Data Complete Ego 1- Bonni Bo mode e b Biff Bill Be ty t Betsy Dr. Web MD 2- Jones Bill mode PDR Merc k manu al Patient Mo Jane m 1-mode1-mode Complete Complete NetworkNetwork Information flow within virtual group Data collected by Cross 2-mode Complete Network Data compiled from newspaper society pages by Davis, Gardner & Gardner Bipartite graphs • Used to represent 2- mode data • Nodes can be partitioned into two sets (corresponding to modes) • Ties occur only between sets, not within Complete Network Data vs. Complete Graph • The term “Complete Network Data” refers to collecting data for/from all actors (vertices) on the graph – The opposite if Ego-Network or Ego-Centric Network data, in which data is collected only from the perspective an individual (the ego) • The term “Complete Graph” refers to a graph where every edge that could exist in the graph, does: – For all i, j (j>i), v(i,j) = 1 Complete Complete Network Graph Data LEE STEVE GER Y BRAZEY BERT STEVE RUSS BERT BRAZEY LEE GERY RUSS EgoEgo NetworkNetwork AnalysisAnalysis Mainstream Ego Network Social Science Networks Analysis data perspective • Combine the perspective of network • Combineanalysis the with perspective the data of of mainstream network analysissocial withscience the data of mainstream social science 1-mode Ego Network Carter Administration meetings Data courtesy of Michael Year 1 Link Year 4 2-mode Ego Network Undirected Graphs • An undirected graph G(V,E) (often referred to simply as a graph or a simple graph) consists of … – Set of nodes|vertices V representing actors – Set of lines|links|edges E representing ties among pairs of actors • An edge is an unordered pair of nodes (u,v) • Nodes u and v adjacent if (u,v) Î E • So E is suBset of set of all pairs of nodes • Drawn without arrow heads – Sometimes with dual arrow heads • Used to represent logically symmetric social relations – In communication with; attending same meeting as Directed vs. Undirected Ties • Undirected relations – Attended meeting with – Communicates daily with • Directed relations – Lent money to • Logically vs empirically directed ties Bonnie – Empirically, even un- Bob directed relations can Biff be non-symmetric due to measurement error Betty Betsy Directed Graphs (Digraphs) • Digraph G(V,E) consists of … – Set of nodes V – Set of directed arcs E • An arc is an ordered pair of nodes (u,v) • (u,v) Î E indicates u sends arc to v • (u,v) Î E does not imply that (v,u) Î E • Ties drawn with arrow heads, which can be in both directions • Represent logically non-symmetric or anti- symmetric social relations – Lends money to Transpose Adjacency matrix • In directed graphs, interchanging rows/columns of adjacency matrix effectively reverses the direction & meaning of ties Mary Bill John Larry Mary Bill John Larry Mary 0 1 0 1 Mary 0 1 0 1 Bill 1 0 0 1 Bill 1 0 1 0 John 0 1 0 0 John 0 0 0 1 Larry 1 0 1 0 Larry 1 1 0 0 Gives money to Gets money from bill bill mary mary john john larry larry Valued Digraphs (vigraphs) • A valued digraph G(V,E,W) consists of … – Set of nodes V – Set of directed arcs E • An arc is an ordered pair of 3.72 nodes (u,v) • (u,v) ∈ E indicates u sends 2.9 5.28 arc to v 0.1 • (u,v) ∈ E does not imply that 3.2 (v,u) ∈ E 1.2 5.1 – Mapping W of arcs to real values 3.5 • Values can represent such things as 8.9 – Strength of relationship 9.1 1.5 – Information capacity of tie – Rates of flow or traffic across tie – Distances between nodes – Probabilities of passing on information – Frequency of interaction Valued Adjacency Matrix Dichotomized • The diagram below uses solid lines to Jim Jill Jen Joe represent the adjacency matrix, while the numbers along the solid line (and Jim - 1 0 1 dotted lines where necessary) Jill 1 - 1 0 represent the proximity matrix. • In this particular case, one can derive Jen 0 1 - 1 the adjacency matrix by dichotomizing Joe 1 0 1 - the proximity matrix on a condition of pij <= 3. Distances btw offices Jill Jim Jill Jen Joe 1 3 Jim - 3 9 2 Jen Jim Jill 3 - 1 15 9 Jen 9 1 - 3 15 Joe 2 15 3 - 3 2 Joe Node-related concepts • Degree BRAZE – Y The number of ties incident LEE upon a node – In a digraph, we have indegree STEV EVANDER BER E (number of arcs to a node) and T outdegree (number of arcs from BILL a node) RUS GERY S MICHAEL • Pendant HARRY – A node connected to a DON JOH component through only N one edge or arc HOLLY • A node with degree 1 • Example: John PAULINE PAM PAT • Isolate CAROL – A node which is a component ANN on its own JENNIE • E.g., Evander Graph traversals BRAZEY LEE • Walk – Any unrestricted traversing of vertices STEVE across edges (Russ-Steve-Bert-Lee-Steve) BERT • Trail BILL – A walk restricted by not repeating an edge GERY or arc, although vertices can be revisited RUSS (Steve-Bert-Lee-Steve-Russ) MICHAEL • Path HARRY – A trail restricted by not revisiting any vertex (Steve- DON Lee-Bert-Russ) JOHN • Geodesic Path HOLLY – The shortest path(s) between two vertices (Steve- Russ-John is shortest path from Steve to John) PAULINE • Cycle PAM PAT – A cycle is in all ways just like a path except that it ends where it begins CAROL – Aside from endpoints, cycles do not repeat nodes ANN – E.g. Brazey-Lee-Bert-Steve-Brazey JENNIE LengthLength && DistanceDistance • Length of a path (or any 10 • Length of a path (or 10 12 walk)any is the walk) number is the of 12 links numberit has of links it has 1111 8 8 • The• GeodesicThe Geodesic Distance Distance 9 9 (aka graph-theoretic(aka graph-theoretic distance)distance) between between two two 22 7 7 nodesnodes is the is lengththe length of theof the shortest path shortest path 3 1 – Distance from 5 to 8 is 2, 3 1 6 – Distancebecause from the 5 shortestto 8 is 2, 6 becausepath the (5- shortest1-8) has two pathlinks (5-1-8) has two links 4 5 4 5 Geodesic Distance Matrix a b c d e f g 0 1 2 3 2 3 4 a 1 0 1 2 1 2 3 b c 2 1 0 1 1 2 3 d 3 2 1 0 2 3 4 e 2 1 1 2 0 1 2 f 3 2 2 3 1 0 1 g 4 3 3 4 2 1 0 SubgraphsSubgraphs • Set of nodes bc • Set of nodes – Is just a set of nodes b c – Is just a set of nodes d • A subgraph a d • A subgraph a – Is set –of Isnodes set of together nodes together with ties amongwith them ties among them f f ee • An induced• An induced subgraphsubgraph – Subgraph defined by a set – Subgraph defined by a set of bcb c nodes of nodes – Like pulling the nodes and – Like pulling the nodes and d d ties out of the original a a ties out graphof the original graph f e f e Subgraph induced by Subgraphconsidering induced the set by{a,b,c,f,e} considering the set {a,b,c,f,e} Components • Maximal sets of nodes in which every node can reach every other by some path (no matter how long) • A graph is connected if it has just one component It is relations (types of tie) that define different networks, not components. A network that has two components remains one (disconnected) network. Components in Directed Graphs • Strong component – There is a directed path from each member of the component to every other • Weak component – There is an undirected path (a weak path) from every member of the component to every other – Is like ignoring the direction of ties – driving the wrong way if you have to A graph with 1 Weak Component & 4 Strong BILL components HARRY DON MICHAEL HOLLY PAT GERY LEE STEVE CAROL JENNIE BRAZEY PAM RUSS BERT JOHN ANN PAULINE Cutpoints and Bridges • Cutpoint s – If a tie is a bridge, at least one of A node which, e its endpoints must be a if deleted, q cutpoint would b increase the g m number of d a components p f • Bridge l h i – A tie that, if j c removed, would k increase n the number r of o components Data Representation • Adjacency matrix • Edgelist • Adjacency/node list data representation – adjacency matrix ! Represen)ng!edges!(who!is!adjacent!to! whom)!as!a!matrix! ! Aij!=!1!if!node!i!has!an!edge!to!node!j! !!!=!0!if!node$i$does!not!have!an!edge!to!j! ! Aii!=!0!unless!the!network!has!self`loops! ! Aij!=!Aji!if!the!network!is!undirected,! or!if!i!and$j!share!a!reciprocated!edge! data representation – adjacency matrix 2( 0! 0! 0! 0! 0! 3( 1( 0! 0! 1! 1! 0! A!=! 0! 1! 0! 1! 0! 0! 0! 0! 0! 1! 5( 4( 1! 1! 0! 0! 0! Issues: 1.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    124 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us