From Las Vegas to Monte Carlo and Back: Sampling Cycles in Graphs

From Las Vegas to Monte Carlo and Back: Sampling Cycles in Graphs

From Las Vegas to Monte Carlo and back: Sampling cycles in graphs Konstantin Klemm SEHRLASSIGER¨ LEHRSASS--EL¨ for Bioinformatics University of Leipzig 1 Agenda • Cycles in graphs: Why? What? Sampling? • Method I: Las Vegas • Method II: Monte Carlo • Robust cycle bases 2 Why care about cycles? (1) Chemical Ring Perception 3 Why care about cycles? (2) Analysis of chemical reaction networks (Io's ath- mosphere) 4 Why care about cycles? (3) • Protein interaction networks • Internet graph • Social networks • : : : Aim: Detailed comparison between network models and empiri- cal networks with respect to presence / absence of cycles 5 Sampling • Interested in average value of a cycle property f(C) 1 hfi = X f(C) jfcyclesgj C2fcyclesg with f(C) = jCj or f(C) = δjCj;h or : : : • exhaustive enumeration of fcyclesg not feasible • approximate hfi by summing over representative, randomly selected subset S ⊂ fcyclesg • How do we generate S then? 6 Las Vegas • Sampling method based on self-avoiding random walk [Rozenfeld et al. (2004), cond-mat/0403536] • probably motivated by the movie \Lost in Las Vegas" (though the authors do not say explicitly) 1. Choose starting vertex s. 2. Hop to randomly chosen neighbour, avoiding previously vis- ited vertices except s. 3. Repeat 2. unless reaching s again or getting stuck 7 Las Vegas { trouble • Number of cycles of length h in complete graph KN N! W (h) = (2h)−1 (N − h)! • For N = 100, W (100)=W (3) ≈ 10150 • Flat cycle length distribution in KN from Rozenfeld method 1 p(h) = (N − 2) • Undersampling of long cycles 8 Las Vegas | results 100 12 10 h* Nh 8 10 10 100 1000 N 4 10 0 20 40 60 80 100 120 140 h Generalized random graphs ("static model") with N = 100; 200; 400; 800, hki = 2, β = 0:5 Leaving Las Vegas : : : 9 Monte Carlo | summing cycles • Sum of two cycles yields new cycle: + = + = • (generalized) cycle: subgraph, all degrees even • simple cycle: connected subgraph, all degrees = 2. 10 Monte Carlo | cycle space (0,0,0,0) (0,0,0,1) (0,0,1,0) (0,0,1,1) (0,1,0,0) (0,1,0,1) (0,1,1,0) (0,1,1,1) (1,0,0,0) (1,0,0,1) (1,0,1,0) (1,0,1,1) (1,1,0,0) (1,1,0,1) (1,1,1,0) (1,1,1,1) • cycle space: contains all (generalized) cycles • finite-dimensional vector space, has cycle basis Linear Algebra kicks butt!!! 11 Monte Carlo | algorithm 1. Generate cycle basis of simple cycles B1; B2; : : : ; Bν 2. Set current cycle C := 0 (empty cycle) 3. (Propose) Draw random index i 2 f1; 2; : : : ; νg 4. (Accept) If C + Bi is simple or null cycle, set C := C + Bi (Reject) Otherwise, leave C unchanged 5. Resume at 3. (or stop after desired number of iterations) 12 Monte Carlo | preliminary results -2 10 -4 10 -6 10 normalized distribution -8 10 0 100 200 300 400 cycle length h Generalized random graphs ("static model") with N = 100; 200; 400; 800, hki = 2, β = 0:5 13 Monte Carlo | properties • detailed balance | ok • adjacent simple cycles equiprobable • extensions easy, e.g. Metropolis with Energy := cycle length • But: ergodicity ?!? 14 Monte Carlo | trouble = + + + = • reachability of cycles depends on choice of cycle basis • long cycles particularly difficult to reach | less space for "maneuvers" 15 Robust cycle bases Kainen (2000): A cycle basis B is robust if for every [simple] cycle Z there is a linear ordering of the subset C(G; B; Z) such that, as each element in the resulting sequence is added to form the sum Z, it intersects the sum of those preceding it in a nontrivial path. In this case, the partial sums must be cycles. A cycle basis is called cyclically robust when the sum of the new cycle and those that went before remains a cycle. Relevance here: basis (cyclically) robust ) ergodic Monte Carlo 16 Robust cycle bases | known results • planar graphs: planar basis, basis cycles are outlines of faces in a planar embedding • complete graphs (Kainen): pick arbitrary vertex x, basis cy- cles are all triangles containing x • slightly more general: graphs spanned by a star (argument analogous to complete graphs) • No general criterion for existence of (cyclically) robust bases 17 Summary / Outlook • Naive approaches tend to give bad statistics due to under- sampling of long cycles • Cycle space method is powerful if ergodicity can be ensured. • Still lost and hopping through Las Vegas? • Escape by (1) finding robust cycle bases, and/or (2) considering move sets beyond cycle bases Co-starring: Peter F. Stadler 18.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    18 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us