Coding Theory of Permutations/Multipermutations and in the LEAN Theorem Prover

Total Page:16

File Type:pdf, Size:1020Kb

Coding Theory of Permutations/Multipermutations and in the LEAN Theorem Prover Coding Theory of Permutations/Multipermutations and in the LEAN Theorem Prover May 2018 Chiba University Graduate School of Science Division of Department of Mathematics and Informatics Justin Kong for Lukela i I.ABSTRACT This thesis investigates problems in Coding theory, a fundamental field of study for the reliable transmission or storage of data. Contributions are in two general areas, permutation/multipermutation codes and coding theory formalization. Linear Codes are well studied, but less is known about nonlinear codes. Permutation and multipermutation codes are types of nonlinear codes. Formal- ization involves the precise statement of mathematical theorems in a specified language in such a way that the veracity of these theorems may be verified algorithmically. Although coding theory is a field of study well-suited for formalization, work in coding theory formalization has been relatively limited. In this thesis we advance the study of permutation and multipermutation codes in three ways. First, we extend an LP (linear programming) decoding method originally designed for permutation codes in the Euclidean metric to permutation codes in the Kendall tau metric. Second, we provide new methods for calculating Ulam permutation sphere sizes. The results are used to present new bounds on the maximum possible size of permutation codes in the Ulam metric, including the fundamental theorem that nontrivial perfect Ulam permutation codes do not exist. Third, new methods for calculating Ulam multipermutation sphere sizes for certain parameters are provided as well as resulting bounds on the maximum possible code size. In this thesis we advance the study of formalization in coding theory in two ways. Contri- butions are made using the Lean theorem proof assistant, a software designed specifically for formalization. The first contribution is a set of files that formalize relevant basic mathematical definitions and theorems as well as the famous Repetition codes and Hamming (7,4) code. The second contribution is a set of files that formalize Levenshtein codes and related definitions and theorems. Levenshtein codes are a type of nonlinear deletion and/or insertion correcting code. ii II.ACKNOWLEDGEMENTS Firstly, I would like to thank my Lord and Savior Jesus Christ, who has lead me throughout my journey of education and without whom I cannot so much as breathe. I am grateful for the love and support of my family, especially my wife Keiko who has been a constant encouragement. I am indebted greatly to my advisor, Dr. Manabu Hagiwara, who has patiently guided me step by step throughout my post-graduate studies. I am constantly amazed by his mathematical prowess and insight, and simultaneous wisdom and kindness. iii CONTENTS i Abstract i ii Acknowledgements ii iii List of Figures vi iv List of Tables vii 1 Introduction 1 1-A Contributions . .1 1-B Organization of Thesis . .3 2 Preliminaries and Literature Overview 4 2-A Introduction . .4 2-B The Idea of Coding . .4 2-C Minimum Distance in Coding Theory . .6 2-D Permutation/Multipermutation Codes Literature Review . .7 2-E Formalization in Coding Theory Literature Review . .9 2-F Conclusion . 10 3 Kendall tau LP-decodable Permutation Codes 11 3-A Introduction . 11 3-B Extending LP decoding Methods . 12 3-C Minimum Weight . 18 3-D Weight Enumerator Polynomials . 20 3-E Parabolic Subgroups and Reflection Subgroups . 27 3-F Conclusion . 35 4 Ulam Permutation Codes 36 4-A Introduction . 36 iv 4-B Preliminaries and Notation . 37 4-C Permutation Ulam Sphere Size . 44 4-D Nonexistence of Nontrivial Perfect Ulam Permutation Codes . 48 4-E Conclusion . 54 5 Ulam Multipermutation Codes 55 5-A Introduction . 55 5-B Multipermutation Ulam Sphere Size and Duplication Sets . 55 5-C Min/Max Spheres and Code Size Bounds . 68 5-C1 Non-Binary Case . 68 5-C2 Binary Case – Cut Location Maximizing Sphere Size . 72 5-D binary case – number of cuts maximizing sphere size . 76 5-E Conclusion . 84 6 Formalization of Coding Theory in Lean 85 6-A Introduction . 85 6-B About Lean . 87 6-C File access and overview . 88 6-D Error correcting systems definition and examples . 94 6-E Formalization of Levenshtein distance . 98 6-F Formalization of Insertion/Deletion spheres . 100 6-G Conclusion . 106 7 Conclusion 106 Appendix A: Consolidation for Compact Constraints 108 Appendix B 119 Appendix C 121 Appendix D 122 v Appendix E 124 Appendix F 125 Appendix G 128 Appendix H 132 References 135 vi III.LIST OF FIGURES 1 Weak Bruhat ordering for S4 .............................. 33 2 Translocation illustration . 42 3 Young diagram and SYT . 45 4 Constructing q5 from q4 (when n = 30)........................ 78 5 Constructing q4 from q3 (when n = 34)........................ 79 6 File dependencies . 88 7 Consolidation . 116 Pc−1 2 2 8 Diagram of i=1 qc−1(i) − qc (i) .......................... 130 2 2 9 Diagram of qc−1(i) − qc (i) ............................... 130 vii IV.LIST OF TABLES I Kendall tau, Euclidean distances . 14 II Euclidean, Kendall tau distances from (1; 0; 2; 3) and [1; 0; 2; 3] ........... 15 III Kendall tau, Euclidean weights . 16 IV Kendall tau, Euclidean weights of C5 ......................... 19 V Permutation Ulam sphere sizes . 37 VI Theoretical limit on maximum Ulam permutation code size . 37 VII Non-feasibility of perfect t-error correcting codes . 52 VIII Multipermutation Ulam Sphere Sizes . 55 IX Theoretical limits on maximum Ulam multipermutation code size . 56 X Maximum sphere size verses bounded value . 83 XI Positivity, Homogeneous Constraints . 113 XII Row-sum (Column-sum) and Weak Row-sum (Column-sum) Constraints . 113 XIII Doubly Stochastic Constraint, Quasi-Homogeneous Constraint, Merged Constraint . 113 1 1. INTRODUCTION In the information age that we live in, data is communicated and stored through a plethora of digital devices. Without the advent of coding theory, reliable communication and data storage would not be possible. Error-correcting codes, explained in chapter 2, are utilized constantly to combat the inherent noise of communication channels. The study of these codes is called coding theory. Codes have also been improving since their advent in the late 1940’s and early 1950’s. With their improvement communication becomes faster and data storage denser. In fact, current codes can approach known limits to what is possible for certain communication schemes. However, the quest to improve codes and to expand their underlying theory is not over by any means. A. Contributions The main contributions of this thesis can be divided into two large categories: 1) contributions to coding theory in the area of permutation/multipermutation codes and 2) contributions to coding theory in the area of formalization. We begin by providing some context for each contribution before stating the actual results of each category. The first category of contribution is to the field of permutation and multipermuation codes. Permutation and multipermutation codes are types of nonlinear codes. Linear codes are well studied and largely well-understood because of their close relation to linear algebra. On the other hand, there remains much mystery in the field of nonlinear codes. Depending on the physical scenario, nonlinear codes may demonstrate superior performance over linear codes. Hence it is worthwhile to devote some energy to research in this area. One category of nonlinear codes are permutation and multipermutation codes. Certain prop- erties, discussed in later chapters, make permutation and multipermutation codes attractive can- didates for use in applications such as flash memory devices. There are several questions that must be answered before this is possible, and it is also necessary to generally expand the theory of permutation and multipermutation codes. 2 Our main contributions to permutation/multipermutation codes are threefold. We state them here, although some explanation of the terminology of these statements is found in subsequent chapters. First, we extend a decoding method to a new class of permutation codes, calling them Kendall tau LP-decodable permutation codes. Second, we provide new ways of calculating Ulam permutation sphere sizes, and use this to prove the fundamental bound on maximal code size that perfect Ulam permutation codes do not exist. Third, we provide new ways of calculating Ulam multipermutation sphere sizes, and use this to prove new bounds on the maximal code size of Ulam multipermutation codes. These results were partially published in [37, 51, 52]. The second category of contribution is to the field of formalization in coding theory. Math- ematical formalization involves the precise statement of mathematical theorems in a specified language in such a way that the veracity of these theorems may be verified algorithmically. When an error-correcting code fails to perform its purported function, errors in communication may result. Moreover, the study of coding theory is often complex and proofs of the properties of a particular error-correcting code may be difficult to understand or verify for non-specialists in coding theory. Even among coding theorists, the vast quantity of coding schemes can sometimes lead to miscommunication. In the future, formalization may become a valuable tool for verifying properties of codes. It may also be a unifying force, as formalization removes the ambiguity of language. Current work in the area of the formalization of coding theory is limited. Libraries of coding theory are necessary to aid future work, since any definitions or lemmas that are formalized in a proof- assistant can be subsequently applied in future formalization. As our main contribution to formalization in coding theory, we provide, to the best of our knowledge, the first coding theory library for the Lean theorem proof assistant. The library is called “Cotoleta” (COding Theory Over the LEan Theorem proof Assistant). The Lean theorem proof assistant is a free software released by Microsoft Research in cooperation with Carnegie Mellon University. Our library includes a couple of key components.
Recommended publications
  • Hook Formulas for Skew Shapes Iii. Multivariate and Product Formulas
    HOOK FORMULAS FOR SKEW SHAPES III. MULTIVARIATE AND PRODUCT FORMULAS ALEJANDRO H. MORALES?, IGOR PAK?, AND GRETA PANOVAy Abstract. We give new product formulas for the number of standard Young tableaux of certain skew shapes and for the principal evaluation of certain Schubert polynomials. These are proved by utilizing symmetries for evaluations of factorial Schur functions, extensively studied in the first two papers in the series [MPP1, MPP2]. We also apply our technology to obtain determinantal and product formulas for the partition function of certain weighted lozenge tilings, and give various probabilistic and asymptotic applications. 1. Introduction 1.1. Foreword. It is a truth universally acknowledged, that a combinatorial theory is often judged not by its intrinsic beauty but by the examples and applications. Fair or not, this attitude is historically grounded and generally accepted. While eternally challenging, this helps to keep the area lively, widely accessible, and growing in unexpected directions. There are two notable types of examples and applications one can think of: artistic and scientific (cf. [Gow]). The former are unexpected results in the area which are both beautiful and mysterious. The fact of their discovery is the main application, even if they can be later shown by a more direct argument. The latter are results which represent a definitive progress in the area, unattainable by other means. To paraphrase Struik, this is \something to take home", rather than to simply admire (see [Rota]). While the line is often blurred, examples of both types are highly desirable, with the best examples being both artistic and scientific. This paper is a third in a series and continues our study of the Naruse hook-length for- mula (NHLF), its generalizations and applications.
    [Show full text]
  • SCHUR-WEYL DUALITY Contents Introduction 1 1. Representation
    SCHUR-WEYL DUALITY JAMES STEVENS Contents Introduction 1 1. Representation Theory of Finite Groups 2 1.1. Preliminaries 2 1.2. Group Algebra 4 1.3. Character Theory 5 2. Irreducible Representations of the Symmetric Group 8 2.1. Specht Modules 8 2.2. Dimension Formulas 11 2.3. The RSK-Correspondence 12 3. Schur-Weyl Duality 13 3.1. Representations of Lie Groups and Lie Algebras 13 3.2. Schur-Weyl Duality for GL(V ) 15 3.3. Schur Functors and Algebraic Representations 16 3.4. Other Cases of Schur-Weyl Duality 17 Appendix A. Semisimple Algebras and Double Centralizer Theorem 19 Acknowledgments 20 References 21 Introduction. In this paper, we build up to one of the remarkable results in representation theory called Schur-Weyl Duality. It connects the irreducible rep- resentations of the symmetric group to irreducible algebraic representations of the general linear group of a complex vector space. We do so in three sections: (1) In Section 1, we develop some of the general theory of representations of finite groups. In particular, we have a subsection on character theory. We will see that the simple notion of a character has tremendous consequences that would be very difficult to show otherwise. Also, we introduce the group algebra which will be vital in Section 2. (2) In Section 2, we narrow our focus down to irreducible representations of the symmetric group. We will show that the irreducible representations of Sn up to isomorphism are in bijection with partitions of n via a construc- tion through certain elements of the group algebra.
    [Show full text]
  • Honors Thesis in Determinant Formulas for Counting Linear Extensions of Tree Posets
    HONORS THESIS IN DETERMINANT FORMULAS FOR COUNTING LINEAR EXTENSIONS OF TREE POSETS STEFAN GROSSER 1. Introduction Partially ordered sets are a fundamental mathematical structure as they formalize the concept of ordering a set of objects. One important question to ask is, given a partially ordered set P , how many ways can you fill in the missing information to create a total ordering on the same set? Put another way, how many ways are there to order the elements of P respecting the order of the poset? Any one way to perform this is known as a linear extension of P . Linear extensions act as a measure of complexity of a poset. The more linear extensions there are, the more information there is to fill in the partial order. Numerous mathematical fields find linear extensions to be a useful tool; this includes representation theory on the symmetric group, enumerative combinatorics, and algorithms. Thus there is a strong motivation to be able to calculate or enumerate linear extensions of posets. However, given an arbitrary partially ordered set, it is known to be very difficult to compute the number of linear extensions. The problem is in fact #P -complete (very hard) [BW91b]. However, for very specific families of posets, there are fast ways to compute this number. Several families of posets have what is known as a product formula for the number of linear extensions. Two such examples are Young diagrams and rooted trees. Both of these families have what is known as the hook-length formula for the number of linear extensions.
    [Show full text]
  • [Math.CO] 25 Apr 2018 Hoe 1.1 Theorem Hlegn.Te Aebe Evl Tde Nbt Combina Both in Studied the Heavily to Been Nection Have Question They These Challenging
    ON THE LARGEST KRONECKER AND LITTLEWOOD–RICHARDSON COEFFICIENTS IGOR PAK⋆, GRETA PANOVA† AND DAMIR YELIUSSIZOV⋆ Abstract. We give new bounds and asymptotic estimates for Kronecker and Littlewood– Richardson coefficients. Notably, we resolve Stanley’s questions on the shape of partitions attaining the largest Kronecker and Littlewood–Richardson coefficients. We apply the results to asymptotics of the number of standard Young tableaux of skew shapes. 1. Introduction What is largest dimension D(n) of an irreducible representation of Sn? Which partitions correspond to the largest representations? These questions are both classical and surprisingly challenging. They have been heavily studied in both combinatorics and probability, in con- nection to the longest increasing subsequences. We refer to [AD, BDJ, Rom] for the history of this problem. In algebraic combinatorics and related fields, the Kronecker and Littlewood–Richardson (LR ) coefficients play a crucial role. They are the structure constants in the ring of char- − acters of Sn and GLN (C), respectively: χµ χν = g(λ, µ, ν) χλ and s s = cλ s . · µ · ν µ,ν λ ν X Xλ These coefficients have been intensely studied from combinatorial (see e.g. [S4, vL]), geometric (see e.g. [Ful]), probabilistic (see [Bia]), and computational point of view (see e.g. [B¨ur, BI]). Yet, relatively little is known about the asymptotics of these coefficients (see 7.1). In recent years, Stanley computed asymptotic of the largest Kronecker and LR–coefficients:§ Theorem 1.1 (Stanley [S5, S6]). We have: O(√n) ( ) max max max g(λ, µ, ν) = √n! e− , arXiv:1804.04693v2 [math.CO] 25 Apr 2018 ∗ λ n µ n ν n ⊢ ⊢ ⊢ λ n/2 O(√n) ( ) max max max max cµ,ν = 2 − .
    [Show full text]
  • Asymptotics of the Number of Standard Young Tableaux of Skew Shape
    ASYMPTOTICS OF THE NUMBER OF STANDARD YOUNG TABLEAUX OF SKEW SHAPE ALEJANDRO H. MORALES?, IGOR PAK? AND GRETA PANOVAy Abstract. We give new bounds and asymptotic estimates on the number of standard Young tableaux of skew shape in a variety of special cases. Our approach is based on Naruse's hook-length formula. We also compare our bounds with the existing bounds on the numbers of linear extensions of the corresponding posets. 1. Introduction The classical hook-length formula (HLF) allows one to compute the number f λ = j SYT(λ)j of standard Young tableaux of a given shape [FRT]. This formula had profound applications in Enumer- ative and Algebraic Combinatorics, Discrete Probability, Representation Theory and other fields (see e.g. [Rom, Sag, S3]). Specifically, the HLF allows to derive asymptotics for f λ for various families of \large" partitions λ. This was famously used to compute the diagram of a random representation of λ 2 Sn with respect to the Plancherel measure (f ) =n!, see [LS, VK] (see also [Bia, S4]). For skew shapes λ/µ, little is known about the asymptotics, since there is no multiplicative formula for f λ/µ = j SYT(λ/µ)j. For large n = jλ/µj, the asymptotics are known for few special families of skew shapes (see [AR]), and for fixed µ (see [OO, RV, S2]). In this paper we show that the \naive HLF" give good approximations for f λ/µ in many special cases of interest. Formally, let λ be a partition of n. Denote by f λ = j SYT(λ)j the number of standard Young tableaux of shape λ.
    [Show full text]
  • Part III Representation Theory Lectured by Stuart Martin, Lent Term 2013
    Part III Representation Theory Lectured by Stuart Martin, Lent Term 2013 Please send any comments and corrections to [email protected]. Contents 0 Preliminaries 2 1 Semisimple algebras 4 2 Irreducible modules for the symmetric group 8 3 Standard basis of Specht modules 12 4 Character formula 14 5 The hook length formula 21 6 Multilinear algebra and algebraic geometry 24 Interlude: some reminders about affine varieties 29 7 Schur{Weyl duality 31 8 Tensor decomposition 34 9 Polynomial and rational representations of GL(V ) 37 Interlude: some reminders about characters of GLn(C) 40 10 Weyl character formula 41 11 Introduction to invariant theory and first examples 44 12 First fundamental theorem of invariant theory 50 Example sheet 1 53 Example sheet 2 57 Last updated Thursday 21 March 2013 1 0 Preliminaries In this course we will study representations of (a) (finite) symmetric groups (b) (infinite) general linear groups all over C, and apply the theory to \classical" invariant theory. We require no previous knowledge, except for some commutative algebra, ordinary representation theory and algebraic geometry, as outlined below. Commutative algebra References: Part III course, Atiyah{Macdonald. It is assumed that you will know about rings, modules, homomorphisms, quotients and chain conditions (the ascending chain condition, ACC, and descending chain condition, DCC). Recall: a chain, or filtration, of submodules of a module M is a sequence (Mj : 0 j n) such that ≤ ≤ M = M0 > M1 > > Mn > 0 ··· The length of the chain is n, the number of links. A composition series is a maximal chain; equivalently, each quotient Mj−1=Mj is irreducible.
    [Show full text]
  • Arxiv:Math/0512035V1 [Math.CO] 1 Dec 2005
    Increasing and Decreasing Subsequences of Permutations and Their Variants Richard P. Stanley Department of Mathematics, Massachusetts Institute of Technology Cambridge, MA 02139, USA [email protected] version of 29 November 2005 Abstract.We survey the theory of increasing and decreasing subsequences of permuta- tions. Enumeration problems in this area are closely related to the RSK algorithm. The asymptotic behavior of the expected value of the length is(w) of the longest increasing subsequence of a permutation w of 1, 2,...,n was obtained by Vershik-Kerov and (al- most) by Logan-Shepp. The entire limiting distribution of is(w) was then determined by Baik, Deift, and Johansson. These techniques can be applied to other classes of permuta- tions, such as involutions, and are related to the distribution of eigenvalues of elements of the classical groups. A number of generalizations and variations of increasing/decreasing subsequences are discussed, including the theory of pattern avoidance, unimodal and alternating subsequences, and crossings and nestings of matchings and set partitions. 1. Introduction Let Sn denote the symmetric group of all permutations of [n] := 1, 2,...,n . We write { } permutations w Sn as words, i.e., w = a1a2 an, where w(i) = ai. An increasing ∈ · · · subsequence of w is a subsequence ai ai satisfying ai < <ai , and similarly for 1 · · · k 1 · · · k decreasing subsequence. For instance, if w = 5642713, then 567 is an increasing subse- quence and 543 is a decreasing subsequence. Let is(w) (respectively, ds(w)) denote the length (number of terms) of the longest increasing (respectively, decreasing) subsequence of w.
    [Show full text]
  • Modular Nekrasov–Okounkov Formulas
    MODULAR NEKRASOV{OKOUNKOV FORMULAS ADAM WALSH AND S. OLE WARNAAR To Christian Krattenthaler on the occasion of his 60th birthday Abstract. Using Littlewood's map, which decomposes a partition into its r-core and r- quotient, Han and Ji have shown that many well-known hook-length formulas admit modular analogues. In this paper we present a variant of the Han{Ji `multiplication theorem' based on a new analogue of Littlewood's decomposition. We discuss several applications to hook- length formulas, one of which leads us to conjecture a modular analogue of the q; t-Nekrasov{ Okounkov formula. 1. Introduction Hook-length formulas abound in combinatorics and representation theory. Perhaps the most famous example is the formula for f λ, the number of standard Young tableaux of shape λ, which was discovered in 1954 by Frame, Robinson and Thrall [22]. If λ is a partition of n and H (λ) denotes the multiset of hook-lengths of λ (see Section 3 for notation and definitions), then λ n! (1.1) f = Q : h2H (λ) h A much more recent identity in the spirit of (1.1) is the Nekrasov{Okounkov formula. It was discovered independently by Nekrasov and Okounkov [44] in their work on random partitions and Seiberg{Witten theory, and by Westbury [56] in his work on universal characters for sln. The form in which the formula is commonly stated is that of Nekrasov and Okounkov (see [44, Equation (6.12)]) X Y z Y (1.2) T jλj 1 − = (1 − T k)z−1 h2 λ2P h2H (λ) k>1 (where P is the set of all partitions) rather than Westbury's hook-length formula for the D'Arcais polynomials Pn(z), defined by the expansion [17] 1 Y 1 X = P (z)T n; (1 − T k)z n k>1 n=0 and implied by Propositions 6.1 and 6.2 of his paper [56].
    [Show full text]
  • ASYMPTOTICS of Q-PLANCHEREL MEASURES 3
    ASYMPTOTICS OF q-PLANCHEREL MEASURES VALENTIN FÉRAY AND PIERRE-LOÏC MÉLIOT Abstract. In this paper, we are interested in the asymptotic size of the rows and columns of a random Young diagram under a natural deformation of the Plancherel measure coming from Hecke algebras. The first lines of such diagrams are typically of order n, so it does not fit in the context of the work of P. Biane and P. Śniady. Using the theory of polynomial functions on Young diagrams of Kerov and Olshanski, we are able to compute explicitly the first- and second-order asymptotics of the length of the first rows. Our method also works for other measures, for example those coming from Schur-Weyl representations. 1. Introduction 1.1. Background. Given a combinatorial family of planar objects, it is natural to ask if, with a good rescaling, there is a limiting continuous object when the size tends to infinity. In the case of Young diagrams endowed with Plancherel measure, this question has been solved in 1977 independently by A. Vershik and S. Kerov [KV77] and B. Logan and L. Shepp [LS77]. V. Ivanov and G. Olshanski [IO02], using unpublished notes of S. Kerov, have been able to describe the fluctuations of diagrams around this limit shape. The Plancherel measure is natural when we look at Young diagrams of a given size n (the set of which will be denoted Yn), because it is related to the regular representation of Sn. But there are many other interesting measures on Young diagrams and we can look for a limiting continuous object in these cases as well.
    [Show full text]
  • Hook-Length Formulas for Skew Shapes
    HOOK FORMULAS FOR SKEW SHAPES I. q-ANALOGUES AND BIJECTIONS ? ? ALEJANDRO H. MORALES , IGOR PAK , AND GRETA PANOVAy Abstract. The celebrated hook-length formula gives a product formula for the number of standard Young tableaux of a straight shape. In 2014, Naruse announced a more general formula for the number of standard Young tableaux of skew shapes as a positive sum over excited diagrams of products of hook-lengths. We give an algebraic and a combinatorial proof of Naruse's formula, by using factorial Schur functions and a generalization of the Hillman{Grassl correspondence, respectively. The main new results are two different q-analogues of Naruse's formula: for the skew Schur functions, and for counting reverse plane partitions of skew shapes. We establish explicit bijections between these objects and families of integer arrays with certain nonzero entries, which also proves the second formula. 1. Introduction 1.1. Foreword. The classical hook-length formula (HLF) for the number of standard Young tableaux (SYT) of a Young diagram, is a beautiful result in enumerative combinatorics that is both mysterious and extremely well studied. In a way it is a perfect formula { highly nontrivial, clean, concise and generalizing several others (binomial coefficients, Catalan numbers, etc.) The HLF was discovered by Frame, Robinson and Thrall [FRT] in 1954, and by now it has numerous proofs: probabilistic, bijective, inductive, analytic, geometric, etc. (see x10.3). Arguably, each of these proofs does not really explain the HLF on a deeper level, but rather tells a different story, leading to new generalizations and interesting connections to other areas.
    [Show full text]
  • Sampling Young Tableaux and Contingency Tables
    Sampling Young Tableaux and Contingency Tables Samuel King Advised by Daniel Stefankoviˇcandˇ Sevak Mkrtchyan Based on work previously completed with Mandar Juvekar 1 Contingency Tables and Markov Chains The problem motivating this work is that of sampling contingency tables. A contingency table A is a matrix with nonnegative integer entries whose rows and columns sum to some specified values. In other words, given vectors m n r = (ri)i=1 and c = (cj)j=1 of positive integers, a contingency table with row sums r and column sums c is some A 2 Matm×n(N) such that n m X X ri = Ai;t and cj = At;j (1) t=1 t=1 for each i 2 [m] and j 2 [n] (we use the notation [k] = f1; : : : ; kg). Notice that P P it must be that ri = cj for such a contingency table to possibly exist. We are motivated by the problem of sampling uniformly from Ω(r; c), the space of all contingency tables with row sums r and column sums c. Throughout the majority of this discussion, the primary method of sam- pling will be using Markov chains. The definitions and main results deal- ing with Markov chains are primarily acquired from Mark Jerrum's textbook Counting, Sampling and Integrating: Algorithms and Complexity [9]. A (finite- space) Markov chain is a sequence of random variables X0;:::;Xt;::: with values in a finite state space Ω such that Pr(Xt = a j X0 = a0;X1 = a1;:::;Xt−1 = at−1) = Pr(Xt = a j Xt−1 = at−1) (2) for all t 2 N and a0; : : : ; at 2 Ω.
    [Show full text]
  • Hook-Length Formulas for Skew Shapes
    HOOK FORMULAS FOR SKEW SHAPES II. COMBINATORIAL PROOFS AND ENUMERATIVE APPLICATIONS ? ? ALEJANDRO H. MORALES , IGOR PAK , AND GRETA PANOVAy Abstract. The Naruse hook-length formula is a recent general formula for the number of stan- dard Young tableaux of skew shapes, given as a positive sum over excited diagrams of products of hook-lengths. In [MPP1] we gave two different q-analogues of Naruse's formula: for the skew Schur functions, and for counting reverse plane partitions of skew shapes. In this paper we give an elemen- tary proof of Naruse's formula based on the case of border strips. For special border strips, we obtain curious new formulas for the Euler and q-Euler numbers in terms of certain Dyck path summations. 1. Introduction In Enumerative Combinatorics, when it comes to fundamental results, one proof is rarely enough, and one is often on the prowl for a better, more elegant or more direct proof. In fact, there is a wide belief in multitude of \proofs from the Book", rather than a singular best approach. The reasons are both cultural and mathematical: different proofs elucidate different aspects of the underlying combinatorial objects and lead to different extensions and generalizations. The story of this series of papers is on our effort to understand and generalize the Naruse hook- length formula (NHLF) for the number of standard Young tableaux of a skew shape in terms of excited diagrams. In our previous paper [MPP1], we gave two q-analogues of the NHLF, the first with an algebraic proof and the second with a bijective proof.
    [Show full text]