Regular Polytope Networks Federico Pernici, Matteo Bruni, Claudio Baecchi and Alberto Del Bimbo, Member, IEEE

Total Page:16

File Type:pdf, Size:1020Kb

Regular Polytope Networks Federico Pernici, Matteo Bruni, Claudio Baecchi and Alberto Del Bimbo, Member, IEEE 1 Regular Polytope Networks Federico Pernici, Matteo Bruni, Claudio Baecchi and Alberto Del Bimbo, Member, IEEE, Abstract—Neural networks are widely used as a model for ℝ푑 ℝ푑 ℝ푑 classification in a large variety of tasks. Typically, a learnable transformation (i.e. the classifier) is placed at the end of such models returning a value for each class used for classification. 퐰 This transformation plays an important role in determining how 퐰푖 푖 the generated features change during the learning process. In 퐰푖 this work, we argue that this transformation not only can be fixed (i.e. set as non-trainable) with no loss of accuracy and with a reduction in memory usage, but it can also be used 퐟 퐟 퐟 to learn stationary and maximally separated embeddings. We show that the stationarity of the embedding and its maximal separated representation can be theoretically justified by setting DCNN DCNN DCNN the weights of the fixed classifier to values taken from the coordinate vertices of the three regular polytopes available in d R , namely: the d-Simplex, the d-Cube and the d-Orthoplex. These regular polytopes have the maximal amount of symmetry Fig. 1. Regular Polytope Networks (RePoNet). The fixed classifiers derived d that can be exploited to generate stationary features angularly from the three regular polytopes available in R with d ≥ 5 are shown. centered around their corresponding fixed weights. Our approach From left: the d-Simplex, the d-Cube and the d-Orthoplex fixed classifier. improves and broadens the concept of a fixed classifier, recently The trainable parameters wi of the classifier are replaced with fixed values proposed in [1], to a larger class of fixed classifier models. Experi- taken from the coordinate vertices of a regular polytope (shown in red). mental results confirm the theoretical analysis, the generalization capability, the faster convergence and the improved performance of the proposed method. Code will be publicly available. All these works seem to suggest that the final fully con- Index Terms—Deep Neural Networks, Fixed classifiers, Inter- nected layer used for classification is somewhat redundant and nal feature representation. does not have a primary role in learning and generalization. In this paper we show that a special set of fixed classifica- tion layers has a key role in modeling the internal feature I. INTRODUCTION representation of DCNNs, while ensuring little or no loss in EEP Convolutional Neural Networks (DCNNs) have classification accuracy and a significant reduction in memory D achieved state-of-the-art performance on a variety of usage. tasks [2], [3] and have revolutionized Computer Vision in both In DCNNs the internal feature representation for an input classification [4], [5] and representation [6], [7]. In DCNNs, sample is the feature vector f generated by the penultimate both representation and classification are typically jointly layer, while the last layer (i.e. the classifier) outputs score learned in a single network. The classification layer placed values according to the inner product as: at the end of such models transforms the d-dimension of the network internal feature representation to the K-dimension of > zi = w f (1) the output class probabilities. Despite the large number of i · trainable parameters that this layer adds to the model (i.e. for each class i, where wi is the weight vector of the classifier arXiv:2103.15632v1 [cs.LG] 29 Mar 2021 d K), it has been verified that its removal only causes a slight for the class i. To evaluate the loss, the scores are further increase× in error [8]. Moreover, the most recent architectures normalized into probabilities via the softmax function [17]. tend to avoid the use of fully connected layers [9] [10] [11]. Since the values of z can be also expressed as z = w> f = It is also well known that DCNNs can be trained to perform i i i w f cos(θ), where θ is the angle between w and f·, the metric learning without the explicit use of a classification layer i i scorejj jj jjforjj the correct label with respect to the other labels is [12] [13] [14]. In particular, it has been shown that excluding obtained by optimizing the length of the vectors w , f from learning the parameters of the classification layer causes i and the angle θ they are forming. This simple formulationjj jj jj ofjj little or no decline in performance while allowing a reduction the final classifier provides the intuitive explanation of how in the number of trainable parameters [1]. Fixed classifiers also feature vector directions and weight vector directions align have an important role in the theoretical convergence analysis simultaneously with each other at training time so that their of training models with batch-norm [15]. Very recently it has average angle is made as small as possible. If the parameters been shown that DCNNs with a fixed classifier and batch- w of the classifier in Eq. 1 are fixed (i.e. set as non- norm in each layer establish a principle of equivalence between i trainable), only the feature vector directions can align toward different learning rate schedules [16]. the classifier weight vector directions and not the opposite. MICC, Media Integration and Communication Center, University of Flo- Therefore, weights can be regarded as fixed angular references rence, Dipartimento di Ingegneria dell’Informazione Firenze, Italy. to which features align. 2 2π 2π 3 3 2π 2π θ π θ π 1 1 2π 2π 0 0 0 20 40 60 80 100 120 140 0 20 40 60 80 100 120 140 epochs epochs (a) (b) (c) (d) Fig. 2. Feature learning on the MNIST dataset in a 2D embedding space. Fig. (a) and Fig. (c) show the 2D features learned by RePoNet and by a standard trainable classifier respectively. Fig. (b) and Fig. (d) show the training evolution of the classifier weights (dashed) and their corresponding class feature means (solid) respectively. Both are expressed according to their angles. Although the two methods achieve the same classification accuracy, features in the proposed method are both stationary and maximally separated. According to this, we obtain a precise result on the spatio- regular polygons in R2 and 5 regular polyhedra in R3, there temporal statistical properties of the generated features during are only three regular polytopes in Rd with d 5, namely the the learning phase. Supported by the empirical evidence in [1] d-Simplex, the d-Cube and the d-Orthoplex. Having≥ different we show that not only the final classifier of a DCNN can be set symmetry, geometry and topology, each regular polytope will as non-trainable with no loss of accuracy and with a significant reflect its properties into the classifier and the embedding space reduction in memory usage, but that an appropriate set of which it defines. Fig. 1 illustrates the three basic architectures values assigned to its weights allows learning a maximally defined by the proposed approach termed Regular Polytope separated and strictly stationary embedding while training. Networks (RePoNet). Fig. 2 provides a first glance at our That is, the features generated by the Stochastic Gradient main result in a 2D embedding space. Specifically, the main Descent (SGD) optimization have constant mean and are angu- evidence from Fig. 2(a) and 2(b) is that the features learned larly centered around their corresponding fixed class weights. by RePoNet remain aligned with their corresponding fixed Constant known mean implies that features cannot have non- weights and maximally exploit the available representation constant trends while learning. Maximally separated features space directly from the beginning of the training phase. and their stationarity are obtained by setting the classifier We apply our method to multiple vision datasets showing weights according to values following a highly symmetrical that it is possible to generate stationary and maximally sepa- configuration in the embedding space. rated features without affecting the generalization performance DCNN models with trainable classifiers are typically con- of DCNN models and with a significant reduction in GPU vergent and therefore, after a sufficient learning time has memory usage at training time. A preliminary exploration of elapsed, some form of stationarity in the learned features can this work was presented in [18], [19]. still be achieved. However, until that time, it is not possible to know where the features will be projected by the learned II. RELATED WORK model in the embedding space. An advantage of the approach Fixed Classifier. Empirical evidence shows that convolu- proposed in this paper is that it allows to define (and therefore tional neural networks with a fixed classification layer (i.e. to know in advance) where the features will be projected not subject to learning) initialized by random numbers does before starting the learning process. not worsen the performance on the CIFAR-10 dataset [20]. A Our result can be understood by looking at the basic recent paper [1] explores in more detail the idea of excluding functionality of the final classifier in a DCNN. The main role from learning the parameters wi in Eq.1. The work shows that of a trainable classifier is to dynamically adjust the decision a fixed classifier causes little or no reduction in classification boundaries to learn class feature representations. When the performance for common datasets while allowing a significant classifier is set as non-trainable this dynamic adjustment capa- reduction in trainable parameters, especially when the number bility is no longer available and it is automatically demanded of classes is large. Setting the last layer as not trainable also to all the previous layers. Specifically, the work [1] reports reduces the computational complexity for training as well as empirical evidence that the expressive power of DCNN models the communication cost in distributed learning.
Recommended publications
  • Geometry of Generalized Permutohedra
    Geometry of Generalized Permutohedra by Jeffrey Samuel Doker A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Mathematics in the Graduate Division of the University of California, Berkeley Committee in charge: Federico Ardila, Co-chair Lior Pachter, Co-chair Matthias Beck Bernd Sturmfels Lauren Williams Satish Rao Fall 2011 Geometry of Generalized Permutohedra Copyright 2011 by Jeffrey Samuel Doker 1 Abstract Geometry of Generalized Permutohedra by Jeffrey Samuel Doker Doctor of Philosophy in Mathematics University of California, Berkeley Federico Ardila and Lior Pachter, Co-chairs We study generalized permutohedra and some of the geometric properties they exhibit. We decompose matroid polytopes (and several related polytopes) into signed Minkowski sums of simplices and compute their volumes. We define the associahedron and multiplihe- dron in terms of trees and show them to be generalized permutohedra. We also generalize the multiplihedron to a broader class of generalized permutohedra, and describe their face lattices, vertices, and volumes. A family of interesting polynomials that we call composition polynomials arises from the study of multiplihedra, and we analyze several of their surprising properties. Finally, we look at generalized permutohedra of different root systems and study the Minkowski sums of faces of the crosspolytope. i To Joe and Sue ii Contents List of Figures iii 1 Introduction 1 2 Matroid polytopes and their volumes 3 2.1 Introduction . .3 2.2 Matroid polytopes are generalized permutohedra . .4 2.3 The volume of a matroid polytope . .8 2.4 Independent set polytopes . 11 2.5 Truncation flag matroids . 14 3 Geometry and generalizations of multiplihedra 18 3.1 Introduction .
    [Show full text]
  • Two Poset Polytopes
    Discrete Comput Geom 1:9-23 (1986) G eometrv)i.~.reh, ~ ( :*mllmlati~ml © l~fi $1~ter-Vtrlq New Yorklu¢. t¢ Two Poset Polytopes Richard P. Stanley* Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139 Abstract. Two convex polytopes, called the order polytope d)(P) and chain polytope <~(P), are associated with a finite poset P. There is a close interplay between the combinatorial structure of P and the geometric structure of E~(P). For instance, the order polynomial fl(P, m) of P and Ehrhart poly- nomial i(~9(P),m) of O(P) are related by f~(P,m+l)=i(d)(P),m). A "transfer map" then allows us to transfer properties of O(P) to W(P). In particular, we transfer known inequalities involving linear extensions of P to some new inequalities. I. The Order Polytope Our aim is to investigate two convex polytopes associated with a finite partially ordered set (poset) P. The first of these, which we call the "order polytope" and denote by O(P), has been the subject of considerable scrutiny, both explicit and implicit, Much of what we say about the order polytope will be essentially a review of well-known results, albeit ones scattered throughout the literature, sometimes in a rather obscure form. The second polytope, called the "chain polytope" and denoted if(P), seems never to have been previously considered per se. It is a special case of the vertex-packing polytope of a graph (see Section 2) but has many special properties not in general valid or meaningful for graphs.
    [Show full text]
  • Abstract Regular Polytopes
    ENCYCLOPEDIA OF MATHEMATICS AND ITS APPLICATIONS Abstract Regular Polytopes PETER MMULLEN University College London EGON SCHULTE Northeastern University PUBLISHED BY THE PRESS SYNDICATE OF THE UNIVERSITY OF CAMBRIDGE The Pitt Building, Trumpington Street, Cambridge, United Kingdom CAMBRIDGE UNIVERSITY PRESS The Edinburgh Building, Cambridge CB2 2RU, UK 40 West 20th Street, New York, NY 10011-4211, USA 477 Williamstown Road, Port Melbourne, VIC 3207, Australia Ruiz de Alarc´on 13, 28014 Madrid, Spain Dock House, The Waterfront, Cape Town 8001, South Africa http://www.cambridge.org c Peter McMullen and Egon Schulte 2002 This book is in copyright. Subject to statutory exception and to the provisions of relevant collective licensing agreements, no reproduction of any part may take place without the written permission of Cambridge University Press. First published 2002 Printed in the United States of America Typeface Times New Roman 10/12.5 pt. System LATEX2ε [] A catalog record for this book is available from the British Library. Library of Congress Cataloging in Publication Data McMullen, Peter, 1955– Abstract regular polytopes / Peter McMullen, Egon Schulte. p. cm. – (Encyclopedia of mathematics and its applications) Includes bibliographical references and index. ISBN 0-521-81496-0 1. Polytopes. I. Schulte, Egon, 1955– II. Title. III. Series. QA691 .M395 2002 516.35 – dc21 2002017391 ISBN 0 521 81496 0 hardback Contents Preface page xiii 1 Classical Regular Polytopes 1 1A The Historical Background 1 1B Regular Convex Polytopes 7 1C Extensions
    [Show full text]
  • 1 Lifts of Polytopes
    Lecture 5: Lifts of polytopes and non-negative rank CSE 599S: Entropy optimality, Winter 2016 Instructor: James R. Lee Last updated: January 24, 2016 1 Lifts of polytopes 1.1 Polytopes and inequalities Recall that the convex hull of a subset X n is defined by ⊆ conv X λx + 1 λ x0 : x; x0 X; λ 0; 1 : ( ) f ( − ) 2 2 [ ]g A d-dimensional convex polytope P d is the convex hull of a finite set of points in d: ⊆ P conv x1;:::; xk (f g) d for some x1;:::; xk . 2 Every polytope has a dual representation: It is a closed and bounded set defined by a family of linear inequalities P x d : Ax 6 b f 2 g for some matrix A m d. 2 × Let us define a measure of complexity for P: Define γ P to be the smallest number m such that for some C s d ; y s ; A m d ; b m, we have ( ) 2 × 2 2 × 2 P x d : Cx y and Ax 6 b : f 2 g In other words, this is the minimum number of inequalities needed to describe P. If P is full- dimensional, then this is precisely the number of facets of P (a facet is a maximal proper face of P). Thinking of γ P as a measure of complexity makes sense from the point of view of optimization: Interior point( methods) can efficiently optimize linear functions over P (to arbitrary accuracy) in time that is polynomial in γ P . ( ) 1.2 Lifts of polytopes Many simple polytopes require a large number of inequalities to describe.
    [Show full text]
  • Uniform Polychora
    BRIDGES Mathematical Connections in Art, Music, and Science Uniform Polychora Jonathan Bowers 11448 Lori Ln Tyler, TX 75709 E-mail: [email protected] Abstract Like polyhedra, polychora are beautiful aesthetic structures - with one difference - polychora are four dimensional. Although they are beyond human comprehension to visualize, one can look at various projections or cross sections which are three dimensional and usually very intricate, these make outstanding pieces of art both in model form or in computer graphics. Polygons and polyhedra have been known since ancient times, but little study has gone into the next dimension - until recently. Definitions A polychoron is basically a four dimensional "polyhedron" in the same since that a polyhedron is a three dimensional "polygon". To be more precise - a polychoron is a 4-dimensional "solid" bounded by cells with the following criteria: 1) each cell is adjacent to only one other cell for each face, 2) no subset of cells fits criteria 1, 3) no two adjacent cells are corealmic. If criteria 1 fails, then the figure is degenerate. The word "polychoron" was invented by George Olshevsky with the following construction: poly = many and choron = rooms or cells. A polytope (polyhedron, polychoron, etc.) is uniform if it is vertex transitive and it's facets are uniform (a uniform polygon is a regular polygon). Degenerate figures can also be uniform under the same conditions. A vertex figure is the figure representing the shape and "solid" angle of the vertices, ex: the vertex figure of a cube is a triangle with edge length of the square root of 2.
    [Show full text]
  • Arxiv:Math/9906062V1 [Math.MG] 10 Jun 1999 Udo Udmna Eerh(Rn 96-01-00166)
    Embedding the graphs of regular tilings and star-honeycombs into the graphs of hypercubes and cubic lattices ∗ Michel DEZA CNRS and Ecole Normale Sup´erieure, Paris, France Mikhail SHTOGRIN Steklov Mathematical Institute, 117966 Moscow GSP-1, Russia Abstract We review the regular tilings of d-sphere, Euclidean d-space, hyperbolic d-space and Coxeter’s regular hyperbolic honeycombs (with infinite or star-shaped cells or vertex figures) with respect of possible embedding, isometric up to a scale, of their skeletons into a m-cube or m-dimensional cubic lattice. In section 2 the last remaining 2-dimensional case is decided: for any odd m ≥ 7, star-honeycombs m m {m, 2 } are embeddable while { 2 ,m} are not (unique case of non-embedding for dimension 2). As a spherical analogue of those honeycombs, we enumerate, in section 3, 36 Riemann surfaces representing all nine regular polyhedra on the sphere. In section 4, non-embeddability of all remaining star-honeycombs (on 3-sphere and hyperbolic 4-space) is proved. In the last section 5, all cases of embedding for dimension d> 2 are identified. Besides hyper-simplices and hyper-octahedra, they are exactly those with bipartite skeleton: hyper-cubes, cubic lattices and 8, 2, 1 tilings of hyperbolic 3-, 4-, 5-space (only two, {4, 3, 5} and {4, 3, 3, 5}, of those 11 have compact both, facets and vertex figures). 1 Introduction arXiv:math/9906062v1 [math.MG] 10 Jun 1999 We say that given tiling (or honeycomb) T has a l1-graph and embeds up to scale λ into m-cube Hm (or, if the graph is infinite, into cubic lattice Zm ), if there exists a mapping f of the vertex-set of the skeleton graph of T into the vertex-set of Hm (or Zm) such that λdT (vi, vj)= ||f(vi), f(vj)||l1 = X |fk(vi) − fk(vj)| for all vertices vi, vj, 1≤k≤m ∗This work was supported by the Volkswagen-Stiftung (RiP-program at Oberwolfach) and Russian fund of fundamental research (grant 96-01-00166).
    [Show full text]
  • Petrie Schemes
    Canad. J. Math. Vol. 57 (4), 2005 pp. 844–870 Petrie Schemes Gordon Williams Abstract. Petrie polygons, especially as they arise in the study of regular polytopes and Coxeter groups, have been studied by geometers and group theorists since the early part of the twentieth century. An open question is the determination of which polyhedra possess Petrie polygons that are simple closed curves. The current work explores combinatorial structures in abstract polytopes, called Petrie schemes, that generalize the notion of a Petrie polygon. It is established that all of the regular convex polytopes and honeycombs in Euclidean spaces, as well as all of the Grunbaum–Dress¨ polyhedra, pos- sess Petrie schemes that are not self-intersecting and thus have Petrie polygons that are simple closed curves. Partial results are obtained for several other classes of less symmetric polytopes. 1 Introduction Historically, polyhedra have been conceived of either as closed surfaces (usually topo- logical spheres) made up of planar polygons joined edge to edge or as solids enclosed by such a surface. In recent times, mathematicians have considered polyhedra to be convex polytopes, simplicial spheres, or combinatorial structures such as abstract polytopes or incidence complexes. A Petrie polygon of a polyhedron is a sequence of edges of the polyhedron where any two consecutive elements of the sequence have a vertex and face in common, but no three consecutive edges share a commonface. For the regular polyhedra, the Petrie polygons form the equatorial skew polygons. Petrie polygons may be defined analogously for polytopes as well. Petrie polygons have been very useful in the study of polyhedra and polytopes, especially regular polytopes.
    [Show full text]
  • Four-Dimensional Regular Polytopes
    faculty of science mathematics and applied and engineering mathematics Four-dimensional regular polytopes Bachelor’s Project Mathematics November 2020 Student: S.H.E. Kamps First supervisor: Prof.dr. J. Top Second assessor: P. Kiliçer, PhD Abstract Since Ancient times, Mathematicians have been interested in the study of convex, regular poly- hedra and their beautiful symmetries. These five polyhedra are also known as the Platonic Solids. In the 19th century, the four-dimensional analogues of the Platonic solids were described mathe- matically, adding one regular polytope to the collection with no analogue regular polyhedron. This thesis describes the six convex, regular polytopes in four-dimensional Euclidean space. The focus lies on deriving information about their cells, faces, edges and vertices. Besides that, the symmetry groups of the polytopes are touched upon. To better understand the notions of regularity and sym- metry in four dimensions, our journey begins in three-dimensional space. In this light, the thesis also works out the details of a proof of prof. dr. J. Top, showing there exist exactly five convex, regular polyhedra in three-dimensional space. Keywords: Regular convex 4-polytopes, Platonic solids, symmetry groups Acknowledgements I would like to thank prof. dr. J. Top for supervising this thesis online and adapting to the cir- cumstances of Covid-19. I also want to thank him for his patience, and all his useful comments in and outside my LATEX-file. Also many thanks to my second supervisor, dr. P. Kılıçer. Furthermore, I would like to thank Jeanne for all her hospitality and kindness to welcome me in her home during the process of writing this thesis.
    [Show full text]
  • Arxiv:1705.01294V1
    Branes and Polytopes Luca Romano email address: [email protected] ABSTRACT We investigate the hierarchies of half-supersymmetric branes in maximal supergravity theories. By studying the action of the Weyl group of the U-duality group of maximal supergravities we discover a set of universal algebraic rules describing the number of independent 1/2-BPS p-branes, rank by rank, in any dimension. We show that these relations describe the symmetries of certain families of uniform polytopes. This induces a correspondence between half-supersymmetric branes and vertices of opportune uniform polytopes. We show that half-supersymmetric 0-, 1- and 2-branes are in correspondence with the vertices of the k21, 2k1 and 1k2 families of uniform polytopes, respectively, while 3-branes correspond to the vertices of the rectified version of the 2k1 family. For 4-branes and higher rank solutions we find a general behavior. The interpretation of half- supersymmetric solutions as vertices of uniform polytopes reveals some intriguing aspects. One of the most relevant is a triality relation between 0-, 1- and 2-branes. arXiv:1705.01294v1 [hep-th] 3 May 2017 Contents Introduction 2 1 Coxeter Group and Weyl Group 3 1.1 WeylGroup........................................ 6 2 Branes in E11 7 3 Algebraic Structures Behind Half-Supersymmetric Branes 12 4 Branes ad Polytopes 15 Conclusions 27 A Polytopes 30 B Petrie Polygons 30 1 Introduction Since their discovery branes gained a prominent role in the analysis of M-theories and du- alities [1]. One of the most important class of branes consists in Dirichlet branes, or D-branes. D-branes appear in string theory as boundary terms for open strings with mixed Dirichlet-Neumann boundary conditions and, due to their tension, scaling with a negative power of the string cou- pling constant, they are non-perturbative objects [2].
    [Show full text]
  • Regular Polyhedra Through Time
    Fields Institute I. Hubard Polytopes, Maps and their Symmetries September 2011 Regular polyhedra through time The greeks were the first to study the symmetries of polyhedra. Euclid, in his Elements showed that there are only five regular solids (that can be seen in Figure 1). In this context, a polyhe- dron is regular if all its polygons are regular and equal, and you can find the same number of them at each vertex. Figure 1: Platonic Solids. It is until 1619 that Kepler finds other two regular polyhedra: the great dodecahedron and the great icosahedron (on Figure 2. To do so, he allows \false" vertices and intersection of the (convex) faces of the polyhedra at points that are not vertices of the polyhedron, just as the I. Hubard Polytopes, Maps and their Symmetries Page 1 Figure 2: Kepler polyhedra. 1619. pentagram allows intersection of edges at points that are not vertices of the polygon. In this way, the vertex-figure of these two polyhedra are pentagrams (see Figure 3). Figure 3: A regular convex pentagon and a pentagram, also regular! In 1809 Poinsot re-discover Kepler's polyhedra, and discovers its duals: the small stellated dodecahedron and the great stellated dodecahedron (that are shown in Figure 4). The faces of such duals are pentagrams, and are organized on a \convex" way around each vertex. Figure 4: The other two Kepler-Poinsot polyhedra. 1809. A couple of years later Cauchy showed that these are the only four regular \star" polyhedra. We note that the convex hull of the great dodecahedron, great icosahedron and small stellated dodecahedron is the icosahedron, while the convex hull of the great stellated dodecahedron is the dodecahedron.
    [Show full text]
  • On Regular Polytope Numbers
    PROCEEDINGS OF THE AMERICAN MATHEMATICAL SOCIETY Volume 131, Number 1, Pages 65{75 S 0002-9939(02)06710-2 Article electronically published on June 12, 2002 ON REGULAR POLYTOPE NUMBERS HYUN KWANG KIM (Communicated by David E. Rohrlich) Abstract. Lagrange proved a theorem which states that every nonnegative integer can be written as a sum of four squares. This result can be generalized as the polygonal number theorem and the Hilbert-Waring problem. In this paper, we shall generalize Lagrange's sum of four squares theorem further. To each regular polytope V in a Euclidean space, we will associate a sequence of nonnegative integers which we shall call regular polytope numbers, and consider the problem of finding the order g(V )ofthesetofregularpolytope numbers associated to V . In 1770, Lagrange proved a theorem which states that every nonnegative integer can be written as a sum of four squares. There are two major generalizations of this beautiful result. One is a horizontal generalization due to Cauchy which is known as the polygonal number theorem, and the other is a higher dimensional generalization which is known as the Hilbert-Waring problem. A nonempty subset A of nonnegative integers is called a basis of order g if g is the minimum number with the property that every nonnegative integer can be written as a sum of g elements in A. Lagrange's sum of four squares can be restated as: the set n2 n =0; 1; 2;::: of nonnegative squares forms a basis of order 4. The polygonalf numbersj are sequencesg of nonnegative integers constructed geometrically from regular polygons.
    [Show full text]
  • Isotoxal Star-Shaped Polygonal Voids and Rigid Inclusions in Nonuniform Antiplane Shear fields
    Published in International Journal of Solids and Structures 85-86 (2016), 67-75 doi: http://dx.doi.org/10.1016/j.ijsolstr.2016.01.027 Isotoxal star-shaped polygonal voids and rigid inclusions in nonuniform antiplane shear fields. Part I: Formulation and full-field solution F. Dal Corso, S. Shahzad and D. Bigoni DICAM, University of Trento, via Mesiano 77, I-38123 Trento, Italy Abstract An infinite class of nonuniform antiplane shear fields is considered for a linear elastic isotropic space and (non-intersecting) isotoxal star-shaped polygonal voids and rigid inclusions per- turbing these fields are solved. Through the use of the complex potential technique together with the generalized binomial and the multinomial theorems, full-field closed-form solutions are obtained in the conformal plane. The particular (and important) cases of star-shaped cracks and rigid-line inclusions (stiffeners) are also derived. Except for special cases (ad- dressed in Part II), the obtained solutions show singularities at the inclusion corners and at the crack and stiffener ends, where the stress blows-up to infinity, and is therefore detri- mental to strength. It is for this reason that the closed-form determination of the stress field near a sharp inclusion or void is crucial for the design of ultra-resistant composites. Keywords: v-notch, star-shaped crack, stress singularity, stress annihilation, invisibility, conformal mapping, complex variable method. 1 Introduction The investigation of the perturbation induced by an inclusion (a void, or a crack, or a stiff insert) in an ambient stress field loading a linear elastic infinite space is a fundamental problem in solid mechanics, whose importance need not be emphasized.
    [Show full text]