Relational Pre-Trained Transformer Is Almost All You Need Towards Democratizing Data Preparation

Total Page:16

File Type:pdf, Size:1020Kb

Relational Pre-Trained Transformer Is Almost All You Need Towards Democratizing Data Preparation RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation Nan Tang Ju Fan Fangyi Li Jianhong Tu QCRI, HBKU, Qatar Renmin University, China Renmin University, China Renmin University, China [email protected] [email protected] [email protected] [email protected] Xiaoyong Du Guoliang Li Sam Madden Mourad Ouzzani Renmin University, China Tsinghua University, China CSAIL, MIT, USA QCRI, HBKU, Qatar [email protected] [email protected] [email protected] [email protected] ABSTRACT Q1: r1[name, expertise, city] = (Michael Jordan, Machine Learning, [M]) Berkeley Can AI help automate human-easy but computer-hard data prepara- A1: tion tasks that burden data scientists, practitioners, and crowd work- Q2: r3[name, affiliation] = (Michael [M], CSAIL MIT) ers? We answer this question by presenting RPT, a denoising autoen- A2: Cafarella coder for tuple-to-X models (“X” could be tuple, token, label, JSON, Q3: r2[name, expertise, [M]] = (Michael Jordan, Basketball, New York City) and so on). RPT is pre-trained for a tuple-to-tuple model by corrupt- A3: city ing the input tuple and then learning a model to reconstruct the (a) Sample Tasks for Value Filling ([M]: value to fill) original tuple. It adopts a Transformer-based neural translation ar- <latexit sha1_base64="PELJVxOArAWGr52Nf6vBT8UXjxI=">AAADJ3icbZLNb9MwFMCd8DXKxzo4crGoQFyo4mrtygFpGwdA4lAkulVqqspxXxprjh3ZDlIV8t9w4V/hggQIwZH/BCft2rH1RdH7/L2XPDvKBDc2CP54/rXrN27e2rnduHP33v3d5t6DE6NyzWDIlFB6FFEDgksYWm4FjDINNI0EnEZnr6r86UfQhiv5wS4ymKR0LnnMGbUuNN3zXoYRzLksLI1yQXVZfBLrp2xgHCZVa2cUYRTjTKtZzmyJny59ptKMysXaX4Brce6kkCq9yRmmAWQZhhe78kGiJGASuKqjLBPgdCcgB0719l8fO9Vt9zGXLDF4Kzlag28l28Cdbu+cfl7R22FCKvpo8G4JvnCKdPo12GuTeixUczHewCHI2Xpb02YraAe14KsGWRkttJLBtPk9nCmWpyAtE9SYMQkyOymotpwJKBthbiCj7IzOYexMSVMwk6I+5xI/cZEZjpV2r7S4jl4kCpoas0gjV5lSm5jLuSq4LTfObdyfFFxmuQXJloPiXGCrcHVp8IxrYFYsnEGZ5u5bMUuopsy6q9VwSyCXf/mqcdJpk247eL/fOjxerWMHPUKP0TNE0AE6RG/QAA0R8z57X70f3k//i//N/+X/Xpb63op5iP4T/+8/J17uTw==</latexit> product company year memory screen chitecture that consists of a bidirectional encoder (similar to BERT) e1 iPhone 10 Apple 2017 64GB 5.8 inchs and a left-to-right autoregressive decoder (similar to GPT), leading e2 iPhone X Apple Inc 2017 256GB 5.8-inch to a generalization of both BERT and GPT. The pre-trained RPT can e3 iPhone 11 AAPL 2019 128GB 6.1 inches already support several common data preparation tasks such as data (b) A Sample Entity Resolution Task cleaning, auto-completion and schema matching. Better still, RPT <latexit sha1_base64="VbzSbnmdJaP568yLMzT896PumY0=">AAADRHicbVJLb9NAELbNq4RXC0cuIyoQhzayQ2nDLU0P7QWplL5EHVXrzSRedb1r7a6hwfWP48IP4MYv4MIBhLgixklUmpaRVvp2Zr5vZmcnyaWwLgy/+sG16zdu3pq73bhz9979B/MLD/etLgzHPa6lNocJsyiFwj0nnMTD3CDLEokHyclGHT94j8YKrXbdKMdexoZKDARnjlzHC/67OMGhUKVjSSGZqcozeZaXa2GWVYSqBkCc1uIEoIyTAdQqFTybXPpouRF5rXXukyxBWUEcz3CVdphofUJZreaLza2P0F7e0AaXINrtArx12rAh3dqbXcgw02ZEkdVloXgKO+iEYtAXNpdsRAp10qz+v0J5qhVSzmozCid0pwueUp+IagkYGLRaFnXLoAfQbrVPo7VXLcjFKUq7BOvRCnQpKDjkRnO0VhuiqT5wnSF8EC6FFapP3J3111ToYiMxqv75KI/nF8NmODa4CqIpWPSmtn08/yXua15kqByXzNqjKMxdr2TGCS6xasSFxZzxExrUEUHFMrS9crwEFTwlTx8G2tBRDsbei4ySZdaOsoQyM+ZSezlWO/8XOyrcoN0rhcoLh4pPCg0KSWMd7wJ9i0Hu5IgAo2WgXoGnzDDuaO8aNITo8pOvgv1WM3rZDN+sLHY603HMeY+9J95zL/LWvI635W17ex73P/nf/B/+z+Bz8D34FfyepAb+lPPIm7Hgz1+SlACo</latexit> can be fine-tuned on a wide range of data preparation tasks, such type description label s1 notebook 2.3GHz 8-Core, 1TB Storage, 8GB memory, 8GB as value normalization, data transformation, data annotation, etc. 16-inch Retina display To complement RPT, we also discuss several appealing techniques phone 6.10-inch touchscreen, a resolution of such as collaborative training and few-shot learning for entity res- t1 828x1792 pixels, A14 Bionic processor, and 4GB olution, and few-shot learning and NLP question-answering for come with 4GB of RAM information extraction. In addition, we identify a series of research (c) A Sample Information Extraction Task (s1: example, t1: task) opportunities to advance the field of data preparation. Figure 1: Motivating Scenarios. PVLDB Reference Format: Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam city Madden, and Mourad Ouzzani. RPT: Relational Pre-trained Transformer Is the for the “Michael Jordan” whose expertise is “Machine Almost All You Need towards Democratizing Data Preparation. PVLDB, Learning”. (ii) Value Filling: Q2 asks for the last name of someone 14(8): 1254 - 1261, 2021. who works at “CSAIL MIT” with the first name “Michael”. doi:10.14778/3457390.3457391 Answering Q1 can help solve a series of problems such as er- ror detection, data repairing, and missing value imputation; and 1 INTRODUCTION answering Q2 can help auto-completion (e.g., give the answer A2 e.g., Data preparation — including data cleaning [1], data transforma- “Cafarella”) and auto-suggestion ( provide a list of candidate tion [31], entity resolution [24], information extraction [10], and names such as {Cafarella, Stonebraker}). so forth — is the most time-consuming and least enjoyable work Scenario 2: Attribute Filling for Schema Matching. Figure 1(a) Q3 for data scientists [18]. Next, we present several scenarios to better asks for the attribute name for the value “New York City”, w.r.t. understand these problems. name “Michael Jordan” and expertise “Basketball”. Answering Scenario 1: Data Cleaning. Figure 1(a) Q1 and Q2 show two typ- this question can help schema matching, a core data integration ical data cleaning problems. (i) Cell Filling: Question Q1 asks for problem [21], by better aligning attributes from different tables. Scenario 3: Entity Resolution (ER) Figure 1(b) shows a typical ER ∗ Ju Fan is the corresponding author. task that asks whether e1, e2 and e3 are the “same”. This work is licensed under the Creative Commons BY-NC-ND 4.0 International A human with enough knowledge can tell that “iPhone 10” = License. Visit https://creativecommons.org/licenses/by-nc-nd/4.0/ to view a copy of “iPhone X” , “iPhone 11”, “Apple” = “Apple Inc” = “AAPL”, and this license. For any use beyond those covered by this license, obtain permission by “inches” = “-inch”. Hence, one can decide that e and e do not emailing [email protected]. Copyright is held by the owner/author(s). Publication rights 1 2 licensed to the VLDB Endowment. match e3, and e1 matches e2 (if the memory does not matter). Proceedings of the VLDB Endowment, Vol. 14, No. 8 ISSN 2150-8097. doi:10.14778/3457390.3457391 Scenario 4: Information Extraction (IE) Figure 1(c) shows an IE task, which is typically done via crowdsourcing [39]. A requester 1254 provides several samples (e.g., s1) that show what “label” should be objectives (where the model is trained to recover missing pieces in extracted, and asks workers to perform similar tasks (e.g., t1). the input) work best; examples include BERT [20], BART [44], and A crowd worker needs first to interpret the task by analyzing s1 T5 [55] for NLP, and TURL [19] for relational tables. (and maybe a few more examples) and concretizes it as “what is the Contributions. We make the following notable contributions. memory size”. Afterwards, he can perform t1 by extracting the label 4GB from t1»description] by knowing “RAM” is for memory. • RPT: We describe a standard Transformer-based denoising Challenges. Scenarios (1–4) are simple for humans, but are hard autoencoder architecture to pre-train sequence-to-sequence for computers. To solve them, computers face the following chal- models for tuple-to-tuple training, with new tuple-aware lenges. (1) Knowledge: computers need to have the background masking mechanisms. (Section 2) • knowledge through understanding an enormous corpora of tables. Fine-tuning RPT: We discuss a wide range of data preparation (2) Experience: computers should be able to learn from prior and tasks that can be supported by fine-tuning RPT. (Section 3) • various tasks. (3) Adaptation: computers should (quickly) adjust to Beyond RPT: We discuss several appealing techniques that new inputs and new tasks. can complement RPT in specific data preparation tasks, e.g., collaborative training and few-shot learning for ER, and few- Vision. Indeed, these problems have been seen as ‘holy grail’ prob- shot learning and NLP question-answering for IE. (Section 4) lems for the database community for decades [1, 21, 25, 30, 36, 46], but despite thousands of papers on these topics, they still remain unsolved. Recent evidence from the NLP community, where DL- 2 RPT based models and representations have been shown to perform 2.1 Architecture nearly as well as humans on various language understanding and RPT uses a standard sequence-to-sequence (or encoder-decoder) question answering tasks, suggests that a learned approach may be Transformer [64] model, similar to BART [52], as shown in Figure 2. a viable option for these data preparation tasks as well. Encoder. RPT uses a bidirectional encoder (similar to BERT [20]) Desiderata. The desiderata for AI-powered tools to achieve near- because it has the advantage of learning to predict the corrupted human intelligence for data preparation is summarized as below, data bidirectionally, from both the context on the left and the con- in response to Challenges (1–3), respectively. (1) Deep learning ar- text on the right of the corrupted data. Moreover, it is Transformer- chitecture and self-supervised pre-training. We need a deep learning based, which can use self-attention to generate a richer representa- architecture that can learn from many tables, similar to language tion of
Recommended publications
  • Mathematics 144 Set Theory Fall 2012 Version
    MATHEMATICS 144 SET THEORY FALL 2012 VERSION Table of Contents I. General considerations.……………………………………………………………………………………………………….1 1. Overview of the course…………………………………………………………………………………………………1 2. Historical background and motivation………………………………………………………….………………4 3. Selected problems………………………………………………………………………………………………………13 I I. Basic concepts. ………………………………………………………………………………………………………………….15 1. Topics from logic…………………………………………………………………………………………………………16 2. Notation and first steps………………………………………………………………………………………………26 3. Simple examples…………………………………………………………………………………………………………30 I I I. Constructions in set theory.………………………………………………………………………………..……….34 1. Boolean algebra operations.……………………………………………………………………………………….34 2. Ordered pairs and Cartesian products……………………………………………………………………… ….40 3. Larger constructions………………………………………………………………………………………………..….42 4. A convenient assumption………………………………………………………………………………………… ….45 I V. Relations and functions ……………………………………………………………………………………………….49 1.Binary relations………………………………………………………………………………………………………… ….49 2. Partial and linear orderings……………………………..………………………………………………… ………… 56 3. Functions…………………………………………………………………………………………………………… ….…….. 61 4. Composite and inverse function.…………………………………………………………………………… …….. 70 5. Constructions involving functions ………………………………………………………………………… ……… 77 6. Order types……………………………………………………………………………………………………… …………… 80 i V. Number systems and set theory …………………………………………………………………………………. 84 1. The Natural Numbers and Integers…………………………………………………………………………….83 2. Finite induction
    [Show full text]
  • Biography Paper – Georg Cantor
    Mike Garkie Math 4010 – History of Math UCD Denver 4/1/08 Biography Paper – Georg Cantor Few mathematicians are house-hold names; perhaps only Newton and Euclid would qualify. But there is a second tier of mathematicians, those whose names might not be familiar, but whose discoveries are part of everyday math. Examples here are Napier with logarithms, Cauchy with limits and Georg Cantor (1845 – 1918) with sets. In fact, those who superficially familier with Georg Cantor probably have two impressions of the man: First, as a consequence of thinking about sets, Cantor developed a theory of the actual infinite. And second, that Cantor was a troubled genius, crippled by Freudian conflict and mental illness. The first impression is fundamentally true. Cantor almost single-handedly overturned the Aristotle’s concept of the potential infinite by developing the concept of transfinite numbers. And, even though Bolzano and Frege made significant contributions, “Set theory … is the creation of one person, Georg Cantor.” [4] The second impression is mostly false. Cantor certainly did suffer from mental illness later in his life, but the other emotional baggage assigned to him is mostly due his early biographers, particularly the infamous E.T. Bell in Men Of Mathematics [7]. In the racially charged atmosphere of 1930’s Europe, the sensational story mathematician who turned the idea of infinity on its head and went crazy in the process, probably make for good reading. The drama of the controversy over Cantor’s ideas only added spice. 1 Fortunately, modern scholars have corrected the errors and biases in older biographies.
    [Show full text]
  • SRB MEASURES for ALMOST AXIOM a DIFFEOMORPHISMS José Alves, Renaud Leplaideur
    SRB MEASURES FOR ALMOST AXIOM A DIFFEOMORPHISMS José Alves, Renaud Leplaideur To cite this version: José Alves, Renaud Leplaideur. SRB MEASURES FOR ALMOST AXIOM A DIFFEOMORPHISMS. 2013. hal-00778407 HAL Id: hal-00778407 https://hal.archives-ouvertes.fr/hal-00778407 Preprint submitted on 20 Jan 2013 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. SRB MEASURES FOR ALMOST AXIOM A DIFFEOMORPHISMS JOSE´ F. ALVES AND RENAUD LEPLAIDEUR Abstract. We consider a diffeomorphism f of a compact manifold M which is Almost Axiom A, i.e. f is hyperbolic in a neighborhood of some compact f-invariant set, except in some singular set of neutral points. We prove that if there exists some f-invariant set of hyperbolic points with positive unstable-Lebesgue measure such that for every point in this set the stable and unstable leaves are \long enough", then f admits a probability SRB measure. Contents 1. Introduction 2 1.1. Background 2 1.2. Statement of results 2 1.3. Overview 4 2. Markov rectangles 5 2.1. Neighborhood of critical zone 5 2.2. First generation of rectangles 6 2.3. Second generation rectangles 7 2.4.
    [Show full text]
  • Inscriptional Records of the Western Zhou
    INSCRIPTIONAL RECORDS OF THE WESTERN ZHOU Robert Eno Fall 2012 Note to Readers The translations in these pages cannot be considered scholarly. They were originally prepared in early 1988, under stringent time pressures, specifically for teaching use that term. Although I modified them sporadically between that time and 2012, my final year of teaching, their purpose as course materials, used in a week-long classroom exercise for undergraduate students in an early China history survey, did not warrant the type of robust academic apparatus that a scholarly edition would have required. Since no broad anthology of translations of bronze inscriptions was generally available, I have, since the late 1990s, made updated versions of this resource available online for use by teachers and students generally. As freely available materials, they may still be of use. However, as specialists have been aware all along, there are many imperfections in these translations, and I want to make sure that readers are aware that there is now a scholarly alternative, published last month: A Source Book of Ancient Chinese Bronze Inscriptions, edited by Constance Cook and Paul Goldin (Berkeley: Society for the Study of Early China, 2016). The “Source Book” includes translations of over one hundred inscriptions, prepared by ten contributors. I have chosen not to revise the materials here in light of this new resource, even in the case of a few items in the “Source Book” that were contributed by me, because a piecemeal revision seemed unhelpful, and I am now too distant from research on Western Zhou bronzes to undertake a more extensive one.
    [Show full text]
  • Bounding, Splitting and Almost Disjointness
    Bounding, splitting and almost disjointness Jonathan Schilhan Advisor: Vera Fischer Kurt G¨odelResearch Center Fakult¨atf¨urMathematik, Universit¨atWien Abstract This bachelor thesis provides an introduction to set theory, cardinal character- istics of the continuum and the generalized cardinal characteristics on arbitrary regular cardinals. It covers the well known ZFC relations of the bounding, the dominating, the almost disjointness and the splitting number as well as some more recent results about their generalizations to regular uncountable cardinals. We also present an overview on the currently known consistency results and formulate open questions. 1 CONTENTS Contents Introduction 3 1 Prerequisites 5 1.1 Model Theory/Predicate Logic . .5 1.2 Ordinals/Cardinals . .6 1.2.1 Ordinals . .7 1.2.2 Cardinals . .9 1.2.3 Regular Cardinals . 10 2 Cardinal Characteristics of the continuum 11 2.1 The bounding and the dominating number . 11 2.2 Splitting and mad families . 13 3 Cardinal Characteristics on regular uncountable cardinals 16 3.1 Generalized bounding, splitting and almost disjointness . 16 3.2 An unexpected inequality between the bounding and splitting numbers . 18 3.2.1 Elementary Submodels . 18 3.2.2 Filters and Ideals . 19 3.2.3 Proof of Theorem 3.2.1 (s(κ) ≤ b(κ)) . 20 3.3 A relation between generalized dominating and mad families . 23 References 28 2 Introduction @0 Since it was proven that the continuum hypothesis (2 = @1) is independent of the axioms of ZFC, the natural question arises what value 2@0 could take and it was shown @0 that 2 could be consistently nearly anything.
    [Show full text]
  • Piece Mold, Lost Wax & Composite Casting Techniques of The
    Piece Mold, Lost Wax & Composite Casting Techniques of the Chinese Bronze Age Behzad Bavarian and Lisa Reiner Dept. of MSEM College of Engineering and Computer Science September 2006 Table of Contents Abstract Approximate timeline 1 Introduction 2 Bronze Transition from Clay 4 Elemental Analysis of Bronze Alloys 4 Melting Temperature 7 Casting Methods 8 Casting Molds 14 Casting Flaws 21 Lost Wax Method 25 Sanxingdui 28 Environmental Effects on Surface Appearance 32 Conclusion 35 References 36 China can claim a history rich in over 5,000 years of artistic, philosophical and political advancement. As well, it is birthplace to one of the world's oldest and most complex civilizations. By 1100 BC, a high level of artistic and technical skill in bronze casting had been achieved by the Chinese. Bronze artifacts initially were copies of clay objects, but soon evolved into shapes invoking bronze material characteristics. Essentially, the bronze alloys represented in the copper-tin-lead ternary diagram are not easily hot or cold worked and are difficult to shape by hammering, the most common techniques used by the ancient Europeans and Middle Easterners. This did not deter the Chinese, however, for they had demonstrated technical proficiency with hard, thin walled ceramics by the end of the Neolithic period and were able to use these skills to develop a most unusual casting method called the piece mold process. Advances in ceramic technology played an influential role in the progress of Chinese bronze casting where the piece mold process was more of a technological extension than a distinct innovation. Certainly, the long and specialized experience in handling clay was required to form the delicate inscriptions, to properly fit the molds together and to prevent them from cracking during the pour.
    [Show full text]
  • Almost-Free E-Rings of Cardinality ℵ1 Rudige¨ R Gob¨ El, Saharon Shelah and Lutz Strung¨ Mann
    Sh:785 Canad. J. Math. Vol. 55 (4), 2003 pp. 750–765 Almost-Free E-Rings of Cardinality @1 Rudige¨ r Gob¨ el, Saharon Shelah and Lutz Strung¨ mann Abstract. An E-ring is a unital ring R such that every endomorphism of the underlying abelian group R+ is multiplication by some ring element. The existence of almost-free E-rings of cardinality greater @ than 2 0 is undecidable in ZFC. While they exist in Godel'¨ s universe, they do not exist in other models @ of set theory. For a regular cardinal @1 ≤ λ ≤ 2 0 we construct E-rings of cardinality λ in ZFC which have @1-free additive structure. For λ = @1 we therefore obtain the existence of almost-free E-rings of cardinality @1 in ZFC. 1 Introduction + Recall that a unital ring R is an E-ring if the evaluation map ": EndZ(R ) ! R given by ' 7! '(1) is a bijection. Thus every endomorphism of the abelian group R+ is multiplication by some element r 2 R. E-rings were introduced by Schultz [20] and easy examples are subrings of the rationals Q or pure subrings of the ring of p-adic integers. Schultz characterized E-rings of finite rank and the books by Feigelstock [9, 10] and an article [18] survey the results obtained in the eighties, see also [8, 19]. In a natural way the notion of E-rings extends to modules by calling a left R-module M an E(R)-module or just E-module if HomZ(R; M) = HomR(R; M) holds, see [1].
    [Show full text]
  • The Axiom of Choice and Its Implications
    THE AXIOM OF CHOICE AND ITS IMPLICATIONS KEVIN BARNUM Abstract. In this paper we will look at the Axiom of Choice and some of the various implications it has. These implications include a number of equivalent statements, and also some less accepted ideas. The proofs discussed will give us an idea of why the Axiom of Choice is so powerful, but also so controversial. Contents 1. Introduction 1 2. The Axiom of Choice and Its Equivalents 1 2.1. The Axiom of Choice and its Well-known Equivalents 1 2.2. Some Other Less Well-known Equivalents of the Axiom of Choice 3 3. Applications of the Axiom of Choice 5 3.1. Equivalence Between The Axiom of Choice and the Claim that Every Vector Space has a Basis 5 3.2. Some More Applications of the Axiom of Choice 6 4. Controversial Results 10 Acknowledgments 11 References 11 1. Introduction The Axiom of Choice states that for any family of nonempty disjoint sets, there exists a set that consists of exactly one element from each element of the family. It seems strange at first that such an innocuous sounding idea can be so powerful and controversial, but it certainly is both. To understand why, we will start by looking at some statements that are equivalent to the axiom of choice. Many of these equivalences are very useful, and we devote much time to one, namely, that every vector space has a basis. We go on from there to see a few more applications of the Axiom of Choice and its equivalents, and finish by looking at some of the reasons why the Axiom of Choice is so controversial.
    [Show full text]
  • Cardinal Invariants Concerning Functions Whose Sum Is Almost Continuous Krzysztof Ciesielski West Virginia University, [email protected]
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by The Research Repository @ WVU (West Virginia University) Faculty Scholarship 1995 Cardinal Invariants Concerning Functions Whose Sum Is Almost Continuous Krzysztof Ciesielski West Virginia University, [email protected] Follow this and additional works at: https://researchrepository.wvu.edu/faculty_publications Part of the Mathematics Commons Digital Commons Citation Ciesielski, Krzysztof, "Cardinal Invariants Concerning Functions Whose Sum Is Almost Continuous" (1995). Faculty Scholarship. 822. https://researchrepository.wvu.edu/faculty_publications/822 This Article is brought to you for free and open access by The Research Repository @ WVU. It has been accepted for inclusion in Faculty Scholarship by an authorized administrator of The Research Repository @ WVU. For more information, please contact [email protected]. Cardinal invariants concerning functions whose sum is almost continuous. Krzysztof Ciesielski1, Department of Mathematics, West Virginia University, Mor- gantown, WV 26506-6310 ([email protected]) Arnold W. Miller1, York University, Department of Mathematics, North York, Ontario M3J 1P3, Canada, Permanent address: University of Wisconsin-Madison, Department of Mathematics, Van Vleck Hall, 480 Lincoln Drive, Madison, Wis- consin 53706-1388, USA ([email protected]) Abstract Let A stand for the class of all almost continuous functions from R to R and let A(A) be the smallest cardinality of a family F ⊆ RR for which there is no g: R → R with the property that f + g ∈ A for all f ∈ F . We define cardinal number A(D) for the class D of all real functions with the Darboux property similarly.
    [Show full text]
  • Georg Cantor English Version
    GEORG CANTOR (March 3, 1845 – January 6, 1918) by HEINZ KLAUS STRICK, Germany There is hardly another mathematician whose reputation among his contemporary colleagues reflected such a wide disparity of opinion: for some, GEORG FERDINAND LUDWIG PHILIPP CANTOR was a corruptor of youth (KRONECKER), while for others, he was an exceptionally gifted mathematical researcher (DAVID HILBERT 1925: Let no one be allowed to drive us from the paradise that CANTOR created for us.) GEORG CANTOR’s father was a successful merchant and stockbroker in St. Petersburg, where he lived with his family, which included six children, in the large German colony until he was forced by ill health to move to the milder climate of Germany. In Russia, GEORG was instructed by private tutors. He then attended secondary schools in Wiesbaden and Darmstadt. After he had completed his schooling with excellent grades, particularly in mathematics, his father acceded to his son’s request to pursue mathematical studies in Zurich. GEORG CANTOR could equally well have chosen a career as a violinist, in which case he would have continued the tradition of his two grandmothers, both of whom were active as respected professional musicians in St. Petersburg. When in 1863 his father died, CANTOR transferred to Berlin, where he attended lectures by KARL WEIERSTRASS, ERNST EDUARD KUMMER, and LEOPOLD KRONECKER. On completing his doctorate in 1867 with a dissertation on a topic in number theory, CANTOR did not obtain a permanent academic position. He taught for a while at a girls’ school and at an institution for training teachers, all the while working on his habilitation thesis, which led to a teaching position at the university in Halle.
    [Show full text]
  • Almost-At-A-Distance-Accepted
    Almost at-a-distance Andrew McKenzie (Kansas) and Lydia Newkirk (Rutgers) accepted draft in Linguistics & Philosophy [email protected] [email protected] https://doi.org/10.1007/s10988-019-09275-6 Abstract We claim that the meaning of the adverbial almost contains both a scalar proximity measure and a modal that allows it to work sometimes when proximity fails, what we call the at-a-distance reading. Essentially, almost can hold if the proposition follows from the normal uninterrupted out- comes of adding a small enough number of premises to a selection of rel- evant facts. Almost at-a-distance is blocked when the temporal properties of the topic time and Davidsonian event prevent normal outcomes from coming true when they need to. This approach to almost differs from the two general approaches that have emerged in the literature, by replacing the negative polar condition (not p) with a positive antecedent condition that entails not p while avoid- ing the numerous well-documented complications of employing a polar condition. Since this approach to almost involves a circumstantial base with a non-interrupting ordering source, almost behaves in certain ways like the progressive, and shows contextual variability of the same kinds that we see with premise sets. 1 Introduction The adverbial almost has a clear intuitive meaning, but has proven very difficult to reconcile with a formal semantics. Two basic approaches characterize our understanding of almost in the literature. Under scalar alternative accounts, almost p holds when p is false but some close alternative proposition q is true. Modal closeness accounts argue that almost p holds of world w when p is false in w, but true in some close alternative world w0.
    [Show full text]
  • Equivalents to the Axiom of Choice and Their Uses A
    EQUIVALENTS TO THE AXIOM OF CHOICE AND THEIR USES A Thesis Presented to The Faculty of the Department of Mathematics California State University, Los Angeles In Partial Fulfillment of the Requirements for the Degree Master of Science in Mathematics By James Szufu Yang c 2015 James Szufu Yang ALL RIGHTS RESERVED ii The thesis of James Szufu Yang is approved. Mike Krebs, Ph.D. Kristin Webster, Ph.D. Michael Hoffman, Ph.D., Committee Chair Grant Fraser, Ph.D., Department Chair California State University, Los Angeles June 2015 iii ABSTRACT Equivalents to the Axiom of Choice and Their Uses By James Szufu Yang In set theory, the Axiom of Choice (AC) was formulated in 1904 by Ernst Zermelo. It is an addition to the older Zermelo-Fraenkel (ZF) set theory. We call it Zermelo-Fraenkel set theory with the Axiom of Choice and abbreviate it as ZFC. This paper starts with an introduction to the foundations of ZFC set the- ory, which includes the Zermelo-Fraenkel axioms, partially ordered sets (posets), the Cartesian product, the Axiom of Choice, and their related proofs. It then intro- duces several equivalent forms of the Axiom of Choice and proves that they are all equivalent. In the end, equivalents to the Axiom of Choice are used to prove a few fundamental theorems in set theory, linear analysis, and abstract algebra. This paper is concluded by a brief review of the work in it, followed by a few points of interest for further study in mathematics and/or set theory. iv ACKNOWLEDGMENTS Between the two department requirements to complete a master's degree in mathematics − the comprehensive exams and a thesis, I really wanted to experience doing a research and writing a serious academic paper.
    [Show full text]