Prototype Selection for Composite Nearest Neighbor Classifiers

Total Page:16

File Type:pdf, Size:1020Kb

Prototype Selection for Composite Nearest Neighbor Classifiers Prototype Selection for Composite Nearest Neighbor Classifiers David B. Skalak Department of Computer Science University of Massachusetts Amherst, Massachusetts 01003-4160 [email protected] CMPSCI Technical Report 96-89 1 PROTOTYPE SELECTION FOR COMPOSITE NEAREST NEIGHBOR CLASSIFIERS A Dissertation Presented by DAVID B. SKALAK Submitted to the Graduate School of the University of Massachusetts Amherst in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 1997 Department of Computer Science c Copyright by DAVID B. SKALAK 1997 All Rights Reserved PROTOTYPE SELECTION FOR COMPOSITE NEAREST NEIGHBOR CLASSIFIERS A Dissertation Presented by DAVID B. SKALAK Approved as to style and content by: Edwina L. Rissland, Chair Paul E. Utgoff, Member Andrew G. Barto, Member Michael Sutherland, Member David W. Stemple, Department Chair Computer Science ACKNOWLEDGMENTS Edwina Rissland has provided unflagging support during my decade in Amherst. More important than any single piece of research is the influence that a teacher has on one’s research career and on one’s life. I am fortunate to have worked under Edwina. Andy Barto, Paul Utgoff and Mike Sutherland have each provided intellectual support of the highest rank. I deeply thank Robert Constable, Chair of the Department of Computer Science at Cornell University, for providing me with computing facilities and office space. For valuable assistance, I thank Paul Cohen, Paul Utgoff, Jamie Callan, the members of the Experimental Knowledge Systems Laboratory at UMass, Priscilla Coe, Sharon Mallory, Neil Berkman and Zijian Zheng. I am especially grateful to Andy Golding. Most of all, I thank Claire Cardie. This work was supported in part by the Air Force Office of Scientific Research un- der Contract 90-0359, by National Science Foundation Grant No. EEC-9209623 State- University-Industry Cooperative Research on Intelligent Information Retrieval, by a University of Massachusetts Graduate School Fellowship, and by trading in S&P 100 Index options. iv ABSTRACT PROTOTYPE SELECTION FOR COMPOSITE NEAREST NEIGHBOR CLASSIFIERS MAY 1997 DAVID B. SKALAK, B.S., UNION COLLEGE M.A., DARTMOUTH COLLEGE J.D., HARVARD LAW SCHOOL M.S., UNIVERSITY OF MASSACHUSETTS AMHERST Ph.D., UNIVERSITY OF MASSACHUSETTS AMHERST Directed by: Professor Edwina L. Rissland Combining the predictions of a set of classifiers has been shown to be an effective way to create composite classifiers that are more accurate than any of the component classifiers. Increased accuracy has been shown in a variety of real-world applications, ranging from protein sequence identification to determining the fat content of ground meat. Despite such individual successes, the answers are not known to fundamental questions about classifier combination, such as “Can classifiers from any given model class be combined to create a composite classifier with higher accuracy?” or “Is it possible to increase the accuracy of a given classifier by combining its predictions with those of only a small number of other classifiers?”. The goal of this dissertation is to provide answers to these and closely related questions with respect to a particular model class, the class of nearest neighbor classifiers. We undertake the first study that investigates in depth the combination of nearest neighbor classifiers. Although previous research has questioned the utility of combining nearest neighbor classifiers, we introduce algorithms that combine a small number of component nearest neighbor classifiers, where each of the components stores a small number of prototypical instances. In a variety of domains, we show that these algorithms yield composite classifiers that are more accurate than a nearest neighbor classifier that stores all training instances as prototypes. The research presented in this dissertation also extends previous work on prototype selection for an independent nearest neighbor classifier. We show that in many domains, storing a very small number of prototypes can provide classification accuracy greater than or equal to that of a nearest neighbor classifier that stores all training instances. We extend previous work by demonstrating that algorithms that rely primarily on random sampling can effectively choose a small number of prototypes. v TABLE OF CONTENTS Page ACKNOWLEDGMENTS v ABSTRACT vi LIST OF TABLES xi LIST OF FIGURES xvi CHAPTER 1. INTRODUCTION 1 1.1 The Problem 1 1.2 Approach to the Problem 5 1.3 Overview of the Results 8 1.4 Guide to the Dissertation 9 2. COMPOSITE CLASSIFIER CONSTRUCTION 12 2.1 Composite Classifier Design Criteria 12 2.1.1 Accuracy 13 2.1.2 Diversity 13 2.1.2.1 Measuring Diversity 14 2.1.3 Efficiency 15 2.1.3.1 Number of Component Classifiers 16 2.1.3.2 Computational Resources of a Component Classifier 16 2.2 Composite Classifier Architectures 17 2.2.1 Stacked Generalization 17 2.2.1.1 Architecture and Algorithm 19 2.2.1.2 Error-Correction Output Coding 22 2.2.1.3 Network Construction Algorithms 24 2.2.2 Boosting 24 2.2.3 Recursive Partitioning 26 2.3 Component Classifier Construction 27 2.3.1 Reasoning Strategy Combination 28 2.3.2 Divide and Conquer 28 vi 2.3.3 Model Class Combination 30 2.3.4 Architecture and Parameter Modification 30 2.3.5 Randomized Search 31 2.3.6 Training Set Resampling 32 2.3.7 Feature Selection 32 2.3.8 Discussion 33 2.4 Combining Classifier Construction 33 2.4.1 Function Approximation 34 2.4.1.1 Linear 34 2.4.1.2 Non-linear 35 2.4.2 Classification 35 2.4.2.1 Voting 35 2.4.2.2 Non-voting Methods 38 2.5 Nearest Neighbor Classifier Construction 39 2.5.1 Prototype Roles 40 2.5.1.1 Cognitive Organization 40 2.5.1.2 Facilitating Case Retrieval 41 2.5.1.3 Local Learning Algorithms 42 2.5.2 Prototype Selection for Nearest Neighbor Classifiers 43 2.5.3 Limitations of Existing Prototype Selection Algorithms 48 2.6 Summary 50 3. NEAREST NEIGHBOR COMPOSITE CLASSIFIERS 52 3.1 Characteristics of the Approach 52 3.1.1 Homogeneous Nearest Neighbor Component Classifiers 52 3.1.2 Prototype Selection through Sampling 54 3.1.3 Coarse-Hypothesis Classifiers 55 3.1.4 Concise Level-1 Representation 55 3.1.5 Summary 56 3.2 Experimental Design 56 3.2.1 Data Sets 56 3.2.2 Experimental Method 59 3.3 A Nearest Neighbor Algorithm 59 3.3.1 Evaluation of Baseline Nearest Neighbor Classifier 62 3.3.2 Disadvantages of Some Nearest Neighbor Algorithms 63 vii 3.4 Classifier Sampling 64 3.4.1 Algorithm 64 3.4.2 Evaluation 67 3.4.2.1 Number of Samples 69 3.4.2.2 Clustering 75 3.4.3 A Taxonomy of Instance Types 78 3.5 Summary 82 4. BOOSTING 83 4.1 Four Algorithms for Boosting 84 4.1.1 Coarse Reclassification 85 4.1.2 Deliberate Misclassification 86 4.1.3 Composite Fitness 87 4.1.4 Composite Fitness–Feature Selection 90 4.1.5 Summary 90 4.2 Coarse Reclassification 91 4.3 Deliberate Misclassification 93 4.4 Composite Fitness 100 4.5 Composite Fitness–Feature Selection 106 4.6 Comparison of the Four Algorithms 108 4.6.1 Composite Accuracy 109 4.6.2 Component Diversity and Accuracy 112 4.7 Comparison to Schapire’s Boosting Algorithm 120 4.8 Analysis 122 4.8.1 Failure Analysis 122 4.8.2 Selective Superiority 126 4.9 Summary 126 5. STACKED GENERALIZATION 130 5.1 Comparison of Combining Algorithms 131 5.2 Number of Component Classifiers 135 5.2.1 Two Component Classifiers 135 5.2.2 More than Two Component Classifiers 138 5.3 Relationship between Accuracy and Diversity 142 5.4 Class Probability Estimates 148 5.5 Generalizing the k-Nearest Neighbor Classifier 151 5.6 Summary 155 viii 6. CONCLUSIONS 157 6.1 Summary of the Dissertation 157 6.2 Contributions 161 6.3 Issues for Future Research 161 APPENDICES A. INSTANCES RANDOMLY OMITTED FROM CROSS-VALIDATION 163 B. ACCURACY AND DIVERSITY CONTOUR GRAPHS 164 C. SUM OF SQUARES AND CROSS-PRODUCT MATRICES 177 BIBLIOGRAPHY 178 ix LIST OF TABLES Table Page 1.1 Topic summary table. 10 1.2 Component classifier summary table: prototypes stored in component classi- fiers for boosting and stacked generalation in this thesis. 11 2.1 Classical nearest neighbor editing algorithms. 44 2.2 Example machine learning nearest neighbor editing algorithms. 47 2.3 Published accuracy of machine learning prototype selection algorithms. Blank entries indicate that we could not find a value reported in the research literature for an editing algorithm. “dlM” indicates IB3 results reported by de la Maza [1991]; “C-J” indicates IB3 results reported by Cameron-Jones [1995]. 49 3.1 Summary of data sets used in experiments. In the column labeled “Feature Types”, “N” designates numeric features; “S” designates symbolic features; “B” designates binary features. 58 3.2 Percent test accuracy and standard deviation for baseline 1-nearest-neighbor algorithm. 63 3.3 Classification accuracy of prototype sampling (PS) algorithm, one prototype per class. 67 3.4 Time to sample 100 sets of prototypes, select the most accurate sample (Sample) and classify the data set using that sample (Application), compared to the time to apply the nearest neighbor algorithm to the data set (NN). Values are wall-clock times, in seconds. 68 3.5 Classification accuracy of prototype sampling (PS) algorithm, two prototypes per class. 69 3.6 Partial trace of prototype sampling algorithm on Breast Cancer Ljubljana data. Accuracies in percent correct. 70 3.7 Optimal number of samples to achieve maximal test accuracy. 75 3.8 Comparison of test accuracies based on 100 samples and 1000 samples. 76 x 3.9 Statistical correlation between (i) clustering of the training set and clustering of the test set, (ii) clustering of the training set and test accuracy, and (iii) clustering of the test set and test accuracy.
Recommended publications
  • Maybe Not--But You Can Have Fun Trying
    12/29/2010 Can You Live Forever? Maybe Not--But… Permanent Address: http://www.scientificamerican.com/article.cfm?id=e-zimmer-can-you-live-forever Can You Live Forever? Maybe Not--But You Can Have Fun Trying In this chapter from his new e-book, journalist Carl Zimmer tries to reconcile the visions of techno-immortalists with the exigencies imposed by real-world biology By Carl Zimmer | Wednesday, December 22, 2010 | 14 comments Editor's Note: Carl Zimmer, author of this month's article, "100 Trillion Connections," has just brought out a much-acclaimed e-book, Brain Cuttings: 15 Journeys Through the Mind (Scott & Nix), that compiles a series of his writings on neuroscience. In this chapter, adapted from an article that was first published in Playboy, Zimmer takes the reader on a tour of the 2009 Singularity Summit in New York City. His ability to contrast the fantastical predictions of speakers at the conference with the sometimes more skeptical assessments from other scientists makes his account a fascinating read. Let's say you transfer your mind into a computer—not all at once but gradually, having electrodes inserted into your brain and then wirelessly outsourcing your faculties. Someone reroutes your vision through cameras. Someone stores your memories on a net of microprocessors. Step by step your metamorphosis continues until at last the transfer is complete. As engineers get to work boosting the performance of your electronic mind so you can now think as a god, a nurse heaves your fleshy brain into a bag of medical waste. As you —for now let's just call it "you"—start a new chapter of existence exclusively within a machine, an existence that will last as long as there are server farms and hard-disk space and the solar power to run them, are "you" still actually you? This question was being considered carefully and thoroughly by a 43-year-old man standing on a giant stage backed by high black curtains.
    [Show full text]
  • Institute for the Future
    Institute for the Future The Institute for the Future (IFTF) is a Palo Alto, California, US– based not-for-profit think tank. It was established, in 1968, as a spin- Institute for the Future off from the RAND Corporation to help organizations plan for the long-term future, a subject known as futures studies.[1] Type Not for profit Contents Industry Future Forecasting History Founded 1968 in Middletown, Genesis Connecticut, First years United States An increase in corporate focus Founders Frank Davidson, Evolution of societal forecasting Olaf Helmer, Paul Work Baran, Arnold Kramish, and People Theodore Gordon Past leaders Headquarters 201 Hamilton References Avenue, Palo Alto, External links United States Key people Marina Gorbis Services Ten Year Forecast, History Technology Horizons, Health Horizons Genesis Website iftf.org (http://iftf.or g) First references to the idea of an Institute for the Future may be found in a 1966 Prospectus by Olaf Helmer and others.[2] While at RAND Corporation, Helmer had already been involved with developing the Delphi method of futures studies. He, and others, wished to extend the work further with an emphasis on examining multiple scenarios. This can be seen in the prospectus summary: To explore systematically the possible futures for our [USA] nation and for the international community. To ascertain which among these possible futures seems desirable, and why. To seek means by which the probability of their occurrence can be enhanced through appropriate purposeful action.[1] First years The Institute opened in 1968, in Middletown, Connecticut. The initial group was led by Frank Davidson and included Olaf Helmer, Paul Baran, Arnold Kramish, and Theodore Gordon.[1] The Institute’s work initially relied on the forecasting methods built upon by Helmer while at RAND.
    [Show full text]
  • The Salt Fish Girl
    TEACHERS’ GUIDE By James Venn, OCT ISBN: 9781459744189 @dundurnpress dundurn.com Copyright © Dundurn, 2018 This teachers’ guide was made possible by the support of the Ontario Media Development Corporation. The publisher is not responsible for websites or their content unless they are owned by the publisher. Books are available from our website (dundurn.com), your favourite bookseller, wholesalers, and UTP Distribution (t: 1-800-565-9523). CONTENTS I. Introduction 4 • About the Book • About the Author II. Literary Devices, Motifs, and Themes 6 III. Pre-Reading Activities 10 • Author Study • Cover Deconstruction • Class Discussion: Science Fiction as a Genre • Composition: Identity • On Identity Assignment IV. Chapter Summaries 13 V. During-Reading Activities 21 VI. Post-Reading Activities 42 • Comparison Study • Cover Redesign • Dramatization • Essay Assignment • Science Fiction Story VII. Assessment Rubric 45 VIII. Ontario Curriculum Connections 48 TRG | SALT FISH GIRL CONTENTS | 3 I. INTRODUCTION ABOUT THE BOOK Salt Fish Girl is a science fiction novel told by two narrators in alternating sections. These two narratives compliment and comment on each other. Miranda is a girl of Chinese heritage growing up on the North American West Coast in the near future. Governments have failed and rule by corporation has been established. Citizens of corporations live in enclaves while the majority of people make do outside these protected spaces. Economic activities, like selling produce, making shoes, advertising, or running a medical clinic, continue. The propagation of transgenic practices has led to some disturbing consequences. The corporations are cloning women for slave labour. They genetically modify these workers with genes culled from other species to deny them full human rights.
    [Show full text]
  • Predicting Future Locations and Arrival Times of Individuals
    Predicting Future Locations and Arrival Times of Individuals Ingrid E. Burbey Dissertation submitted to the faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Doctor of Philosophy In Computer Engineering Thomas L. Martin, Chair Mark T. Jones Scott F. Midkiff Manuel A. Perez-Quinones Joseph G. Tront 26 April, 2011 Blacksburg, Virginia Keywords: Context Awareness, Location Awareness, Context Prediction, Location Prediction, Time-of-Arrival Prediction Predicting Future Locations and Arrival Times of Individuals Ingrid E. Burbey ABSTRACT This work has two objectives: a) to predict people's future locations, and b) to predict when they will be at given locations. Current location-based applications react to the user‘s current location. The progression from location-awareness to location-prediction can enable the next generation of proactive, context-predicting applications. Existing location-prediction algorithms predict someone‘s next location. In contrast, this dissertation predicts someone‘s future locations. Existing algorithms use a sequence of locations and predict the next location in the sequence. This dissertation incorporates temporal information as timestamps in order to predict someone‘s location at any time in the future. Sequence predictors based on Markov models have been shown to be effective predictors of someone's next location. This dissertation applies a Markov model to two-dimensional, timestamped location information to predict future locations. This dissertation also predicts when someone will be at a given location. These predictions can support presence or understanding co-workers‘ routines. Predicting the times that someone is going to be at a given location is a very different and more difficult problem than predicting where someone will be at a given time.
    [Show full text]
  • THE FEASIBILITY of a TESTABLE GAIA HYPOTHESIS a Project Presented to the Faculty of the Undergraduate College of Mathematics and Sciences James Madison University
    THE FEASIBILITY OF A TESTABLE GAIA HYPOTHESIS A Project Presented to The Faculty of the Undergraduate College of Mathematics and Sciences James Madison University In Partial Fulfillment of the Requirements for the Degree Bachelor of Sciences by Brent Franklin Bauman 1998 Introduction In contrast to Darwinian theories of natural selection which hold that life evolves and adapts to existing geologic conditions, the Gaia hypothesis, first proposed by James Lovelock in 1969, states that Earth and the life inhabiting it coevolve as one globally integrated system. This system, Gaia, is homeostatically regulated through biological negative feedback reactions and creates conditions that are self-sustaining and comfortable for life itself. Lovelock compares the Earth's self-regulating abilities to those of a hypothetical planetary organism that is capable of maintaining the Earth's surface biogeochemistry as an organism maintains its own life-sustaining internal conditions. Views regarding the relationships between life and the Earth have historically resulted in polar opposite positions of scientific thought. Within the natural sciences, two such positions currently prevail: reductionism and holism. Reductionism is the notion that systems can be understood by analysis of their component parts. It proposes, for example, that all biological phenomena can be understood and described by the laws of physics. Holism is the notion that the component parts of a system can only be understood in context of the whole system. For example, biological phenomena exhibit emergent properties that can be ultimately reduced to or explained by the laws of physics. Although numerous holistic theories have recently emerged, reductionism has remained the dominant paradigm in the sciences since the 1950's (Goldsmith 65).
    [Show full text]
  • 1 GENERAL PURPOSE TECHNOLOGIES in THEORY, APPLICATION and CONTROVERSY: a REVIEW by Clifford Bekar, Kenneth Carlaw, Richard Lipse
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Simon Fraser University Institutional Repository GENERAL PURPOSE TECHNOLOGIES IN THEORY, APPLICATION AND CONTROVERSY: A REVIEW by Clifford Bekar, Kenneth Carlaw, Richard Lipsey Contacts Clifford Bekar: [email protected] Kenneth Carlaw:[email protected] Richard Lipsey: [email protected] Abstract Distinguishing characteristics of General Purpose Technologies (GPTs) are identified and definitions discussed. Our definition includes multipurpose and single-purpose technologies, defining them according to their micro-technological characteristics, not their macro-economic effects. Identifying technologies as GPTs requires recognizing their evolutionary nature, and accepting possible uncertainties concerning marginal cases. Many of the existing ‘tests’ of whether particular technologies are GPTs are based on misunderstandings either of what GPT theory predicts or what such tests can establish. The development of formal GPT theories is outlined, showing that only the early theories predicted the inevitability of GPT-induced showdown and surges. More recent GPT theories, designed to model the characteristics of GPTs, do not imply the necessity of specific macro effects. We show that GPTs can rejuvenate the growth process without causing slowdowns or surges. We conclude that existing criticisms of GPT theory can be resolved and that the concept remains useful for economic theory. Key Words: General Purpose Technologies, technological change, patents, slowdowns, surges, growth theories, productivity JEL Classes: N00, O30, O33, O40, O41. 1 GENERAL PURPOSE TECHNOLOGIES IN THEORY, APPLICATIONS AND CONTROVERSY: A REVIEW1 Two main approaches to studying economic growth over the very long-run are found in the literature: the analysis of economic historians and the formal model building of economic theorists.
    [Show full text]
  • Reading Horizons Vol. 25, No. 3
    Reading Horizons: A Journal of Literacy and Language Arts Volume 25 Issue 3 April 1985 Article 10 4-1985 reading Horizons vol. 25, no. 3 Follow this and additional works at: https://scholarworks.wmich.edu/reading_horizons Part of the Education Commons Recommended Citation (1985). reading Horizons vol. 25, no. 3. Reading Horizons: A Journal of Literacy and Language Arts, 25 (3). Retrieved from https://scholarworks.wmich.edu/reading_horizons/vol25/iss3/10 This Complete Issue is brought to you for free and open access by the Special Education and Literacy Studies at ScholarWorks at WMU. It has been accepted for inclusion in Reading Horizons: A Journal of Literacy and Language Arts by an authorized editor of ScholarWorks at WMU. For more information, please contact wmu- [email protected]. ~1tfJ---­ HORIZONS VOLUME 25, NUMBER 3 Editor - Ken VanderMeulen College of Fnucation Western Michigan University Kalarrazoo. MI 49008 READING HORIZONS has been published quarterly since 1960. on the campus of Western Michi~pn University in Kalarrazoo. As a journal devoted to the teaching of reading at all levels. it provides all interested professio~lls with the ideas. reports. and important developnents that constitute the ever widening horizons of reading. Copyright 1985 Western Michigan University READING HORIZONS READING HORIZONS (ISSN 0034-0502) is published quart.erly by the College of Education at Western Michigan University, in Kalarmzoo, Mi chigan, 49008. Second Class Postage paid at KalQIlTIzoo, Michigan. Po:::;trm:::;tcr: Send c.ulrln;:...;:...; Cl!clilgC:"'; GU WIVIU, [LEADING IIOIUZONS, Editor, Kalarmzoo, MI, 49008. SUBSCRIPTIONS Subscriptions are available at $10.00 per year for individuals, $12.00 for institutions.
    [Show full text]
  • Sociobiology and the Quest for Human Nature
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln Anthropology Faculty Publications Anthropology, Department of August 1987 Precis of Vaulting Ambition: Sociobiology and the Quest for Human Nature Philip Kitcher University of California, San Diego, La Jolla, California Patrick Bateson (Comment by) University of Cambridge, Cambridge, England Jon Beckwith (Comment by) Department of Microbiolgoy and Molecular Genetics, Harvard Medical School, Boston, Mass. Irwin S. Bernstein (Comment by) Department of Psychology, University of Georgia, Athens, GA Patricia Smith Churchland (Comment by) Philosophy Department and Cognitive Science Program, University of California, San Diego, La Jolla, FCaliforniaollow this and additional works at: https://digitalcommons.unl.edu/anthropologyfacpub See P nextart of page the forAnthr additionalopology authorsCommons Kitcher, Philip; Bateson, Patrick (Comment by); Beckwith, Jon (Comment by); Bernstein, Irwin S. (Comment by); Smith Churchland, Patricia (Comment by); Draper, Patricia (Comment by); Dupre, John (Comment by); Futterman, Andrew (Comment by); Ghiselin, Michael T. (Comment by); Harpending, Henry (Comment by); Johnston, Timothy D. (Comment by); Allen, Garland E. (Comment by); Lamb, Michael E. (Comment by); McGrew, W. C. (Comment by); Plotkin, H. C. (Comment by); Rosenberg, Alexander (Comment by); Saunders, Peter T. (Comment by); Ho, Mae-Wan (Comment by); Singer, Peter (Comment by); Smith, Eric Aiden (Comment by); Smith, Peter K. (Comment by); Sober, Elliot (Comment by); Stenseth, Nils C. (Comment by); and Symons, Donald (Comment by), "Precis of Vaulting Ambition: Sociobiology and the Quest for Human Nature" (1987). Anthropology Faculty Publications. 16. https://digitalcommons.unl.edu/anthropologyfacpub/16 This Article is brought to you for free and open access by the Anthropology, Department of at DigitalCommons@University of Nebraska - Lincoln.
    [Show full text]
  • Using High-Order Prior Belief Predictions in Hierarchical Temporal Memory for Streaming Anomaly Detection
    Rochester Institute of Technology RIT Scholar Works Theses 2018 Using High-Order Prior Belief Predictions in Hierarchical Temporal Memory for Streaming Anomaly Detection Brody Kutt [email protected] Follow this and additional works at: https://scholarworks.rit.edu/theses Recommended Citation Kutt, Brody, "Using High-Order Prior Belief Predictions in Hierarchical Temporal Memory for Streaming Anomaly Detection" (2018). Thesis. Rochester Institute of Technology. Accessed from This Thesis is brought to you for free and open access by RIT Scholar Works. It has been accepted for inclusion in Theses by an authorized administrator of RIT Scholar Works. For more information, please contact [email protected]. MS IN COMPUTING AND INFORMATION SCIENCES ROCHESTER INSTITUTE OF TECHNOLOGY ROCHESTER, NEW YORK CERTIFICATE OF APPROVAL MS DEGREE THESIS The MS degree thesis of Brody Kutt has been examined and approved by the thesis committee as satisfactory for the thesis required for the MS degree in Computing and Information Sciences Dr. Dhireesha Kudithipudi, Thesis Advisor Dr. Zachary Butler, Co-Chair Dr. Nathan Cahill, Reader Dr. William Hewlett, External Chair Date i Using High-Order Prior Belief Predictions in Hierarchical Temporal Memory for Streaming Anomaly Detection by Brody Kutt Submitted to the B. Thomas Golisano College of Computing and Information Sciences Department of Computer Science in partial fulfillment of the requirements for the Master of Science Degree at the Rochester Institute of Technology 2018 ii iii Abstract Autonomous streaming anomaly detection can have a significant impact in any domain where continuous, real-time data is common. Often in these domains, datasets are too large or complex to hand label.
    [Show full text]
  • Arxiv:2009.01908V1 [Astro-Ph.SR] 3 Sep 2020
    Solar Physics DOI: 10.1007/•••••-•••-•••-••••-• Progress in Solar Cycle Predictions: Sunspot Cycles 24–25 in Perspective Dibyendu Nandy1,2 c Springer •••• Abstract The dynamic activity of the Sun – sustained by a magnetohydrodynamic dynamo mechanism working in its interior – modulates the electromagnetic, particulate and radiative environment in space. While solar activity variations on short timescale create space weather, slow long-term modulation forms the basis of space climate. Space weather impacts diverse space-reliant technologies while space climate influences planetary atmospheres and climate. Having prior knowl- edge of the Sun’s activity is important in these contexts. However, forecasting solar-stellar magnetic activity has remained an outstanding challenge. In this review, predictions for sunspot cycle 24 and the upcoming cycle 25 are summa- rized, and critically assessed. The analysis demonstrates that while predictions based on diverse techniques disagree across solar cycles 24–25, physics-based predictions for solar cycle 25 have converged and indicates a weak sunspot cycle 25. It is argued that this convergence in physics-based predictions is indicative of progress in the fundamental understanding of solar cycle predictability. Based on this understanding, resolutions to several outstanding questions related to solar cycle predictions are discussed. Keywords: Solar Activity; Sunspots; Solar Cycle Prediction; Magnetohydrody- namics; Solar Dynamo 1. The Case for Solar Cycle Predictions The space environment in the solar system is governed by the variable activity of the Sun. This variability is manifested in changing flux of solar radiation, B Dibyendu Nandy [email protected] arXiv:2009.01908v1 [astro-ph.SR] 3 Sep 2020 1 Center of Excellence in Space Sciences India, Indian Institute of Science Education and Research Kolkata, Mohanpur 741246, West Bengal, India 2 Department of Physical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur 741246, West Bengal, India SOLA: ms.tex; 7 September 2020; 0:23; p.
    [Show full text]
  • Annual FTI Letter: 2019 Review + 2020 Preview
    Annual FTI Letter: 2019 Review + 2020 Preview In 2009, I began creating an end-of-year inventory of signals, outliers and trends. It was my way of methodically reflecting and getting myself organized for the coming year. It is a practice I've refined and honed. Far from being a nostalgic look back at what was or might have been, or a list of predictions that might not be any more accurate than the local weather forecast, my annual inventory is something entirely different: a way to think about the evolution of technology, science and humanity as part of a long continuum. But this isn't just the end of a year, it is the beginning of a new decade. I am reminded of a few famous science fiction stories set in the 2020s. Soylent Green, a 1973 dystopian thriller film based on a book by Harry Harrison, was set in the year 2022, a time when global warming and pollution results in food and water shortages. Unchecked capitalism leads to an unprecedented, massive wealth divide, where affordable housing is scarce and human "furniture" fulfills all of the needs of the uber- rich. Another seminal work, Alan Moore and David Lloyd's V for Vendetta, was adapted by the Wachowskis. In the wake of an unstoppable super-virus, regulation and nationalism lead to the catastrophic rise of a police state in the year 2020. Masked freedom fighters are pitted against the government -- difficult decisions have to be made. Page "1 It's hard not to think about the coming decade within the context of these two stories.
    [Show full text]
  • Educational Guide Finiansonbroadway.Com Music by Burton Lane Book & Lyrics by Yip Harburg Book by Fred Saidy Educational Guide
    educational guide FiniansonBroadway.com music By Burton lane Book & lyrics By yiP harBurg Book By Fred saidy educational guide written By Scott Lupi contriButing editors Susan Fuller, Marty Johnson, Lindsay Weiner Project manager Susan Fuller Production Photos © Joan Marcus layout & design: Danielle Johnson and Kevin M. Johnson created By iTheatrics.com www.FiniansonBroadway.com © iTheatrics 2009 INTRODUCTION HOW TO USE ThIS GUIDE ................................................................................2 BeFORE THE SHOW .......................................................................... 3–10 ABOUT THE SHOW Basic Plot Synopsis .................................................................................... 4–5 Character Descriptions ...................................................................................6 About the Writers ..........................................................................................7 Production History of the Show.......................................................................8 Vocabulary from Finian’s Rainbow ...................................................................8 IMPORTANT IDEAS IN THE SHOW Sharecroppers of the South ............................................................................9 Rainbows .....................................................................................................10 Leprechauns ................................................................................................10 aFTER THE SHOW ..........................................................................
    [Show full text]