Layout Inference and Table Detection in Spreadsheet Documents

Total Page:16

File Type:pdf, Size:1020Kb

Layout Inference and Table Detection in Spreadsheet Documents Layout Inference and Table Detection in Spreadsheet Documents Dissertation submitted April 20, 2020 by M.Sc. Elvis Koci born May 09, 1987 in Sarande, Albania at Technische Universität Dresden and Universitat Politècnica de Catalunya Supervisors: Prof. Dr.-Ing. Wolfgang Lehner Assoc. Prof. Dr. Oscar Romero IT BI D C 2 THESIS DETAILS Thesis Title: Layout Inference and Table Detection in Spreadsheet Documents Ph.D. Student: Elvis Koci Supervisors: Prof. Dr.-Ing. Wolfgang Lehner, Technische Universität Dresden Assoc. Prof. Dr. Oscar Romero, Universitat Politècnica de Catalunya The main body of this thesis consists of the following peer-reviewed publications: 1. Elvis Koci, Maik Thiele, Oscar Romero, and Wolfgang Lehner. A machine learning approach for layout inference in spreadsheets. In IC3K 2016: The 8th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Man- agement: volume 1: KDIR, pages 77–88. SciTePress, 2016 2. Elvis Koci, Maik Thiele, Oscar Romero, and Wolfgang Lehner. Cell classification for layout recognition in spreadsheets. In Ana Fred, Jan Dietz, David Aveiro, Kecheng Liu, Jorge Bernardino, and Joaquim Filipe, editors, Knowledge Discovery, Knowledge Engineering and Knowledge Management (IC3K ‘16: Revised Selected Papers), volume 914 of Communications in Computer and Information Science, pages 78–100. Springer, Cham, 2019 3. Elvis Koci, Maik Thiele, Oscar Romero, and Wolfgang Lehner. Table identification and reconstruction in spreadsheets. In the International Conference on Advanced Infor- mation Systems Engineering (CAiSE), pages 527–541. Springer, 2017 4. Elvis Koci, Maik Thiele, Wolfgang Lehner, and Oscar Romero. Table recognition in spreadsheets via a graph representation. In the 13th IAPR International Workshop on Document Analysis Systems (DAS), pages 139–144. IEEE, 2018 5. Elvis Koci, Maik Thiele, Oscar Romero, and Wolfgang Lehner. A genetic-based search for adaptive table recognition in spreadsheets. In 2019 International Confer- ence on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, September 20-25, 2019, pages 1274–1279. IEEE, 2019 6. Elvis Koci, Maik Thiele, Josephine Rehak, Oscar Romero, and Wolfgang Lehner. DECO: A dataset of annotated spreadsheets for layout and table recognition. In 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Syd- ney, Australia, September 20-25, 2019, pages 1280–1285. IEEE, 2019 7. Elvis Koci, Dana Kuban, Nico Luettig, Dominik Olwig, Maik Thiele, Julius Gonsior, Wolfgang Lehner, and Oscar Romero. Xlindy: Interactive recognition and informa- tion extraction in spreadsheets. In Sonja Schimmler and Uwe M. Borghoff, editors, Proceedings of the ACM Symposium on Document Engineering 2019, Berlin, Germany, September 23-26, 2019, pages 25:1–25:4. ACM, 2019 This thesis is jointly submitted to the Faculty of Computer Science at Technische Univer- sität Dresden (TUD) and the Department of Service and Information System Engineering (ESSI) at Universitat Politècnica de Catalunya (UPC), in partial fulfillment of the require- ments within the scope of the IT4BI-DC program for the joint Ph.D. degree in computer science (TUD: Dr.-Ing., UPC: Ph.D. in Computer Science). The thesis is not submitted to any other organization at the same time. The author has obtained the rights to include parts of the already published articles in the thesis. 3 4 ABSTRACT Spreadsheet applications have evolved to be a tool of great importance for businesses, open data, and scientific communities. Using these applications, users can perform var- ious transformations, generate new content, analyze and format data such that they are visually comprehensive. The same data can be presented in different ways, depending on the preferences and the intentions of the user. These functionalities make spreadsheets user-friendly, but not as much machine-friendly. When it comes to integrating with other sources, the free-for-all nature of spreadsheets is disadvantageous. It is rather difficult to algorithmically infer the structure of the data when they are intermingled with formatting, formulas, layout artifacts, and textual meta- data. Therefore, user involvement is often required, which results in cumbersome and time-consuming tasks. Overall, the lack of automatic processing methods limits our abil- ity to explore and reuse a great amount of rich data stored into partially-structured doc- uments such as spreadsheets. In this thesis, we tackle this open challenge, which so far has been scarcely investigated in literature. Specifically, we are interested in extracting tabular data from spreadsheets, since they hold concise, factual, and to a large extend structured information. It is easier to process such information, in order to make it available to other applications. For in- stance, spreadsheet (tabular) data can be loaded into databases. Thus, these data would become instantly available to existing or new business processes. Furthermore, we can eliminate the risk of losing valuable company knowledge, by moving data or integrating spreadsheets with other more sophisticated information management systems. To achieve the aforementioned objectives and advancements, in this thesis, we develop a spreadsheet processing pipeline. The requirements for this pipeline were derived from a large scale empirical analysis of real-world spreadsheets, from business and Web settings. Specifically, we propose a series of specialized steps that build on top of each other with the goal of discovering the structure of data in spreadsheet documents. Our approach is bottom-up, as it starts from the smallest unit (i.e., the cell) to ultimately arrive at the individual tables of the sheet. Additionally, this thesis makes use of sophisticated machine learning and optimization techniques. In particular, we apply these techniques for layout analysis and table de- tection in spreadsheets. We target highly diverse sheet layouts, with one or multiple ta- bles and arbitrary arrangement of contents. Moreover, we foresee the presence of textual metadata and other non-tabular data in the sheet. Furthermore, we work even with prob- lematic tables (e.g., containing empty rows/columns and missing values). Finally, we bring flexibility to our approach. This not only allows us to tackle the above-mentioned challenges but also to reuse our solution for different (spreadsheet) datasets. 5 6 CONTENTS 1 INTRODUCTION 13 1.1 Motivation ..................................... 14 1.2 Contributions ................................... 15 1.3 Outline ....................................... 16 2 FOUNDATIONS AND RELATED WORK 19 2.1 The Evolution of Spreadsheet Documents ................. 20 2.1.1 Spreadsheet User Interface and Functionalities .......... 21 2.1.2 Spreadsheet File Formats ........................ 22 2.1.3 Spreadsheets Are Partially-Structured ................ 23 2.2 Analysis and Recognition in Electronic Documents ............ 23 2.2.1 A General Overview of DAR ...................... 23 2.2.2 DAR in Spreadsheets ........................... 26 2.3 Spreadsheet Research Areas ......................... 26 2.3.1 Layout Inference and Table Recognition .............. 27 2.3.2 Unifying Databases and Spreadsheets ............... 29 2.3.3 Spreadsheet Software Engineering .................. 30 2.3.4 Data Wrangling Approaches ..................... 31 3 AN EMPIRICAL STUDY OF SPREADSHEET DOCUMENTS 33 3.1 Available Corpora ................................ 34 3.2 Creating a Gold Standard Dataset ...................... 36 3.2.1 Initial Selection .............................. 36 3.2.2 Annotation Methodology ........................ 37 3.3 Dataset Analysis ................................. 42 3.3.1 Takeaways from Business Spreadsheets ............... 42 3.3.2 Comparison Between Domains .................... 47 3.4 Summary and Discussion ............................ 50 3.4.1 Datasets for Experimental Evaluation ................. 52 3.4.2 A Processing Pipeline .......................... 52 4 LAYOUT ANALYSIS 55 4.1 A Method for Layout Analysis in Spreadsheets ............... 56 7 4.2 Feature Extraction ................................ 58 4.2.1 Content Features ............................. 58 4.2.2 Style Features ............................... 59 4.2.3 Font Features ............................... 60 4.2.4 Formula and Reference Features ................... 60 4.2.5 Spatial Features .............................. 61 4.2.6 Geometrical Features .......................... 63 4.3 Cell Classification ................................. 63 4.3.1 Classification Datasets .......................... 64 4.3.2 Classifiers and Assessment Methods ................. 65 4.3.3 Optimum Under-Sampling ....................... 66 4.3.4 Feature Selection ............................. 68 4.3.5 Parameter Tuning ............................. 71 4.3.6 Classification Evaluation ......................... 72 4.4 Layout Regions .................................. 79 4.5 Summary and Discussions ............................ 82 5 CLASSIFICATION POST-PROCESSING 83 5.1 Dataset for Post-Processing ........................... 84 5.2 Pattern-Based Revisions ............................. 85 5.2.1 Misclassification Patterns ........................ 86 5.2.2 Relabeling Cells .............................. 87 5.2.3 Evaluating the Patterns ......................... 87 5.3 Region-Based Revisions ............................. 88 5.3.1 Standardization Procedure ....................... 88 5.3.2 Extracting Features from Regions ................... 91 5.3.3 Identifying
Recommended publications
  • OFFICE the Text in the Main Editing Window and Italicize It
    ic or bold to the text you type in the Insert Endnote window, so if you W'ant to itali- cize a book title in a note, you have to insert the note with no italics, then edit OFFICE the text in the main editing window and italicize it. EasySpreadsheet handled our complex sample worksheets reasonably well, although it did not even try to open the charts. Our 4MB Microsoft Excel spread- sheet opened slowly but accurately. Easy0ffice7.0 labeled Filel and File2. Each of those EasySpreadsheet makes life simple for be- E-Press Corp, www.e-press.com. menus contains more than 20 items, in- ginners by displaying a vortical Totals ••COO cluding PDF and HTML export and items column on the right edge, showing the EasyOffice 7.0 packs more applications that store and search backup, grandfather, sums of all rows, and a horizontal Totals and utilities into one freeware package and great-grandfather versions of your column at the foot, with the sums of all than you'll find in any high-priced suite. A files—features you won't find in better- columns, it supports about 125 functions, <?6MB installer expands into a word known suites. Some menu items have but none as advanced as pivot tables, pn'cessor, spreadsheet, calculator, picture shortcut keys that let you access them by array formulas, conditional formatting, fil- editor. PDr editor, presentation program, typing an underlined letter; others are ac- teritig, and macros. You cannot customize and e-mail client. EasyOffice also contains the built-in number several applications that no other office formats.
    [Show full text]
  • Openoffice.Org Теория И Практика
    В серии: Библиотека ALT Linux OpenOffice.org Теория и практика Иван Хахаев Вадим Машков Галина Губкина Инна Смирнова Дмитрий Смирнов Роман Козодаев Елена Смородина Татьяна Турченюк Москва ALT Linux; БИНОМ. Лаборатория знаний 2008 УДК 004.91 ББК 32.97 O60 Авторы: Хахаев И., Машков В., Губкина Г., Смирнова И., Смирнов Д., Козодаев Р., Смородина Е., Турченюк Т. OpenOffice.org: Теория и практика / И. Хахаев, В. Машков, O60 Г. Губкина и др. М. : ALT Linux ; БИНОМ. Лаборатория знаний, 2008. 319 с. : ил. (Библиотека ALT Linux). ISBN 978-5-94774-891-8 Данная книга открывает многие нетривиальные возможности офис- ного пакета OpenOffice.org (версии 2 и выше), которые поясняются на примерах конкретных задач. Рассмотрены автоматическая нумерация и перекрестные ссылки при оформлении курсовой работы, тонкости на- бора математических формул, вычислительные возможности электрон- ных таблиц на примере задач из курсов экономического цикла, создание презентаций и составление собственной галереи элементов для создания схем и многое другое. Для широкого круга пользователей офисных при- ложений. Сайт книги: http://books.altlinux.ru/openoffice. На сайте книги вы найдёте: • Обновлённую электронную версию текста книги с исправлениями. • Файлы примеров, использованных в книге. • Дополнительные материалы, не вошедшие в книгу. УДК 004.91 ББК 32.97 Как приобрести печатный экземпляр книги? Приобрести книгу в интернет-магазине ALT Linux. По вопросам оптовых и мелкооптовых заку- пок обращайтесь на [email protected]. Каждый имеет право воспроизводить, распространять и/или вносить измене- ния в настоящий Документ в соответствии с условиями GNU Free Documentation License, Версией 1.2 или любой более поздней версией, опубликованной Free Software Foundation; Данный Документ содержит следующий текст, помещаемый на первую стра- ницу обложки: ¾В серии “Библиотека ALT Linux”¿.
    [Show full text]
  • Teaching Students to Model Neural Circuits and Neural Networks Using an Electronic Spreadsheet Simulator
    Behavior Research Methods, Instruments, & Computers 1985, 17(2),339-344 SESSION XI TEACHING APPLICATIONS OF MICROCOMPUTERS N. John Castellan, Presider Teaching students to model neural circuits and neural networks using an electronic spreadsheet simulator THOMAS T. HEWETT Drexel University, Philadelphia. Pennsylvania An electronic spreadsheet simulator can be used to enable students to conduct simulated micro­ electrode recording experiments. In addition, it can be used both to let students explore the oper­ ation of models of hypothetical neural networks and to let them design and develop their own neural models. A recent paper (Smith et al., 1984) described the educa­ demand. This calculation cycle often involves recalcula­ tional goals of student ownership of a personal microcom­ tion of new values for previously entered formulas that puter at Drexel University. In a later paper, Hewett and depend upon still other values that may have been changed Perkey (1984) argued that many computer programs in­ during or since the last calculation cycle. The typical tended for instructional purposes are impractical or of little spreadsheet simulator also offers a number of program­ use when each student has a personal machine. However, ming language capabilities, including the ability to required student access to a personal computer creates new manipulate strings and to do iterative calculations. In ad­ opportunities for the use of microcomputer applications dition, built-in functions can be combined to produce a programs. One obvious, but nontrivial, example is the im­ limited plotting capability, thereby providing both a pact of the word processor on what is considered accept­ graphical and a numerical method for illustrating relational able written work, from lab reports to term papers.
    [Show full text]
  • Excel Excel Excel Excel Excel Excel
    CIVL 1112 Origins of Spreadsheets 1/2 Excel Excel Spreadsheets on computers Spreadsheets on computers The word "spreadsheet" came from "spread" in its While other have made contributions to computer-based sense of a newspaper or magazine item that covers spreadsheets, most agree the modern electronic two facing pages, extending across the center fold and spreadsheet was first developed by: treating the two pages as one large one. The compound word "spread-sheet" came to mean the format used to present book-keeping ledgers—with columns for categories of expenditures across the top, invoices listed down the left margin, and the amount of each payment in the cell where its row and column intersect. Dan Bricklin Bob Frankston Excel Excel Spreadsheets on computers Spreadsheets on computers Because of Dan Bricklin and Bob Frankston's Bricklin has spoken of watching his university professor implementation of VisiCalc on the Apple II in 1979 and the create a table of calculation results on a blackboard. IBM PC in 1981, the spreadsheet concept became widely known in the late 1970s and early 1980s. When the professor found an error, he had to tediously erase and rewrite a number of sequential entries in the PC World magazine called VisiCalc the first electronic table, triggering Bricklin to think that he could replicate the spreadsheet. process on a computer, using the blackboard as the model to view results of underlying formulas. His idea became VisiCalc, the first application that turned the personal computer from a hobby for computer enthusiasts into a business tool. Excel Excel Spreadsheets on computers Spreadsheets on computers VisiCalc was the first spreadsheet that combined all VisiCalc went on to become the first killer app, an essential features of modern spreadsheet applications application that was so compelling, people would buy a particular computer just to use it.
    [Show full text]
  • The Ultimate Guide to Google Sheets Everything You Need to Build Powerful Spreadsheet Workflows in Google Sheets
    The Ultimate Guide to Google Sheets Everything you need to build powerful spreadsheet workflows in Google Sheets. Zapier © 2016 Zapier Inc. Tweet This Book! Please help Zapier by spreading the word about this book on Twitter! The suggested tweet for this book is: Learn everything you need to become a spreadsheet expert with @zapier’s Ultimate Guide to Google Sheets: http://zpr.io/uBw4 It’s easy enough to list your expenses in a spreadsheet, use =sum(A1:A20) to see how much you spent, and add a graph to compare your expenses. It’s also easy to use a spreadsheet to deeply analyze your numbers, assist in research, and automate your work—but it seems a lot more tricky. Google Sheets, the free spreadsheet companion app to Google Docs, is a great tool to start out with spreadsheets. It’s free, easy to use, comes packed with hundreds of functions and the core tools you need, and lets you share spreadsheets and collaborate on them with others. But where do you start if you’ve never used a spreadsheet—or if you’re a spreadsheet professional, where do you dig in to create advanced workflows and build macros to automate your work? Here’s the guide for you. We’ll take you from beginner to expert, show you how to get started with spreadsheets, create advanced spreadsheet-powered dashboard, use spreadsheets for more than numbers, build powerful macros to automate your work, and more. You’ll also find tutorials on Google Sheets’ unique features that are only possible in an online spreadsheet, like built-in forms and survey tools and add-ons that can pull in research from the web or send emails right from your spreadsheet.
    [Show full text]
  • What Is PC MOUSE????
    PC MOUSE FACT SHEET September 1983 What is PC MOUSE???? PC Mouse is a small hand-held pointing and selecting device used to move the cursor on your screen. As you slide PC Mouse on the mouse pad, the cursor moves correspondingly. Three buttons on top of mouse allow you to activate functions by the use of pop-up menus. PC Mouse software with pop-up menus has been created for some of your favorite software programs: Lotus 1 -2-3™, Personal Editor™, VisiCalc®, WordStar®, Volkswriter™ and Multiplan™. The pop-up menus are transparent to the underlying program, and no modification to your existing software is required. Pop-up menus allow you to select and execute commands at the press of a button. No longer do you have to memorize commands. Even if you are an expert typist, moving around in spreadsheets and wordprocessors will be faster and easier than ever before. PC Mouse interfaces through an RS-232 asynchronous communication port. Since there are no moving parts, no maintenance or cleaning is required. PC Mouse software is compatible with PC-DOS 1.10 and 2.00. Only 10K of memory and disk space are used by the mouse driver. ""—signifies manufacturers' trademark -"'—signifies manufacturers' registered trademark APPLICATION: With the help of the function buttons/pop-up menus, PC Mouse can be used to do word processing, spreadsheets and graphics. PC MOUSE PACKAGE INCLUDES: HARDWARE Optical mouse and pad, RS-232 interface box and cable, and a power supply. SOFTWARE Pop-up menu software Version 2.0 provides pop-up menus for Lotus 1 -2-3, Personal Editor, VisiCalc, WordStar, Volkswriter, and Multiplan.
    [Show full text]
  • Unit 6: Computer Software
    Computer Software Unit 6: Computer Software Introduction Collectively computer programs are known as computer software. This unit consisting of four lessons presents different aspects of computer software. Lesson 1 introduces software and its classification, system software which assists the users to develop programs for solving user problems is presented in Lesson 2. Many programs for widely used applications are available commercially. These programs are popularly known as application packages or package programs or simply packages. Advantages of package programs and brief outline of popular packages for word-processing, spreadsheet analysis, database management systems, desktop publication and graphic and applications are discussed in Lesson 3. Tasks for developing computer programs and brief introduction to some common programming languages are presented in Lesson 4. Lesson 1: Introduction and Classification 1.1 Learning Objectives On completion of this lesson you will be able to • understand the concept of software • distinguish between system software and application software • know components of system software and types of application software. 1.2 Software Software of a computer system is intangible rather than physical. It is the term used for any type of program. Software consists of statements, which instruct a computer to perform the required task. Without software a computer is simply a mass of electronic components. For a computer to input, store, make decisions, arithmetically manipulate and Software consists of output data in the correct sequence it must have access to appropriate statements, which instruct programs. Thus, the software includes all the activities associated with a computer to perform the required task. the successful development and operation of the computing system other than the hardware pieces.
    [Show full text]
  • Email and Spreadsheet Program Before Microsoft
    Email And Spreadsheet Program Before Microsoft downstairsHamlen splutter and pneumatically.poorly. Hari usually Long-waisted lunches hereunderMerill sizzled or canalsforgivingly. dimly when grouty Dennis bassets Pc spreadsheet will be confused on and email program before printing options and you may have automation capability of the standard features but in Send a spreadsheet in Numbers on Mac Apple Support. Ablebitscom Excel form-ins and Outlook tools. Now fan must evidence a password before from any circle the Excel. Microsoft updates and email spreadsheet program before. Request Email Account Request Voicemail delivery to your email. Microsoft Office your on a MacBook Coolblue Before 2359. This is same spreadsheet and email program before you to discover new may often look. In computing an else suite as a collection of productivity software usually containing at least in word processor spreadsheet and a presentation program There are watching different brands and types of office suites Popular office suites include Microsoft Office Google Workspace formerly G. Introduction to Microsoft Excel 2019Office 365 Drake Training. College Application Spreadsheet. Introduction to Excel Starter Microsoft Support. This is married to email program. Share them before inserting tables wizard, microsoft and email spreadsheet program before helping them before you indicate where someone is very unfortunate that you! The floating balloon then drag and expand inside the desired size is achieved. It is now sign application before and email program. Unlike the email and spreadsheet program before microsoft office applications that works just insert. Best Spreadsheet Software Microsoft Excel vs Google Sheets. Almost three weeks, outlook and place it tends to spreadsheet and program before helping to google offers a portion of your inbox.
    [Show full text]
  • Package 'Gnumeric'
    Package ‘gnumeric’ March 9, 2017 Version 0.7-8 Date 2017-03-09 Title Read Data from Files Readable by 'gnumeric' Author Karoly Antal <[email protected]>. Maintainer Karoly Antal <[email protected]> Depends R (>= 2.8.1), XML Imports utils Description Read data files readable by 'gnumeric' into 'R'. Can read whole sheet or a range, from several file formats, including the native format of 'gnumeric'. Reading is done by using 'ssconvert' (a file converter utility included in the 'gnumeric' distribution <http://projects.gnome.org/gnumeric/>) to convert the requested part to CSV. From 'gnumeric' files (but not other formats) can list sheet names and sheet sizes or read all sheets. License GPL (>= 2) Repository CRAN Date/Publication 2017-03-09 13:20:28 NeedsCompilation no R topics documented: read.gnumeric.sheet . .2 read.gnumeric.sheet.info . .6 read.gnumeric.sheets . .7 Index 9 1 2 read.gnumeric.sheet read.gnumeric.sheet Read data from a gnumeric (or MS Excel, Openoffice Calc, Xbase, Quatro Pro, Paradox, HTML, etc) spreadsheet or database file using ssconvert from the gnumeric distribution Description Read data from a sheet of a gnumeric (or other common spreadsheet or database) file to a data.frame. Requires an external program, ‘ssconvert’ (normally installed with gnumeric in ‘PATH’. (Gnumeric home page is http://projects.gnome.org/gnumeric/) (Note: last gnumeric release for windows is 1.12.17 from 2014) Calls ‘ssconvert’ to convert the input to CSV. ‘ssconvert’ can read several file formats (see Details below). Note: During conversion to CSV ‘ssconvert’ also evaluates formulas (e.g. ‘=sum(A1:A3)’) in cells, and emits the result instead of the formula.
    [Show full text]
  • Heat Transfer Data
    Appendix A HEAT TRANSFER DATA This appendix contains data for use with problems in the text. Data have been gathered from various primary sources and text compilations as listed in the references. Emphasis is on presentation of the data in a manner suitable for computerized database manipulation. Properties of solids at room temperature are provided in a common framework. Parameters can be compared directly. Upon entrance into a database program, data can be sorted, for example, by rank order of thermal conductivity. Gases, liquids, and liquid metals are treated in a common way. Attention is given to providing properties at common temperatures (although some materials are provided with more detail than others). In addition, where numbers are multiplied by a factor of a power of 10 for display (as with viscosity) that same power is used for all materials for ease of comparison. For gases, coefficients of expansion are taken as the reciprocal of absolute temper­ ature in degrees kelvin. For liquids, actual values are used. For liquid metals, the first temperature entry corresponds to the melting point. The reader should note that there can be considerable variation in properties for classes of materials, especially for commercial products that may vary in composition from vendor to vendor, and natural materials (e.g., soil) for which variation in composition is expected. In addition, the reader may note some variations in quoted properties of common materials in different compilations. Thus, at the time the reader enters into serious profes­ sional work, he or she may find it advantageous to verify that data used correspond to the specific materials being used and are up to date.
    [Show full text]
  • Spreadsheet Development Methodologies with Resolver
    Spreadsheet development using Resolver. Moving spreadsheets into the 21st Century. Kemmis & Thomas Spreadsheet development methodologies using Resolver. Moving spreadsheets into the 21st Century. Patrick Kemmis Giles Thomas Resolver Systems Ltd 17a Clerkenwell Road, London, EC1M 5RD, UK [email protected] ABSTRACT We intend to demonstrate the innate problems with existing spreadsheet products and to show how to tackle these issues using a new type of spreadsheet program – Resolver. It addresses the issues head-on and thereby moves the 1980’s “VisiCalc paradigm” on to match the advances in computer languages and user requirements. Continuous display of the spreadsheet grid and the equivalent computer program, together with the ability to interact and add code through either interface, provides a number of new methodologies for spreadsheet development. 1. INTRODUCTION Spreadsheets have many intrinsic problems stemming from the history of their development and the difficulties in changing the core user interface and coding language. The method of interaction has not significantly changed in over twenty years. The language used for their program code has not advanced with the developments in new dynamic computer languages. Resolver Systems has developed a new type of spreadsheet product, which addresses many of the issues identified in this paper and offers a new method of interaction and development. Developing robust spreadsheets is very hard using existing products. The rigorous testing of the computer software industry is absent from almost all business user-developed spreadsheets. The need to change as data requirements and calculations evolve invariably leads to their deterioration over time. The ever increasing size of data manipulated in spreadsheets only exacerbates the problems.
    [Show full text]
  • Po Box 5487, Berkeley, Ca 94705 (415)
    VOLUME 1, NUMBER 4, OCTOBER 1984 AN INTERNATIONAL NEWSLETTER FOR USERS OF MORROW'S COMPUTERS P.O. BOX 5487, BERKELEY, CA 94705 (415) 654-3798 • If you thought you couldn't afford hard disk performance, think again. • The MDS-E hard disk Micro Decision computer with 128K RAM • Seagate Sl)t" Hard Disk with S.4M bytes formatted (Second hard disk can be added) • 384K floppy disk backup. Superfast CP/M 3.0 operating system (compatible with most CP/M 2.2 software) • NewWord word processor. Correct-it spelling checker • New tilt & swivel monitor. Low profile keyboard. Morrow does it again. At $1999, this special introductory offer shatters the price barrier for hard disk computer systems • Call (800) 521-3493 (in California (408) 980-7462) for a dealer near you. Or write to Morrow, 600 McCormick Street, San Leandro, California 94577. CONTENTS EDITORIAL EXCHANGE Edi torial. ••••••••• 2 Letters to the Editor•• 6 COLUMNS The Can File •••• • Ed Niehaus 10 David's Q & A Colurm •• Dave Block 12 Fran The Mailbox ••• •• Stan Ahal t 14 MORROW USERS GROUPS Lost & Found Department ••••• •• Clarence Heier 18 Cleo .............. •• Lionel Johnston • 18 News About MJrrow Users Groups • •• Clarence Heier 19 THE CURIOUS NOVICE'S EXPERIENCE INSIGHT: Spreadsheet Calculators, Part I •• Art Zerrx:>n • 22 Manuals .. ................ •• Milton Levison 25 How To Tell \\hat MD You Have ••••• •• Brian Leyton 26 About Surge and Spike Protectors •••••••• ••• Jerry Sheperd 27 I Thought It Would Never Happen to Me •••• •• Rick Goul ian 28 Never Too Old to Start with a MOrrow • Herb Kahler • 30 WORDSTAR AND NEWWORD MOre Printing and Editing Concurrently with WordStar • Nick Mills •••• 33 Brightening Your Day with NeWWord •••••••••• Bill Steele 35 Progr~ing Your Function Keys with NeWWord ••••• Bill Steele.
    [Show full text]