Spaghetti Code for the 21St Century

Total Page:16

File Type:pdf, Size:1020Kb

Spaghetti Code for the 21St Century Web Applications – Spaghetti Code for the 21st Century Tommi Mikkonen and Antero Taivalsaari Sun Microsystems Laboratories P.O. Box 553 (TUT), FI-33101 Tampere Finland {tommi.mikkonen, antero.taivalsaari}@sun.com Abstract have reintroduced problems that had already been eliminated years ago in the aftermath of the “spaghetti The software industry is currently in the middle of a code wars” of the 1970s. 1 paradigm shift. Applications are increasingly written In this paper , we investigate web application for the World Wide Web. Unfortunately, the development from the viewpoint of software technologies used for web application development engineering principles such as modularity, consistency, today violate well-known software engineering and portability. We argue that current web principles, and they have reintroduced problems that technologies are inadequate in supporting many of had already been eliminated years ago in the aftermath these principles, but that there is no fundamental of the “spaghetti code wars” of the 1970s. In this reason for web applications to be any worse than paper, we investigate web application development conventional applications in any of these areas. Rather, from the viewpoint of software engineering principles. the current inadequacies are just an accidental We argue that current web technologies are consequence of the poor conceptual and technological inadequate in supporting many of these principles, but foundation of the web development technologies today. also that there is no fundamental reason for web The structure of this paper is as follows. Section 2 applications to be any worse than conventional provides an overview of web development, followed applications in any of these areas. Rather, the current by an introduction to software engineering principles in inadequacies are an accidental consequence of the Section 3. Sections 4 and 5 analyze web development poor conceptual and technological foundation of the technologies in view of the software engineering web development technologies today. principles and some additional topics, and Section 6 offers possible solutions to emerging problems. Finally, Section 7 concludes the paper. 1. Introduction 2. Web development – A view from the The software industry is in the middle of a trenches paradigm shift towards web-based software. Consequently, applications are increasingly written for The World Wide Web has undergone a number of the web rather than for any specific type of operating evolutionary phases. Initially, web pages were little system, CPU architecture or device. We believe that more than simple textual documents with limited user the trend towards web-based applications will continue interaction capabilities. Soon, graphics support and and even strengthen in the future, causing a form-based data entry were added. Gradually, with the fundamental change in the way people develop, deploy introduction of DHTML [12], it became possible to and use software. create increasingly interactive web pages with built-in Unfortunately, programming languages and support for advanced graphics and animation. software development environments and tools have not Today, we are in the middle of another major yet adapted to this ongoing shift. Today, software evolutionary step towards “Web 2.0” technologies, development for the web is based on a myriad of commonly associated with systems such as Ajax [5], technologies and tools with little consistency or elegance. Even worse, many of the technologies violate 1 An earlier version of this paper has been published as Sun Labs established software engineering principles, and they Technical Report TR-2007-166, June 2007. Ruby on Rails [35], and Google Web Toolkit (GWT) ebay.com, or cnn.com contain large chunks of HTML, [34]. However, these technologies are still hybrid CSS and JavaScript code, in which HTML definitions, solutions and only partial steps in the evolution style sheets and JavaScript functions are interspersed in towards using the web as a true application platform. no specific order, other than the order established by In general, they enable building web sites that behave the tools used to generate the web page. Consequently, much like desktop applications, for example, by the source document of such web sites is often nearly allowing web pages to be updated one user interface unreadable to humans. Furthermore, the document element at a time, rather than requiring the entire page usually contains generous hard-coded references to to be updated each time something changes. In this other similar documents and resources such as images paper, we refer to such applications as web or animations. These resources may be located applications. anywhere in the world, and they are usually included in The fundamental components of the web browser the document in the form of long Uniform Resource include the Hypertext markup language (HTML), Locators (URLs). Some of these entities do not even Cascading Style Sheets (CSS), the JavaScript scripting need to be available, in which case the browser ignores language, and the Document Object Model (DOM). In the references and uses a placeholder instead or the following, we provide a summary of these overlooks the entire item. This is enabled by the technologies: general principle to ignore erroneous or incomplete 1. HTML is the predominant markup language for the parts of the web documents. creation of web pages. It provides a means to If the document that represents the web site is describe the structure of text-based information in commonly difficult to read, the actual behavior of the a document – by denoting certain text as headings, document is even harder to understand. This is because paragraphs, lists, and so on – and to supplement the typical user interaction model of the web is such that text with interactive forms, embedded images, that a new web page is generated and sent to the and other objects. browser each time the user clicks on a link or a button 2. CSS is a stylesheet language that is used to on a page. While adequate for displaying documents, describe the presentational aspects of a document the approach is not ideal for Web 2.0 development, in written in a markup language. A stylesheet allows which the web browser runs applications that behave the stylistic rules of a web page (e.g. colors and like conventional desktop applications, with rich fonts) to be defined independently of the content desktop-style user interface and direct manipulation of the web page. capabilities beyond navigation of pages. The above technologies do not introduce a coherent 3. JavaScript. In addition to declarative HTML and foundation for real applications on the web. Rather, CSS definitions, a web document can also contain they reflect the evolution of the World Wide Web, in executable code. The most commonly used which new features have been added on top of existing scripting language on the web today is JavaScript features. For instance, the I/O model to generate new [10]. Syntactically JavaScript resembles the C and web pages on the fly when anything changes seems Java programming languages. However, in outright arcane compared to the direct manipulation practice JavaScript is a much more dynamic, capabilities provided by most desktop applications in interpreted language that borrows features from the 1980s and 1990s. other highly dynamic languages such as Smalltalk In order to compensate for the inadequacies of the [11], Lisp [20] and Self [36]. underlying technology, today's web development 4. DOM is a platform-independent way of environments rely extensively on tool support. Tools representing a collection of objects that constitute such as Adobe Dreamweaver and Microsoft a page in a web browser. The DOM allows a Expression Web are utilized for enhancing the end-user scripting language such as JavaScript to experience and for providing a more reasonable programmatically examine and change the web development experience. This has made web page. Effectively, the DOM serves as an interface development a lot less arduous. However, it has also – essentially a large shared data structure – exacerbated problems: If programmers had been fully between the browser and the scripting language, exposed to the gruesome details of web development, allowing the two to communicate with each other. they would have undoubtedly questioned some of the In practice, web pages that utilize the above four technological foundations and put more pressure on technologies – serving as the logical equivalent of improving the underlying technology. “binary code” of traditional applications – are often messy. Even major web sites like amazon.com, 3. Software engineering principles these technologies bring the need to apply software engineering principles to web development. In 1968, Edsger Dijkstra started his crusade against In general, web development in its current form “spaghetti code” [7]. Spaghetti code is a pejorative resembles software development in the 1970s before term for source code that has a complex and tangled real engineering principles were applied. We argue that control structure, especially one using many gotos, there is a need to introduce engineering principles to threads, global variables, or other "unstructured" web development. constructs. Next, we investigate web development in view of As highlighted by the spaghetti code discussions, software engineering principles. Our analysis is software engineering remained an undeveloped, independent of any particular web
Recommended publications
  • Introduction: C++ and Software Engineering
    Comp151 Introduction Background Assumptions • This course assumes that you have taken COMP102/103, COMP104 or an equivalent. The topics assumed are: – Basic loop constructs, e.g., for, while, repeat, etc. – Functions –Arrays – Basic I/O – Introduction to classes – Abstract Data Types – Linked Lists – Recursion – Dynamic Objects Why Take This Course You all know how to program, so why take this course? • In COMP104 you essentially only learned “the C part” of C++ and can write “small” C++ programs. • Most of the time you write code that is (almost) the same as code that’s been written many times before. How do you avoid wasting time “re-inventing the wheel”? How do you re-use coding effort? • What if you need to write a large program and/or work with a team of other programmers? How do you maintain consistency across your large program or between the different coders? • In this course you will learn the essence of Object Oriented Programming (OOP). The goal is to teach you how to design and code large software projects. A Short “History” of Computing • Early programming languages (e.g., Basic, Fortran) were unstructured. This allowed “spaghetti code” – code with a complex and tangled structure - with goto’s and gosub’s jumping all over the place. Almost impossible to understand. • The abuses of spaghetti code led to structured programming languages supporting procedural programming (PP) – e.g., Algol, Pascal, C. • While well-written C is easier to understand, it’s still hard to write large, consistent code. This inspired researchers to borrow AI concepts (from knowledge representation, especially semantic networks) resulting in object-oriented programming (OOP )languages (e.g., Smalltalk, Simula, Eiffel, Objective C, C++, Java).
    [Show full text]
  • COBOL-Skills, Where Art Thou?
    DEGREE PROJECT IN COMPUTER ENGINEERING 180 CREDITS, BASIC LEVEL STOCKHOLM, SWEDEN 2016 COBOL-skills, Where art Thou? An assessment of future COBOL needs at Handelsbanken Samy Khatib KTH ROYAL INSTITUTE OF TECHNOLOGY i INFORMATION AND COMMUNICATION TECHNOLOGY Abstract The impending mass retirement of baby-boomer COBOL developers, has companies that wish to maintain their COBOL systems fearing a skill shortage. Due to the dominance of COBOL within the financial sector, COBOL will be continually developed over at least the coming decade. This thesis consists of two parts. The first part consists of a literature study of COBOL; both as a programming language and the skills required as a COBOL developer. Interviews were conducted with key Handelsbanken staff, regarding the current state of COBOL and the future of COBOL in Handelsbanken. The second part consists of a quantitative forecast of future COBOL workforce state in Handelsbanken. The forecast uses data that was gathered by sending out a questionnaire to all COBOL staff. The continued lack of COBOL developers entering the labor market may create a skill-shortage. It is crucial to gather the knowledge of the skilled developers before they retire, as changes in old COBOL systems may have gone undocumented, making it very hard for new developers to understand how the systems work without guidance. To mitigate the skill shortage and enable modernization, an extraction of the business knowledge from the systems should be done. Doing this before the current COBOL workforce retires will ease the understanding of the extracted data. The forecasts of Handelsbanken’s COBOL workforce are based on developer experience and hiring, averaged over the last five years.
    [Show full text]
  • Programming Languages
    Lecture 19 / Chapter 13 COSC1300/ITSC 1401/BCIS 1405 12/5/2004 Lecture 19 / Chapter 13 COSC1300/ITSC 1401/BCIS 1405 12/5/2004 General Items: Computer Program • Lab? Ok? • Read the extra credits • A series of instructions that direct a computer to perform tasks • Need to come to class • Such as? Who is the programmer? • Have a quiz / no books / use notes -> What is the big idea • School is almost over • Programming language is a series of rules for writing the instructions • • There are hundreds of computer programs – need-based! Reading Materials: Programming language • - Two basic types: Low- and high-level programming languages Miscellaneous: o Low-level: Programming language that is machine-dependent ° Must be run on specific machines o High-level: Language that is machine-independent ° Can be run on different types of machines Programming Language Low Level High Level Machine Assembly Language Language Procedural Nonprocedural Remember: Ultimately, everything must be converted to the machine language! Object Oriented F.Farahmand 1 / 12 File: lec14chap13f04.doc F.Farahmand 2 / 12 File: lec14chap13f04.doc Lecture 19 / Chapter 13 COSC1300/ITSC 1401/BCIS 1405 12/5/2004 Lecture 19 / Chapter 13 COSC1300/ITSC 1401/BCIS 1405 12/5/2004 Categories of programming languages o Nonprocedural language -> Programmer specifies only what the - Machine language program should accomplish; it does not explain how o Only language computer understands directly - Forth-generation language - Assembly language o Syntax is closer to human language than that
    [Show full text]
  • Techniques for Understanding Unstructured Code Mel A
    Association for Information Systems AIS Electronic Library (AISeL) International Conference on Information Systems ICIS 1985 Proceedings (ICIS) 1985 Techniques for Understanding Unstructured Code Mel A. Colter University of Colorado at Colorado Springs Follow this and additional works at: http://aisel.aisnet.org/icis1985 Recommended Citation Colter, Mel A., "Techniques for Understanding Unstructured Code" (1985). ICIS 1985 Proceedings. 6. http://aisel.aisnet.org/icis1985/6 This material is brought to you by the International Conference on Information Systems (ICIS) at AIS Electronic Library (AISeL). It has been accepted for inclusion in ICIS 1985 Proceedings by an authorized administrator of AIS Electronic Library (AISeL). For more information, please contact [email protected]. Techniques for Understanding Unstructured Code Mel A. Colter Associate Professor of Management Science and Information Systems College of Business Administration University of Colorado at Colorado Springs P.O. Box 7150 Colorado Springs, Colorado 80933-7150 ABSTRACT Within the maintenance activity, a great deal of time is spent in the process of understanding unstructured code prior to changing or fixing the program. This involves the comprehension of complex control structures. While automated processes are available to structure entire programs, there is a need for less formal structuring processes to be used by practicing profes- sionals on small programs or local sections of code. This paper presents methods for restruc- turing complex sequence, selection, and iteration structures into structured logic. The pro- cedures are easily taught and they result in solutions of reduced complexity as compared to the original code. Whether the maintenance programmer uses these procedures simply for understanding, or for actually re-writing the program, they will,simplify efforts on unstruc- tured code.
    [Show full text]
  • PDF Version for Printing
    SI 413, Unit 9: Control Daniel S. Roche ([email protected]) Fall 2018 In our final unit, we will look at the big picture. Within a program, the “big picture” means how the program flows from line to line during its execution. We’ll also see some of the coolest and most advanced features of modern languages. We’ve talked about many different aspects of computer programs in some detail - functions, scopes, variables, types. But how does a running program move between these different elements? What features control the path of a program’s execution through the code? These features are what constitutes the control flow in a programming language, and different languages support different methods of control flow. The concept of control flow in very early computer languages, and in modern assembly languages, is termed “unstructured” programming and consists only of sequential execution and branches (i.e., goto statements). More modern programming languages feature the control structures that you have come to know and love: selection (if/else and switch statements), iteration (while, for, and for/each loops), and recursion (function calls). 1 Unstructured programming The most basic control flow method is so ingrained in our understanding of how programs work that you might not even notice it: Sequencing is the process by which control flows continually to the next instruction. This comes from the earliest programs, which is still seen in assembly languages today with the program counter (address of the next instruction) that automatically gets incremented by 1 at each execution step. In most programming languages, sequencing happens automatically without any special instructions.
    [Show full text]
  • What Is Spaghetti Code (And Why You Should Avoid It)
    What is Spaghetti Code (And Why You Should Avoid It) Spaghetti code is not an endearing term. It’s a word that describes a type of code that some say will cause your technology infrastructure to fail. Despite the warnings about spaghetti code, many enterprise businesses are faced with issues that involve the set of processes and errors that make up spaghetti coding. This occurs for a number of reasons but can be detrimental to your overall infrastructure, most experts believe. In this article, we’ll talk about spaghetti code as a concept and what it means for your organization. What is Spaghetti Code? Spaghetti code is a pejorative piece of information technology jargon that is caused by factors like unclear project scope of work, lack of experience and planning, an inability to conform a project to programming style rules, and a number of other seemingly small errors that build up and cause your code to be less streamlined overtime. Typically, spaghetti code occurs when multiple developers work on a project over months or years, continuing to add and change code and software scope with optimizing existing programming infrastructure. This usually results in somewhat unplanned, convoluted coding structures that favor GOTO statements over programming constructs, resulting in a program that is not maintainable in the long-run. For businesses, creating a program only to have it become unmaintainable after years of work (and money) has gone into the project costs, IT managers, and other resources. In addition, it makes programmers feel frustrated to spend hours on coding an infrastructure only to have something break, and then they have to sift through years of work, often managed by different developers, to identify the issue, if they can solve it at all.
    [Show full text]
  • Generalized Structured Programs and Loop Trees Ward Douglas Maurer∗
    Science of Computer Programming 67 (2007) 223–246 www.elsevier.com/locate/scico Generalized structured programs and loop trees Ward Douglas Maurer∗ Computer Science Department, George Washington University, Washington, DC 20052, USA Received 22 September 2005; received in revised form 1 November 2006; accepted 21 February 2007 Available online 13 April 2007 Abstract Any directed graph, even a flow graph representing “spaghetti code”, is shown here to have at least one loop tree, which is a structure of loops within loops in which no loops overlap. The nodes of the graph may be rearranged in such a way that, with respect to their new order, every edge proceeds in the forward direction except for the loopbacks. Here a loopback goes from somewhere in a loop L to the head of L. We refer to such a rearrangement as a generalized structured program, in which forward goto statements remain unrestricted. Like a min-heap or a max-heap, a loop tree has an array representation, without pointers; it may be constructed in time no worse than O(n2) for any program written in this fashion. A scalable version of this construction uses a label graph, whose only nodes are the labels of the given program. A graph has a unique loop tree if and only if it is reducible. c 2007 Elsevier B.V. All rights reserved. Keywords: Generalized structured program; GSP; Loop trees; Reducible graphs; Overlapping loops; Strongly connected components 1. Introduction There exist today many variants on the idea of a structured program. B¨ohm and Jacopini [8] consider programs which are decomposable into Π , Φ,andΔ (in other words, sequencing, if -then-else,andwhile loops); these programs contain no goto logic at all, except what is implied by the constructions Φ and Δ.
    [Show full text]
  • Structured Programming
    Structured Programming Mr. Dave Clausen La Cañada High School I. What is programming? A computer can only carry out a small number of instructions or simple calculations. A computer can only solve the problem if it is broken down into smaller steps. 2 II. What is Structured programming? An attempt to formalize the logic and structure of programs. (i.e.) Procedure Bubble_Sort (Var Original, Duplicate, Sorted : ListType); {Pre: The array is filled with ramdom numbers. Post: The numbers will be sorted.} Var Element, Index : Integer; Begin WriteLn ('Sorting...'); For Element := 1 to MaxEntries do For Index := MaxEntries downto (Element+1) do If Original[Index] < Original[Index-1] Then Swap (Original[Index], Original[Index-1]); Create_Sorted(Original, Sorted); Recreate_Original(Original, Duplicate); Pause End; {Bubble} 3 III. What is the Purpose of Structured Programming? To make computer programs – Easier to read – Easier to debug – Easier to understand – Easier to maintain To allow programmers to work as a team 4 Purpose 2 To reduce testing time To increase programming productivity To increase clarity by reducing the programs’ complexity To decrease maintenance and effort 5 IV. Why is this special programming necessary? Programming in the 60's: "fiddling" with the program until it worked. Spaghetti Code "Spaghetti Code" Spaghetti Code.txt 5 REM SPAGHETTI CODE 10 GOTO 40 20 PRINT “THIS IS AN EXAMPLE” 30 GOTO 70 40 GOTO 20 50 PRINT “OF SPAGHETTI CODE” 60 GOTO 80 70 GOTO 50 80 END 6 Avoid using GOTO statements Edger W. Dijkstra 1968 “The GOTO statement should be abolished from all higher level programming languages…” “…The GOTO statement is just too primitive; it is too much of an invitation to make a mess of one’s program.” Mr.
    [Show full text]
  • Handout 7 Loops
    p October 30, 2001 ­c Achim Jung and Uday Reddy Handout 7 Loops 1. Using branching to save code. On Handout 5 we saw how by giving the processor control over its program counter we can implement alternative execution paths: flow diagram: machine code: condition branch if positive no yes condition C ½ C C ¾ statement ½ statement branch unconditionally C ¾ By using two branching instructions we can make sure that although machine instructions are lined up linearly, only one C C ¾ of the regions ½ and is executed. From this example, it is easy to see how any flow diagram could be translated into machine code. Branching can also be used to avoid a repetition of code, as in the following examples: infinite loop: while - do loop: loop body loop body condition branch if positive 2. Spaghetti code. In a sense, however, branching is too powerful, because it allows us to write programs which are very intricate and hard to understand. Here is an example: region 1 region 2 The top half looks like an ordinary loop but then from the end of region 2 the program jumps back into the middle of the body of that loop. There is nothing in the design of machine language to preclude such programs. Similarly, the language of flow diagrams also allows arbitrary jumping from one place to another. One speaks of spaghetti code for this “style” of programming. Early programming languages were very close to actual machine code. FORTRAN and BASIC, for example, retain the concept of a numbered list of instructions with jumps from one location to another as the primary control construct.
    [Show full text]
  • Structured Programming.Sample
    PAPER NO. CT 33 SECTION 3 CERTIFIED INFORMATION COMMUNICATION TECHNOLOGISTS (CICT) STRUCTURED PROGRAMMING STUDY TEXT www.masomomsingi.co.ke Contact 0728 776 317 1 KASNEB SYLLABUS STRUCTURED PROGRAMMING GENERAL OBJECTIVE This paper is intended to equip the candidate with the knowledge, skills and attitude that will enable him/her to apply the structured programming approach to develop programs LEARNING OUTCOMES A candidate who passes this paper should be able to; Analyse a problem and design an appropriate solution Write codes using C programming language Test and debug a structured program code Produce documentation, both user and technical, to support programs. CONTENT 1. Introduction to structured programming Introduction to programming languages Types of programming languages Generations of programming languages Programming approaches Language translators Basic concepts of structured programming Problem definition, structure and design Integrated development environment (IDE) 2. Programming basics Variables and data types Input/output statements Assignments Namespaces Comments Pre-processor directives Expressions and operators Control structures Writing and running a simple program 3. Functions/sub-programs Functions verses procedures Parameter passing Recursion Calling procedures Argument naming Event procedures Testing and debugging errors Writing and running a program using functions and procedures 2 www.masomomsingi.co.ke Contact 0728 776 317 4. Data structures Arrays Pointers Linked lists Unions
    [Show full text]
  • Statement Level Control Structures
    Chapter 8 Topics Chapter 8 • Introduction • Selection Statements • Iterative Statements • Unconditional Branching Statement-Level • Guarded Commands Control Structures • Conclusions Levels of Control Flow Evolution of Control Statements – Within expressions • FORTRAN I control statements were based – Controlled by operator precedence and associativity directly on IBM 704 hardware – Among program units • Much research and argument in the 1960s – Function call and concurrency about the issue – Among program statements – One important result: It was proven that all algorithms represented by flowcharts can be coded – Control statements and control structures with only two-way selection and pretest logical loops (Bohm and Jacopini, Structured Programming Theorem, 1966) – Any language with these features is “Turing- complete” – can compute anything that is computable Goto Statement Control Structures • At the machine level we really have only • A control structure is a control statement and conditional and unconditional branches (or the statements whose execution it controls gotos) • Programmers care more about readability and • Using conditional gotos we can create any kind writ ability than theoretical results of selection or iteration structure – While we can build very small languages and/or use • But undisciplined use of gotos can create very simple control structures, we would like to an spaghetti code expressive language with readable code – Languages often provide multiple control structure for iteration and selection to aid readability
    [Show full text]
  • Chapter 3: Understanding Structure Chapter Contents Book Title: Programming Logic and Design Printed By: Ronald Suchy ([email protected]) © 2013
    Chapter 3: Understanding Structure Chapter Contents Book Title: Programming Logic and Design Printed By: Ronald Suchy ([email protected]) © 2013 , Chapter 3 Undertanding Structure Chapter Introduction 3-1 The Disadvantages of Unstructured Spaghetti Code 3-2 Understanding the Three Basic Structures 3-3 Using a Priming Input to Structure a Program 3-4 Understanding the Reasons for Structure 3-5 Recognizing Structure 3-6 Structuring and Modularizing Unstructured Logic 3-7 Chapter Review 3-7a Chapter Summary 3-7b Key Terms 3-7c Review Questions 3-7d Exercise Chapter 3: Understanding Structure Chapter Introduction Book Title: Programming Logic and Design Printed By: Ronald Suchy ([email protected]) © 2013 , Chapter Introduction In this chapter, you will learn about: The disadvantages of unstructured spaghetti code The three basic structures—sequence, selection, and loop Using a priming input to structure a program The need for structure Recognizing structure Structuring and modularizing unstructured logic Chapter 3: Understanding Structure: 3-1 The Disadvantages of Unstructured Spaghetti Code Book Title: Programming Logic and Design Printed By: Ronald Suchy ([email protected]) © 2013 , 3-1 The Diadvantage of Untructured Spaghetti Code Professional business applications usually get far more complicated than the examples you have seen so far in Chapters 1 and 2. Imagine the number of instructions in the computer programs that guide an airplane’s flight or audit an income tax return. Even the program that produces your paycheck at work contains many, many instructions. Designing the logic for such a program can be a time-consuming task. When you add several thousand instructions to a program, including several hundred decisions, it is easy to create a complicated mess.
    [Show full text]