Towards a Better SCM: Revlog and Mercurial

Total Page:16

File Type:pdf, Size:1020Kb

Towards a Better SCM: Revlog and Mercurial Towards a Better SCM: Revlog and Mercurial Matt Mackall Selenic Consulting [email protected] Abstract projects demand scalability in multiple dimen- sions, including efficient handling of large numbers of files, large numbers of revisions, Large projects need scalable, performant, and and large numbers of developers. robust software configuration management sys- tems. If common revision control operations As an example of a moderately large project, are not cheap, they present a large barrier to we can look at the Linux kernel, a project proper software engineering practice. This pa- now in its second decade. The Linux kernel per will investigate the theoretical limits on source tree has tens of thousands of files and SCM performance, and examines how existing has collected on the order of a hundred thou- systems fall short of those ideals. sand changesets since adopting version control only a few years ago. It also has on the order I then describe the Revlog data storage scheme of a thousand contributors scattered around the created for the Mercurial SCM. The Revlog globe. It also continues to grow rapidly. So it’s scheme allows all common SCM operations to not hard to imagine projects growing to manage be performed in near-optimal time, while pro- millions of files, millions of changesets, and viding excellent compression and robustness. many thousands of people developing in par- Finally, I look at how a full distributed SCM allel over a timescale of decades. (Mercurial) is built on top of the Revlog scheme, some of the pitfalls we’ve surmounted At the same time, certain SCM features become in on-disk layout and I/O performance and the increasingly important. Decentralization is cru- protocols used to efficiently communicate be- cial: thousands of users with varying levels of tween repositories. network access can’t hope to efficiently cooper- ate if they’re all struggling to commit change- sets to a central repository due to locking and bandwidth concerns. So it becomes critical that 1 Introduction: the need for SCM a system be able to painlessly handle per-user scalability development branches and repeated merging with the branches of other developers. Group- ing of interdependent changes in multiple files As software projects grow larger and open into a single “atomic” changeset becomes a ne- development practices become more widely cessity for understanding and working with the adopted, the demands on source control man- huge number of changes that are introduced. agement systems are greatly increased. Large And robust, compact storage of the revision 84 • Towards a Better SCM: Revlog and Mercurial history is essential for large number of develop- seeks and network bandwidth. We have in fact ers to work with their own decentralized repos- already long since reached a point where disk itories. seeks heavily dominate many workloads, in- cluding ours. So we’ll examine the important aspects of source control from the perspective of the facet it’s most likely to eventually be con- 2 Overview of Scalability Limits strained by. and Existing Systems For simplicity, we’ll make a couple simplify- To intelligently evaluate scalability issues over ing assumptions. First, we’ll assume files of a a timescale of a decade or more, we need to constant size, or rather that performance should look at the likely trends in both project growth generally be linearly related to file size. Sec- and computer performance. Many facets of ond, we’ll assume for now that a filesystem’s computer performance have historically fol- block allocation scheme is reasonably good and lowed an exponential curve, but at different that fragmentation of single files is fairly small. rates. If we order the more important of these Third, we’ll assume that file lookup is roughly facets by rate, we might have a list like the fol- constant time for moderately-sized directories. lowing: With that in mind, let’s look at some of the the- oretical limits for the most important SCM op- • CPU speed erations as well as scalability of existing sys- • disk capacity tems, starting with operations on individual files: • memory capacity • memory bandwidth Storage compression: For compressing sin- gle file revisions, the best schemes known in- • LAN bandwidth clude SCCS-style “weaves” [8] or RCS-style deltas [5], together with a more generic com- • disk bandwidth pression algorithm like gzip. For files in a typi- • WAN bandwidth cal project, this results in average compression on the order of 10:1 to 20:1 with relatively con- • disk seek rate stant CPU overhead (see more about calculat- ing deltas below). So while CPU speed has changed by many orders of magnitude, disk seek rate has only Retrieving arbitrary file revisions: It’s easy changed slightly. In fact, seek rate is now dom- to see that we can easily achieve constant time inated by disk rotational latency and thus its (O(1) seeks) retrieval of individual file revi- rate of improvement has already run up against sions, simply by storing each revision in a sep- a wall imposed by physics. Similarly, WAN arate file in the history repository. In terms of bandwidth runs up against limits of existing big-O notation, we can do no better. This ig- communications infrastructure. nores some details of filesystem scalability, but it’s a good approximation. This is perhaps the So as technology progresses, it makes more and most fundamental operation in an SCM, so the more sense to trade off CPU power to save disk scalability of this operation is crucial. 2006 Linux Symposium, Volume Two • 85 Most SCMs, including CVS, and Bitkeeper use Checking out a project revision: Assuming delta or weave-based schemes that store all the O(1) seeks to check out file revisions, we might revisions for a given file in a single back-end expect O(files) seeks to check out all the files file. Reconstructing arbitrary versions requires in the project. But if we arrange so that file reading some or all of the history file, thus revisions are nearly consecutive, we can avoid making performance O(revisions) in disk band- most of those seeks, and instead our limit be- width and computation. As a special case, CVS comes O(files) as measured by disk bandwidth. stores the most recent revision of a given file A comparable operation is untarring an archive uncompressed, but still must read and parse the of that revision’s files. entire history file to retrieve it. As most systems must read the entirety of a SVN uses a skip-delta [7] scheme that requires file’s history to reconstruct a revision, they’ll reading O(log revisions) deltas (and seeks) to require O(total file revisions) disk bandwidth reconstruct a revision. and CPU to check out an entire project tree. SCMs that don’t visit the repository history in Unpacked git stores a back-end file for each file the filesystem’s natural layout can easily see revision, giving O(1) performance, but packed this performance degrade into O(total files) ran- git requires searching a collection of indices dom disk seeks, which can happen with SCMs that grow as the project history grows. Also backed by database engines or with systems worth noting is that git does not store an index like git which store objects by hash (unpacked) information at the file level. Thus operations or creation time (packed). like finding the previous version of a file to cal- culate a delta or the finding the common ances- Committing changes: For a change to a small tor of two file revisions can require searching set of the files in a project, we can expect to the entirety of the project’s history. be bound by O(changed files) seeks, or, as the Adding file revisions: Similarly, we can see number of changes approaches the number of that adding file revisions by the same scheme files, O(files) disk bandwidth. is also O(1) seeks. Most systems use schemes that require rewriting the entire history file, thus Systems like git can meet this target for com- making their performance decrease linearly as mit, as they simply create a new back-end file the number of revisions increase. for each file committed. Systems like CVS re- quire rewriting file histories and thus take in- Annotate file history: We could, in princi- creasingly more time as histories grow deeper. ple, incrementally calculate and store an anno- Non-distributed systems also need to deal with tated version of each version of a file we store lock contention which grows quite rapidly with and thus achieve file history annotation in O(1) the number of concurrent users, lock hold time, seeks by adding more up-front CPU and stor- and network bandwidth and latency. age overhead. Almost all systems instead take O(revision) disk bandwidth to construct anno- Finding differences in the working direc- tations, if not significantly more. While anno- tory: There are two different approaches to this tation is traditionally not performance critical, problem. One is to have the user explicitly ac- some newer merge algorithms rely on it [8]. quire write permissions on a set of files, which nicely limits the set of files that need to be ex- Next, we can look at the performance limits of amined so that we only need O(writable files) working with revisions at the project level: comparisons. Another is to allow all files to 86 • Towards a Better SCM: Revlog and Mercurial be edited and to detect changes to all managed files. As most systems require O(file revisions) to re- trieve a file version for comparison, this opera- tion can be expensive.
Recommended publications
  • Debian Developer's Reference Version 12.0, Released on 2021-09-01
    Debian Developer’s Reference Release 12.0 Developer’s Reference Team 2021-09-01 CONTENTS 1 Scope of This Document 3 2 Applying to Become a Member5 2.1 Getting started..............................................5 2.2 Debian mentors and sponsors......................................6 2.3 Registering as a Debian member.....................................6 3 Debian Developer's Duties 9 3.1 Package Maintainer's Duties.......................................9 3.1.1 Work towards the next stable release............................9 3.1.2 Maintain packages in stable .................................9 3.1.3 Manage release-critical bugs.................................. 10 3.1.4 Coordination with upstream developers............................ 10 3.2 Administrative Duties.......................................... 10 3.2.1 Maintaining your Debian information............................. 11 3.2.2 Maintaining your public key.................................. 11 3.2.3 Voting.............................................. 11 3.2.4 Going on vacation gracefully.................................. 12 3.2.5 Retiring............................................. 12 3.2.6 Returning after retirement................................... 13 4 Resources for Debian Members 15 4.1 Mailing lists............................................... 15 4.1.1 Basic rules for use....................................... 15 4.1.2 Core development mailing lists................................. 15 4.1.3 Special lists........................................... 16 4.1.4 Requesting new
    [Show full text]
  • Beginning Portable Shell Scripting from Novice to Professional
    Beginning Portable Shell Scripting From Novice to Professional Peter Seebach 10436fmfinal 1 10/23/08 10:40:24 PM Beginning Portable Shell Scripting: From Novice to Professional Copyright © 2008 by Peter Seebach All rights reserved. No part of this work may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or by any information storage or retrieval system, without the prior written permission of the copyright owner and the publisher. ISBN-13 (pbk): 978-1-4302-1043-6 ISBN-10 (pbk): 1-4302-1043-5 ISBN-13 (electronic): 978-1-4302-1044-3 ISBN-10 (electronic): 1-4302-1044-3 Printed and bound in the United States of America 9 8 7 6 5 4 3 2 1 Trademarked names may appear in this book. Rather than use a trademark symbol with every occurrence of a trademarked name, we use the names only in an editorial fashion and to the benefit of the trademark owner, with no intention of infringement of the trademark. Lead Editor: Frank Pohlmann Technical Reviewer: Gary V. Vaughan Editorial Board: Clay Andres, Steve Anglin, Ewan Buckingham, Tony Campbell, Gary Cornell, Jonathan Gennick, Michelle Lowman, Matthew Moodie, Jeffrey Pepper, Frank Pohlmann, Ben Renow-Clarke, Dominic Shakeshaft, Matt Wade, Tom Welsh Project Manager: Richard Dal Porto Copy Editor: Kim Benbow Associate Production Director: Kari Brooks-Copony Production Editor: Katie Stence Compositor: Linda Weidemann, Wolf Creek Press Proofreader: Dan Shaw Indexer: Broccoli Information Management Cover Designer: Kurt Krames Manufacturing Director: Tom Debolski Distributed to the book trade worldwide by Springer-Verlag New York, Inc., 233 Spring Street, 6th Floor, New York, NY 10013.
    [Show full text]
  • Leitfaden Für Debian-Betreuer
    Leitfaden für Debian­Betreuer Osamu Aoki, Helge Kreutzmann, and Mechtilde Stehmann August 27, 2021 Leitfaden für Debian­Betreuer by Osamu Aoki, Helge Kreutzmann, and Mechtilde Stehmann Copyright © 2014­2020 Osamu Aoki Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the ”Software”), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions: The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. THE SOFTWARE IS PROVIDED ”AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IM­ PLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE. Diese Anleitung wurde mit den nachfolgenden Dokumenten als Referenz erstellt: • »Making a Debian Package (AKA the Debmake Manual)«, Copyright © 1997 Jaldhar Vyas. • »The New­Maintainer’s Debian Packaging Howto«, Copyright © 1997 Will Lowe. • »Debian­Leitfaden für Neue Paketbetreuer«, Copyright © 1998­2002 Josip Rodin, 2005­2017 Osamu Aoki, 2010 Craig Small und 2010 Raphaël Hertzog. Die neuste Version dieser Anleitung sollte • im Paket debmake­doc und • auf der Debian­Dokumentations­Website verfügbar sein.
    [Show full text]
  • Efficient Algorithms for Comparing, Storing, and Sharing
    EFFICIENT ALGORITHMS FOR COMPARING, STORING, AND SHARING LARGE COLLECTIONS OF EVOLUTIONARY TREES A Dissertation by SUZANNE JUDE MATTHEWS Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY May 2012 Major Subject: Computer Science EFFICIENT ALGORITHMS FOR COMPARING, STORING, AND SHARING LARGE COLLECTIONS OF EVOLUTIONARY TREES A Dissertation by SUZANNE JUDE MATTHEWS Submitted to the Office of Graduate Studies of Texas A&M University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Approved by: Chair of Committee, Tiffani L. Williams Committee Members, Nancy M. Amato Jennifer L. Welch James B. Woolley Head of Department, Hank W. Walker May 2012 Major Subject: Computer Science iii ABSTRACT Efficient Algorithms for Comparing, Storing, and Sharing Large Collections of Evolutionary Trees. (May 2012) Suzanne Jude Matthews, B.S.; M.S., Rensselaer Polytechnic Institute Chair of Advisory Committee: Dr. Tiffani L. Williams Evolutionary relationships between a group of organisms are commonly summarized in a phylogenetic (or evolutionary) tree. The goal of phylogenetic inference is to infer the best tree structure that represents the relationships between a group of organisms, given a set of observations (e.g. molecular sequences). However, popular heuristics for inferring phylogenies output tens to hundreds of thousands of equally weighted candidate trees. Biologists summarize these trees into a single structure called the consensus tree. The central assumption is that the information discarded has less value than the information retained. But, what if this assumption is not true? In this dissertation, we demonstrate the value of retaining and studying tree collections.
    [Show full text]
  • Analysis of Devops Tools to Predict an Optimized Pipeline by Adding Weightage for Parameters
    International Journal of Computer Applications (0975 – 8887) Volume 181 – No. 33, December 2018 Analysis of DevOps Tools to Predict an Optimized Pipeline by Adding Weightage for Parameters R. Vaasanthi V. Prasanna Kumari, PhD S. Philip Kingston Research Scholar, HOD, MCA Project Manager SCSVMV University Rajalakshmi Engineering Infosys, Mahindra City, Kanchipuram College, Chennai Chennai ABSTRACT cloud. Now-a-days more than ever, DevOps [Development + Operations] has gained a tremendous amount of attention in 2. SCM software industry. Selecting the tools for building the DevOps Source code management (SCM) is a software tool used for pipeline is not a trivial exercise as there are plethora’s of tools development, versioning and enables team working in available in market. It requires thought, planning, and multiple locations to work together more effectively. This preferably enough time to investigate and consult other plays a vital role in increasing team’s productivity. Some of people. Unfortunately, there isn’t enough time in the day to the SCM tools, considered for this study are GIT, SVN, CVS, dig for top-rated DevOps tools and its compatibility with ClearCase, Mercurial, TFS, Monotone, Bitkeeper, Code co- other tools. Each tool has its own pros/cons and compatibility op, Darcs, Endevor, Fossil, Perforce, Rational Synergy, of integrating with other tools. The objective of this paper is Source Safe, and GNU Bazaar. Table1 consists of SCM tools to propose an approach by adding weightage to each with weightage. parameter for the curated list of the DevOps tools. 3. BUILD Keywords Build is a process that enables source code to be automatically DevOps, SCM, dependencies, compatibility and pipeline compiled into binaries including code level unit testing to ensure individual pieces of code behave as expected [4].
    [Show full text]
  • Compiling C Programs
    CSCI 2132: Software Development Lab 6: Exploring bash and C Compilation Synopsis In this lab, you will: • Customize the behaviour your bash shell • Write and compile some simple C programs • Practice the edit-compile-fix cycle using emacs and the shell • Learn to patch programs using diff and patch Contents Overview......................................................................................................2 Step 1: Login and lab setup.................................................................................3 Step 2: The name of your shell.............................................................................3 Step 3: The .bashrc file.....................................................................................3 Step 4: Customize rm ........................................................................................4 Optional Step: Install bashrc.new as your new .bashrc file............................................7 Step 5: Edit .profile .......................................................................................8 Optional Step: Install profile.new as your new .profile file.......................................... 10 Step 6: Write a simple C program.......................................................................... 11 Step 7: Explore exit codes................................................................................... 12 Step 8: diff .................................................................................................. 14 Step 9: patch................................................................................................
    [Show full text]
  • Distributed Revision Control with Mercurial
    Distributed revision control with Mercurial Bryan O’Sullivan Copyright c 2006, 2007 Bryan O’Sullivan. This material may be distributed only subject to the terms and conditions set forth in version 1.0 of the Open Publication License. Please refer to Appendix D for the license text. This book was prepared from rev 028543f67bea, dated 2008-08-20 15:27 -0700, using rev a58a611c320f of Mercurial. Contents Contents i Preface 2 0.1 This book is a work in progress ...................................... 2 0.2 About the examples in this book ..................................... 2 0.3 Colophon—this book is Free ....................................... 2 1 Introduction 3 1.1 About revision control .......................................... 3 1.1.1 Why use revision control? .................................... 3 1.1.2 The many names of revision control ............................... 4 1.2 A short history of revision control .................................... 4 1.3 Trends in revision control ......................................... 5 1.4 A few of the advantages of distributed revision control ......................... 5 1.4.1 Advantages for open source projects ............................... 6 1.4.2 Advantages for commercial projects ............................... 6 1.5 Why choose Mercurial? .......................................... 7 1.6 Mercurial compared with other tools ................................... 7 1.6.1 Subversion ............................................ 7 1.6.2 Git ................................................ 8 1.6.3
    [Show full text]
  • DVCS Or a New Way to Use Version Control Systems for Freebsd
    Brief history of VCS FreeBSD context & gures Is Arch/baz suited for FreeBSD? Mercurial to the rescue New processes & policies needed Conclusions DVCS or a new way to use Version Control Systems for FreeBSD Ollivier ROBERT <[email protected]> BSDCan 2006 Ottawa, Canada May, 12-13th, 2006 Ollivier ROBERT <[email protected]> DVCS or a new way to use Version Control Systems for FreeBSD Brief history of VCS FreeBSD context & gures Is Arch/baz suited for FreeBSD? Mercurial to the rescue New processes & policies needed Conclusions Agenda 1 Brief history of VCS 2 FreeBSD context & gures 3 Is Arch/baz suited for FreeBSD? 4 Mercurial to the rescue 5 New processes & policies needed 6 Conclusions Ollivier ROBERT <[email protected]> DVCS or a new way to use Version Control Systems for FreeBSD Brief history of VCS FreeBSD context & gures Is Arch/baz suited for FreeBSD? Mercurial to the rescue New processes & policies needed Conclusions The ancestors: SCCS, RCS File-oriented Use a subdirectory to store deltas and metadata Use lock-based architecture Support shared developments through NFS (fragile) SCCS is proprietary (System V), RCS is Open Source a SCCS clone exists: CSSC You can have a central repository with symlinks (RCS) Ollivier ROBERT <[email protected]> DVCS or a new way to use Version Control Systems for FreeBSD Brief history of VCS FreeBSD context & gures Is Arch/baz suited for FreeBSD? Mercurial to the rescue New processes & policies needed Conclusions CVS, the de facto VCS for the free world Initially written as shell wrappers over RCS then rewritten in C Centralised server Easy UI Use sandboxes to avoid locking Simple 3-way merges Can be replicated through CVSup or even rsync Extensive documentation (papers, websites, books) Free software and used everywhere (SourceForge for example) Ollivier ROBERT <[email protected]> DVCS or a new way to use Version Control Systems for FreeBSD Brief history of VCS FreeBSD context & gures Is Arch/baz suited for FreeBSD? Mercurial to the rescue New processes & policies needed Conclusions CVS annoyances and aws BUT..
    [Show full text]
  • Debianization of Predictprotein
    debian-unstable package-building git-repository lintian and quilt sources Programming challenge 10: Debianization of PredictProtein Jens Preußner Supervisor: Laszlo Kajan The Bioinformatics Lab 10th of July, 2012 debian-unstable package-building git-repository lintian and quilt sources Table of contents 1 debian-unstable Installation 2 package-building Structure of directory Building 3 git-repository Local git repository Remote git repository Initial push 4 lintian and quilt Fixing lintian warnings Working with quilt 5 References Now perform as root: $ chroot /srv/unstable/ $ adduser --home /home/<user> --uid <uid> <user> $ su - <user> debian-unstable package-building git-repository lintian and quilt sources Installing debian-unstable in L1 Required packages: debootstrap, dpkg-dev - easy Bind your home and proc, see Network filesystems debian-unstable package-building git-repository lintian and quilt sources Installing debian-unstable in L1 Required packages: debootstrap, dpkg-dev - easy Bind your home and proc, see Network filesystems Now perform as root: $ chroot /srv/unstable/ $ adduser --home /home/<user> --uid <uid> <user> $ su - <user> debian-unstable package-building git-repository lintian and quilt sources Installing debian-unstable in L1 To know for sure, that you are in wheezy/sid In new environment: $ cat /etc/debian version wheezy/sid instead of 6.0.4 debian-unstable package-building git-repository lintian and quilt sources My special case I had a perl module for uploading The module was not contained in CPAN Naming conventions:
    [Show full text]
  • C/C++ Compile Guide
    WizFi630S Guide C/C++ Compile Guide (Version 1.0.0) © 2019 WIZnet Co., Ltd. All Rights Reserved. For more information, please visit our website at http://www.wiznet.io/ © Copyright 2019 WIZnet Co., Ltd. All rights reserved. 1 WizFi630S Guide Document Revision History Date Revision Changes 2019-11-25 1.0 Release © Copyright 2019 WIZnet Co., Ltd. All rights reserved. 2 WizFi630S Guide Contents 1. Overview ................................................................................................................. 4 2. Download ................................................................................................................ 4 2.1 Prerequisites .................................................................................................. 4 2.2 Packages for Building Environment .......................................................... 4 2.3 OpenWRT Firmware Repository................................................................. 6 2.4 Menuconfig .................................................................................................... 7 3. Write C Code........................................................................................................... 7 3.1 Helloworld ...................................................................................................... 7 3.2 Make the Environment Script .................................................................... 8 4. Cross Compile ......................................................................................................... 8 4.1
    [Show full text]
  • CSE 391 Lecture 9
    CSE 391 Lecture 9 Version control with Git slides created by Ruth Anderson & Marty Stepp, images from http://git-scm.com/book/en/ http://www.cs.washington.edu/391/ 1 Problems Working Alone • Ever done one of the following? . Had code that worked, made a bunch of changes and saved it, which broke the code, and now you just want the working version back… . Accidentally deleted a critical file, hundreds of lines of code gone… . Somehow messed up the structure/contents of your code base, and want to just “undo” the crazy action you just did . Hard drive crash!!!! Everything’s gone, the day before deadline. • Possible options: . Save as (MyClass-v1.java) • Ugh. Just ugh. And now a single line change results in duplicating the entire file… 2 Problems Working in teams . Whose computer stores the "official" copy of the project? • Can we store the project files in a neutral "official" location? . Will we be able to read/write each other's changes? • Do we have the right file permissions? • Lets just email changed files back and forth! Yay! . What happens if we both try to edit the same file? • Bill just overwrote a file I worked on for 6 hours! . What happens if we make a mistake and corrupt an important file? • Is there a way to keep backups of our project files? . How do I know what code each teammate is working on? 3 Solution: Version Control • version control system: Software that tracks and manages changes to a set of files and resources. • You use version control all the time .
    [Show full text]
  • Software Development a Practical Approach!
    Software Development A Practical Approach! Hans-Petter Halvorsen https://www.halvorsen.blog https://halvorsen.blog Software Development A Practical Approach! Hans-Petter Halvorsen Software Development A Practical Approach! Hans-Petter Halvorsen Copyright © 2020 ISBN: 978-82-691106-0-9 Publisher Identifier: 978-82-691106 https://halvorsen.blog ii Preface The main goal with this document: • To give you an overview of what software engineering is • To take you beyond programming to engineering software What is Software Development? It is a complex process to develop modern and professional software today. This document tries to give a brief overview of Software Development. This document tries to focus on a practical approach regarding Software Development. So why do we need System Engineering? Here are some key factors: • Understand Customer Requirements o What does the customer needs (because they may not know it!) o Transform Customer requirements into working software • Planning o How do we reach our goals? o Will we finish within deadline? o Resources o What can go wrong? • Implementation o What kind of platforms and architecture should be used? o Split your work into manageable pieces iii • Quality and Performance o Make sure the software fulfills the customers’ needs We will learn how to build good (i.e. high quality) software, which includes: • Requirements Specification • Technical Design • Good User Experience (UX) • Improved Code Quality and Implementation • Testing • System Documentation • User Documentation • etc. You will find additional resources on this web page: http://www.halvorsen.blog/documents/programming/software_engineering/ iv Information about the author: Hans-Petter Halvorsen The author currently works at the University of South-Eastern Norway.
    [Show full text]