A Build System for Software Development in Robotic Academic Collaborative Environments

Total Page:16

File Type:pdf, Size:1020Kb

A Build System for Software Development in Robotic Academic Collaborative Environments A Build System for Software Development in Robotic Academic Collaborative Environments Daniele E. Domenichelli∗, Silvio Traversaro∗, Luca Muratorey, Alessio Rocchiy, Francesco Nori∗, Lorenzo Natale∗ ∗iCub Facility, Istituto Italiano di Tecnologia (IIT), Via Morego 30, 16163 Genoa, Italy yDepartment of Advanced Robotics, Istituto Italiano di Tecnologia (IIT), Via Morego 30, 16163 Genoa, Italy fdaniele.domenichelli, silvio.traversaro, alessio.rocchi, luca.muratore, francesco.nori, [email protected] Abstract—The software development cycle in the robotic re- and in particular the iCub software project1 [1]. This article search environment is hectic and heavily driven by project or is organized as follows: paper deadlines. Developers have only little time available for packaging the C/C++ code they write, develop and maintain the • We analyze the academic software development cycle and build system and continuous integration tools. will point the issues that arise in this environment (Sec- Research projects are joint efforts of different groups working tion II), emphasizing the critical role of the build system remotely and asynchronously. The typical solution is to rely (Section III), and describing the concept of “superbuild” on binary distributions and/or large repositories that compile explaining their importance (Section IV). all software and dependencies. This approach hinders code sharing and reuse and often leads to repositories whose inter- • We will list some of the tools commonly used in software dependencies are difficult to manage. development to handle dependencies (Section V), and we Following many years of experience leading software integra- will state our objectives and explain the reasons why tion is research projects we developed YCM, a tool that supports we decided not to introduce a new tool in our workflow our best practices addressing these issues. YCM is a set of CMake (Section VI). scripts that provides (1) build system support: to develop and package software libraries and components, and (2) superbuild • We will introduce YCM the set of CMake modules that deployment: to prepare and distribute sets of packages in source we developed, and we will show how YCM superbuilds form as a single meta build. In this paper we describe YCM are currently being used in four different European- and report on our experience adopting it as a tool for managing funded research projects by different teams, and success- software repositories in large research projects. fully promote reuse of code and collaboration between Index Terms—Build system; software development cycle; soft- ware reuse; superbuild; researchers from different groups (Section VII). II. ACADEMIC SOFTWARE DEVELOPMENT CYCLE I. INTRODUCTION It is an indisputable fact that lots of software created for In robotic research environments the software development research is wasted. Academic research products are scientific cycle is quite hectic and often heavily driven by project or papers. Data produced using the software is more important paper deadlines. The “publish or perish” model that drives the than software itself. Researchers may know how to write research has a sensible impact on the re-usability of the code code, but they are not software engineers [2] and they lack developed for research purpose on two different levels: on a experience in coding using established best practices. They micro scale, the research products are small tools that are not work individually most of the time and have to be at the sufficiently engineered to be reused in large projects; on a same time developers, designers, testers, release managers and larger scale, the products are large conglomerates of software integration managers. Moreover they have academic duties with strong coupling (i.e. lots of references between files in such as teaching, tutoring and taking classes [3]. different modules) and weak cohesion. In both cases reusing The continuous renewal of Ph.D. and Grad Students without the code is not easy. experience in software development weighs heavily on the This situation is worsened by the fact that most robotics software development process. A review process, besides software is either developed in C/C++ or has extensive depen- improving the overall quality of the code [4], could contribute dencies written in this languages, for which no standard cross- to improve the developers’ software skills. Unfortunately in platform tools for packaging and distribution is available, as most of the cases, this kind of process does not exist for opposed to higher level languages. academic software. The goal of our research is to foster re-usability and the All these factors introduce several different effects in the overall quality and maintainability of software developed for software products developed for research purposes. academic research purposes and its build system. It is moti- vated by our experience in managing large C/C++ software 1The iCub (http://www.icub.org/) is an open source humanoid robot adopted in more than 30 research groups world-wide, whose software is the result projects in collaborative robotic research environments (con- of the work of a large community (1.8M LOC, 170 contributors and 103 sisting of several research groups based on different countries), developers, source: OpenHub, https://www.openhub.net/p/robotology) On a micro scale, the research driven software development The pitfall in this approach is that when the funded project is cycle heavily impacts the self contained tools, algorithms, and over, the parts of the code produced that should be reusable are libraries released. The goal of the development phase is often supposed to be re-engineered and moved out of the repository. to produce a working prototype as fast as possible, with the Unfortunately this rarely happens, due to lack of resources intention to clean up the code later. Unfortunately experience and to the need to move on to new projects. If some code shows that the Agile software development principle known as is required for a later project, even if just a small part, “re- LeBlanc’s Law “Later equals never” [5], [6] is almost always usability” is achieved by depending on the whole software true for academic software, unless some commercial interests collection, or by duplicating the code in the new repository. intervene. The opposite approach is to let each team use their favorite While the code related to the subject of the study itself might tools and to store their code using their favorite SCM and host. be good quality, the code surrounding it is often low quality, This approach apparently has the advantage that every group written in a hurry, and good practices are often neglected. can continue its work as they did before without spending Researchers have only little time available for packaging the time to learn how the other groups work. Unfortunately when code they write and the build system maintenance is something it is time to integrate the efforts of different groups, the effect that goes often beyond their skills and interests. As a result, the is exactly the opposite. Setup of all the software required product might perform “well enough” for writing an article, becomes complicated and time consuming, due to the variety but when this is published – due to the tight research schedule of tools required, to the lack of testing and documentation [8], driven by projects, Ph.D. programs or paper deadlines – the and to the fact that build instructions are not always complete author is likely to abandon the code and start working on or up to date. Similarly, updates of the whole stack are painful, something new. The code is never properly re-engineered, and sometimes dependencies are added, and the other developers it is hardly re-usable as a component of a larger system. are not notified, so they have to figure it out by themselves. On a macro scale, the research driven software development The learning curve for newcomers is steep and it could take cycle affects the whole software ecosystem around funded days before they can be operational. research projects where different researchers, teams, and insti- Another defect of this kind of approach is that it makes tutions work remotely and asynchronously on different parts harder to give visibility to the software developed for a funded of a big project. project, since code and documentation are scattered around Trying to achieve integration and interoperability between different repositories and hosts. software developed by different groups is definitely a push In this paper we suggest a hybrid approach that uses a meta- towards better practices, but that’s not enough. repository whose only job is to fetch external repositories, When a funded project starts, one of the first decision to where the actual source code is stored, and build it eventually take is where to store the source code, which tools will be with the assistance of an external tool to handle dependencies used for development, and which operating systems, IDEs, (we called this approach “superbuild”). and compilers will be supported. Users have very different sets of skills, they work in III. BUILD SYSTEMS IN ACADEMIC SOFTWARE different fields and have different roles in the organizations: The build system of a software project converts the source students, researchers, developers, team leaders. Each of them code into libraries and executables. It has an important role in will use the software with different goals: to produce some any software project [9] and it needs to evolve together with data for a thesis or for an article, to implement some new the source code. A build system that grows uncontrolled or algorithm, to show some demo to the projects reviewers. that becomes too hard to understand and to maintain can slow Research groups are usually interested in enforcing their down the development process [10] or even stop the developers group identity, while coordinators need to show an integrated from making any progress [6].
Recommended publications
  • Developer Survey
    Developer Survey Questions requiring a response are in r ed . Questions in which a response is NOT required are in blue. This survey is a critical element of the developers workshop. We are using it to capture nuts and bolts information about codes within the community so that we can assess the landscape before the workshop and use this information to drive the discussions. Please collaborate to provide only one submission per code and submit your response using the online survey: h ttps://ucdavis.co1.qualtrics.com/jfe/form/SV_57wtv4gpuaowTsh Basic Information Code identification 1. What is the name of the code? [small text box] 2. Who are the primary authors/maintainers? [medium text box] 3. URL of webpage for the code (if different than the version control repository) [small text box] 4. URL of version control repository (if public) [small text box] Software 1. Which license(s) do you use? Select all that apply. a. Apache license b. BSD license c. GNU General Public License d. GNU Lesser General Public License e. MIT license f. Mozilla Public License g. Common Development and Distribution License h. Eclipse Public License i. Other. Please specify [small text box] j. No license 2. What programming language(s) is your code currently written in? Select all that apply a. Fortran 77 b. Fortran 90 or later c. C d. C++ e. Go f. Python g. Julia h. Matlab i. Other. Please specify. [small text box] 3. List the primary (high-level) code dependencies (e.g., PETSc, deal.ii, FEniCS) [medium text box] 4. List any additional (low-level) code dependencies (e.g., MPI, NetCDF, HDF5) [medium text box] 5.
    [Show full text]
  • Updating Systems and Adding Software in Oracle® Solaris 11.4
    Updating Systems and Adding Software ® in Oracle Solaris 11.4 Part No: E60979 November 2020 Updating Systems and Adding Software in Oracle Solaris 11.4 Part No: E60979 Copyright © 2007, 2020, Oracle and/or its affiliates. License Restrictions Warranty/Consequential Damages Disclaimer This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. Warranty Disclaimer The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. Restricted Rights Notice If this is software or related documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, then the following notice is applicable: U.S. GOVERNMENT END USERS: Oracle programs (including any operating system, integrated software, any programs embedded, installed or activated on delivered hardware, and modifications of such programs) and Oracle computer documentation or other Oracle data delivered to or accessed by U.S. Government end users are "commercial
    [Show full text]
  • Embedded Linux Systems with the Yocto Project™
    OPEN SOURCE SOFTWARE DEVELOPMENT SERIES Embedded Linux Systems with the Yocto Project" FREE SAMPLE CHAPTER SHARE WITH OTHERS �f, � � � � Embedded Linux Systems with the Yocto ProjectTM This page intentionally left blank Embedded Linux Systems with the Yocto ProjectTM Rudolf J. Streif Boston • Columbus • Indianapolis • New York • San Francisco • Amsterdam • Cape Town Dubai • London • Madrid • Milan • Munich • Paris • Montreal • Toronto • Delhi • Mexico City São Paulo • Sidney • Hong Kong • Seoul • Singapore • Taipei • Tokyo Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and the publisher was aware of a trademark claim, the designations have been printed with initial capital letters or in all capitals. The author and publisher have taken care in the preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. For information about buying this title in bulk quantities, or for special sales opportunities (which may include electronic versions; custom cover designs; and content particular to your business, training goals, marketing focus, or branding interests), please contact our corporate sales depart- ment at [email protected] or (800) 382-3419. For government sales inquiries, please contact [email protected]. For questions about sales outside the U.S., please contact [email protected]. Visit us on the Web: informit.com Cataloging-in-Publication Data is on file with the Library of Congress.
    [Show full text]
  • Project Report
    Project Report An Extension of CodeFeedr Team 1Up Project Report An Extension of CodeFeedr by Roald van der Heijden, Matthijs van Wijngaarden, Wouter Zonneveld in order to obtain the degree of Bachelor of Science in Computer Science at the Delft University of Technology, to be defended publicly on the 5th of February 2020, 10:30 Project duration: November 11, 2019 – January 31, 2020 Thesis committee: Dr. G. Gousios, Client, TU Delft Dr. A. Katsifodimos, Supervisor, TU Delft Dr. H. Wang, Bachelor Project Coordinator, TU Delft An electronic version of this thesis is available at http://repository.tudelft.nl/. Contents 1 Introduction 4 2 CodeFeedr 5 2.1 Overview.........................................5 2.2 Architecture........................................5 2.3 Dependencies.......................................6 3 Research Report 7 3.1 Overview.........................................7 3.2 Problem Description...................................7 3.3 Design Goals.......................................8 3.4 Requirement Analysis...................................9 3.5 Development Methodology................................ 10 3.6 Related Work....................................... 11 3.7 Design Choices...................................... 12 4 Software Architecture 15 4.1 Design Patterns...................................... 15 4.2 Plugins.......................................... 15 4.3 SQL REPL......................................... 17 5 Implementation 18 5.1 Plugins.......................................... 18 5.2 SQL REPL........................................
    [Show full text]
  • Lmod Or How to Protect Your Sanity from Dependency Hell
    Lmod or how to protect your sanity from dependency hell Steffen Müthing Interdisciplinary Center for Scientific Computing Heidelberg University Dune User Meeting Aachen September 26, 2013 1 Steffen Müthing | Lmod or how to protect your sanity from dependency hell ⇒ Have to keep around multiple versions of MPI, BLAS, ParMetis, ALUGrid, UGGrid, . The Issue • DUNE has a number of non-packaged dependencies • Some of those contain libraries that are bound to a compiler / MPI version • You have to support multiple compilers / MPIs • Library developer (we try to be good about this...) • Clusters with different compiler / MPI combinations • Easily switch between release / debug version of MPI (only with dynamic linking) • Use GCC in general, but switch to clang during the “fix compilation errors” stage 2 Steffen Müthing | Lmod or how to protect your sanity from dependency hell The Issue • DUNE has a number of non-packaged dependencies • Some of those contain libraries that are bound to a compiler / MPI version • You have to support multiple compilers / MPIs • Library developer (we try to be good about this...) • Clusters with different compiler / MPI combinations • Easily switch between release / debug version of MPI (only with dynamic linking) • Use GCC in general, but switch to clang during the “fix compilation errors” stage ⇒ Have to keep around multiple versions of MPI, BLAS, ParMetis, ALUGrid, UGGrid, . 2 Steffen Müthing | Lmod or how to protect your sanity from dependency hell Problems • Do I already have ALUGrid for MPICH? • If yes, where on earth did I put it? • Did I really build it with the correct dependencies? • Why does my build fail? Do all the libraries in my nice --with= actually work together? 3 Steffen Müthing | Lmod or how to protect your sanity from dependency hell • Look around for something that’s already there • Distribution package managers (APT, rpm, Portage, MacPorts, homebrew,.
    [Show full text]
  • Diplomat: Using Delegations to Protect Community Repositories
    Diplomat: Using Delegations to Protect Community Repositories Trishank Karthik Kuppusamy, Santiago Torres-Arias, Vladimir Diaz, and Justin Cappos, New York University https://www.usenix.org/conference/nsdi16/technical-sessions/presentation/kuppusamy This paper is included in the Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’16). March 16–18, 2016 • Santa Clara, CA, USA ISBN 978-1-931971-29-4 Open access to the Proceedings of the 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI ’16) is sponsored by USENIX. Diplomat: Using Delegations to Protect Community Repositories Trishank Karthik Kuppusamy Santiago Torres-Arias Vladimir Diaz Justin Cappos Tandon School of Engineering, New York University Abstract software. Major repositories run by Adobe, Apache, Debian, Fedora, FreeBSD, Gentoo, GitHub, GNU Sa- Community repositories, such as Docker Hub, PyPI, vannah, Linux, Microsoft, npm, Opera, PHP, RedHat, and RubyGems, are bustling marketplaces that distribute RubyGems, SourceForge, and WordPress repositories software. Even though these repositories use common have all been compromised at least once [4,5,7,27,28,30, software signing techniques (e.g., GPG and TLS), at- 31,35,36,39–41,48,59,61,62,67,70,79,80,82,86,87,90]. tackers can still publish malicious packages after a server For example, a compromised SourceForge repository compromise. This is mainly because a community repos- mirror located in Korea distributed a malicious ver- itory must have immediate access to signing keys in or- sion of phpMyAdmin, a popular database administration der to certify the large number of new projects that are tool [79]. The modified version allowed attackers to gain registered each day.
    [Show full text]
  • With Yocto/Openembedded
    PORTING NEW CODE TO RISC-V WITH YOCTO/OPENEMBEDDED Martin Maas ([email protected]) 1st RISC-V Workshop, January 15, 2015 Monterey, CA WHY WE NEED A LINUX DISTRIBUTION • To build an application for RISC-V, you need to: – Download and build the RISC-V toolchain + Linux – Download, patch and build application + dependencies – Create an image and run it in QEMU or on hardware • Problems with this approach: – Error-prone: Easy to corrupt FS or get a step wrong – Reproducibility: Others can’t easily reuse your work – Rigidity: If a dependency changes, need to do it all over • We need a Linux distribution! – Automatic build process with dependency tracking – Ability to distribute binary packages and SDKs 2 RISCV-POKY: A PORT OF THE YOCTO PROJECT • We ported the Yocto Project – Official Linux Foundation Workgroup, supported by a large number of industry partners – Part I: Collection of hundreds of recipes (scripts that describe how to build packages for different platforms), shared with OpenEmbedded project – Part II: Bitbake, a parallel build system that takes recipes and fetches, patches, cross-compiles and produces packages (RPM/DEB), images, SDKs, etc. • Focus on build process and customizability 3 GETTING STARTED WITH RISCV-POKY • Let’s build a full Linux system including the GCC toolchain, Linux, QEMU + a large set of packages (including bash, ssh, python, perl, apt, wget,…) • Step I: Clone riscv-poky: git clone [email protected]:ucb-bar/riscv-poky.git • Step II: Set up the build system: source oe-init-build-env • Step III: Build an image (may
    [Show full text]
  • ROOT Package Management: “Lazy Install” Approach
    ROOT package management: “lazy install” approach Oksana Shadura ROOT Monday meeting Outline ● How we can improve artifact management (“lazy-install”) system for ROOT ● How to organise dependency management for ROOT ● Improvements to ROOT CMake build system ● Use cases for installing artifacts in the same ROOT session Goals ● Familiarize ROOT team with our planned work ● Explain key misunderstandings ● Give a technical overview of root-get ● Explain how root-get and cmake can work in synergy Non Goals We are not planning to replace CMake No change to the default build system of ROOT No duplication of functionality We are planning to “fill empty holes” for CMake General overview Manifest - why we need it? ● Easy to write ● Easy to parse, while CMakeLists.txt is impossible to parse ● Collect information from ROOT’s dependencies + from “builtin dependencies” + OS dependencies + external packages to be plugged in ROOT (to be resolved after using DAG) ● It can be easily exported back as a CMakeLists.txt ● It can have extra data elements [not only what is in CMakeLists.txt, but store extra info] ○ Dependencies description (github links, semantic versioning) ■ url: "ssh://[email protected]/Greeter.git", ■ versions: Version(1,0,0)..<Version(2,0,0) Manifest is a “dump” of status of build system (BS), where root-get is just a helper for BS Manifest - Sample Usage scenarios and benefits of manifest files: LLVM/Clang LLVM use CMake as a LLVMBuild utility that organize LLVM in a hierarchy of manifest files of components to be used by build system llvm-build, that is responsible for loading, verifying, and manipulating the project's component data.
    [Show full text]
  • Designdocument < Hellasgrid/Egiumdrepository
    Table of Contents EGI Repository Design Document...................................................................................................................1 Executive Summary................................................................................................................................1 Glossary..................................................................................................................................................1 Table of Contents....................................................................................................................................1 Introduction and Requirements and Objectives......................................................................................1 Operations on the repository............................................................................................................2 Contents of the repository................................................................................................................2 Supported Projects............................................................................................................................2 Users and User's Roles in the Repository...............................................................................................2 Repository Contents for End Users..................................................................................................3 Repository Administrators...............................................................................................................3
    [Show full text]
  • Kace Asset Management Guide
    Kace Asset Management Guide Metaphorical Lucio always evangelising his synarchy if Moishe is capitular or writhe sufferably. Hymenopterous Sibyl always politicks his decimators if Aub is alabastrine or site photogenically. Which Rodolph breezed so fined that Swen inculcated her speculator? With kace customers run quest kace systems management, united states merchant marine academy. Maintenance costs have been annualized over several period on three years to extract their harvest over time. Comparing suites from sap helps clients bring their asset management solutions provider hardinge inc, kace asset management are stored in lifecycle management feature relies on active assets. Learn how asset management leaders are ensuring a reliable and sustainable supply chain, Microsoft and Symantec all require the installation of an agent to perform OS and application discovery tasks, and Symantec problems. You need to unselect a conflict with. Using Custom fields Within software Asset where in Dell KACE, etc. Are you sure you as to delete your attachment? All four evaluated solutions include: antioch university as well as it opens a solution helps track down systems management? An integrated mechanism to report problems and service requests enables prompt response to end users and reduces administrative roadblocks. Product was easy to use. To analyze the opportunities in the market for stakeholders by identifying the high growth segments. Best Practices in Lifecycle Management: Comparing Suites from Dell, and complete security. Although Microsoft does not natively include vulnerability scans, this in proper way detracts from our investment in supporting operating systems regardless of equipment brand. Past performance is just poor indicator of future performance.
    [Show full text]
  • Possible Directions for Improving Dependency Versioning in R by Jeroen Ooms
    CONTRIBUTED RESEARCH ARTICLES 197 Possible Directions for Improving Dependency Versioning in R by Jeroen Ooms Abstract One of the most powerful features of R is its infrastructure for contributed code. The built-in package manager and complementary repositories provide a great system for development and exchange of code, and have played an important role in the growth of the platform towards the de-facto standard in statistical computing that it is today. However, the number of packages on CRAN and other repositories has increased beyond what might have been foreseen, and is revealing some limitations of the current design. One such problem is the general lack of dependency versioning in the infrastructure. This paper explores this problem in greater detail, and suggests approaches taken by other open source communities that might work for R as well. Three use cases are defined that exemplify the issue, and illustrate how improving this aspect of package management could increase reliability while supporting further growth of the R community. Package management in R One of the most powerful features of R is its infrastructure for contributed code (Fox, 2009). The base R software suite that is released several times per year ships with the base and recommended packages and provides a solid foundation for statistical computing. However, most R users will quickly resort to the package manager and install packages contributed by other users. By default, these packages are installed from the “Comprehensive R Archive Network” (CRAN), featuring over 4300 contributed packages as of 2013. In addition, other repositories like BioConductor (Gentleman et al., 2004) and Github (Dabbish et al., 2012) are hosting a respectable number of packages as well.
    [Show full text]
  • Roles in a Networked Software Development Ecosystem: a Case Study in Github Patrick Wagstrom IBM TJ Watson Research Center Hawthorne, NY, [email protected]
    University of Nebraska - Lincoln DigitalCommons@University of Nebraska - Lincoln CSE Technical reports Computer Science and Engineering, Department of 2012 Roles in a Networked Software Development Ecosystem: A Case Study in GitHub Patrick Wagstrom IBM TJ Watson Research Center Hawthorne, NY, [email protected] Corey Jergensen University of Nebraska-Lincoln, [email protected] Anita Sarma University of Nebraska-Lincoln, [email protected] Follow this and additional works at: http://digitalcommons.unl.edu/csetechreports Wagstrom, Patrick; Jergensen, Corey; and Sarma, Anita, "Roles in a Networked Software Development Ecosystem: A Case Study in GitHub" (2012). CSE Technical reports. 149. http://digitalcommons.unl.edu/csetechreports/149 This Article is brought to you for free and open access by the Computer Science and Engineering, Department of at DigitalCommons@University of Nebraska - Lincoln. It has been accepted for inclusion in CSE Technical reports by an authorized administrator of DigitalCommons@University of Nebraska - Lincoln. Roles in a Networked Software Development Ecosystem: A Case Study in GitHub Patrick Wagstrom Corey Jergensen, Anita Sarma IBM TJ Watson Research Center Computer Science and Engineering Department 19 Skyline Dr University of Nebraska, Lincoln Hawthorne, NY, USA 10532 Lincoln, NE, USA 68588 [email protected] {cjergens,asarma}@cse.unl.edu ABSTRACT tiple languages and utilizing multiple different development Open source software development has evolved beyond single frameworks and libraries, For example, development of a web projects into complex networked ecosystems of projects that application may use the JavaScript library jQuery for the user share portions of their code, social norms, and developer commu- interaction, Ruby on Rails for the backend processing, and Rack nities.
    [Show full text]