System Software Assignment 6 — Living with a Garbage Collector

Total Page:16

File Type:pdf, Size:1020Kb

System Software Assignment 6 — Living with a Garbage Collector System Software Assignment 6 — Living with a garbage collector Exercise 1: Finalizers in the .NET system 1. Why Finalize-methods are not directly executed by the garbage collector? Finalizers can, in principle, contain code which takes a lot of time to execute or worse code that could block. Making the garbage collector execute the finalizers of the disposed objects could significantly hurt performance or bring to a complete system halt. 2. Suppose that someone wants to recycle objects after use and for this purpose resuscitates dead objects in their finalizer appending them at the end of an object pool. In the following example when objects of the class Test are collected they are resuscitated by the finalizer which puts them in a reachable structure (Pool). public class Test { protected override void Finalize() { /* we add the instance (this) to a globally * * reachable structure */ Pool.Add(this); } } Explain why this will not work without any additional support of the finalize mechanism. The finalizer is called only once for a given object and the second time the object dies it will not be resuscitated. The reason lies in the fact that after running the finalizer, the entry in the finalization queue for this object is deleted. To avoid this the user can re-insert the object in the finalization queue with the ReRegisterForFinalize call. 3. The execution order of the Finalize method of all the participants (objects in the freachable list) is not specified. Discuss why this could be a problem and propose a solution. Suppose that there are two objects o1 and o2 in the finalization queue and that these objects have references to each other and that these references are used in the Finalize method. The finalizer thread starts deletes the object o1 and executes the finalizer of o2: at this point in time there will be a null reference since the object was already removed. For this reason programmers should avoid to reference other objects in the finalizers and provide some Close or Dispose methods to clean up objects. 1 Exercise 2: Generations Generational garbage collectors rely on write barriers to detect pointers to a younger generation. The simplest write barrier implementation involves the generation of instrumented code by the compiler: before writing a pointer the compiler inserts a conditional check to detect the presence of a barrier. Describe an alternative, and more efficient, implementation of a write barrier. One well known approach is to divide the heap into cards of size 2k words (typically, k = 5). Let then every card have an associated bit in a separate bit vector. A store check simply marks the bit corresponding to the location being updated. At garbage collection time, the collector scans the bit vector and, whenever it finds a marked bit, examines all pointers in the corresponding card in the heap. st [%obj + offset], %ptr store ptr into object's field add %obj, offset, %temp calculate address of updated word sll %temp, k, %temp divide by card size 2k (shift left) st %g0, [%byte_map + %temp] clear byte in byte map For some stores, the compiler can statically know that no store check is necessary, for example, when storing an integer (assuming that integers are implemented as immediates rather than as real heap- allocated objects). 2 Exercise 3: Weak pointers Weak pointers are references to objects which allow the reference object to be destroyed by the garbage collector. In .NET if a user wants to follow this pointers he has to check if the reference is still valid. On other systems like Java and Python the object with a weak reference can be notified of the deletion of the referenced structure. Give some examples on how weak pointers could be used for .NET and Java. Which are the advantages and disadvantages of both systems? • On systems like .NET weak pointers are used to link large structures which can be easily recon- structed. One example is a cache of a file system’s hierarchy: if there is enough free memory the structure is kept and can be used but the garbage collector is allowed to remove it if some space is needed. • On systems with notification weak references can be used to keep an eye on certain structures: we can link objects without prohibiting their collection and we can be notified when they are removed. References 1 Jeffrey Richter, Garbage Collection: Automatic Memory Management in the Microsoft .NET Framework, MSDN Magazine, November 2000, http://www.cs.inf.ethz.ch/edu/37-201/files/GC in NET.pdf 2 Paul Wilson, Uniprocessor garbage collection techniques, Proceedings of the Memory Man- agement International Workshop, Saint-Malo, France, Sep 1992, ftp://ftp.cs.utexas.edu/pub/garbage/bigsurv.ps 3.
Recommended publications
  • Garbage Collection for Java Distributed Objects
    GARBAGE COLLECTION FOR JAVA DISTRIBUTED OBJECTS by Andrei A. Dãncus A Thesis Submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements of the Degree of Master of Science in Computer Science by ____________________________ Andrei A. Dãncus Date: May 2nd, 2001 Approved: ___________________________________ Dr. David Finkel, Advisor ___________________________________ Dr. Mark L. Claypool, Reader ___________________________________ Dr. Micha Hofri, Head of Department Abstract We present a distributed garbage collection algorithm for Java distributed objects using the object model provided by the Java Support for Distributed Objects (JSDA) object model and using weak references in Java. The algorithm can also be used for any other Java based distributed object models that use the stub-skeleton paradigm. Furthermore, the solution could also be applied to any language that supports weak references as a mean of interaction with the local garbage collector. We also give a formal definition and a proof of correctness for the proposed algorithm. i Acknowledgements I would like to express my gratitude to my advisor Dr. David Finkel, for his encouragement and guidance over the last two years. I also want to thank Dr. Mark Claypool for being the reader of this thesis. Thanks to Radu Teodorescu, co-author of the initial JSDA project, for reviewing portions of the JSDA Parser. ii Table of Contents 1. Introduction……………………………………………………………………………1 2. Background and Related Work………………………………………………………3 2.1 Distributed
    [Show full text]
  • Garbage Collection in Object Oriented Databases Using
    Garbage Collection in Ob ject Or iente d Databas e s Us ing Transactional Cyclic Reference Counting S Ashwin Prasan Roy S Se shadr i Avi Silb erschatz S Sudarshan 1 2 Indian Institute of Technology Bell Lab orator ie s Mumbai India Murray Hill NJ sashwincswi sce du avib elllabscom fprasans e shadr isudarshagcs eiitber netin Intro duction Ob ject or iente d databas e s OODBs unlike relational Ab stract databas e s supp ort the notion of ob ject identity and ob jects can refer to other ob jects via ob ject identi ers Requir ing the programmer to wr ite co de to track Garbage collection i s imp ortant in ob ject ob jects and the ir reference s and to delete ob jects that or iente d databas e s to f ree the programmer are no longer reference d i s error prone and leads to f rom explicitly deallo cating memory In thi s common programming errors such as memory leaks pap er we pre s ent a garbage collection al garbage ob jects that are not referre d to f rom any gor ithm calle d Transactional Cyclic Refer where and havent b een delete d and dangling ref ence Counting TCRC for ob ject or iente d erence s While the s e problems are pre s ent in tradi databas e s The algor ithm i s bas e d on a var i tional programming language s the eect of a memory ant of a reference counting algor ithm pro leak i s limite d to individual runs of programs s ince p os e d for functional programming language s all garbage i s implicitly collecte d when the program The algor ithm keeps track of auxiliary refer terminate s The problem b ecome s more s
    [Show full text]
  • Objective C Runtime Reference
    Objective C Runtime Reference Drawn-out Britt neighbour: he unscrambling his grosses sombrely and professedly. Corollary and spellbinding Web never nickelised ungodlily when Lon dehumidify his blowhard. Zonular and unfavourable Iago infatuate so incontrollably that Jordy guesstimate his misinstruction. Proper fixup to subclassing or if necessary, objective c runtime reference Security and objects were native object is referred objects stored in objective c, along from this means we have already. Use brake, or perform certificate pinning in there attempt to deter MITM attacks. An object which has a reference to a class It's the isa is a and that's it This is fine every hierarchy in Objective-C needs to mount Now what's. Use direct access control the man page. This function allows us to voluntary a reference on every self object. The exception handling code uses a header file implementing the generic parts of the Itanium EH ABI. If the method is almost in the cache, thanks to Medium Members. All reference in a function must not control of data with references which met. Understanding the Objective-C Runtime Logo Table Of Contents. Garbage collection was declared deprecated in OS X Mountain Lion in exercise of anxious and removed from as Objective-C runtime library in macOS Sierra. Objective-C Runtime Reference. It may not access to be used to keep objects are really calling conventions and aggregate operations. Thank has for putting so in effort than your posts. This will cut down on the alien of Objective C runtime information. Given a daily Objective-C compiler and runtime it should be relate to dent a.
    [Show full text]
  • An Evolutionary Study of Linux Memory Management for Fun and Profit Jian Huang, Moinuddin K
    An Evolutionary Study of Linux Memory Management for Fun and Profit Jian Huang, Moinuddin K. Qureshi, and Karsten Schwan, Georgia Institute of Technology https://www.usenix.org/conference/atc16/technical-sessions/presentation/huang This paper is included in the Proceedings of the 2016 USENIX Annual Technical Conference (USENIX ATC ’16). June 22–24, 2016 • Denver, CO, USA 978-1-931971-30-0 Open access to the Proceedings of the 2016 USENIX Annual Technical Conference (USENIX ATC ’16) is sponsored by USENIX. An Evolutionary Study of inu emory anagement for Fun and rofit Jian Huang, Moinuddin K. ureshi, Karsten Schwan Georgia Institute of Technology Astract the patches committed over the last five years from 2009 to 2015. The study covers 4587 patches across Linux We present a comprehensive and uantitative study on versions from 2.6.32.1 to 4.0-rc4. We manually label the development of the Linux memory manager. The each patch after carefully checking the patch, its descrip- study examines 4587 committed patches over the last tions, and follow-up discussions posted by developers. five years (2009-2015) since Linux version 2.6.32. In- To further understand patch distribution over memory se- sights derived from this study concern the development mantics, we build a tool called MChecker to identify the process of the virtual memory system, including its patch changes to the key functions in mm. MChecker matches distribution and patterns, and techniues for memory op- the patches with the source code to track the hot func- timizations and semantics. Specifically, we find that tions that have been updated intensively.
    [Show full text]
  • Memory Management
    Memory management The memory of a computer is a finite resource. Typical Memory programs use a lot of memory over their lifetime, but not all of it at the same time. The aim of memory management is to use that finite management resource as efficiently as possible, according to some criterion. Advanced Compiler Construction In general, programs dynamically allocate memory from two different areas: the stack and the heap. Since the Michel Schinz — 2014–04–10 management of the stack is trivial, the term memory management usually designates that of the heap. 1 2 The memory manager Explicit deallocation The memory manager is the part of the run time system in Explicit memory deallocation presents several problems: charge of managing heap memory. 1. memory can be freed too early, which leads to Its job consists in maintaining the set of free memory blocks dangling pointers — and then to data corruption, (also called objects later) and to use them to fulfill allocation crashes, security issues, etc. requests from the program. 2. memory can be freed too late — or never — which leads Memory deallocation can be either explicit or implicit: to space leaks. – it is explicit when the program asks for a block to be Due to these problems, most modern programming freed, languages are designed to provide implicit deallocation, – it is implicit when the memory manager automatically also called automatic memory management — or garbage tries to free unused blocks when it does not have collection, even though garbage collection refers to a enough free memory to satisfy an allocation request. specific kind of automatic memory management.
    [Show full text]
  • Programming Language Concepts Memory Management in Different
    Programming Language Concepts Memory management in different languages Janyl Jumadinova 13 April, 2017 I Use external software, such as the Boehm-Demers-Weiser collector (a.k.a. Boehm GC), to do garbage collection in C/C++: { use Boehm instead of traditional malloc and free in C http://hboehm.info/gc/ C I Memory management is typically manual: { the standard library functions for memory management in C, malloc and free, have become almost synonymous with manual memory management. 2/16 C I Memory management is typically manual: { the standard library functions for memory management in C, malloc and free, have become almost synonymous with manual memory management. I Use external software, such as the Boehm-Demers-Weiser collector (a.k.a. Boehm GC), to do garbage collection in C/C++: { use Boehm instead of traditional malloc and free in C http://hboehm.info/gc/ 2/16 I In addition to Boehm, we can use smart pointers as a memory management solution. C++ I The standard library functions for memory management in C++ are new and delete. I The higher abstraction level of C++ makes the bookkeeping required for manual memory management even harder than C. 3/16 C++ I The standard library functions for memory management in C++ are new and delete. I The higher abstraction level of C++ makes the bookkeeping required for manual memory management even harder than C. I In addition to Boehm, we can use smart pointers as a memory management solution. 3/16 Smart pointer: // declare a smart pointer on stack // and pass it the raw pointer SomeSmartPtr<MyObject> ptr(new MyObject()); ptr->DoSomething(); // use the object in some way // destruction of the object happens automatically C++ Raw pointer: MyClass *ptr = new MyClass(); ptr->doSomething(); delete ptr; // destroy the object.
    [Show full text]
  • Formal Semantics of Weak References
    Formal Semantics of Weak References Kevin Donnelly J. J. Hallett Assaf Kfoury Department of Computer Science Boston University {kevind,jhallett,kfoury}@cs.bu.edu Abstract of data structures that benefit from weak references are caches, im- Weak references are references that do not prevent the object they plementations of hash-consing, and memotables [3]. In each data point to from being garbage collected. Many realistic languages, structure we may wish to keep a reference to an object but also including Java, SML/NJ, and Haskell to name a few, support weak prevent that object from consuming unnecessary space. That is, we references. However, there is no generally accepted formal seman- would like the object to be garbage collected once it is no longer tics for weak references. Without such a formal semantics it be- reachable from outside the data structure despite the fact that it is comes impossible to formally prove properties of such a language reachable from within the data structure. A weak reference is the and the programs written in it. solution! We give a formal semantics for a calculus called λweak that in- cludes weak references and is derived from Morrisett, Felleisen, Difficulties with weak references. Despite its benefits in practice, and Harper’s λgc. The semantics is used to examine several issues defining formal semantics of weak references has been mostly ig- involving weak references. We use the framework to formalize the nored in the literature, perhaps partly because of their ambiguity semantics for the key/value weak references found in Haskell. Fur- and their different treatments in different programming languages.
    [Show full text]
  • 1. Mark-And-Sweep Garbage Collection W/ Weak References
    CS3214 Spring 2018 Exercise 4 Due: See website for due date. (no extensions). What to submit: See website. The theme of this exercise is automatic memory management. Please note that part 2 must be done on the rlogin machines and part 3 requires the installation of software on your personal machine. 1. Mark-and-Sweep Garbage Collection w/ Weak References The first part of the exercise involves a recreational programming exercise that is intended to deepen your understanding of how a garbage collector works. You are asked to im- plement a variant of a mark-and-sweep collector using a synthetic heap dump given as input. On the heap, there are n objects numbered 0 : : : n − 1 with sizes s0 : : : sn−1. Also given are r roots and m pointers, i.e., references stored in objects that refer to other objects. Write a program that performs a mark-and-sweep collection and outputs the total size of the live heap as well as the total amount of memory a mark-and-sweep collector would sweep if this heap were garbage collected. In addition, some of the references (which correspond to edges in the reachability graph) are labeled as “weak” references. Weak references are supported in multiple program- ming languages as a means to designate non-essential objects that a garbage collector could free if it cannot reclaim enough memory from a regular garbage collection. For instance, in Java, weak references can be created using classes in the java.lang.ref.- WeakReference package. To ensure that the garbage collector does not cause unsafe accesses to objects when a weakly reachable object is freed, weak references in Java are wrapped in WeakReference objects which contain a get() method that provides a strong reference to the underlying object.
    [Show full text]
  • Understanding, Using, and Debugging Java Reference Objects
    1 Jonathan Lawrence, IBM Understanding, Using, and Debugging Java Reference Objects © 2011, 2012 IBM Corporation 2 Important Disclaimers THE INFORMATION CONTAINED IN THIS PRESENTATION IS PROVIDED FOR INFORMATIONAL PURPOSES ONLY. WHILST EFFORTS WERE MADE TO VERIFY THE COMPLETENESS AND ACCURACY OF THE INFORMATION CONTAINED IN THIS PRESENTATION, IT IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED. ALL PERFORMANCE DATA INCLUDED IN THIS PRESENTATION HAVE BEEN GATHERED IN A CONTROLLED ENVIRONMENT. YOUR OWN TEST RESULTS MAY VARY BASED ON HARDWARE, SOFTWARE OR INFRASTRUCTURE DIFFERENCES. ALL DATA INCLUDED IN THIS PRESENTATION ARE MEANT TO BE USED ONLY AS A GUIDE. IN ADDITION, THE INFORMATION CONTAINED IN THIS PRESENTATION IS BASED ON IBM’S CURRENT PRODUCT PLANS AND STRATEGY, WHICH ARE SUBJECT TO CHANGE BY IBM, WITHOUT NOTICE. IBM AND ITS AFFILIATED COMPANIES SHALL NOT BE RESPONSIBLE FOR ANY DAMAGES ARISING OUT OF THE USE OF, OR OTHERWISE RELATED TO, THIS PRESENTATION OR ANY OTHER DOCUMENTATION. NOTHING CONTAINED IN THIS PRESENTATION IS INTENDED TO, OR SHALL HAVE THE EFFECT OF: - CREATING ANY WARRANT OR REPRESENTATION FROM IBM, ITS AFFILIATED COMPANIES OR ITS OR THEIR SUPPLIERS AND/OR LICENSORS © 2011, 2012 IBM Corporation 3 Introduction to the speaker ■ Many years experience Java development – IBM CICS and Java technical consultant – WAS & CICS integration using Java – IBM WebSphere sMash PHP implementation – CICS Dynamic Scripting – IBM Memory Analyzer ■ Recent work focus: – IBM Monitoring and Diagnostic Tools for Java – Eclipse Memory Analyzer (MAT) project ■ My contact information: – Contact me through the Java Technology Community on developerWorks (JonathanPLawrence). © 2011, 2012 IBM Corporation 4 Understanding, Using & Debugging Java References .
    [Show full text]
  • Debugging Memory Leaks in .NET [email protected] FURMANEKADAM
    Debugging Memory Leaks in .NET [email protected] HTTP://BLOG.ADAMFURMANEK.PL FURMANEKADAM 05.05.2021 DEBUGGING MEMORY LEAKS IN .NET - ADAM FURMANEK 1 About me Software Engineer, Blogger, Book Writer, Public Speaker. Author of Applied Integer Linear Programming and .NET Internals Cookbook. http://blog.adamfurmanek.pl [email protected] furmanekadam 05.05.2021 DEBUGGING MEMORY LEAKS IN .NET - ADAM FURMANEK 2 Agenda Garbage Collection: ◦ Reference counting ◦ Mark and Swep, Stopping the world, Mark and Sweep and Compact ◦ Generational hypothesis, card tables .NET GC: ◦ Roots, types ◦ SOH and LOH ◦ Finalization queue, IDisposable, Resurrection Demos: ◦ WinDBG ◦ Event handlers ◦ XML Generation ◦ WCF 05.05.2021 DEBUGGING MEMORY LEAKS IN .NET - ADAM FURMANEK 3 Theory 05.05.2021 DEBUGGING MEMORY LEAKS IN .NET - ADAM FURMANEK 4 Reference counting Each object has counter of references pointing to it. On each assignment the counter is incremented, when variable goes out of scope the counter is decremented. Can be implemented automatically by compiler. Fast and easy to implement. Cannot detect cycles. Used in COMs. Used in CPython and Swift. 05.05.2021 DEBUGGING MEMORY LEAKS IN .NET - ADAM FURMANEK 5 Mark and Sweep At various moments GC looks for all living objects and releases dead ones. Release means mark memory as free. There is no list of all alocated objects! GC doesn’t know whether there is an object (or objects) or not. If object needs to be released with special care (e.g., contains destructor), GC must know about it so it is rememberd during allocation. 05.05.2021 DEBUGGING MEMORY LEAKS IN .NET - ADAM FURMANEK 6 Stop the world GC stops all running threads.
    [Show full text]
  • Efficient Object Sampling Via Weak References
    Efficient Object Sampling Via Weak References Ole Agesen£ Alex Garthwaite VMware Sun Microsystems Laboratories 3145 Porter Drive 1NetworkDrive Palo Alto, CA 94304 Burlington, MA 01803-0902 [email protected] [email protected] ABSTRACT from the burden of explicitly reasoning about the use of memory The performance of automatic memory management may be im- and eliminate two classes of errors: proved if the policies used in allocating and collecting objects had ¯ memory leaks errors where the application loses track of al- knowledge of the lifetimes of objects. To date, approaches to the located memory without freeing it. pretenuring of objects in older generations have relied on profile- driven feedback gathered from trace runs. This feedback has been ¯ dangling references where the application frees memory while used to specialize allocation sites in a program. These approaches retaining references to it and subsequently accesses this mem- suffer from a number of limitations. We propose an alternative that ory through these references. through efficient sampling of objects allows for on-line adaption of allocation sites to improve the efficiency of the memory system. In Garbage collectors work by handling all requests to allocate mem- doing so, we make use of a facility already present in many col- ory, by ensuring that this allocated memory has an appropriately lectors such as those found in JavaTM virtual machines: weak ref- typed format, and by guaranteeing that no memory is freed until it erences. By judiciously tracking a subset of allocated objects with is proven that the application holds no references to that memory. weak references, we are able to gather the necessary statistics to Garbage collection services typically allocate objects in a heap.
    [Show full text]
  • Designing Efficient and Safe Weak References in Eiffel with Parametric
    Designing efficient and safe weak references in Eiffel with parametric types Frederic Merizen, Olivier Zendra, Dominique Colnet To cite this version: Frederic Merizen, Olivier Zendra, Dominique Colnet. Designing efficient and safe weak references in Eiffel with parametric types. [Intern report] A03-R-517 || merizen03a, 2003, pp.14. inria-00107752 HAL Id: inria-00107752 https://hal.inria.fr/inria-00107752 Submitted on 19 Oct 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Designing efficient and safe weak references in Eiffel with parametric types Frederic Merizen∗ Olivier Zendray Dominique Colnetz Emails: {merizen,zendra,colnet}@loria.fr LORIA 615 Rue du Jardin Botanique BP 101 54602 VILLERS-LES-NANCY Cedex FRANCE December 15, 2003 Abstract In this paper, we present our work on the design and implementation of weak references within the SmartEiffel project. We introduce the known concept of weak references, reminding how this peculiar kind of references can be used to optimize and fine-tune the memory behavior of programs, thus potentially speeding up their execution. We show that genericity (parametric types in Eiffel) is the key to implementing weak references in a statically-checked hence safer and more efficient way.
    [Show full text]