Gdb, -Wall, -O of G++, What Are Their Meanings?

Total Page:16

File Type:pdf, Size:1020Kb

Gdb, -Wall, -O of G++, What Are Their Meanings? CISC3130, Spring 2013 April 26 Getting Ready: copy sample C/C++ codes into your current directory using command copy_code 1. Study the Makefile a. Note the options –ggdb, -Wall, -o of g++, what are their meanings? b. Add rules for compiling factorial.cc and cstring.cc to programs named as factorial and cstring respectively c. Add a target “all” for compiling all three programs d. Modify it so that when called with “clean” target, all three program files will be deleted 2. Many ways to go about debugging: a. Thinking about what the program is doing and making an educated guess as to what the problem is. b. Printing out variables (using cout statement, or printf in C) c. Adding assert statements in critical points in the program (for example, in the beginning of a function body, to make sure the pre-condition of the function is met) d. Using a debugger: a more interactive approach For the most tough bugs related to memory issues (bus errors, segmentation faults), one can use gdb to find on which line of code the segmentation fault occurs. Once the line of code in question has been found, it is useful to know about the values in that method, who called the method, and why (specifically) the error is occuring. Using a debugger makes finding all of this information very simple. 3. Loading a program into gdb gdb cstring gdb commands: • run • help: show help message • backtrace, where: displaying the calling stack o up: go to the caller of current function o down: go back to the function called by the current function • step: execute the next statement, • break: tell gdb which function to break in o To set a breakpoint at a line: break [filename] lineno o To set a breakpoint at a function: break func1 o To set a breakpoint in a method: break TestClass::testFunc(int) (use info breakpoints to see all breakpoints Use disable command to disable a breakpoint) • condition: add condition to a breakpoint, i.e., only break when the condition is met • print: display value of a variable • set x=3: set variable x to 3 • quit: exit gdb 4. Memory related runtime error a. A program's address space The following diagram illustrates the layout of memory space that is used by a process (a program running in the system): Stack is a certain amount of memory given for each process for storing information related to each function calls (i.e., invocations). Whenever a function call is invoked, a stack frame is created for the function call for storing the local variables of the function, the arguments passed to the functions, as well as the return address (i.e., when returning from the function, which instruction to resume execute). The stack frame is pushed onto the top of the stack upon the function call, and it will be pop off from the stack upon function exits. When there is a sequence of function calls, such as main() call func1(), which in turns call func2(), which again call func3(), the stack would contain stack frame for all currently active functions: main in the bottom of the stack, then func1, func2, and func3 will be on the top. • In gdb, you can use command where to display the current calling stack. The heap, or free store (not a heap, as in heap sort), is a chunk of memory set aside for processes to borrow memory from, dynamically. If you borrow it (using malloc in C, and new in C++), you have to return it (using free and delete respectively). If you don't, you cause memory leaks, which is a serious problem for long-running program as the memory available in the system will decrease overtime, degrading system performance. In addition to the stack and heap, you get memory for your code and static data (Text as both of them cannot be modified), and for global data (Data). b. Segmentation fault (based on wikipedia entry) A segmentation fault occurs when a program attempts to access a memory location that it is not allowed to access, or attempts to access a memory location in a way that is not allowed (for example, attempting to write to a read-only location, or to overwrite part of the operating system). A few causes of a segmentation fault can be summarized as follows: • attempting to execute a program that does not compile correctly. Note that most compilers will not output a binary given a compile-time error. • a buffer overflow, i.e., when you access an array with an index value less than 0, or larger than the size of the array minus 1 • using uninitialized pointers. • dereferencing NULL pointers. • attempting to access memory the program does not own. • attempting to alter memory the program does not own (storage violation). Generally, segmentation faults occur because: a pointer is either NULL, points to random memory (probably never initialized to anything), or points to memory that has been freed/deallocated/"deleted". How to fix segmentation fault In order to fix a segmentation fault, you need to figure out which statement causes the problem. The easiest way is to use gdb. Exercise : debug cstring.c program 1. Use gdb to run the program, and use gdb command where to figure out at which point in the execution of the program the segmentation fault occurs. 2. Sometimes the program breaks due to the fact that you are passing invalid pointer value to a library call (such as printf and getchar, etc), which might in turn call other low-level system calls. You will notice this from the stack trace printed out by where command. In such case, you want to use command up to go up to the statement in your program, check the variable's value (such as array index, pointer value) at this point for problematic memory access. 3. Fix the problem. c. Stack overflow Segmentation fault an also be caused by stack overflow. We have seen that stack is used for storing stack frames that are created for function calls, on a last-in- first-out basis. If a program has an indefinite recursive function call, stack is eventually overflown. This sometimes leads to segmentation fault problem, and sometimes leads to stack overflow problem. Exercise : use gdb to debug factorial.c 1. Set breakpoint in the function, and use display command to see the value of n and &n (the memory address of variable n). Note that by default, the address is displayed as a hexadecimal number (i.e., base 16 numbers). For example: (int *) 0xbffff450 here 0x denotes it's a base 16 number, and a,b,c,d,e,f are used to represent 10,11,12,13,14,15 respectively. There is an easy conversion between binary and hexadecimal number, therefore the latter is often used for its shorter form. 2. Now rerun the program in gdb, and write down the value of n and &n for the first 6 times that the program stops in the breakpoint. Do you see any pattern in the value of &n? How large (in number of bytes) is the stack frame for the factorial function? In this system, does the stack grow up or down in the memory? 3. Fix the program Optional: debug main.cc (which uses template class in C++). .
Recommended publications
  • University of California at Berkeley College of Engineering Department of Electrical Engineering and Computer Science
    University of California at Berkeley College of Engineering Department of Electrical Engineering and Computer Science EECS 61C, Fall 2003 Lab 2: Strings and pointers; the GDB debugger PRELIMINARY VERSION Goals To learn to use the gdb debugger to debug string and pointer programs in C. Reading Sections 5.1-5.5, in K&R GDB Reference Card (linked to class page under “resources.”) Optional: Complete GDB documentation (http://www.gnu.org/manual/gdb-5.1.1/gdb.html) Note: GDB currently only works on the following machines: • torus.cs.berkeley.edu • rhombus.cs.berkeley.edu • pentagon.cs.berkeley.edu Please ssh into one of these machines before starting the lab. Basic tasks in GDB There are two ways to start the debugger: 1. In EMACS, type M-x gdb, then type gdb <filename> 2. Run gdb <filename> from the command line The following are fundamental operations in gdb. Please make sure you know the gdb commands for the following operations before you proceed. 1. How do you run a program in gdb? 2. How do you pass arguments to a program when using gdb? 3. How do you set a breakpoint in a program? 4. How do you set a breakpoint which which only occurs when a set of conditions is true (eg when certain variables are a certain value)? 5. How do you execute the next line of C code in the program after a break? 1 6. If the next line is a function call, you'll execute the call in one step. How do you execute the C code, line by line, inside the function call? 7.
    [Show full text]
  • Compiling and Debugging Basics
    Compiling and Debugging Basics Service CoSiNus IMFT P. Elyakime H. Neau A. Pedrono A. Stoukov Avril 2015 Outline ● Compilers available at IMFT? (Fortran, C and C++) ● Good practices ● Debugging Why? Compilation errors and warning Run time errors and wrong results Fortran specificities C/C++ specificities ● Basic introduction to gdb, valgrind and TotalView IMFT - CoSiNus 2 Compilers on linux platforms ● Gnu compilers: gcc, g++, gfortran ● Intel compilers ( 2 licenses INPT): icc, icpc, ifort ● PGI compiler fortran only (2 licenses INPT): pgf77, pgf90 ● Wrappers mpich2 for MPI codes: mpicc, mpicxx, mpif90 IMFT - CoSiNus 3 Installation ● Gnu compilers: included in linux package (Ubuntu 12.04 LTS, gcc/gfortran version 4.6.3) ● Intel and PGI compilers installed on a centralized server (/PRODCOM), to use it: source /PRODCOM/bin/config.sh # in bash source /PRODCOM/bin/config.csh # in csh/tcsh ● Wrappers mpich2 installed on PRODCOM: FORTRAN : mympi intel # (or pgi or gnu) C/C++ : mympi intel # (or gnu) IMFT - CoSiNus 4 Good practices • Avoid too long source files! • Use a makefile if you have more than one file to compile • In Fortran : ” implicit none” mandatory at the beginning of each program, module and subroutine! • Use compiler’s check options IMFT - CoSiNus 5 Why talk about debugging ? Yesterday, my program was running well: % gfortran myprog.f90 % ./a.out % vmax= 3.3e-2 And today: % gfortran myprog.f90 % ./a.out % Segmentation fault Yet I have not changed anything… Because black magic is not the reason most often, debugging could be helpful! (If you really think that the cause of your problem is evil, no need to apply to CoSiNus, we are not God!) IMFT - CoSiNus 6 Debugging Methodical process to find and fix flows in a code.
    [Show full text]
  • Lecture 15 15.1 Paging
    CMPSCI 377 Operating Systems Fall 2009 Lecture 15 Lecturer: Emery Berger Scribe: Bruno Silva,Jim Partan 15.1 Paging In recent lectures, we have been discussing virtual memory. The valid addresses in a process' virtual address space correspond to actual data or code somewhere in the system, either in physical memory or on the disk. Since physical memory is fast and is a limited resource, we use the physical memory as a cache for the disk (another way of saying this is that the physical memory is \backed by" the disk, just as the L1 cache is \backed by" the L2 cache). Just as with any cache, we need to specify our policies for when to read a page into physical memory, when to evict a page from physical memory, and when to write a page from physical memory back to the disk. 15.1.1 Reading Pages into Physical Memory For reading, most operating systems use demand paging. This means that pages are only read from the disk into physical memory when they are needed. In the page table, there is a resident status bit, which says whether or not a valid page resides in physical memory. If the MMU tries to get a physical page number for a valid page which is not resident in physical memory, it issues a pagefault to the operating system. The OS then loads that page from disk, and then returns to the MMU to finish the translation.1 In addition, many operating systems make some use of pre-fetching, which is called pre-paging when used for pages.
    [Show full text]
  • NASM for Linux
    1 NASM for Linux Microprocessors II 2 NASM for Linux Microprocessors II NASM Package nasm package available as source or as executables Typically /usr/bin/nasm and /usr/bin/ndisasm Assembly NASM Linux requires elf format for object files ELF = Executable and Linking Format Typical header size = 330h bytes for nasm −f elf [−o <output>] <filename> Linking Linux Object files can be linked with gcc gcc [−options] <filename.o> [other_files.o] Disassembly View executable as 32-bit assembly code ndisasm −e 330h –b 32 a.out | less objdump –d a.out | less Fall 2007 Hadassah College Dr. Martin Land Fall 2007 Hadassah College Dr. Martin Land 3 NASM for Linux Microprocessors II 4 NASM for Linux Microprocessors II gcc Stages Example — 1 Stages of Gnu C compilation factorial2.c #include <math.h> main #include <stdio.h> sets j = 12 main() Source Translation Assembly Object Executable calls factorial 10,000,000 times Code Unit Code Code File { int times; prog.c prog.i prog.s prog.o a.out int i , j = 12; preprocess compile assemble link for (times = 0 ; times < 10000000 ; ++times){ i = factorial(j); gcc -E } gcc -S printf("%d\n",i); gcc -c } gcc int factorial(n) int n; factorial calculates n! by recursion { if (n == 0) return 1; else return n * factorial(n-1); } Fall 2007 Hadassah College Dr. Martin Land Fall 2007 Hadassah College Dr. Martin Land 5 NASM for Linux Microprocessors II 6 NASM for Linux Microprocessors II Example — 2 Example — 3 ~/gcc$ gcc factorial2.c Compile program as separate files produces executable a.out factorial2a.c ~/gcc$ time a.out main() { 479001600 int times; int i,j=12; for (times = 0 ; times < 10000000 ; ++times){ real 0m9.281s i = factorial(j); factorial2b.c } #include <math.h> printf("%d\n",i); user 0m8.339s #include <stdio.h> } sys 0m0.008s int factorial(n) int n; { Program a.out runs in 8.339 seconds on 300 MHz if (n == 0) Pentium II return 1; else return n * factorial(n-1); } Fall 2007 Hadassah College Dr.
    [Show full text]
  • Memory Management
    Memory management Virtual address space ● Each process in a multi-tasking OS runs in its own memory sandbox called the virtual address space. ● In 32-bit mode this is a 4GB block of memory addresses. ● These virtual addresses are mapped to physical memory by page tables, which are maintained by the operating system kernel and consulted by the processor. ● Each process has its own set of page tables. ● Once virtual addresses are enabled, they apply to all software running in the machine, including the kernel itself. ● Thus a portion of the virtual address space must be reserved to the kernel Kernel and user space ● Kernel might not use 1 GB much physical memory. ● It has that portion of address space available to map whatever physical memory it wishes. ● Kernel space is flagged in the page tables as exclusive to privileged code (ring 2 or lower), hence a page fault is triggered if user-mode programs try to touch it. ● In Linux, kernel space is constantly present and maps the same physical memory in all processes. ● Kernel code and data are always addressable, ready to handle interrupts or system calls at any time. ● By contrast, the mapping for the user-mode portion of the address space changes whenever a process switch happens Kernel virtual address space ● Kernel address space is the area above CONFIG_PAGE_OFFSET. ● For 32-bit, this is configurable at kernel build time. The kernel can be given a different amount of address space as desired. ● Two kinds of addresses in kernel virtual address space – Kernel logical address – Kernel virtual address Kernel logical address ● Allocated with kmalloc() ● Holds all the kernel data structures ● Can never be swapped out ● Virtual addresses are a fixed offset from their physical addresses.
    [Show full text]
  • Benchmarking the Stack Trace Analysis Tool for Bluegene/L
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Juelich Shared Electronic Resources John von Neumann Institute for Computing Benchmarking the Stack Trace Analysis Tool for BlueGene/L Gregory L. Lee, Dong H. Ahn, Dorian C. Arnold, Bronis R. de Supinski, Barton P. Miller, Martin Schulz published in Parallel Computing: Architectures, Algorithms and Applications , C. Bischof, M. B¨ucker, P. Gibbon, G.R. Joubert, T. Lippert, B. Mohr, F. Peters (Eds.), John von Neumann Institute for Computing, J¨ulich, NIC Series, Vol. 38, ISBN 978-3-9810843-4-4, pp. 621-628, 2007. Reprinted in: Advances in Parallel Computing, Volume 15, ISSN 0927-5452, ISBN 978-1-58603-796-3 (IOS Press), 2008. c 2007 by John von Neumann Institute for Computing Permission to make digital or hard copies of portions of this work for personal or classroom use is granted provided that the copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise requires prior specific permission by the publisher mentioned above. http://www.fz-juelich.de/nic-series/volume38 Benchmarking the Stack Trace Analysis Tool for BlueGene/L Gregory L. Lee1, Dong H. Ahn1, Dorian C. Arnold2, Bronis R. de Supinski1, Barton P. Miller2, and Martin Schulz1 1 Computation Directorate Lawrence Livermore National Laboratory, Livermore, California, U.S.A. E-mail: {lee218, ahn1, bronis, schulzm}@llnl.gov 2 Computer Sciences Department University of Wisconsin, Madison, Wisconsin, U.S.A. E-mail: {darnold, bart}@cs.wisc.edu We present STATBench, an emulator of a scalable, lightweight, and effective tool to help debug extreme-scale parallel applications, the Stack Trace Analysis Tool (STAT).
    [Show full text]
  • ENCM 335 Fall 2018 Lab 3 for the Week of October 1
    page 1 of 11 ENCM 335 Fall 2018 Lab 3 for the Week of October 1 Steve Norman Department of Electrical & Computer Engineering University of Calgary September 2018 Lab instructions and other documents for ENCM 335 can be found at https://people.ucalgary.ca/~norman/encm335fall2018/ Administrative details Each student must hand in their own assignment Later in the course, you may be allowed to work in pairs on some assignments. Due Dates The Due Date for this assignment is 3:30pm Friday, October 5. The Late Due Date is 3:30pm Tuesday, October 9 (not Monday the 8th, because that is a holiday). The penalty for handing in an assignment after the Due Date but before the Late Due Date is 3 marks. In other words, X/Y becomes (X{3)/Y if the assignment is late. There will be no credit for assignments turned in after the Late Due Date; they will be returned unmarked. Marking scheme A 4 marks B 8 marks C unmarked D 2 marks E 8 marks F 2 marks G 4 marks total 28 marks How to package and hand in your assignments Please see the information in the Lab 1 instructions. Function interface comments, continued For Lab 2, you were asked to read a document called \Function Interface Com- ments". ENCM 335 Fall 2018 Lab 3 page 2 of 11 Figure 1: Sketch of a program with a function to find the average element value of an array of ints. The function can be given only one argument, and the function is supposed to work correctly for whatever number of elements the array has.
    [Show full text]
  • Theory of Operating Systems
    Exam Review ● booting ● I/O hardware, DMA, I/O software ● device drivers ● virtual memory 1 booting ● hardware is configured to execute a program in Read-Only Memory (ROM) or flash memory: – the BIOS, basic I/O system – UEFI is the current equivalent ● BIOS knows how to access all the disk drives, chooses one to boot (perhaps with user assistance), loads the first sector (512 bytes) into memory, and starts to execute it (jmp) – first sector often includes a partition table 2 I/O hardware and DMA ● electronics, and sometimes moving parts, e.g. for disks or printers ● status registers and control registers read and set by CPU software – registers can directly control hardware, or be read and set by the device controller ● device controller can be instructed to do Direct Memory Access to transfer data to and from the CPU's memory – DMA typically uses physical addresses 3 Structure of I/O software ● user programs request I/O: read/write, send/recv, etc – daemons and servers work autonomously ● device-independent software converts the request to a device-dependent operation, and also handles requests from device drivers – e.g file systems and protocol stacks – e.g. servers in Minix ● one device driver may manage multiple devices – and handles interrupts ● buffer management required! 4 Device Drivers ● configure the device or device controller – i.e. must know specifics about the hardware ● respond to I/O requests from higher-level software: read, write, ioctl ● respond to interrupts, usually by reading status registers, writing to control registers, and transferring data (either via DMA, or by reading and writing data registers) 5 Memory Management ● linear array of memory locations ● memory is either divided into fixed-sized units (e.g.
    [Show full text]
  • Theory of Operating Systems
    Exam Review ● booting ● I/O hardware, DMA, I/O software ● device drivers ● memory (i.e. address space) management ● virtual memory 1 booting ● hardware is configured to execute a program in Read-Only Memory (ROM) or flash memory: – the BIOS, basic I/O system – UEFI is the current equivalent ● BIOS knows how to access all the disk drives, chooses one to boot (perhaps with user assistance), loads the first sector (512 bytes) into memory, and starts to execute it (jmp) – first sector often includes a partition table 2 I/O hardware and DMA ● electronics, and sometimes (disks, printers) moving parts ● status registers and control registers read and set by CPU software – registers can directly control hardware, or be read and set by the device controller ● device controller can be instructed to do Direct Memory Access to transfer data to and from the CPU's memory – DMA typically uses physical addresses 3 Structure of I/O software ● user programs request I/O: read/write, send/recv, etc – daemons and servers work autonomously ● device-independent software converts the request to a device-dependent operation, and also handles requests from device drivers – e.g file systems and protocol stacks – e.g. servers in Minix ● one device driver may manage multiple devices – and handles interrupts ● buffer management required! 4 Device Drivers ● configure the device or device controller – i.e. must know specifics about the hardware ● respond to I/O requests from higher-level software: read, write, ioctl ● respond to interrupts, usually by reading status registers, writing to control registers, and transferring data (either via DMA, or by reading and writing data registers) 5 Memory Management ● linear array of memory locations ● memory is either divided into fixed-sized units (e.g.
    [Show full text]
  • Warrior1: a Performance Sanitizer for C++ Arxiv:2010.09583V1 [Cs.SE]
    Warrior1: A Performance Sanitizer for C++ Nadav Rotem, Lee Howes, David Goldblatt Facebook, Inc. October 20, 2020 1 Abstract buffer, copies the data and deletes the old buffer. As the vector grows the buffer size expands in a geometric se- This paper presents Warrior1, a tool that detects perfor- quence. Constructing a vector of 10 elements in a loop mance anti-patterns in C++ libraries. Many programs results in 5 calls to ’malloc’ and 4 calls to ’free’. These are slowed down by many small inefficiencies. Large- operations are relatively expensive. Moreover, the 4 dif- scale C++ applications are large, complex, and devel- ferent buffers pollute the cache and make the program run oped by large groups of engineers over a long period of slower. time, which makes the task of identifying inefficiencies One way to optimize the performance of this code is to difficult. Warrior1 was designed to detect the numerous call the ’reserve’ method of vector. This method will small performance issues that are the result of inefficient grow the underlying storage of the vector just once and use of C++ libraries. The tool detects performance anti- allow non-allocating growth of the vector up to the speci- patterns such as map double-lookup, vector reallocation, fied size. The vector growth reallocation is a well known short lived objects, and lambda object capture by value. problem, and there are many other patterns of inefficiency, Warrior1 is implemented as an instrumented C++ stan- some of which are described in section 3.4. dard library and an off-line diagnostics tool.
    [Show full text]
  • A Study on Faults and Error Propagation in the Linux Operating System
    A Thesis for the Degree of Ph.D. in Engineering A Study on Faults and Error Propagation in the Linux Operating System March 2016 Graduate School of Science and Technology Keio University Takeshi Yoshimura Acknowledgement I would like to thank my adviser, Prof. Kenji Kono. His guidance helped me in all the time of research. I would like to express my sincere gratitude to Prof. Hiroshi Yamada. This dissertation would not have been possible without their advice and encouragement. I am also grateful to the members of my thesis committee: Prof. Shingo Takada, Prof. Hiroaki Saito, and Prof. Kenichi Kourai. This dissertation was greatly improved by their invaluable feedback. During my Ph.D., I did an internship at NEC. I enjoyed working with Dr. Masato Asahara and the opportunity had a significant impact on my research skills. He also taught me LDA, which is the core of the fault study in this dis- sertation. I am also thankful to my colleagues in the sslab. Their surprising enthusiasm and skills have always inspired me. I appreciate the financial supports from the Research Fellowships of the Japan Society for the Promotion of Science for Young Scientists and the Core Re- search for Evolutional Science and Technology of Japan Science and Technology Agency. Finally, I would like to thank my family, my parents, sister for their support all these years. Without their support and encouragement, many accomplishments in my life including this dissertation would not have been possible. 2 Abstract A Study on Faults and Error Propagation in the Linux Operating System Takeshi Yoshimura Operating systems are crucial for application reliability.
    [Show full text]
  • GDB Debugger CS 211 – Programming Practicum GDB Debugger
    GDB Debugger CS 211 – Programming Practicum GDB Debugger • Part of the GNU Software Tools • Many Debuggers in IDEs are just Wrappers for GDB • Huge amount of commands in GBD, many options available to help debug your program • Even to most basic knowledge can save you lots of time GDB Debugger • Step 1 – Compile your program using the –g flag • gcc –g mazeflawed.c • The –g flag creates additional information for the executable that gdb used to convert machine code lines back to the source code line of the original program GDB Debugger • Step 2 - Open the GDB debugger • gdb a.out • Give the name of the executable created by the compiler GDB Debugger • Step 3 – Run your program with any command line arguments • run <command-line-arguments> • If no command line arguments are needed, just type in run. • For the mazeflawed.c program, you need the datafile name, so: • run mazedata1.txt GDB Debugger • Step 4 – Let GDB tell you on which line the Segmentation Fault occurs Program received signal SIGSEGV, Segmentation fault. 0x00000000004007a9 in main (argc=2, argv=0x7fffffffca78) at mazeflawed.c:52 52 m1.arr[i][j] = '.’; • The above states the Segmentation fault occurred at line 52. • It also shows the code at line 52 GDB Debugger • Step 5 – The list command will display more lines of code • Use the help command to find out about more commands. • GDB has so many commands that no one knows them all. • Most people learn a few key commands. GDB Debugger • Step 6 – The print command will display values stored in variables 52 m1.arr[i][j] = ‘.’; print i $1 = 14 • The print i command shows that the variable i contains the value of 14 at the time of the segmentation fault.
    [Show full text]