UNIT 6C Organizing Data: Trees and Graphs Trees

UNIT 6C Organizing Data: Trees and Graphs 15110 Principles of Computing, 1 Carnegie Mellon University - CORTINA Trees 15110 Principles of Computing, 2 Carnegie Mellon University - CORTINA 1 Trees • A tree is a hierarchical data structure. – Every tree has a node called the root . – Each node can have 1 or more nodes as children . – A node that has no children is called a leaf . • A common tree in computing is a binary tree . – A binary tree consists of nodes that have at most 2 children. – A complete binary tree has the maximum number of nodes on each of its levels. • Applications: data compression, file storage, game trees 15110 Principles of Computing, 3 Carnegie Mellon University - CORTINA Binary Tree 84 41 96 24 50 98 13 37 The root has the data value 84. There are 4 leaves in this binary tree: 13, 37, 50, 98. This binary tree is not complete. 15110 Principles of Computing, 4 Carnegie Mellon University - CORTINA 2 Binary Trees: Implementation • One common implementation of binary trees uses nodes like a linked list does. – Instead of having a “next” pointer, each node has a “left” pointer and a “right” pointer. • We could also use an array. 45 Level 1 45 31 70 19 38 86 Level 1 Level 2 Level 3 31 70 Level 2 19 38 86 Level 3 15110 Principles of Computing, 5 Carnegie Mellon University - CORTINA Binary Search Tree (BST) • A binary search tree (BST) is a binary tree such that – All nodes to the left of any node have data values less than that node – All nodes to the right of any node have data values greater than that node 15110 Principles of Computing, 6 Carnegie Mellon University - CORTINA 3 Inserting into a BST • For each data value that you wish to insert into the binary search tree: – Start at the root and compare the new data value with the root. – If it is less, move down left. If it is greater, move down right. – Repeat on the child of the root until you end up in a position that has no node. – Insert a new node at this empty position. 15110 Principles of Computing, 7 Carnegie Mellon University - CORTINA Example • Insert: 84, 41, 96, 24, 37, 50, 13, 98 84 41 96 24 50 98 13 37 15110 Principles of Computing, 8 Carnegie Mellon University - CORTINA 4 Using a BST • How would you search for an element in a BST? 84 41 96 24 50 98 13 37 15110 Principles of Computing, 9 Carnegie Mellon University - CORTINA Max-Heaps • A max-heap is a binary tree such that – The largest data value is in the root – For every node in the max-heap, its children contain smaller data. – The max-heap is an almost-complete binary tree. • An almost-complete binary tree is a binary tree such that every level of the tree has the maximum number of nodes possible except possibly the last level, where its nodes are attached as far left as possible. 15110 Principles of Computing, 10 Carnegie Mellon University - CORTINA 5 These are not heaps! Why? 84 68 41 56 53 26 24 10 38 72 30 13 7 15110 Principles of Computing, 11 Carnegie Mellon University - CORTINA Adding data to a Max-Heap Insert new data into next available tree position so the tree remains (almost) complete. 84 Exchange data with its 62 parent(s) (if necessary) 41 3. 56 until the tree is restored to a heap. 24 106241 13 38 2. 13 7 6210 1. insert here 15110 Principles of Computing, 12 Carnegie Mellon University - CORTINA 6 Building a Max-Heap • To build a max-heap, just insert each element one at a time into the heap using the previous algorithm. 15110 Principles of Computing, 13 Carnegie Mellon University - CORTINA Removing Data from a Max-Heap Remove root (maximum value) and replace it with "last" node 10 of the lowest level. 2. 8462 Exchange data with its larger child (if necessary) 624110 3. 56 until the tree is restored to a heap. 24 4110 13 38 1. move last node to root 13 7 10 15110 Principles of Computing, 14 Carnegie Mellon University - CORTINA 7 BSTs vs. Max-Heaps • Which tree is designed for easier searching? • Which tree is designed for retrieving the maximum value quickly? • A heap is guaranteed to be “balanced” (complete or almost-complete). What about a BST? 15110 Principles of Computing, 15 Carnegie Mellon University - CORTINA BSTs vs Max-Heaps • BST with n elements – Insert and Search: • worst case O(log n) if tree is “balanced” • worst case O(n) in general since tree could have one node per level • Max-Heap with n elements – Insert and Remove-Max • worst case O(log n) since tree is always “balanced” – Find-Max • worst case O(1) since max is always at the root 15110 Principles of Computing, 16 Carnegie Mellon University - CORTINA 8 Graphs 15110 Principles of Computing, 17 Carnegie Mellon University - CORTINA 15110 Principles of Computing, 18 Carnegie Mellon University - CORTINA 9 Graphs • A graph is a data structure that consists of a set of vertices and a set of edges connecting pairs of the vertices. – A graph doesn’t have a root, per se. – A vertex can be connected to any number of other vertices using edges. – An edge may be bidirectional or directed (one-way). – An edge may have a weight on it that indicates a cost for traveling over that edge in the graph. • Applications: computer networks, transportation systems, social relationships 15110 Principles of Computing, 19 Carnegie Mellon University - CORTINA A Weighted Graph 12 Will 7 0 1 2 3 4 5 6 Erie 6 0 0 10 0 8 7 0 0 Scr 10 7 SC 1 10 0 12 7 0 0 0 4 3 8 5 Pitt 9 0 12 0 6 0 7 5 Phil 2 7Harr 11 3 8 7 6 0 9 4 0 4 7 0 0 9 0 011 0 1 2 3 4 5 6 5 0 0 7 4 0 0 3 Pitt. Erie Will. S.C. Harr. Scr. Phil. 6 0 0 5 0 11 3 0 vertices edges 15110 Principles of Computing, 20 Carnegie Mellon University - CORTINA 10 Original Graph 12 Will. 7 Erie 7 6 Scr. 10 4 8 S.C. 5 3 Pitt 9 Phil. 7 Harr. 11 15110 Principles of Computing, 21 Carnegie Mellon University - CORTINA Minimal Spanning Tree Will. Erie 7 Scr. 4 8 S.C. 5 3 Pitt Phil. 7 Harr. The minimum total cost to connect all vertices using edges from the original graph without using cycles. (minimum total cost = 34) 15110 Principles of Computing, 22 Carnegie Mellon University - CORTINA 11 Original Graph 12 Will. 7 Erie 7 6 Scr. 10 4 8 S.C. 5 3 Pitt 9 Phil. 7 Harr. 11 15110 Principles of Computing, 23 Carnegie Mellon University - CORTINA Shortest Paths from Pittsburgh 14 10 Will. 12 Erie 6 Scr. 4 10 S.C. 8 8 3 Pitt 7 Phil. 15 7 Harr. The total costs of the shortest path from Pittsburgh to every other location using only edges from the original graph. 15110 Principles of Computing, 24 Carnegie Mellon University - CORTINA 12.

UNIT 6C Organizing Data: Trees and Graphs Trees

Trees • Binary Trees • Newer Types of Tree Structures • M-Ary Trees

Balanced Trees Part One

CSE 326: Data Structures Binary Search Trees

Assignment 3: Kdtree ______Due June 4, 11:59 PM

Splay Trees Last Changed: January 28, 2017

Binary Search Tree

Search Trees for Strings a Balanced Binary Search Tree Is a Powerful Data Structure That Stores a Set of Objects and Supports Many Operations Including

Trees, Binary Search Trees, Heaps & Applications Dr. Chris Bourke

CMSC 420: Lecture 7 Randomized Search Structures: Treaps and Skip Lists

Red-Black Trees

Simple Balanced Binary Search Trees

Balanced Binary Search Trees – AVL Trees, 2-3 Trees, B-Trees