The Meerkat Multicomputer: Tradeoffs in Multicomputer Architecture
by
Robert C. Bedichek
A dissertation submitted in partial ful®llment of the requirements for the degree of
Doctor of Philosophy
University of Washington
1994
Approved by (Co-Chairperson of Supervisory Committee)
(Co-Chairperson of Supervisory Committee)
Program Authorized to Offer Degree
Date
In presenting this dissertation in partial ful®llment of the requirements for the Doctoral degree at the University of Washington, I agree that the Library shall make its copies freely available for inspection. I further agree that extensive copying of this dissertation is allowable only for scholarly purposes, consistent with ªfair useº as prescribed in the U.S. Copyright Law. Requests for copying or reproduction of this dissertation may be referred to University Micro®lms, 1490 Eisenhower Place, P.O. Box 975, Ann Arbor, MI 48106, to whom the author has granted ªthe right to reproduce and sell (a) copies of the manuscript in microform and/or (b) printed copies of the manuscript made from microform.º
Signature
Date
University of Washington
Abstract
The Meerkat Multicomputer: Tradeoffs in Multicomputer Architecture
by Robert C. Bedichek
Co-Chairpersons of Supervisory Committee: Professor Henry M. Levy Professor Edward D. Lazowska Department of Computer Science and Engineering
A central problem preventing the wide application of distributed memory multicomputers has been their high price, especially for small installations. High prices are due to long design times, support for scaling to thousands of nodes, and high production costs. This thesis demonstrates a new approach that combines some carefully chosen restrictions on scaling with a software-intensive methodology. We used a concrete design, Meerkat, as a vehicle for exploring the multicomputer design space. Meerkat is a distributed memory multicomputer architecture effective in the range from several to hundreds of processors. It uses a two-dimensional, passive backplane to connect nodes composed of processors, memory, and I/O devices. The interconnect is conceptually simple, inexpensive to design and build, has low latency, and provides high bandwidthon longmessages. However,itdoes notscale tothousands of processors, provide high bandwidth on short messages, or implement cache coherent shared memory. This thesis describes Meerkat's architecture, the niche that Meerkat ®lls, and the ratio- nale for our design choices. We present performance results obtained from our hardware prototype and a calibrated simulator to demonstrate that parallel numerical workloads work well on Meerkat.
Table of Contents
List of Figures vi
List of Tables viii
Chapter1: Introduction 1