Real-Time Fault Detection and Situational Awareness for Rovers: Report on the Mars Technology Program Task

Real-time Fault Detection and Situational Awareness for Rovers: Report on the Mars Technology Program Task Richard Dearden, Thomas Willeke Frank Hutter fRIACS, QSS Group Inc.g / NASA Ames Research Center Fachbereich Informatik M.S. 269-3, Moffett Field, CA 94035, USA Technische Universitat¨ Darmstadt, Germany fdearden,[email protected] [email protected] Reid Simmons, Vandi Verma Sebastian Thrun Carnegie Mellon University Computer Science Department, Stanford University Pittsburg, PA 15213, USA 353 Serra Mall, Gates Bldg 154 freids,[email protected] Stanford, CA 94305, USA [email protected] Abstract— An increased level of autonomy is critical for 1. INTRODUCTION meeting many of the goals of advanced planetary rover mis- This paper reports the results from a study done as part of sions such as NASA’s 2009 Mars Science Lab. One impor- NASA’s Mars Technology Program entitled “Real-time Fault tant component of this is state estimation, and in particular Detection and Situational Awareness for Rovers”. The main fault detection on-board the rover. In this paper we describe goal of this project was to develop and demonstrate a fault- the results of a project funded by the Mars Technology Pro- detection technology capable of operating on-board a Mars gram at NASA, aimed at developing algorithms to meet this rover in real-time. requirement. We describe a number of particle filtering-based algorithms for state estimation which we have demonstrated Fault diagnosis is a critical task for autonomous operation of successfully on diagnosis problems including the K-9 rover systems such as spacecraft and planetary rovers. The diag- at NASA Ames Research Center and the Hyperion rover at nosis problem is to determine the state of a system over time CMU. Because of the close interaction between a rover and given a stream of observations of that system. A common its environment, traditional discrete approaches to diagnosis approach to this problem is model-based diagnosis [3], [4], are impractical for this domain. Therefore we model rover in which the overall system state is represented as an assign- subsystems as hybrid discrete/continuous systems. There are ment of a mode (a discrete state) to each component of the three major challenges to make particle filters work in this system. Such an assignment is a possible description of the domain. The first is that fault states typically have a very current state of the system if the set of models associated with low probability of occurring, so there is a risk that no sam- the modes is consistent with the observed sensor values. An ples will enter fault states. The second issue is coping with example model-based diagnosis system is Livingstone [24], the high-dimensional continuous state spaces of the hybrid which flew on the Deep Space One spacecraft as part of the system models, and the third is the severely constrained com- Remote Agent Experiment [15] in May 1999. In Living- putational power available on the rover. This means that very stone, diagnosis is done by maintaining a candidate hypothe- few samples can be used if we wish to track the system state ses (in other systems more than one hypothesis is kept) about in real time. We describe a number of approaches to rover the current state of each system component, and comparing diagnosis specifically designed to address these challenges. the candidate’s predicted behaviour with the system sensors. Traditional approaches operate on discrete models and use TABLE OF CONTENTS monitors to translate continuous sensor readings into discrete values. The monitors are typically only used once the sensor 1 INTRODUCTION readings have settled on a consistent value, and hence these 2 HYBRID DIAGNOSIS USING PARTICLE FILTERS systems cannot generally diagnose transient events. 3 RISK-SENSITIVE PARTICLE FILTERS For many applications, e.g. planetary rovers, the complex dy- 4 VARIABLE RESOLUTION PARTICLE FILTER namics of the system make reasoning with a discrete model 5 RAO-BLACKWELLIZED PARTICLE FILTERS inadequate. This is because too fine a discretization is required to accurately model the system; because the monitors 6 FUTURE WORK would need global sensor information to discretize a single sensor correctly; and because transient events must be diag- nosed. To overcome this we need to reason directly with the continuous values we receive from sensors: Our model needs 0-7803-8155-6/04/$17.00/ c 2004 IEEE to be a hybrid system. IEEEAC paper # 1209 A hybrid system consists of a set of discrete modes, which ular Kalman Filter (see for example [7]) approximates the dis- represent fault states or operational modes of the system, and tribution by a single Gaussian distribution. The particle filter a set of continuous variables which model the continuous has a number of important advantages: quantities that affect system behaviour. We will use the term state to refer to the combination of these, that is, a state is • It can be applied more easily to hybrid models. The par- a mode plus a value for each continuous variable, while the ticle filter simply maintains a discrete and continuous state mode of a system refers only to the discrete part of the state. for every sample, so the whole is a distribution over the com- For example, consider a motor. It can be idle or powered, and plete model. Banks of Kalman filters—one for each discrete has a number of fault modes such as having a faulty encoder. state—can also be used, but there is no simple way to deter- These correspond to the discrete part of the model. It also has mine the contribution of each filter to the overall distribution. continuous state, such as its running speed, the current pow- • It can represent non-Gaussian distributions. This al- ering it, and so on. In each discrete mode, there is a set of dif- lows models with non-linear continuous dynamics and non- ferential equations that describe the relationship between the Gaussian noise. As we shall see, these are important consid- various continuous values, and the way those values evolve erations for the rover domain. over time. There is also a transition function that describes • It can easily be adjusted to available computation, simply how the system moves from one mode to another. In many by increasing or decreasing the number of samples. This can cases, not all of the hybrid system will be observable. There- be done on the fly as the algorithm is running. fore, we also have an observation function that defines the likelihood of an observation given the mode and the values The essence of the particle filter approach is to simulate the of the continuous variables. All these processes are inher- behaviour of the system. Each sample predicts a future be- ently noisy, and the representation reflects this by explicitly haviour of the system in a Monte-Carlo fashion, and the sam- including noise in the continuous values, and stochastic tran- ples that match the observed system behaviour are kept, while sitions between system modes. We describe our hybrid model ones that fail to predict the observations tend to die out. We in more detail below. describe the basic particle filter algorithm in Section 2. The complex dynamics of the rover, along with its interac- The new algorithms we have developed for this project are tion with an extremely complex, poorly modeled, and noisy motivated by a number of problems with applying the stan- environment—the surface of Mars—makes it very difficult to dard particle filter to diagnosis problems: determine the true state of the rover at any point in time with certainty. To combat this, we advocate diagnosis algorithms 1. Very low prior fault probabilities: Diagnosis problems that explicitly represent uncertainty at every point, thus allow- are particularly difficult for approximation algorithms based ing control of the rover to reason about the uncertainty when on sampling because the low probabilities of transitions to selecting actions to perform. This is extremely important for fault states can lead to incorrect diagnoses because there are the rover as actions that appear good for the most likely state no samples in a state even though it has a non-zero probability may be catastrophic if the rover turns out to be in another of of occurring. its possible states. Explicitly representing uncertainty about 2. Restricted computational resources: For space applica- state also makes diagnosis easier as we automatically have a tions, computation time is often at a premium, particularly for set of alternate states when a new observation is inconsistent on-board real-time diagnosis. For this reason, diagnosis must with the most likely state or states. be as efficient as possible. 3. High dimensional state spaces: As the dimensionality of To represent uncertainty about the state of the rover, the a problem grows, the number of samples required to accu- diagnosis algorithms we will present maintain a belief rately approximate the posterior distribution grows exponen- distribution—a probability distribution over the states the tially. rover could be in. To maintain this distribution, the al- 4. Non-linear stochastic transitions and observations: gorithms will perform Bayesian belief updating. In this Many algorithms are restricted to linear models with Gaus- approach, we begin with a prior probability distribution sian noise. Our domains frequently behave non-linearly, so P(S0) = Φ that represents our initial beliefs about the state we would prefer an algorithm without this restriction. of the system, and as a sequence of observations y1:t are 5. Multimodal system behaviour: Even in a single discrete made of the system, we update the distribution to produce mode, the observations are often consistent with several val- P(StjΦ; y1:t), the probability at time t of each state given the ues for the continuous variables, and so multi-modal distri- prior and the observations.

Real-Time Fault Detection and Situational Awareness for Rovers: Report on the Mars Technology Program Task

Refactoring the Curiosity Rover's Sample Handling Architecture on Mars

(PLEXIL) for Executable Plans and Command Sequences”, International Symposium on Artificial Intelligence, Robotics and Automation in Space (Isairas), 2005

Enabling and Enhancing Astrophysical Observations with Autonomous Systems

Intelligence for Space Robotics

Survey of Command Execution Systems for NASA Spacecraft and Robots

Ad Astra Academy: Using Space Exploration to Promote Student Learning and Motivation in the City of God, Rio De Janeiro, Brazil

Development of Autonomous Actions to Enable the Next Decade of Ocean World Exploration

Hyde-A General Purpose Framework For

2020 Mars Society Convention Schedule

Autonomous Science Restart for the Planned Europa Mission with Lightweight Planning and Execution

Ssim: NASA Mars Rover Robotics Flight Software Simulation

So You Passed an Earned Value Management Government Validation - Now What?