i
PREFACE
Teaching stochastic processes to students whose primary interests are in applications has long been a problem. On one hand, the subject can quickly become highly technical and if mathe- matical concerns are allowed to dominate there may be no time available for exploring the many interesting areas of applications. On the other hand, the treatment of stochastic calculus in a cavalier fashion leaves the student with a feeling of great uncertainty when it comes to exploring new material. Moreover, the problem has become more acute as the power of the di erential equation point of view has become more widely appreciated. In these notes, an attempt is made to resolve this dilemma with the needs of those interested in building models and designing al- gorithms for estimation and control in mind. The approach is to start with Poisson counters and to identity the Wiener process with a certain limiting form. We do not attempt to de ne the Wiener process per se. Instead, everything is done in terms of limits of jump processes. The Poisson counter and di erential equations whose right-hand sides include the di erential of Poisson counters are developed rst. This leads to the construction of a sample path representa- tions of a continuous time jump process using Poisson counters. This point of view leads to an e cient problem solving technique and permits a uni ed treatment of time varying and nonlinear problems. More importantly, it provides sound intuition for stochastic di erential equations and their uses without allowing the technicalities to dominate. In treating estimation theory, the conditional density equation is given a central role. In addition to the standard additive white noise observation models, a number of other models are developed as well. For example, the wide spread interest in problems arising in speech recognition and computer vision has in uenced the choice of topics in several places. ii Contents
iii iv CONTENTS Chapter 1
Probability Spaces
We recall a few basic ideas from probability theory and, in the process, establish some of the notation and language we will use.
1.1 Sets and Probability Spaces
A set consisting of a nite collection of elements is said to be nite. In this case the cardinality of the set is the number of elements in the set. If a set is nite, or if its elements can be put into one-to-one correspondence with the positive integers, it is said to be countable. Sets that are not countable, such as the set of real numbers between zero and one, are said to be non-denumerably in nite.IfA and B are subsets of a set S we use A ∪ B and A ∩ B to denote the union and intersection of A and B, respectively. We denote the empty set by . We use the notation {0, 1}S to indicate the set of all subsets of S with and S included. This is sometimes called the power set. It is easy to see that if S is a nite set with cardinality s then the cardinality of {0, 1}S is 2s and hence {0, 1}S is also nite. On the other hand, if S is countably in nite then, as was discussed by Cantor, {0, 1}S is non-denumerably in nite. This is a strong suggestion that one must exercise care in attempting to reason about the set of all subsets of S when S is in nite.
In 1933 the mathematician A. N. Kolmogorov described a precise mathematical model for the subject of probability. This model has come to be widely used because it is both elegant and self contained. It is not, however, necessarily easy to relate the Kolmogorov axioms to real world phenomena, nor are the axioms he used the only ones that deserve consideration. For example, the use of probability amplitudes in quantum mechanics calls for a rather di erent set of ideas. Although we will not give the arguments in detail, everything in these notes is consistent with the Kolmogorov point of view. The idea behind the Kolmogorov formalism is that one associates to the set S a collection of subsets P, called the events, such that each P ∈Phas a well de ned probability, (P ). Our intuitive ideas about probability are mostly based on the analysis of simple situations for which S is a nite set and P is the set of all subsets of S. In this case we ask that (S)=1, ( ) = 0 and that be additive on disjoint sets. This amounts to asking that