Multivariate Statistics Chapter 2: Multivariate distributions and inference
Pedro Galeano Departamento de Estad´ıstica Universidad Carlos III de Madrid [email protected]
Course 2017/2018
Master in Mathematical Engineering
Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 3 Master in Mathematical Engineering 1 / 106 1 Introduction
2 Basic concepts
3 Multivariate distributions
5 Hypothesis testing
Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 3 Master in Mathematical Engineering 2 / 106 Introduction
Multivariate statistical analysis is concerned with analysing and understanding data in more than one (high) dimensions.
Therefore, as in Chapter 1, we assume that we are given a set of n observations of p univariate random variables x1,..., xp. The p univariate random variables can be summarized in a multivariate random 0 p variable x = (x1,..., xp) defined in R . In this chapter we give an introduction to the basic probability tools associated with the multivariate random variable x that are useful in multivariate statistical analysis.
Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 3 Master in Mathematical Engineering 3 / 106 Introduction
In particular, we present:
I the basic probability tools used to describe a multivariate random variable, inclu- ding the concepts of marginal and conditional distributions and the concept of independence;
I the mean vector, the covariance matrix and the correlation matrix of a mul- tivariate random variable and their counterparts for marginal and conditional distributions;
I the basic techniques needed to derive the distribution of transformations with special emphasis on linear transformations;
I several multivariate distributions, including the multivariate Gaussian distribution, along with most of its companion distributions and other interesting alternatives; and
I statistical inference for multivariate samples, including parameter estimation and hypothesis testing.
Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 3 Master in Mathematical Engineering 4 / 106 Basic concepts
We can say that we have the joint distribution of a multivariate random variable when the following are specified:
p 1 The sample space of the possible values, which, in general, is a subset of R .
2 The probabilities of each possible result of the sample space.
We say that a p-dimensional random variable is discrete when each of the p scalar variables that comprise it are discrete as well.
Analogously, we say that the variable is continuous if its components are conti- nuous as well.
Otherwise, the variable is mixed.
Pedro Galeano (Course 2017/2018) Multivariate Statistics - Chapter 3 Master in Mathematical Engineering 5 / 106 Basic concepts
0 Let x = (x1,..., xp) be a multivariate random variable.
0 0 00 The cumulative distribution function (cdf) of x at a point x = x1 ,..., xp , 0 is denoted by Fx x and is given by: