Math 1 Lab Day 4 Statistics

Total Page:16

File Type:pdf, Size:1020Kb

Math 1 Lab Day 4 Statistics Day 11.3 - Range & Measures of Central Tendency One of the topics to be covered in statistics is the analysis of the data. We will do that by looking at the : RANGE, MEAN, MEDIAN, AND MODE. RANGE The RANGE of a set of data is the difference between the greatest and the least values. DATA 6, 8, 3, 2, 1, 5, 9, 4 The range is MEAN The MEAN , or average, denoted by 푥̅, of a set of numbers is found by adding the numbers and then dividing that total by the number of data items in the set. Example 1: DATA 80, 92, 85, 91, 95, 88 The mean is Example 2: Mary got 80, 78, 85, and 82 on four math tests. What score must she get on the fifth test to get an 80 average? Example 3: The average grade for 26 students in Mr. Feheley’s math lab is 82. The average For the 24 students in Mr. Bill’s math lab is 86. What is the mean grade for all the students in both classes? Mr. Feheley : Mr. Bill: 퐹푒ℎ푒푙푒푦 푠푢푚+퐵푙푙 푠푢푚 Combined mean : 푡표푡푎푙 푠푡푢푑푒푛푡푠 Example 4: Find the mean of the data in this table: DATA 3.5 3.4 3.7 3.8 4.0 FREQUENCY 6 3 2 3 1 Example 5: Frank’s scores on his first 4 tests are 65, 72, 83, and 91. What does Frank need to earn on the fifth test to earn an average of 80? MEDIAN The MEDIAN is the middle value when a set of numbers is arranged in order from least to greatest or greatest to least. if there is an odd number of items the median is the middle number Example: 2, 5, 7, 19, 20 The median is: if there is an even number of items, the median is the average of the two middle numbers Example: 3, 4, 6, 7, 8, 9 The median is: **** it is possible that the median could be a number not in the set ( as in above example) **** the median means that 50% scored below and 50% scored above this number Example 1: Find the median of (3, 6, 7, 2, 4, 8, 3 ) Example 2: Find the median of (3, 7, 8, 1, 3, 8, 4, 6) Example 3: INTERVAL FREQUENCY CUMULATIVE Determine the median interval based FREQUENCY on the table to the left. 40 - 49 2 2 50 - 59 2 4 60 - 69 10 14 70 - 79 15 29 80 - 89 11 40 90 - 99 8 48 MODE The MODE of a data set is any data that has the greatest frequency. Example 1: { 3, 6, 7, 2, 4, 8, 3 } The mode is: Example 2: In the preceding table the mode interval is: **** If each data value has a frequency of 1 or all have the same frequency you can say that the set has no mode. **** Data can have 2 or 3 or more modes if the data values have the same high frequency. Two modes are called bimodal sets ************************************************************************************ SOME IMPORTANT FACTS The mean, median and mode are called measures of central tendencies because they are giving us an idea of where the "center" of the data lies. The mode is useful when dealing with data that is not numerical, such as the color of eyes, type of cars, etc. The mean is influenced by very large or very small numbers while the median is not. A "good" or "fair" test is one where the median and the mean are close in value. A value that is widely separated from the rest of the data in a data set is called and outlier. This value will influence the mean more than the median. It should be taken into account when you analyze the measures of central tendencies. Before you determine the range, mean, median & mode, put the data in numerical order. ************************************************************************************ Ex. Mr. Bill had the following cell phone bills over the course of a year: 45, 45, 45, 45, 45, 120, 90, 45, 45, 45, 45, 45. His cell phone company called and offered him a plan for $50 a month to save him money based on his average cell phone bill. He refused, based on the median and mode of his bills. What was the mean, median, and mode for his bills. Should he have accepted the offer? Why or why not? 2) Given the scores: 3, 3, 4, 4, 4, 5, 5. Which of the following statements is true? 1) mean > median 2) mean > mode 3) median < mode 4) mean = median 3) The prices of seven race cars sold last week are listed in the table below. What is the mean value of these race cars, in dollars? What is the median value of these race cars, in dollars? State which of these measures of central tendency best represents the value of the seven race cars. Justify your answer. Day 11.3 Homework 1. Which value of x will make the mean of the data equal to 6? _______ { 3, 3, 4, 5, 6, 7, 8, x } a) 8 b) 12 c) 16 d) 48 2. If the average ( mean) of 4, -8, 6 and x is 3 then x is equal to _______ a) -6 b) 1 c) 4 d) 10 3. Jane's test grades were 90, 75, 90, 83, 87. What is the median. _______ 4. Which data has 54 as its median. _______ a) 10, 30, 40, 45, 50, 58, 60, 65, 75, 90 b) 25, 25, 30, 35, 50, 56, 65, 70, 75, 80 c) 30, 40, 45, 47, 54, 55, 60, 65, 75, 85 d) 10, 20, 30, 40, 50, 54, 60, 70, 80, 90 5. Which statement is ALWAYS true ? _______ a) The median is a number in the data set. b) The median is also the maximum value. c) The median and the mean are the same. d) 50% of the data is at or below the median. 6. The accompanying graph shows the high temperatures in Elmira, New York, for a 5-day period in January. Which statement describes the data? (1) median = mode (3) mean < mode (2) median = mean (4) mean = mode 7. INTERVAL FREQUENCY Which interval contains the ________ 14 - 16 2 Median? 17 - 19 6 20 - 22 9 23 - 25 8 8. VOTER SURVEYS This bar graph shows the number of 700 600 people in favor of a proposition in 4 500 400 different surveys. What is the mean 300 200 number of favorable responses in the 4 100 NUMBER OF VOTERS OF NUMBER 0 surveys? #1 #2 #3 #4 SURVEY NUMBER 9. Alex earned scores of 60, 74, 82, 87, 87, and 94 on his first six algebra tests. What is the relationship between the measures of central tendency of these scores? (1) median < mode < mean (3) mode < median < mean (2) mean < mode < median (4) mean < median < mode 10. Jerry's quiz grade are 91, 83, 92, 85, 85, 90 , 94. His teacher says he may use either his mean or median for his report card grade. Jerry wants the highest grade. Which should he choose and what will the grade be? (Show work) 11. The average number of CD's owned by John, Sue, Juan and Kate is 114. John owns 200, Sue owns 121 and Kate owns 53. How many CD's does Juan own? 12. The average (mean) of Fred's 6 test scores is 75. His teacher will drop his lowest grade of 55. What is the mean of the remaining 5 test scores? .
Recommended publications
  • 5. the Student T Distribution
    Virtual Laboratories > 4. Special Distributions > 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 5. The Student t Distribution In this section we will study a distribution that has special importance in statistics. In particular, this distribution will arise in the study of a standardized version of the sample mean when the underlying distribution is normal. The Probability Density Function Suppose that Z has the standard normal distribution, V has the chi-squared distribution with n degrees of freedom, and that Z and V are independent. Let Z T= √V/n In the following exercise, you will show that T has probability density function given by −(n +1) /2 Γ((n + 1) / 2) t2 f(t)= 1 + , t∈ℝ ( n ) √n π Γ(n / 2) 1. Show that T has the given probability density function by using the following steps. n a. Show first that the conditional distribution of T given V=v is normal with mean 0 a nd variance v . b. Use (a) to find the joint probability density function of (T,V). c. Integrate the joint probability density function in (b) with respect to v to find the probability density function of T. The distribution of T is known as the Student t distribution with n degree of freedom. The distribution is well defined for any n > 0, but in practice, only positive integer values of n are of interest. This distribution was first studied by William Gosset, who published under the pseudonym Student. In addition to supplying the proof, Exercise 1 provides a good way of thinking of the t distribution: the t distribution arises when the variance of a mean 0 normal distribution is randomized in a certain way.
    [Show full text]
  • Chapter 8 Fundamental Sampling Distributions And
    CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS 8.1 Random Sampling pling procedure, it is desirable to choose a random sample in the sense that the observations are made The basic idea of the statistical inference is that we independently and at random. are allowed to draw inferences or conclusions about a Random Sample population based on the statistics computed from the sample data so that we could infer something about Let X1;X2;:::;Xn be n independent random variables, the parameters and obtain more information about the each having the same probability distribution f (x). population. Thus we must make sure that the samples Define X1;X2;:::;Xn to be a random sample of size must be good representatives of the population and n from the population f (x) and write its joint proba- pay attention on the sampling bias and variability to bility distribution as ensure the validity of statistical inference. f (x1;x2;:::;xn) = f (x1) f (x2) f (xn): ··· 8.2 Some Important Statistics It is important to measure the center and the variabil- ity of the population. For the purpose of the inference, we study the following measures regarding to the cen- ter and the variability. 8.2.1 Location Measures of a Sample The most commonly used statistics for measuring the center of a set of data, arranged in order of mag- nitude, are the sample mean, sample median, and sample mode. Let X1;X2;:::;Xn represent n random variables. Sample Mean To calculate the average, or mean, add all values, then Bias divide by the number of individuals.
    [Show full text]
  • Chapter 4: Central Tendency and Dispersion
    Central Tendency and Dispersion 4 In this chapter, you can learn • how the values of the cases on a single variable can be summarized using measures of central tendency and measures of dispersion; • how the central tendency can be described using statistics such as the mode, median, and mean; • how the dispersion of scores on a variable can be described using statistics such as a percent distribution, minimum, maximum, range, and standard deviation along with a few others; and • how a variable’s level of measurement determines what measures of central tendency and dispersion to use. Schooling, Politics, and Life After Death Once again, we will use some questions about 1980 GSS young adults as opportunities to explain and demonstrate the statistics introduced in the chapter: • Among the 1980 GSS young adults, are there both believers and nonbelievers in a life after death? Which is the more common view? • On a seven-attribute political party allegiance variable anchored at one end by “strong Democrat” and at the other by “strong Republican,” what was the most Democratic attribute used by any of the 1980 GSS young adults? The most Republican attribute? If we put all 1980 95 96 PART II :: DESCRIPTIVE STATISTICS GSS young adults in order from the strongest Democrat to the strongest Republican, what is the political party affiliation of the person in the middle? • What was the average number of years of schooling completed at the time of the survey by 1980 GSS young adults? Were most of these twentysomethings pretty close to the average on
    [Show full text]
  • Notes Mean, Median, Mode & Range
    Notes Mean, Median, Mode & Range How Do You Use Mode, Median, Mean, and Range to Describe Data? There are many ways to describe the characteristics of a set of data. The mode, median, and mean are all called measures of central tendency. These measures of central tendency and range are described in the table below. The mode of a set of data Use the mode to show which describes which value occurs value in a set of data occurs most frequently. If two or more most often. For the set numbers occur the same number {1, 1, 2, 3, 5, 6, 10}, of times and occur more often the mode is 1 because it occurs Mode than all the other numbers in the most frequently. set, those numbers are all modes for the data set. If each number in the set occurs the same number of times, the set of data has no mode. The median of a set of data Use the median to show which describes what value is in the number in a set of data is in the middle if the set is ordered from middle when the numbers are greatest to least or from least to listed in order. greatest. If there are an even For the set {1, 1, 2, 3, 5, 6, 10}, number of values, the median is the median is 3 because it is in the average of the two middle the middle when the numbers are Median values. Half of the values are listed in order. greater than the median, and half of the values are less than the median.
    [Show full text]
  • Volatility Modeling Using the Student's T Distribution
    Volatility Modeling Using the Student’s t Distribution Maria S. Heracleous Dissertation submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Doctor of Philosophy in Economics Aris Spanos, Chair Richard Ashley Raman Kumar Anya McGuirk Dennis Yang August 29, 2003 Blacksburg, Virginia Keywords: Student’s t Distribution, Multivariate GARCH, VAR, Exchange Rates Copyright 2003, Maria S. Heracleous Volatility Modeling Using the Student’s t Distribution Maria S. Heracleous (ABSTRACT) Over the last twenty years or so the Dynamic Volatility literature has produced a wealth of uni- variateandmultivariateGARCHtypemodels.Whiletheunivariatemodelshavebeenrelatively successful in empirical studies, they suffer from a number of weaknesses, such as unverifiable param- eter restrictions, existence of moment conditions and the retention of Normality. These problems are naturally more acute in the multivariate GARCH type models, which in addition have the problem of overparameterization. This dissertation uses the Student’s t distribution and follows the Probabilistic Reduction (PR) methodology to modify and extend the univariate and multivariate volatility models viewed as alternative to the GARCH models. Its most important advantage is that it gives rise to internally consistent statistical models that do not require ad hoc parameter restrictions unlike the GARCH formulations. Chapters 1 and 2 provide an overview of my dissertation and recent developments in the volatil- ity literature. In Chapter 3 we provide an empirical illustration of the PR approach for modeling univariate volatility. Estimation results suggest that the Student’s t AR model is a parsimonious and statistically adequate representation of exchange rate returns and Dow Jones returns data.
    [Show full text]
  • The Value at the Mode in Multivariate T Distributions: a Curiosity Or Not?
    The value at the mode in multivariate t distributions: a curiosity or not? Christophe Ley and Anouk Neven Universit´eLibre de Bruxelles, ECARES and D´epartement de Math´ematique,Campus Plaine, boulevard du Triomphe, CP210, B-1050, Brussels, Belgium Universit´edu Luxembourg, UR en math´ematiques,Campus Kirchberg, 6, rue Richard Coudenhove-Kalergi, L-1359, Luxembourg, Grand-Duchy of Luxembourg e-mail: [email protected]; [email protected] Abstract: It is a well-known fact that multivariate Student t distributions converge to multivariate Gaussian distributions as the number of degrees of freedom ν tends to infinity, irrespective of the dimension k ≥ 1. In particular, the Student's value at the mode (that is, the normalizing ν+k Γ( 2 ) constant obtained by evaluating the density at the center) cν;k = k=2 ν converges towards (πν) Γ( 2 ) the Gaussian value at the mode c = 1 . In this note, we prove a curious fact: c tends k (2π)k=2 ν;k monotonically to ck for each k, but the monotonicity changes from increasing in dimension k = 1 to decreasing in dimensions k ≥ 3 whilst being constant in dimension k = 2. A brief discussion raises the question whether this a priori curious finding is a curiosity, in fine. AMS 2000 subject classifications: Primary 62E10; secondary 60E05. Keywords and phrases: Gamma function, Gaussian distribution, value at the mode, Student t distribution, tail weight. 1. Foreword. One of the first things we learn about Student t distributions is the fact that, when the degrees of freedom ν tend to infinity, we retrieve the Gaussian distribution, which has the lightest tails in the Student family of distributions.
    [Show full text]
  • Statistics, Measures of Central Tendency I
    Statistics, Measures of Central TendencyI We are considering a random variable X with a probability distribution which has some parameters. We want to get an idea what these parameters are. We perfom an experiment n times and record the outcome. This means we have X1;:::; Xn i.i.d. random variables, with probability distribution same as X . We want to use the outcome to infer what the parameters are. Mean The outcomes are x1;:::; xn. The Sample Mean is x1+···+xn x := n . Also sometimes called the average. The expected value of X , EX , is also called the mean of X . Often denoted by µ. Sometimes called population mean. Median The number so that half the values are below, half above. If the sample is of even size, you take the average of the middle terms. Mode The number that occurs most frequently. There could be several modes, or no mode. Dan Barbasch Math 1105 Chapter 9 Week of September 25 1 / 24 Statistics, Measures of Central TendencyII Example You have a coin for which you know that P(H) = p and P(T ) = 1 − p: You would like to estimate p. You toss it n times. You count the number of heads. The sample mean should be an estimate of p: EX = p, and E(X1 + ··· + Xn) = np: So X + ··· + X E 1 n = p: n Dan Barbasch Math 1105 Chapter 9 Week of September 25 2 / 24 Descriptive StatisticsI Frequency Distribution Divide into a number of equal disjoint intervals. For each interval count the number of elements in the sample occuring.
    [Show full text]
  • Exploring the Mode – the Most Likely Value? Jerzy Letkowski Western New England University
    OC13077 Exploring the Mode – the Most Likely Value? Jerzy Letkowski Western New England University Abstract The mean, median and mode are typical summary measures, each representing some focal tendency in random data. Even so the focus of each of the measures is different; they usually are packaged together as measures of central tendency. Among these three measures, the mode is granted the least amount of attention and it is frequently misinterpreted in many introductory textbooks on statistics. Also its centrality is questionable in some situations. The purpose of the paper is to provide a formal definition of the mode and to show appropriate interpretation of this measure with respect to different random data types: qualitative and quantitative (discrete and continuous). Several cases are presented to exemplify this purpose. Detail data processing steps are shown in Google spreadsheets. Keywords: mode, statistics, summary measures, probability distribution, data types, spreadsheet. Exploring the Mode – the Most Likely Value? OC13077 WHAT IS THE MODE? In general, in the context of a population, the concept of the mode of a random variable, X, having domain DX, is quite simple. Formally speaking (Spiegel, 1975, p. 84, McClave, 2011, p. 58, Sharpie, 2010, p. 115), it is a value, x*, of the variable that uniquely maximizes the probability-distribution function, f(x) : x* : f (x* ) = max {f (x)} or None (i) x∈DX One could refer to such a definition of the mode as a probability-distribution driven definition. In a way, it is a universal definition. For example, the maximum point for the Normal distribution (density function) is µ.
    [Show full text]
  • UNIT NUMBER 19.4 PROBABILITY 4 (Measures of Location and Dispersion)
    “JUST THE MATHS” UNIT NUMBER 19.4 PROBABILITY 4 (Measures of location and dispersion) by A.J.Hobson 19.4.1 Common types of measure 19.4.2 Exercises 19.4.3 Answers to exercises UNIT 19.4 - PROBABILITY 4 MEASURES OF LOCATION AND DISPERSION 19.4.1 COMMON TYPES OF MEASURE We include, here three common measures of location (or central tendency), and one common measure of dispersion (or scatter), used in the discussion of probability distributions. (a) The Mean (i) For Discrete Random Variables If the values x1, x2, x3, . , xn of a discrete random variable, x, have probabilities P1,P2,P3,....,Pn, respectively, then Pi represents the expected frequency of xi divided by the total number of possible outcomes. For example, if the probability of a certain value of x is 0.25, then there is a one in four chance of its occurring. The arithmetic mean, µ, of the distribution may therefore be given by the formula n X µ = xiPi. i=1 (ii) For Continuous Random Variables In this case, it is necessary to use the probability density function, f(x), for the distribution which is the rate of increase of the probability distribution function, F (x). For a small interval, δx of x-values, the probability that any of these values occurs is ap- proximately f(x)δx, which leads to the formula Z ∞ µ = xf(x) dx. −∞ (b) The Median (i) For Discrete Random Variables The median provides an estimate of the middle value of x, taking into account the frequency at which each value occurs.
    [Show full text]
  • Skewness and Kurtosis 2
    A distribution is said to be 'skewed' when the mean and the median fall at different points in the distribution, and the balance (or centre of gravity) is shifted to one side or the other-to left or right. Measures of skewness tell us the direction and the extent of Skewness. In symmetrical distribution the mean, median and mode are identical. The more the mean moves away from the mode, the larger the asymmetry or skewness A frequency distribution is said to be symmetrical if the frequencies are equally distributed on both the sides of central value. A symmetrical distribution may be either bell – shaped or U shaped. 1- Bell – shaped or unimodel Symmetrical Distribution A symmetrical distribution is bell – shaped if the frequencies are first steadily rise and then steadily fall. There is only one mode and the values of mean, median and mode are equal. Mean = Median = Mode A frequency distribution is said to be skewed if the frequencies are not equally distributed on both the sides of the central value. A skewed distribution may be • Positively Skewed • Negatively Skewed • ‘L’ shaped positively skewed • ‘J’ shaped negatively skewed Positively skewed Mean ˃ Median ˃ Mode Negatively skewed Mean ˂ Median ˂ Mode ‘L’ Shaped Positively skewed Mean ˂ Mode Mean ˂ Median ‘J’ Shaped Negatively Skewed Mean ˃ Mode Mean ˃ Median In order to ascertain whether a distribution is skewed or not the following tests may be applied. Skewness is present if: The values of mean, median and mode do not coincide. When the data are plotted on a graph they do not give the normal bell shaped form i.e.
    [Show full text]
  • Measures of Central Tendency & Dispersion
    Measures of Central Tendency & Dispersion Measures that indicate the approximate center of a distribution are called measures of central tendency. Measures that describe the spread of the data are measures of dispersion. These measures include the mean, median, mode, range, upper and lower quartiles, variance, and standard deviation. A. Finding the Mean The mean of a set of data is the sum of all values in a data set divided by the number of values in the set. It is also often referred to as an arithmetic average. The Greek letter (“mu”) is used as the symbol for population mean and the symbol ̅ is used to represent the mean of a sample. To determine the mean of a data set: 1. Add together all of the data values. 2. Divide the sum from Step 1 by the number of data values in the set. Formula: ∑ Example: Consider the data set: 17, 10, 9, 14, 13, 17, 12, 20, 14 ∑ The mean of this data set is 14. B. Finding the Median The median of a set of data is the “middle element” when the data is arranged in ascending order. To determine the median: 1. Put the data in order from smallest to largest. 2. Determine the number in the exact center. i. If there are an odd number of data points, the median will be the number in the absolute middle. ii. If there is an even number of data points, the median is the mean of the two center data points, meaning the two center values should be added together and divided by 2.
    [Show full text]
  • Fluorescence Correlation Spectroscopy 1
    Petra Schwille and Elke Haustein Fluorescence Correlation Spectroscopy 1 Fluorescence Correlation Spectroscopy An Introduction to its Concepts and Applications Petra Schwille and Elke Haustein Experimental Biophysics Group Max-Planck-Institute for Biophysical Chemistry Am Fassberg 11 D-37077 Göttingen Germany 2 Fluorescence Correlation Spectroscopy Petra Schwille and Elke Haustein Table of Contents TABLE OF CONTENTS 1 1. INTRODUCTION 3 2. EXPERIMENTAL REALIZATION 5 2.1. ONE-PHOTON EXCITATION 5 2.2. TWO-PHOTON EXCITATION 7 2.3. FLUORESCENT DYES 9 3. THEORETICAL CONCEPTS 10 3.1. AUTOCORRELATION ANALYSIS 10 3.2. CROSS-CORRELATION ANALYSIS 18 4. FCS APPLICATIONS 21 4.1. CONCENTRATION AND AGGREGATION MEASUREMENTS 21 4.2. CONSIDERATION OF RESIDENCE TIMES: DETERMINING MOBILITY AND MOLECULAR INTERACTIONS 23 4.2.1 DIFFUSION ANALYSIS 23 4.2.2 CONFINED AND ANOMALOUS DIFFUSION 24 4.2.3 ACTIVE TRANSPORT PHENOMENA IN TUBULAR STRUCTURES 24 4.2.4 DETERMINATION OF MOLECULAR INTERACTIONS 26 4.2.5 CONFORMATIONAL CHANGES 27 4.3. CONSIDERATION OF CROSS-CORRELATION AMPLITUDES: A DIRECT WAY TO MONITOR ASSOCIATION/DISSOCIATION AND ENZYME KINETICS 28 4.4. CONSIDERATION OF FAST FLICKERING: INTRAMOLECULAR DYNAMICS AND PROBING OF THE MICROENVIRONMENT 30 5. CONCLUSIONS AND OUTLOOK 31 6. REFERENCES 32 Petra Schwille and Elke Haustein Fluorescence Correlation Spectroscopy 3 1. Introduction Currently, it is almost impossible to open your daily newspaper without stumbling over words like gene manipulation, genetic engineering, cloning, etc. that reflect the tremendous impact of life sciences to our modern world. Press coverage of these features has long left behind the stage of merely reporting scientific facts. Instead, it focuses on general pros and cons, and expresses worries about what scientists have to do and what to refrain from.
    [Show full text]