CHAPTER – 11 Univariate and Bivariate Analysis of Data

CHAPTER – 11 Univariate and Bivariate Analysis of Data 1. Which of the following cannot be covered under univariate analysis of data? a. Association between two variables b. Computation of mean, median and mode c. Preparation of frequency table d. Computation of percentage frequency for a variable 2. Both mean and mode can be computed for what type of measurement scales. a. Nominal b. Ordinal c. Interval d. Ratio e. c and d f. a and c 3. Which of the following option is not available for the treatment of missing value? a. Substitute the average value of the response b. Substitute a neutral value c. Return to the field to get the desired observation d. The concerned questionnaire should be eliminated from the analysis. 4. In the interpretation of the cross table, the percentages should be computed a. Row wise b. Column wise c. In the direction of the independent variable d. Computing percentages either row wise or column wise does not make a difference. 5. For which type of measurement, the coefficient of variation can be computed. a. Nominal scale b. Ordinal scale c. Interval scale d. Ratio scale 6. A third variable is introduced in the two variable table to a. Refine the association that was observed originally between two variables. b. The introduction of third variable may show that there was no association between the original two variables. c. Introducing a third variable may indicate association between two original variables although initially no relationship was found between them. d. All of the above are true. 7. Which of the following is true about the missing observation? a. Missing value could be coded with another number which should not be equal to the value of the variable obtained as a part of the survey. b. It is possible to get different research conclusions if the value of the missing observation was available. c. The actual results may deviate from the observed results depends upon the number of missing observation and the extent to which the missing data would be different from the actual observation. d. All of the above statements are true. 8. Which of the following is true in case of interpretation of a table with analysis of multiple responses? a. The percentages add up to 100%. b. The percentages exceed 100% because of multiplicity of answer. c. The percentages cannot be computed. d. All the above statements are incorrect. 9. There are 20% female students in a class – this is an example of a. Descriptive analysis b. Inferential analysis c. Gender bias d. All of the above are correct 10. For which type of measurement, median cannot be computed. a. Nominal b. Ordinal c. Interval d. Ratio 11. Which of the following analysis cannot be carried out using ordinal scale data? a. Preparation of frequency distribution b. Computation of the quartiles of the distribution c. Computing the mode of the distribution d. Computing the standard deviation of the variable 12. The uses of a frequency distribution are a. To get the extent of non response. b. To detect the presence of extreme cases (outliers in the distribution). c. To get the extent of illegitimate responses. d. All of the above 13. That measure of central tendency above which 50% values fall and below which the remaining 50% fall is called: a. Mean b. Median c. Mode d. Range 14. When a respondent assigns an order of preference using values as 1, 2, 3 and so on, he is using a. Nominal values b. Ordinal values c. Interval values d. Ratio values 15. The median can be computed from a. Ordinal, interval and nominal data b. Ratio, ordinal and nominal data c. Ratio, interval and ordinal data d. Ratio, interval and nominal data 16. Which of the following gives the measure of consistency of data? a. Mean b. Standard deviation c. Mode d. Median 17. The simple way to look at association for the data which requires only the ability to compute percentages. a. Cross-tabulation b. Correlation coefficient c. Spearman rank correlation coefficient d. Simple graph e. None of the above 18. Factor analysis is a technique for a. Univariate data b. Bivariate data c. Multivariate data d. Both (a) and (b) e. Both (a) and (c) f. Both (b) and (c) 19. Which of the following may not be the right method for dealing with missing information on a questionnaire? a. Discard the questionnaire b. Ignore that particular question and code the remaining c. Based on responses of similar respondents, substitute a value d. All of them may be appropriate e. Both (a) and (b) f. Both (b) and (c) 20. Median can be computed for a. Closed ended class interval b. Open ended class interval c. Ordinal scale data d. Interval scale data e. Ratio scale data f. All of them are true 21. The one-way table cannot be used for a. Frequency distribution b. Percentage distribution c. Cumulative frequency distribution d. To compute measure of central tendency e. It can be used for all of the above. For the next 4 questions, read the following table: TABLE Consumption of ice cream and household income Low Consumption of Ice cream High Consumption of Ice cream Total Low Income 30 10 40 Middle Income 20 20 40 High Income 12 28 40 Total 62 58 120 22 The above table is an example of a. Cross-tabulation b. One way tabulation c. Four way classification d. None of the above 23 What percentage of household have less consumption of Ice cream? a. 50 b 51.67 c 54 d 49.38 24 How many households are there with middle income? a 30 b 28 c 40 d None of the above 25 How many household with middle income have high consumption of Ice cream\/ a 20 b 30 c 28 d 12 For the next 3 question read the following data Given below are the marks of 10 students in an examination 40, 60, 50, 55, 45, 70, 60, 55, 75, 80. 26 What is the average score of the students? a 55 b 60 c 65 d 59 27 What is median of the distribution of marks? a 55 b 57.5 c 58.5 d 60 28 What is the range of the marks of the distribution? a 40 b 18 c 20 d 30 CHAPTER-12 Testing of Hypotheses 1 A p value of o.o5 means a. There is only 5% chance that the results are incorrect b. There is probability of 5 in 100 that this result would occur if the null hypotheses were true c. There is only 5% chance of getting this result d. All of the above are true 2. The probability of rejecting a null hypothesis when it is true is called a Level of significance b Type II error c Type I error d Beta 3 While testing for single population mean when the sample size n is less than 30 and population standard deviation is unknown a z test is used. b t test with n-2 degrees of freedom is used. c t test with n-1 degrees of freedom is used. d None of the above statement is true. 4 Which of the following statements is false? a t distribution is symmetrical. b z distribution is symmetrical. c t distribution is flatter than z distribution. d t distribution is applicable only when sample size n is greater than 50. 5 Which of the following distribution is useful for small sample while testing for population means? a z distribution b. F distribution c. chi-square distribution d. t distribution 6 When testing for the equality of two means, the hypotheses would take which of following forms? a. 푯풐:풖ퟏ = 풖ퟐ 퐻1:푢1 ≠푢2 2 2 b. 퐻표: 휎1 = 휎2 2 2 퐻1: 휎1 ≠ 휎2 C 퐻표 : μ ≤ 2.0 퐻1 : μ > 2.0 d 퐻표:푝1 = 푝2 퐻1:푝1 ≠ 푝2 7 Testing hypotheses concerning population parameters using sample data is called a Exploratory research b Descriptive research c Descriptive analysis d Inferential analysis 8 Which of the following could be labeled as a null hypothesis? a μ ≠ 25 b μ > 25 c μ = 25 d μ < 25 9 The degrees of freedom for testing the equality of two population means assuming equal variance using a t test are a 푛1 + 푛2 b 푛1 - 푛2 c 푛1 + 푛2 - 1 d 풏ퟏ + 풏ퟐ – 2 10 When testing for the proportion of a population, we use a t test b F test c Z test assuming normal approximation to binominal distribution d Paired sample t test 11 When we accept the null hypothesis when it is false we, are committing a type 1 error b type 2 error c neither type 1 nor type 2 error d none of the above is true 12 The alternative hypothesis is “that more than 80% of the students know driving” is an example of a One-tailed test b Two-tailed test c Type 1 error d Type 2 error 13 The degrees of freedom for the t test to test the hypothesis about paired sample are a n b 푛1 + 푛2 c 푛1 + 푛2 – 2 d n -1 14 In case of small sample with the objective of testing the equality of two population means with unequal population variance a z test is used b t test with degrees of freedom 푛1 + 푛2 – 2 is used c t test with 푛1 + 푛2 degrees of freedom is used d None of the above statements is correct. 15 The null hypothesis to test the equality of two population means using t test would be a 푡1 = 푡2 b 푋1 = 푋2 c μ1>μ2 d 훍 ퟏ = 훍 ퟐ 16 The most appropriate level of significance used in testing of hypothesis is a.

CHAPTER – 11 Univariate and Bivariate Analysis of Data

T-TEST Outline Hypothesis Testing Steps Bivariate Analysis

Chapter Five Multivariate Statistics

Package 'Biwavelet'

Meta4diag: Bayesian Bivariate Meta-Analysis of Diagnostic Test Studies for Routine Practice

UNIT 1 INTRODUCTION to STATISTICS Introduction to Statistics

Correlation and Regression Analysis

Multiple Linear Regression

The Landscape of R Packages for Automated Exploratory Data Analysis by Mateusz Staniak and Przemysław Biecek

WHY MULTIVARIATE ANALYSIS? • Used for Analysing Complicated

Geographic Information Systems Correlation Modeling As A

Univariate Analysis and Normality Test Using SAS, STATA, and SPSS

Chapter Four: Univariate Statistics SPSS V11 Chapter Four: Univariate Statistics