Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute
Total Page:16
File Type:pdf, Size:1020Kb
Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute Rajesh Singh Department of Statistics, Banaras Hindu University Varanasi (U.P.), India Email: [email protected] Pankaj Chauhan School of Statistics, DAVV, Indore (M.P.), India Nirmala Sawan School of Statistics, DAVV, Indore (M.P.), India Florentin Smarandache Department of Mathematics, University of New Mexico, Gallup, USA Email: [email protected] Abstract Some ratio estimators for estimating the population mean of the variable under study, which make use of information regarding the population proportion possessing certain attribute, are proposed. Under simple random sampling without replacement (SRSWOR) scheme, the expressions of bias and mean-squared error (MSE) up to the first order of approximation are derived. The results obtained have been illustrated numerically by taking some empirical population considered in the literature. Key words: Proportion, bias, MSE, ratio estimator. 1. Introduction The use of auxiliary information can increase the precision of an estimator when study variable y is highly correlated with auxiliary variable x. There exist situations when information is available in the form of attribute φ , which is highly correlated with y. For example a) Sex and height of the persons, b) Amount of milk produced and a particular breed of the cow, c) Amount of yield of wheat crop and a particular variety of wheat etc. (see Jhajj et. al. [1]). Consider a sample of size n drawn by SRSWOR from a population of size N. Let th yi and φi denote the observations on variable y and φ respectively for i unit (i =1,2,....N) . Suppose there is a complete dichotomy in the population with respect to the presence or absence of an attribute, say φ , and it is assumed that attribute φ takes only the two values 0 and 1 according as φi = 1, if ith unit of the population possesses attribute φ = 0, otherwise. Pak.j.stat.oper.res. Vol.IV No.1 2008 pp47-53 Rajesh Singh, Pankaj Chauhan, Nirmala Sawan, Florentin Smarandache N n Let A = ∑φi and a = ∑φi denote the total number of units in the population and i=1 i=1 A a sample respectively possessing attribute φ . Let P = and p = denote the N n proportion of units in the population and sample respectively possessing attribute φ . Taking into consideration the point biserial correlation between a variable and an attribute, Naik and Gupta [2] defined ratio estimator of population mean when the prior information of population proportion of units, possessing the same attribute is available, as follows: ⎛ P ⎞ t NG = y⎜ ⎟ (1.1) ⎝ p ⎠ Here y is the sample mean of variable of interest. The MSE of t NG up to the first order of approximation is – ⎛1− f ⎞ 2 2 2 MSE(t NG ) = ⎜ ⎟[]Sy + R1 Sφ − 2R1Syφ (1.2) ⎝ n ⎠ N N n Y 2 1 2 2 1 2 Where f = , R1 = , Sy = ∑()yi − Y , Sφ = ∑ ()φi − P , N P N −1 i=1 N −1 i=1 1 N Syφ = ∑ ()φi − P ()yi − Y . N −1 i=1 In the present paper, some ratio estimators for estimating the population mean of the variable under study, which make use of information regarding the population proportion possessing certain attribute, are proposed. The expressions of bias and MSE have been obtained. The numerical illustrations have also been done by taking some empirical populations considered in the literature. 2. The suggested estimator Following Ray and Singh [3], we propose the following estimator – y + b (P − p) t = φ P = R *P (2.1) 1 p n s yφ y + bφ (P − p) ⎛ 1 ⎞ 2 Where , R * = , s 2 = φ − p and bφ = 2 φ ⎜ ⎟∑()i sφ p ⎝ n −1⎠ i=1 ⎛ 1 ⎞ n s yφ = ⎜ ⎟∑()φi − p ()yi − Y . ⎝ n −1⎠ i=1 Remark 1: When we put bφ = 0 in (2.1) the proposed estimator turns to the Naik and Gupta [2] ratio estimator t NG given in (1.1). 48 Pak.j.stat.oper.res. Vol.IV No.1 2008 pp47-53 Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute MSE of this estimator can be found by using Taylor series expansion given by – ∂f (c,d) ∂f (c,d) f (p,y) ≅ f (P, y) + P,Y (p − P) + (y − Y) (2.2) ∂c ∂c P,Y * Where f (p,y) = R and f (P,Y) = R1 . Expression (2.2) can be applied to the proposed estimator in order to obtain MSE equation as follows: * ∂()()y + bφ (P − p) / p ∂((y + bφ (P − p)))/ p R − R1 ≅ ()p − P + ()y − Y ∂p P,Y ∂y P,Y ⎛ y b P ⎞ 1 ≅ −⎜ + φ ⎟ (p − P) + (y − Y) ⎜ p2 p2 ⎟ p ⎝ ⎠ P,Y P,Y 2 ()Y + B P 2(Y + B P) 1 E(R* − R )2 ≅ φ V(p) − φ Cov(p, y) + V(y) 1 P4 P3 P2 2 1 ⎧(Y + BφP) 2(Y + BφP) ⎫ ≅ 2 ⎨ 2 V(p) − Cov(p, y) + V(y)⎬ (2.3) P ⎩ P P ⎭ Sφy ρpbSy Where Bφ = 2 = . Sφ Sφ Syφ ρpb = , is the point biserial correlation coefficient. SySφ Now, 2 2 MSE(t1) = P E(R1 − R φ ) ()Y + B P 2 2(Y + B P) ≅ φ V()p − φ Cov()p, y + V()y (2.4) P2 P Simplifying (2.4), we get MSE of t1 as ⎛1− f ⎞ 2 2 2 2 MSE()t1 ≅ ⎜ ⎟[]R1Sφ + Sy ()1− ρpb (2.5) ⎝ n ⎠ Several authors have used prior value of certain population parameters (s) to find more precise estimates. Searls (1964) used Coefficient of Variation (CV) of study character at estimation stage. In practice this CV is seldom known. Motivated by Searls (1964) work, Sen (1978), Sisodiya and Dwivedi (1981), and Upadhyaya and Singh (1984) used the known CV of the auxiliary character for estimating population mean of a study character in ratio method of estimation. The use of prior value of Coefficient of Kurtosis in estimating the population variance of study character y was first made by Singh et. al. (1973). Later, used by and Searls and Intarapanich (1990), Upadhyaya and Singh (1999), Singh (2003) and Singh et. al. (2004) in the estimation of population mean of study character. Recently Singh and Tailor (2003) proposed a modified ratio estimator by using the known value of correlation coefficient. Pak.j.stat.oper.res. Vol.IV No.1 2008 pp47-53 49 Rajesh Singh, Pankaj Chauhan, Nirmala Sawan, Florentin Smarandache In next section, we propose some ratio estimators for estimating the population mean of the variable under study using known parameters of the attribute φ such as coefficient of variation Cp, Kurtosis (β2 (φ)) and point biserial correlation coefficientρpb . 3. Suggested Estimators We suggest following estimator – y + bφ (P − p) t = (m1P +m2) (3.1) (m1p + m2 ) Where m1(≠ 0) , m2 are either real number or the functions of the known parameters of the attribute such as Cp, (β2 (φ)) andρpb . The following scheme presents some of the important estimators of the population mean, which can be obtained by suitable choice of constants m1 and m2: Estimator Values of m1 m2 y + b (P − p) 1 1 t = φ P 1 p y + bφ (P − p) 1 β2 (φ) t 2 = []P + β2 (φ) ()p + β2 (φ) y + bφ (P − p) 1 Cp t3 = []P + Cp ()p + Cp y + bφ (P − p) 1 ρpb t 4 = []P + ρpb ()p + ρpb ) y + bφ (P − p) β2 (φ) Cp t5 = []Pβ2 (φ) + Cp ()pβ2 (φ) + Cp y + bφ (P − p) Cp β2 (φ) t6 = []PCp + β2 (φ) ()pCp + β2 (φ) y + bφ (P − p) Cp ρpb t7 = []PCp + ρpb ()pCp + ρpb y + bφ (P − p) ρpb Cp t8 = []Pρpb + Cp ()pρpb + Cp y + bφ (P − p) β2 (φ) ρpb t9 = []Pβ2 (φ) + ρpb ()pβ2 (φ) + ρpb y + bφ (P − p) ρpb β2 (φ) t10 = []Pρpb + β2 (φ) ()pρpb + β2 (φ) 50 Pak.j.stat.oper.res. Vol.IV No.1 2008 pp47-53 Ratio Estimators in Simple Random Sampling Using Information on Auxiliary Attribute Following the approach of section 2, we obtain the MSE expression for these proposed estimators as – ⎛1− f ⎞ 2 2 2 MSE(ti ) ≅ ⎜ ⎟[]R iSφ + Sy (1− ρpb ) , (i = 1,2,3,....,10 ) (3.2) ⎝ n ⎠ Y Y Y Y Where R1 = , R 2 = , R 3 = , R 4 = , P P + β2 (φ) P + Cp P + ρpb Yβ2 (φ) YCp YCp Yρpb R 5 = , R 6 = , R 7 = , R8 = , Pβ2 (φ) + Cp PCp + β2 (φ) PCp + ρpb Pρpb + Cp Yβ2 (φ) Yρpb R 9 = , R10 = . Pβ2 (φ) + ρpb Pρpb + β2 (φ) 4. Efficiency comparisons It is well known that under simple random sampling without replacement (SRSWOR) the variance of the sample mean is ⎛1− f ⎞ 2 V(y) = ⎜ ⎟Sy (4.1) ⎝ n ⎠ From (4.1) and (3.2), we have V(y) − MSE(ti ) ≥ 0 , i = 1,2,.....,10 2 2 Sφ 2 ⇒ ρpb > 2 R i (4.2) Sy When this condition is satisfied, proposed estimators are more efficient than the sample mean. Now, we compare the MSE of the proposed estimators with the MSE of Naik and Gupta (1996) estimator t NG . From (3.2) and (1.1) we have MSE(t NG ) − MSE(ti ) ≥ 0 , (i = 1,2,.....,10 ) 2 2 Sφ 2 2 ⇒ ρpb ≥ 2 []R i − R φ + 2R φK yp (4.3) Sy Cy where K yp = ρyp . Cp Pak.j.stat.oper.res. Vol.IV No.1 2008 pp47-53 51 Rajesh Singh, Pankaj Chauhan, Nirmala Sawan, Florentin Smarandache 5. Empirical Study The data for the empirical study is taken from natural population data set considered by Sukhatme and Sukhatme (1970): y = Number of villages in the circles and φ = A circle consisting more than five villages N = 89, Y = 3.36, P = 0.1236, ρpb = 0.766, Cy = 0.604, Cp = 2.19, β2 (φ) = 6.23181.