<<

J. Stat. Appl. Pro. 7, No. 2, 379-388 (2018) 379

Journal of Applications & Probability An International Journal

http://dx.doi.org/10.18576/jsap/070215

Modified Estimators Using quartiles and

Asra Nazir1, S. Maqbool2 and Showkat Ahmad3

1Department of statistics, university of Kashmir, J & k, India. 2Division of Agricultural statistics, SKUAST-Kashmir, India. 3Department of Statistics and Operation Research, Aligarh Muslim University, Aligarh UP, India.

Received: 12 Dec. 2018, Revised: 22 Jun. 2018, Accepted: 27 Jun. 2018. Published online: 1 Jul. 2018.

Abstract: In this paper, we proposed some regression estimators for estimating the population using linear combination of first quartile, third quartile and standard deviation of the auxiliary variable which are known. square error (MSE) of the proposed estimator is obtained and compared with that of simple random without replacement (SRSWOR) sample mean, classical ratio estimator and existing ratio estimator’s .Theoretical result is supported by a numerical illustration. Keywords: First quartile, Third quartile, standard quartile deviation, Simple random sampling, Standard deviation.

1 Introduction

Consider a finite populationU {U1 ,U 2 ,U 3 ,...... U N }of N distinct and identifiable units. Let Y be a study variable with value Yi measured on Ui=1,2,3……N giving a vector y={y1,y2………yN}.The problem is to estimate the population mean on the basis of a random sample selected from the population U. The SRSWOR sample mean is the simplest estimator for estimating the population mean.If an auxiliary variable X,closely related to the study variable Y is available then one can improve the performance of the estimators of the study variable by using the known values of the population parameters of the auxiliary variable.That is ,when the population parameter of the auxiliary variable X such as population mean, , , , are known ,a no of estimators such as ratio,product and linear regression estimators and their modifications are suggested in literature and are performing better than the SRSWOR sample mean under certain conditions. Among these estimators the ratio estimators and its modifications are widely attracted many researchers for the estimation of the mean of the study variable(see for example Murthy[1],Singh and Chaudhary[2],Al-Omari, et al.[3],Kadilar and Cingi[4,5],Yan and Tian[6],Subramanian[7] andcingi and Kadilar[8])Before discussing further about the existing modified ratio estimators and the proposed modified ratio estimators, the notations to be used in this paper are described below:  N-Population size  f=n\N, Sampling fraction  Y-Study variable  X-Auxiliary variable  X ,Y  Population means  X,y-sample means

 SX,SY-Population standard deviations

 SXY-Population between X and Y

 CX,CY-Coefficient of variations   -coefficient of correlation between X and Y *Corresponding author e-mail: [email protected] © 2018 NSP Natural Sciences Publishing Cor. 380 A. Naziret al.: Modified linear regression estimators …

1 N  r rth order central r  (X i  X ) , N i1  2  B  3 1 3 ,Coefficient of skewness of the auxiliary variable 2

 4  B1  2 , Coefficient of kurtosis of the auxiliary variable  2

sxy  b= ,Regression coefficient of Y on X   2 s x

 Q1-First (lower )quartile of auxiliary variable

 Q3-Third (upper)quartile of auxiliary variable

 Qr=(Q3-Q1)-Inter-quartile of auxiliary variable

(Q3  Q1 )  Qd = -Semi-quartile range of auxiliary 2

(Q3  Q1 )  Qd = -Semi –quartile average of auxiliary variable 2  B(.)-Bias of the estimator  MSE(.)-Mean Squared error of the estimator

ˆ ˆ th th  Yi (YPj )  i Existing(j proposed)modified ratio estimator of Y

In the case of simple random sampling without replacement (SRSWOR), the sample ys r s is used to estimate population mean Y which is an unbiased estimator and its is

(1 f ) V (y )  S 2 (1) srs n y The ratio estimator for estimating the population mean Y of the study variable Y is defined as ˆ y Y  X (2) R x ˆ The bias and mean squared error of YR to the first order of approximation are given below:

ˆ 1 f 2 B(Y )  Y (C  C C ) (3) R n x X Y

ˆ 1 f 2 2 2 MSE(Y )  Y (C  C  2C C ) (4) R n y x x y The usual linear regression estimator and its variance are given b

ˆ Ylr  y   (X  x) (5)

ˆ 1 f 2 2 V (Ylr )  S y (1  ) (6) n

© 2018 NSP Natural Sciences Publishing Cor.

J. Stat. Appl. Pro. 7, No. 2, 379-388 (2018) / http://www.naturalspublishing.com/Journals.asp 381

The ratio and the linear regression estimators are used for improving the precision of estimate of the population mean based on SRSWOR when thereexistan auxiliary variable X and positively correlated with Y. It has been established; in general that the linear regression estimator is more efficient than the ratio estimator whenever the regression line of the study variable on the auxiliary variable does not passes through the neighborhood of the origin. Thus,the linear regression estimator is more precise than the ratio estimator unless   R Kadilarandcingi[4]have suggested a class of modified ratio estimators in the form of linear regression estimator as given below: ˆ  X  Y 1 [y  b(X  x)]  (7)  x  Kadilar and Cingi[4] have shown that the modified linear regression type ratio estimator given in (7) perform well compare to the usual ratio estimator under certain conditions.Further estimators are proposed by modifying the estimators given in (7) by using the known first quartile,third quartile and the standard deviation etc, together with some other modified ratio estimator in kadilarandcingi [4,5] and Yian and Tian [6].Khare BB, Srivastava U, Kumar K [10] A generalized chain ratio in regression estimator for population mean using two auxiliary characters in sample survey. S. Kumar, P. Maheshwari& P. Chhaparwal [11]An improved regression type estimator to estimate population mean under non-normality in simple random sampling.The list of existing modified linear regression type ratio estimators together with the constant,the bias and the mean squared error are given below in table 1: Table 1: Existing modified ratio estimators class 1 together with the constant, bias and mean squared error.

Estimators Constant MSE (.)  X    Y 1 f 2 2 2 2 1 (R2 S x  S y (1  ) Y1  y  b(X  x)  R1 = x   X   n  1  1  X    ˆ 1 2  Y 1 f 2 2 2 2 Y2  y  b(X  x) 1 (R3 S x  S y (1  )   R   1 x   2  2 n 1 X   2

It is to be noted that “the existing modified ratio estimators “means the list of modified ratio estimators to be considered in this paper unless otherwise stated. It does not mean to the entire list of modified ratio estimator available in the literature. The list of modified linear regression type ratio estimator given in table 1 uses the known values of the parameters like and their linear combinations to improve the ratio estimator in estimation of population mean. In this paper an attempt is made to use the first quartile, third quartile and standard deviation of the auxiliary variables to introduced modified linear regression type ratio estimators for estimating population mean in line with kadilar and cingi [4].

2 ProposedModified Linear Regression Type Ratio Estimators

In this section, a class of modified linear regression type ratio estimators is proposed for estimating the finite population mean and also derived the bias and the mean squared error of the proposed estimators (see Appendix A).The proposed ˆ estimators is defined as Ypj ; j  1,2 for estimating the population mean Y together with the constant, the bias and the mean squared error are presented in the following table 2: Table 2: Proposed class of estimators. Estimators Constant Rpj MSE (.)

1 f 2 2 2 2 1 X  (Q1  S x ) 1Y Y  y  b(X  x) R  (R p1S x  S y (1  ) p1    p1 n  1 x  (Q1  S x )  1 X  (Q1  S x )

 Y 1 f 2 2 2 2 1 (R S  S (1  )  X  (Q  Q ) R p2  p2 x y 1 1 3  X  (Q  Q ) n Yp2  y  b(X  x)  1 1 3  1 x  (Q1  Q3 ) 

© 2018 NSP Natural Sciences Publishing Cor.

382 A. Naziret al.: Modified linear regression estimators …

3 Efficiencyofthe ProposedEstimators

The variance of SRSWOR sample mean ys r s is given below: 1 f V (y )  S 2 (8) srs n y ˆ The mean squared error of the usual ratio estimator Y to the first degree approximation is given below: ˆ 1 f 2 2 2 MSE(Y )  Y (C  C  2C C ) (9) n y x x y The modified ratio estimators given in table 1 and the proposed modified ratio estimators given in table 2 are represented in three classes as given below: ˆ ˆ Class 1 :The mean squared error and the constant of the existing modified linear regression type ratio estimator Y1 to Y2 listed in the Table 1 are represented in a single class(say,class1),which will be very much useful for comparing with that of proposed modified linear regression type ratio estimators and are given below:

ˆ 1 f 2 2 2 2 MSE(Y )  (R S  S (1  ); j  1,2 (10) pj n pj x y ˆ ˆ Class 2: The mean squared error and the constant of the proposed modified linear regression estimator Yp1 to Yp2 listed in table 2 are represented in a single class, which will be very much useful for comparing with that of existing modified linear regression type ratio estimators

ˆ 1 f 2 2 2 2 MSE(Y )  (R S  S (1  ); j  1,2 (11) pj n pj x y From the expression given in (8)and (11) we have derived the conditions (see Appendix –B)for which the proposed ˆ estimatorYpj are more efficient than the simple random sampling without replacement (SRSWOR) sample mean ys r s and are given below

ˆ s y MSE(Ypj )  V (ysrs )if Rpj   sx

(12) From the expression given in (9) and (11)we have derived the conditions(see Appendix –C) for which the proposed ˆ estimators Ypj are more efficient than the usual ratio estimators and are given below:

C x  C y C y  C x Y ( )  R pj  Y ( ) ˆ ˆ S x S x MSE( Ypj )  MSE(Y ) if (13) C y  C x C x  C y Y ( )  R pj  Y ( ) S x S x From the expression given in (10) and (11) we have derived the conditions (see Appendix-D) for which the proposed estimators ˆ Ypj ;j=1,2 and are given below: ˆ ˆ MSE(Ypj )  MSE(Yi )if Rpj  Ri ;i 1,2,...... 12 (14)

4 Empirical Studies

The performance of the proposed modified linear regression type ratio estimators are assessed with that of the SRSWOR sample mean, the usual ratio estimator and the existing modified ratio estimators for a natural population considered by kadilar and cingi[4].The population consists of data on apple production amount (as study variable) and no of apple trees (as auxiliary variable) in 106 villages of Aegean region in 1999.the parameters computed from the above population are

© 2018 NSP Natural Sciences Publishing Cor.

J. Stat. Appl. Pro. 7, No. 2, 379-388 (2018) / http://www.naturalspublishing.com/Journals.asp 383 given below:

Table 3: Data Statistics. N=106 n=40 Y  2 2 1 2.5 9 4 3 X  2 7 4 2 1.6 9 8 1

S y 1 1 4 9 6.9 1 0 2 Cy  5.1961

Cx  2.0855 S x  5 7 1 8 8.9320

 2 (x)  34.5723   0.8 5 6 0

b  0.1721 1 (x)  5.1 2 3 8

Q3  2 6 7 0 0 Q1  2387.5

Table 4: The constant, the bias and the mean squared error of the existing and proposed modified ratio estimators are given. Estimators Constant B(.) MSE(.) SRSWOR sample mean - 2057484.9094 ys r s - Ratio estimator ˆ 169.6823 975171.1057 YR 0.0807 Linear Regression ˆ - 549896.3301 Y - Estimator lr Existing Modified linear ˆ 72.58 851646.74 Y 0.081 regression type ratio 1 66.31 798418.15 estimators(class 1) ˆ Y2 0.074 Proposed Modified linear ˆ 60.03 749997.08 Y 0.067 regression type ratio p1 48.38 672828.112 estimators(class 2) ˆ Yp2 0.054

From the values of table 4,it is observed that mean squared error of the proposed modified linear regression type ratio estimator are less than the variance of the SRSWOR sample mean ,mean squared error of the usual ratio type estimator and existing modified linear regression estimators.

5Conclusions

In this paper, we have proposed a class of modified ratio estimators using the quartiles and standard deviation of the auxiliary variable. The bias and mean squared error of the proposed estimators are obtained and compared with that of the existing estimators. We have derived the conditions for which the proposed estimators are more efficient than the SRSWOR sample mean, the usual ratio estimator and the existing modified linear regression type ratio estimators. Hence we recommend the proposed estimators for the practical use of application.

References

[1] M. N. Murthy, Sampling theory and methods. Calcutta: Statistical Publishing Society,(1967). [2] D. Singh and F. S .Chaudhary, Theory and analysis of sample survey designs. New Delhi: New Age International Publisher.(1986). [3] A. L. Al- Omari, A. A. Jemain, and K. Ibrahim, "New ratio estimators of the mean using simple random sampling and ranked set sampling methods," RevistaInvestigaciónOperational., 30, 97-108(2009).

© 2018 NSP Natural Sciences Publishing Cor.

384 A. Naziret al.: Modified linear regression estimators …

[4] C. Kadilar and H. Cingi, "Ratio estimators in simple random sampling," Applied Mathematics and Computation.151, 893- 902(2004). [5] C.Kadilar and H. Cingi, "An improvement in estimating the population mean by using the correlation coefficient," Hacettepe Journal of Mathematics and Statistics., 35, 103-109(2006). [6] Z. Yan and B. Tian, "Ratio method to the mean estimation using coefficient of skewness of auxiliary variable," ICICA 2010, Part II, CCIS.,106, 103–110(2010). [7] J. Subramani, "Generalized modified ratio estimator for estimation of finite population mean," Journal of Modern Applied Statistical Methods., 12, 121-155(2013). [8] H. Cingi and C. Kadilar, Advances in sampling theory- ratio method of estimation. Sharjah, United Arab Emirates: Bentham Science Publishers, (2009). [9] W. G. Cochran, Sampling techniques, 3rd ed. New Delhi, India: Wiley Eastern [10] Khare BB, Srivastava U, Kumar K (2013) A generalized chain ratio in regression estimator for population mean using two auxiliary characters in sample survey. J Sci Res,57, 147–153(2013). [11] S. Kumar, P. Maheshwari& P. Chhaparwal (2018) An improved regression type estimator to estimate population mean under non- normality in simple random sampling. 20(6), 1035–1050(2017).

Appendix –A

We have derived the bias and mean squared error of the proposed modified linear regression type ratio estimator ˆ Ypj ; j 1,2,3 to first order of approximation and are given below: y Y x  X Let e  and e  .Further we can write y =Y (1 e ) and x  X (1 e )and from the definition of o Y o X o 1 eo and e1 we obtain:

E[ eo ]=E[ e1 ]=0 (1 f ) (1 f ) (1 f ) E[e2 ]  C 2 ; E[e2 ]  C 2 ; E[e e ]  C C o n y 1 n x 0 1 n y x ˆ The proposed estimators Ypj in the form of and e1 is given below:

ˆ  (X   j )  Ypj  Y (1 e0 )  b(X  x) ; j  1,2 (X (1 e1 )   j )

Q1  S X Q1  Q3 Where 1  and 2   1  1     ˆ (X   j ) Y  Y (1 e )  b(X  x)  pj 0   e1 X (X   j )(1 )  X   j 

ˆ Y (1 e0 )  b(X  x) X Ypj  Where pj  ; j  1,2 (1 pje1 ) X   j

ˆ -1 Ypj  Y (1 e0 )  b(X  x)( (1 pje1)

ˆ 2 2 3 3 -1 Ypj  Y (1 e0 )  b(X  x)( (1 pje1  pje 1  pje 1  ...... ) Neglecting the terms higher than third order, we will get:

© 2018 NSP Natural Sciences Publishing Cor.

J. Stat. Appl. Pro. 7, No. 2, 379-388 (2018) / http://www.naturalspublishing.com/Journals.asp 385

ˆ 2 2 2 Ypj  Y Ye0  bXe1 Ypje1 Y pje0e1  bX pje1 Y pje1 ˆ 2 2 2 Ypj Y  Ye0  bXe1 Ypje1 Y pje0e1  bX pje1 Y pje1 (A) Taking expectations on both sides of (A),we get: E(Yˆ Y )  YE(e )  bXE(e ) Y E(e ) Y E(e e )  bX E(e2 ) Y 2 E(e2 ) pj 0 1 pj 1 pj 0 1 pj 1 pj 1

ˆ (1 f )  2 2 S xy S xy    Bias(Ypj )  Y pj Cx  pj Cx  pj Cx  n  S x S x 

ˆ (1 f ) 2 2 Bias(Y )  Y C ; j  1,2 (B) pj n pj x

Substitute the value of  pj in (B),we get the bias of the proposed estimator ˆ (Ypj ); j 1,2 as given below: (1 f )  X 2  Bias(Yˆ )  Y C 2  (C) pj  x  n  X   j  Multiply and divide by Y 2 in (3.19), we get 2 ˆ (1 f )  S x 2  Y Bias(Y )   R  where (D) pj  pj  R pj  ; j  1,2 n  Y  X   j Squaring both sides of (A),neglecting the terms more than 2nd order and taking expectation on both sides we get ˆ 2 2 2 2 2 2 2 2 2 2 2 E(Ypj Y )  E(Y eo )  b X E(e1 ) Y  pj E(e1 )  2bYXE(e0e1 )  2Y  pj E(eoe1 )  2bXY pj E(e1 )

ˆ (1 f ) 2 2 2 2 2 2 2 2 2 2 MSE(Y )  (Y C  b X C  Y  C  2bYXC C  2Y  C C  2bXY C ) pj n y x pj x y x pj y x j x 2 (1  f ) S xy S xy S xy S xy S xy ˆ 2 2 2 2 2 2 2 MSE(Ypj )  (Y C y  2 S x  Y  pjC x  2 2 S y S x  2Y  pj S y C x  2 2 S xY j C x ) n S x S x S x S y S x S y S x 2 (1  f ) S xy S xy S xy S xy S xy MSE(Yˆ )  (Y 2C 2  S 2  Y 2 2 C 2  2 S S  2Y 2 S C  2 S Y C ) pj n y S 2 x pj x S 2 S S y x pj S S y x S 2 x j x x x x y x y x 2 (1 f ) S S S S MSE(Yˆ )  (Y 2C 2 Y 2 2 C 2  xy S 2  2 xy S  2Y 2 xy C  2Y C 2 xy C ) pj n y pj x S 2 x S 2 pj S S x j x S 2 x x x x y x (1 f ) S 2 MSE(Yˆ )  (Y 2 2 C 2 Y 2C 2  xy ) pj n pj x y S 2 x 2 2 (1 f ) S S S xy MSE(Yˆ )  (Y 2 2 C 2 Y 2C 2  x y )   pj n pj x y S 2 S S x since x y (1 f ) MSE(Yˆ )  (Y 2 2 C 2  S 2   2 S 2 ) pj n pj x y y

(1  f ) MSE(Yˆ )  (Y 2 2 C 2  S 2 (1   2 )) pj n pj x y

 pj in the above expression, we will get: Substitute the value of (1 f ) X 2 MSE(Yˆ )  (Y 2 C 2  S 2 (1  2 )) pj 2 x y n (X   j )

© 2018 NSP Natural Sciences Publishing Cor.

386 A. Naziret al.: Modified linear regression estimators …

(1 f ) Y 2 MSE(Yˆ )  (Y 2 X 2C 2  S 2 (1  2 )) pj n (X   ) 2 x y j The mean squared error of the proposed modified linear regression type ratio estimator ˆ Is given below: Ypj Y ˆ (1 f ) 2 2 2 2 MSE(Y )  (R S  S (1  )) Rpj  ; j  1,2 (E) pj n pj x y X   Where j Appendix-(B)

The condition for which the proposed modified linear regression type ratio estimators(class 2) perform better than the SRSWOR sample mean are derived and are given below: ˆ For MSE(Ypj )  V (ysrs ) (1 f ) 1 f (R 2 S 2  S 2 (1  2 )  S 2 n pj x y n y 2 2 2 2 2  (RpjS x  S y (1  )  S y 2 2 2 2 2 2  RpjS x  S y  S y   S y  0 2 2 2 2  RpjS x  S y   0 2 2 2 2  RpjS x  S y 

 RpjSx  S y 

S x    R pj S y

S y  R pj   S x

ˆ S y That is MSE(Ypj )  V (ysrs ) if R pj   S x Appendix-C

The conditions for which the proposed modified ratio estimators (class 2) perform better than the usual ratio estimators are derived and are given below: ˆ ˆ For MSE(Ypj )  MSE(YR ) 1 f 1 f (R 2 S 2  S 2 (1  2 ))  Y 2 (C 2  C 2  2C C ) n pj x y n y x x y

2 *2 2 2 2 2 2 2 * R pj  Y (R pjS  S (1  ))  Y (C  C  2C C ) Where R  x y y x x y pj Y *2 2 2 2 2 2  R pjS x  Cy (1  )  Cy  Cx  2CxCy  0

*2 2 2 2 2 2 2  R pjS x  C y   C y  C y  Cx  2CxCy  0

*2 2 2 2 2  R pjS x   C y  Cx  2CxC y  0

2 *2 2  (Cx  C y )  R pjS x  0

© 2018 NSP Natural Sciences Publishing Cor.

J. Stat. Appl. Pro. 7, No. 2, 379-388 (2018) / http://www.naturalspublishing.com/Journals.asp 387

* *  (C  C R pjS )(C  C R pjS )  0 x y x x y x * * Condition 1: (Cx  C y R pjS x )  (Cx  C y R pjS x )  0 * * Cx  R pjS x  C y andCx  R pjS x  C y

* C y  C x * C x  C y  R pj  andRpj  S x S x

Cx  C y * C y  Cx * R pj   Rpj  where R pj  S x S x Y

C x  C y * C y  C x  Y ( )  R pj  Y ( ) S x S x * * Condition 2: (Cx  C y R pjS x )  (Cx  C y R pjS x )  0 * * C x  R pj S x  C y andCx  R pj S x  C y

* C y  C x * C x  C y  R pj  andRpj  S x S x

C x  C y * C y  C x * R pj   R pj  where R pj  S x S x Y

C x  C y * C y  C x  Y ( )  R pj  Y ( ) S x S x

© 2018 NSP Natural Sciences Publishing Cor.

388 A. Naziret al.: Modified linear regression estimators …

Asra Nazir Research Scholar, Pursuing Ph.D in the Department of Statistics, University of Kashmir.

Showkat Maqbool. is faculty member in the Department of Agricultural Statistics,Wadura SKUAST-K. He Obtained his Doctorate from the University of Kashmir.His field of interest is mainly in the area of Sampling Theory.

Showkat Ahmad Lone is faculty member in the Department of Statistics,Govt.Degree.College Sumbal. He Obtained his Doctorate from Aligarh Muslim University.His field of interest is mainly in the area of Reliability.

© 2018 NSP Natural Sciences Publishing Cor.