On a Case of Inadequacy in Using the Arithmetic Mean

On a Case of Inadequacy in Using the Arithmetic Mean

Available online at www.sciencedirect.com ScienceDirect Procedia Economics and Finance 10 ( 2014 ) 134 – 140 7th International Conference on Applied Statistics On a case of inadequacy in using the arithmetic mean Irina-Maria Dragana* a University of Economic Studies, Statistics and Econometrics Department, Virgil Madgearu Building, Calea Dorobantilor No.15-17, 6th floor, Sector 1, Bucharest 010552, Romania Abstract In strong heterogeneous populations, the use of the average indicator is not appropriate in order to summarize the values, because this involves a relatively homogeneous population and a normal, or approximately normal, distribution around the central value. In the case of some genuine economic processes, there is any possibility to apply even the homogenization process of data, by taking into consideration the string boundaries as outliers’ values, as it can be proceed with a series of measurements. In such heterogeneous populations, the characteristic respect a Cauchy distribution, which is somewhat similar to Gauss-Laplace distribution, however it has more elongated tails and a specific density of the distribution. For these cases it is appropriate as the middle value to be established through the median. The case study, conducted and released in this paper, refers to the financial performance of SMEs. This population is characterized by a strong heterogeneity, so in this case the merger at the level of the population is risky by using the mean, because it generates distorted indicators of the shaped reality. The obtained results in this research might represent a guide for the study of situations in which there are heterogeneous populations, moreover where it is not possible to clean their extreme values. © 2014 Elsevier B.V. This is an open access article under the CC BY-NC-ND license © 2014 The Authors. Published by Elsevier B.V. (http://creativecommons.org/licenses/by-nc-nd/3.0/). Selection and peer-review under responsibility of the Department of Statistics and Econometrics, Bucharest University of Selection and peer-review under responsibility of the Department of Statistics and Econometrics, Bucharest University of Economic Studies. Key words: Cauchy distribution; heterogeneous populations; SMEs; arithmetic mean (average). * Corresponding author. Tel.:+40723408598. E-mail address: [email protected]. 2212-5671 © 2014 Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/3.0/). Selection and peer-review under responsibility of the Department of Statistics and Econometrics, Bucharest University of Economic Studies. doi: 10.1016/S2212-5671(14)00286-X Irina-Maria Dragan / Procedia Economics and Finance 10 ( 2014 ) 134 – 140 135 1. Introduction The private sector of the economy is dominated by small and medium enterprises (Nicolescu et al., 2012), a fact highlighted by the structure of all the enterprises that submitted and been approved balance sheet statements (table 1). Table 1 Company's distribution by size Average number of employees Total 0-9 10-49 50-249 Over 250 56,417 42,197 7,912 1,552 613,078 Of the total of 613,078 active enterprises, a number of 601,428, representing 98.10%, achieved a turnover of up to 2 million euros and only 11,650 firms have a turnover exceeding this ceiling, increase of 4.5% compared to the previous year. Of enterprises with turnover of up to 2 million euros, 93.1% are micro-enterprises, with up to 10 employees, and 99.97% are SMEs. In this sector, but also in many other cases of social and economic practice, there are situations distinguished by a great heterogeneity, in which neither Gauss-Laplace distribution assumption can be accepted, nor homogeneity can be achieved by treating the extreme values as outliers and removing them. In the case of SMEs, but also in others in which it is not acceptable to establish the average value as the arithmetic mean and which assumed a normal distribution of values, can be used the substitution variants of the normal law, from which an apparently peculiar model is proposed by Augustin Cauchy, who developed an earlier proposal made by Maria Agnesi. 2. Substituting Gauss-Laplace distribution and the central value calculation The proposal of Maria Agnesi (Dauben and Scriba, 2002) came into literature under the name “Maria Agnesi loop” or “Witch of Agnesi” or as she called it “la Versiera”. The curve has the following equation yabb222/( x ), a,b>0, x R . (1) If a = b, the simpler form: ya322/( a x ), a>0, x R . (2) The Italian mathematician has deduced the curve in the following manner: a variable line (d), which passes through the origin, cut the circle x2 + y2 = ay in a point A and the line y = a in a point B. The point M of intersection of the lines AM parallel to axis OX and BM parallel to axis OY describes the curve ya 22 x a 3 that is the curve versiera. Cauchy reconsidered the proposal of Maria Agnesi as: a2 Ax, a,b>0, x \ (3) bx22 Agnesiana is positive, A(x) > 0 whatever x \ , it is symmetrical to 0y axis, has two inflections and the axis Ox is horizontal asymptote for curve y = A(x) to both minus and plus infinity limAx 0 . x The graphics similarity to Agnesi loop with a normal density is enticing but also generating potential confusions. Cauchy turned A(x) in a probability density function, considering ab1 and inserting a control factor: 11 Ax* , x \ (4) 1 x2 In this way, A*(x) has become a function of frequency Ax* 0 and Axdx* 1 . R 136 Irina-Maria Dragan / Procedia Economics and Finance 10 ( 2014 ) 134 – 140 However, the major problem of this density is that the theoretical average value is indefinite as 1121 xx E x A* x dx dx dx ln 1 x2 ! (5) 22 R 11xx22 As a result, the credibility of the Cauchy distribution, as a substitute for the normal law, is void precisely by this property, even if visually we see that the mean would be zero, as well as the median and modal value. Because the mean for Cauchy variable cannot be establish with the formula of the Central Limit Theorem, this does not apply. The mean of n variables assigned to Cauchy, is not normal but Cauchy. The „Cauchy random variable”, as it was called later (Smithies, 2008), has the simple density: 11 Xfx:;, 1/x 2 2 (6) where x \\, şi 0 . Here the parameters and do not have the meanings that they acquire in the case of Gauss-Laplace's law: here is a position parameter and is a simple real scale parameter. In fact, if we take 0 and 1 we obtain uncomplicated form: 11 \ fx2 , x 1 x (7) which is indeed a density, whereas fx() 0 and 11 1 dx arctgx 1 (8) 2 1 x 12 The distribution function has the form 2 11x u 1x 2 F x;, 1 2 du arctg (9) 2 or if 0 and 1, than 11 XFx: arctgx (10) 2 If we compute the mean of X namely 11 xdx EX xdFxln 1 x2 (11) 2 x 21 so the mean does not exist, although practically it can be distinguish on the loop as being zero. But as the theory does not allow writing E(X), the location parameter will be estimated using the median. Moments of order r, determine such Irina-Maria Dragan / Procedia Economics and Finance 10 ( 2014 ) 134 – 140 137 r rr 1 x EX xdFx dx (12) 2 1 x where r=2,3…., are infinite. For example if r=2 11 xdx2 1 E X2 dx dx x 1 (13) 22 11xx An important property of Cauchy’s law is that the ratio of two independent random normally distributed variables, zero mean, follows a Cauchy distribution with 0 and 1 (Johnson et al., 1994). 2 2 Furthermore, if X and Y are respectively XN (0, x ) and ZN (0, z ) , then the ratio X/Y follows a Cauchy distribution with 0 and xz/ . The Cauchy distribution has been rediscovered along with the creation of the well-known today Student distribution (the t-distribution) in 1908 by the British chemist and statistician William Sealy Gosset (Pearson, 1990): n 1 2 n 1/2 2 t Tft:1 (14) n n n 2 where tn\`, * , is Gamma function, with (1) 1 and (1 / 2) . For n =1 we obtained the Cauchy distribution (Neubrander, 1984). Student variable has zero mean E(t) = 0 and the variance Var t n/2 n doesn’t make sense but only for n 2 . Gradually, the statistics literature has been enriched with other contributions and inferential applications of this distribution (McCullagh, 1992; Osu and Ohakwe, 2011; Carrillo et al., 2010; Arnold and Beaver, 2000). Because the Cauchy density tails are much broader, compared to the density of the normal distribution, this model can be used to study the variables with extreme values, the same as our case concerning the economic and financial performance of SMEs. 3. The profitability performances of the SMEs in Romania The annual balance sheet data indicate, for the SMEs sector of the economy, a high heterogeneity, the recorded values being placed in an area of particularly large amplitude, more pronounced for the financial performance indicators (Isaic-Maniu and Dragan, 2010). For example, considering for the first and the last five NACE activities (totally, there are 88 NACE activities in which operate the SMEs sector) the values of profitability ratios (computed as a percentage ratio between net result and the turnover) were between 74.57%, for real estate transactions, and 0.42% in the manufacture of tobacco products (Dragan, 2012).

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    7 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us