Available online at www.sciencedirect.com

Procedia Engineering 15 ( 2011 ) 5020 – 5024

$GYDQFHGLQ&RQWURO(QJLQHHULQJDQG,QIRUPDWLRQ6FLHQFH

Prediction of Coal Output in Wuhai Using Grey-Markvo Model Improved by Nonlinear Regression

Bo Zhang1,Junhai Ma1,2 *

1 College of Management EconomicˈTianjin University ˈTianjin 300072ˈChina 2 University of Finance & Economics, Tianjin 300222,

Abstract

Coal resource is abundant in Wuhai city. Its production had better meet the need of economic growth and be preserved for sustainable development. Accuracy prediction is critical to make program. The final model of grey theory is exponential growth. Exponential and quadratic functions are directly assumed. Parameters are estimated by Least Square. Residuals sum of squares indicate superiority of quadratic function. Long trend is fitted by quadratic. Deviation is measured by sine function and Markov chain respectively. Coal outturn of Wuhai in 2015 is predicted to be 25.1949 and 30.2917 million tones. Since output in 2009 is 29.73 million tones, quadratic and Markov chain reveal underlying trend of Wuhai coal production. Keywords. Coal production, grey system, nonlinear regression, Markov Chain

1. Introduction

Wuhai with 58.8 percents of revealed coking coal in lies in southwest of as well as west of Erdos, is an important national base of coking for its rich coal reserve. Coal plays a significant role in Wuhai which contributes a lot to industry and living of Inner Mongolia[1]. Inadequate mining maybe decrease economic growth, while unreasonable exploitation overdraw sustainable ability. Coal price must be brought down for its sufficiency as well as its profit. For sustainable development in Wuhai, it is essential to supply coal in accordance with market demand to avoid blind exploitation, which helps to ensure healthy progress of market. Prediction of coal production is a meaningful work. There are two ways to predict coal output, one is to find out regular principle of itself, and the other is to analysis from its numerous influencing factors such as oil and natural gas. Those factors, difficult to acquire history datum[3], have complicate relations with coal yield. We directly figure out regular pattern from history datum of coal output by grey model. As a traditional prediction method[4-6], grey model GM(1,1) whose tendency is a smoothing curve, is designed for short-term, equidistance, less information and deviation series and not suitable for systems

* Corresponding author. Tel.:+8615222025960 E-mail address: [email protected],[email protected]

1877-7058 © 2011 Published by Elsevier Ltd. doi: 10.1016/j.proeng.2011.08.933 Bo Zhang and Junhai Ma / Procedia Engineering 15 ( 2011 ) 5020 – 5024 5021 with large fluctuation. The appreciating nonlinear model for historical information is firstly assumed to break through the default and the parameters are estimated by Least Square. The residuals of series with large volatility can be still fitted by Markov chain[7-8] which describes a random system and forecasts future state on the basis of probability transition matrix. The matrix reflects intrinsic principle of transition and influencing strength of random factors. Prediction of large random volatility and system amendment influenced by stochastic factors are suitable for Markov chain which remedy the defect of grey model. Based on history coal productions in Wuhai, future ones are predicted by time series model through finding fixed pattern, which is helpful to make production plan for relevant enterprises and government.

2. GREY-Markov Model Improved By Nonlinear Regession Great attention had been paid to grey system after its birth and many researchers devoted themselves to its theory and application research until now. (0) (0) (0) (0) (0) XXX= { (1), (2), , Xn ( )} Assume time series X has n observations , by grey theory, the final model ∧ (0) +=−aakªº(0) −b − = Xk( 1) (1 eX )«» (1) ek , 0,1, 2, , n , is ¬¼a (1) where parameters ab, are estimated by Least Square through 1-accumulating generation operator data. Grey model works better for given series with three conditions as follows: ρ(1)k + <=1,kn 2, 3, , − 1 ρ()k (2) ρ ∈=ε ()kk [0,], 3,4,, n (3) ε < 0.5 (4) ρ ==Xk() ()kkk−1 , 2,3,,. n ¦ Xi() Where i=1 The key hypothesis of grey theory is exponential growth of 1-accumulating generation operator data. For series not fulfilling the condition and having unsatisfactory forecast, function type between series and Xk(0) ()= fk (;α ) k f time is directly assumed to be α , where time is input variable of unknown function according with scatter plot, and is parameter estimated by Least Square[9]. Prediction precision can be enhanced by combining Markov chain and grey model, since grey model prefer short, less sample and deviation system series and Markov appreciate large stochastic volatility. Its effects are certified to be better than individual grey model in practical applications[7-8]. The idea of Grey-Markov model is concluded from some applications as follows: GM (1,1 ) is firstly built as well as residuals are divided into several states, then one step probability transition matrix is estimated for Markov predicting pseudo residuals, in which states division and interval limits are critical for forecast. For high accuracy, Researchers proposed two ways to split states, one was on the basis of the limits of residuals or relative ones, and the other was in accordance with prediction curves which was followed by several paralleled lines with equal or unequal space. Each method has its own sphere of application of good forecast precision.

3. Forecasting Results 5022 Bo Zhang and Junhai Ma / Procedia Engineering 15 ( 2011 ) 5020 – 5024

Coal production in Wuhai from 1976 to 2010 was shown in Fig. 1 from which increasing trend and small deviation were obvious. The biggest increasing amplitude, lied in 2009 to 2010, took great inconvenient for prediction. According to grey theory, the function between output and time was assumed to

aaªºb −−(1976)t ft( )=− (1 e ) 492 − e , ¬¼«»a (5) where f and t restimpresented coal output and year respectively as well as parameters ab, were e ated by function nls in R language with initial values a =−0.1,b = 300 . The final estimations were   ab=−0.049, = 326.782 coincided with GM (1, 1) model parameter estimate, and the summation of residuals square was 2540275. =++2 Quadratic grown, displayed in Fig. 1, could be hypothesized as f ()tatbtc (6) === with parameters abc,,. Initial values were selected as ab 1 , -2, c 300 after several   trying. The fitted values were ab==2.442, -48.339,cˆ = 780.747 and residual sum of squares was 1899084 less than 2540275, the squares of the grey model, which indicated the superiority of quadratic model.

Considering small deviation in Fig. 1, sine function was added to quadratic model,

t − μ f() t=+++ at2 bt c d sin( ) σ μ σ (7) with six parameters abc,,,,d , . The initial values of abc,, were set to the estimation of quadratic model as d,,μ σ well as parameters was 50, 2, 0.5. The optimized a ˈ b ˈ c ˈ values were μ =2.446 =-48.976 =790.026 d =87.874ˈ =-41.261ˈσ =0.511. The residual sum of squares of Eqn. (7), the smallest one of our three Fig. 1.History coal production in Wuhai model, was 1762818, which demonstrated that sine function adding quadratic model improved effect. The real and predicted values of three models were depicted as Fig. 2.

Fig. 2 listed the real and predicted values, where scatter point, solid line, long dashed line and short dashed line represented real data, quadratic adding sine function, quadratic and grey model respectively. Quadratic exemplified growing trend and sine function, which fluctuated apart from data deviation in Fig. 2, modified volatility unsatisfactorily. Tab.1 showed the predicted values from 2010 to 2015 where predicted ones were lower since sine function did not simulate a rapid growth from 2009 to 2010 occurred in true data

Fig. 2. Compare between real and predicted values of three models of grey model, quadratic and quadratic adding sine function in Wu coal output Bo Zhang and Junhai Ma / Procedia Engineering 15 ( 2011 ) 5020 – 5024 5023

Markov chain was introduced to TABLE I present volatility based on relative PREDICTION OF QUADRATIC ADDING SINE residuals of quadratic model, i.e. the ratio VOLATILITY between residuals and real values. Four Year Predicted Value (ten thousands tones) states were divided between maximum 34% and minimum -58% with first state [- 2010 2040.37 60% ˈ -30%], second state [-30% ˈ 0], 2011 2036.53 third state [0ˈ20%], forth state [20%ˈ 2012 2136.26 40%], whose process was demonstrated in 2013 2408.35 Fig. 3. 2014 2460.16 2015 2519.49

Followed from Fig. 3, one step probability transition matrix was calculated as §·1/3 2/3 0 0 ¨¸1/14 10/14 3/14 0 ¨¸. ¨¸0 /11 3 /11 7 /11 1/11 ¨¸ ©¹0/6 0/6 1/6 5/6 Relative residual in 2010 was 34 percent, belonging to forth state. The probability belonging to four stats in 2011 was evaluated as follows Famous Markov property assumed that the present value only have correlation to the past one period and no relation to the other past. Using the test of Markov property in reference [10], the p- Fig. 3. State division of relative residuals of quadratic value was 0.36>0.05 and accepted the null model hypothesis that Markov model was suitable. TABLE 2 Predicting results of quadratic model adding Markov chain (ten thousands tones)

Seco First nd Third Forth Relat §·1/3 2/3 0 0 State Predict Modifi ¨¸1/14 10/14 3/14 0 State State State ive ()¨¸ Year Prob ed ed 0001 Prob Proba Proba Resu ¨¸0 /11 3 /11 7 /11 1/11 abilit Values Value ¨¸ abilit bility bility ials ©¹0/6 0/6 1/6 5/6 y y = ()0 0 0.17 0.83 1959.6 2969.1 2010 0.00 0.00 0.00 1.00 34% Mean of boundary of each state, 5 7 2079.7 2836.0 being -45%,-15%ˈ10% and 30% 2011 0.00 0.00 0.17 0.83 27% 8 6 respectively, could be viewed as 2204.7 2865.4 2012 0.00 0.05 0.24 0.71 23% relative residuals in this state. 9 3 Relative residuals in 2011 was 0×(- 2334.6 2904.2 2013 0.00 0.10 0.28 0.61 20% 9 6 45%)+0×(- 2469.4 2958.5 15%)+0.17×10%+0.83×30%=27%. 2014 0.01 0.15 0.30 0.54 17% 6 3 Substituting the year into Eqn. (6), 2609.1 3029.1 2015 001 020 032 048 14% 20.7978 million tones was obtained, 5024 Bo Zhang and Junhai Ma / Procedia Engineering 15 ( 2011 ) 5020 – 5024

and coal yield in Wuhai in 2011 was predicted to be 20.7978×(1ˉ27%)=28.3606 million tones. Coal outputs in other years were forecasted at the same way and the results were expressed in Tab. 2.

Conclusion Using extensively, grey model whose fundamental hypothesis is exponential growth, is essentially equal to nonlinear regression with parameter estimation by Least Square. Since our coal production in Wuhai has quadratic increasing, quadratic model is directly assumed between output and time with sine function and Markov chain fitting volatility. The result shows that quadratic model adding Markov chain has its own superiority for our data, by which coal yield of Wuhai in 2015 is predicted to be 30.2917 million tones.

Acknowledgment

The research work of thesis is fully supported by the National Natural Science Foundation of China (Grant NO. 10772132) and the Doctoral Foundation of Ministry of Education of China (Grant No. 20070056063).

References

[1] D. Z. Ping, “Some Discussion on Coal Resources Exploitation and its Environmental Protection in Wuhai City,” Conservation and Utilization of mineral Resources, no. 2, pp. 45-47, Feb. 2005. [2] W. Z. Hong, Z. A. Guo, “Study on Forecasted Trend Of China’s Coal Output,” China Mining Magazine, vol. 12, no. 11, pp. 5–8, Nov. 2003. [3] T. J. Ming, Development of AI Forecasting System to Coal’s Output based on EF Model, Science & Technology Information, no. 34, pp. 45–46, Dec. 2007. [4] D. J. Long, Grey System Theory ⚠⨶䇪ส⹰, : Huazhong University Press, 2002, pp. 102–190. [5] T. Bin, “Research of Random Model of Bridge Load by Grey Markov Chain Prediction,” Master dissertation, Structure Engineering, Central South University, , China, 2007. [6] J. L. Deng, Introduction to grey system theory, The Journal of Grey System, vol. 1, no. 1, pp.1–24, Jan. 1989. [7] S. P. Meyn, R. L. Tweedie, Markov Chains and Stochastic Stability. London: Springer-Verlag, 1993, pp. 37-58. [8] H. James, A New Approach to The Economic Analysis Of Nonstationary Time Series And The Business Cycle, Econometrica , vol. 57, no. 2, pp. 357–84, Apr. 1989. [9] G. A. F. Seber, C. J. Wild, Nonlinear Regression. New York: John Wiley and Sons, 1989, pp. 1-100. [10] J. A. Matos and M. Fernandesbes, “Testing the Markov property with high frequency data,” Journal of Econometrics, vol.141, no. 1, pp. 44-64, Nov. 2007.