Contribution of Prof. H. Akaike Tsttitilmdlito Statistical Modeling of Complex Systems
Total Page:16
File Type:pdf, Size:1020Kb
Contribution of Prof. H. Akaike tSttitilMdlito Statistical Modeling of Complex Systems Research Organization of Information and Systems Genshiro Kitagawa Research Organization of ISI, Dublin, August 23, 2011 Information and Systems Hirotugu Akaike(1927-2009) Research Organization of Information and Systems 2 Brief History 1952 Graduated from Math. Dept., Tokyo University Researcher of the Institute of Statistical Mathematics 1962 Head of 2nd Section, 1st Division 1973 Director of 5th Division 1985 Director of Dept. of Prediction and Control 1986 Director-General of ISM (-1994) 1988 Member of Science Council of Japan (- 1991) Head of Dept. of Statistical Science, Graduate University of Advanced Study 1994 PfEProf. Emerit itThIus, The Inst tSttitMth. Statist. Math. Prof. Emeritus, Graduate Univ. for Advance Study Research Organization of Information and Systems 3 Prizes 1972 Ishikawa Prize (Establishment of statistical analysis and control method for dynamic systems) 1980 Okochi Prize (Research and realization of optimal steam temperature control of thermal electric plant) 1989 AhiPiAsahi Prize (Research on statistics, in particular theory and applications of AIC) The Purple Ribbon Medal (Statistics, in particular time series analysis and its applications) 1996 The 1st Japan Statistical Society Prize (Contributions to statistical theory and its applications ) 2000 The Order of the Sacred Treasure 2006 Kyoto Prize ( Major contribution to statistical science and modeling with the development of AIC) Fellow of ASA,,,, RSS, IMS, IEEE, JSS Research Organization of Information and Systems 4 Laureate of 22nd Kyoto Prize "Major contribution to statistical science and modeling with the development of the Akaike Information Criterion (AIC)” Research Organization of Information and Systems 5 Ontline 1. Launching period: Mathematical analysis and structural modeling (1950’s) 2. Frequency domain analysis (1960’s) 3. Time domain time series modeling (1970’s) 4. AIC and statistical modelingg( (1970’s) 5. Bayes modeling (1980’s) Research Organization of Information and Systems 6 1. Launching Period (1952-1960) Mathematical Analysis • DiiDecision process Bird cage • Evaluation of probability distribution • Monte Carlo method solving linear equation • Computation of eigenvalues • Converggpgence of optimum gradient method Convergence Analysis of Optimal Gradient Method AISM(1959) Analyygzed limiting behavior of the op pgtimal gradient method and showed the poor convergence property, known as bird cage. Became the fffoundation of nonlinear optimization methods such as Conjugate Gradient method and Quasi Newton method … Kowarik and Osbone (1968) Research Organization of Information and Systems 7 Structural Modeling At this time, he was rather interested in real-world problems. And f’from very early stage in the1950’, he realized the conventional linear stationary models are unrealistic and the importance of developing a model that fully takes into account of the structure of the process. Control of filature production process (Joint work with Dr. Shimazaki of the Sericultural Experiment Station) • Developed a control method based on gap process modeling • This method provided a reference process to detect abnormalities in actual reeling process. •Brouggght significant innovation for silk production in Ja pan Research Organization of Information and Systems 8 2. Frequency Domain Analysis: (1960-70) • By 1960, he established contacts with engineers It took almost 10 years for me to develop substantial contacts, and this happened largely by coincidence. Statistical Science (1995) • He realized that linear stationary modeling is utilized and many unsolved important problems are left for statisticians • Smoothing method for power spectrum estimation (Suspension system of a car, Dr. Kaneshige, Isuzu Motor Co.) Akaike windows •Estimation of frequency response function (Dr. Yamanouchi, Transportation Tech. Res. Inst.) Developed a method of estimating frequency response function from observations under normal steering (without using sinusoidal inputs) Research Organization of Information and Systems 9 2. Frequency Domain Analysis: (1960-70) Developed practical method of analyzing complex systems Organized workshop on practical use of time series method • Ship’s yawing, rolling (Yamanouchi, Kawashima) • Rolling of a car(Kawamura) • Engine of a car(Kaneshige) • Response of an air-plane to side wind(Takeda) • Hydroelectric power plant(Nakamura) • Estimation of underground structure through micro tremor • TiTsunami (Kinosita) • EEG analysis (Suhara) Research Organization of Information and Systems AISM supplement (1964) 10 Difficulty in Feedback Systems rotary kiln Cementrotary rotary kiln kiln Cement Rotary Kiln, Chichibu Cement Company rotary kiln •rotkilitary kiln is a comp litdlicated fdbktfeedback system, consitiisting of many variables such as raw material feed, fuel feed, gas damper angle, temperature at various locations, etc. • Conventional spectral analysis methods were useless to identify the source of fluctuation • In the frequency domain analysis, we cannot explicitly utilize the physical realizability of the system n1 A Limitation of frequency y n 2 y n1 domain approach Return to time domain modeling B Madison syypmposium (1967)(()1967) n2 Research Organization of Information and Systems 11 3. Time Domain Time Series Modeling (1968-) y ( y , , y )T Vector time series n n1 n wn1 Structural model B m 1 y ni kij ynk, j ni n1 ji k 1 A1 ni ki nk,i wni k 1 y n2 y n1 A2 VAR model representation n2 m y A y w n k nk n B2 k 1 Estimation of kij and ki through Ak wn2 Research Organization of Information and Systems 12 Spectrum & Power Contribution VAR model Power spectrum of yni m y A y w w ~ N(0,) 2 2 n k nk n n pii ( f ) | bij ( f ) | j k1 j1 Cross spectrum * p( f ) B( f ) B( f ) Power contribution m B( f ) (b ) A ep(exp(2i f ) 2 2 ij | bij ( f ) | j 1 r ij ( f ) pii ( f ) Order m is unknown! Proportion of the contribution from noise input e (n) in the Proper selection of the order j power spectrum of yi(n) at is crucial freqqyuency f. Research Organization of Information and Systems 13 Predictive Point of View and FPE Conventional fitting view point Predictive view point “True” “True” Distribution Model Distribution Estimation Estimation Prediction Future Present Present Model Data Data Data Evaluation Scalar AR model m 2 Multivariate case: MFPE, FPEC y n ak ynk wn , wn ~ N(0, m ) k 1 FPE is a forerunner of AIC FPE (Final Prediction Error) 2 nm1 n log FPEm nlogˆm nlog nm1 nm1 2 2 FPEm ˆm nlogˆ (m1) nm1 m Research Organization of Information and Systems Akaike (1969) 14 Realization of Statistical Control Cement rotary kiln VARX model m m yn Aj yn j Bjrn j n j1 j1 State –Space Representation xn Fxn1 Grn n yn Hxn Electric power plant Nonlinear control Japan Bailay Ozaki-Peng Criterion Function L T T J Exn Qxn rn Rrn Roll-reducing j1 Nakamura-Akaike(1981) control Mitsui ship building Statistical Optimal controller Autopilot Oda * rn Kxn Hrn1 Noise-adaptive controller Yokokawa Kitagawa-Akaike- TIMSAC Akaike-Nakagawa(1972) Ohtsu (2006) Ohtsu-Horigome-Kitagawa(1978) Research Organization of Information and Systems 1970 1980 2000 15 Analysis and Control of Feedback Systems Metabolism Mulivariate Wada AR model Ship’s Economics dynamics Tanaka-Sato Ohtsu-Horigome Finance, CDS Nuclear Tsuda Power power plant contribution Hitachi ((Fukunihsi) Man-machine system IhiIshiguro-Ohya Akaike’s linear time series modeling starts from multivariate case. 1968 1977 1993 2000 Research Organization of Information and Systems 16 Main Contributions in Time Domain Modeling 1. Developed practical method of VAR modeling • Analysis of feedback systems • Statistical controller 2. Use of state-space model • For optimal controller design • ML E sti mati on and id entifi cat ion o f VARMA mo de l • BAYSEA: Bayesian method for seasonal adjustment 3. Oder determination • FPE, MFPE, FPEC •AIC 4. Developed TIMSAC (Time Series Analysis and Control program package) Research Organization of Information and Systems 17 4. AIC and Statistical Modeling (1971-) Predictive Point of View “True” Distribution Prediction Predictive Estimation Future Pont of view Present Model Data Data Evaluation MFPE (1) Predictive point of view (2) Prediction by distribution AIC (3) Evaluation by K-L information Factor analysis Research Organization of Information and Systems 18 Predictive Point of View and AIC Model g(y): distribution of future data, f(y): model Kullback-Leibler Information g I(g; f ) EY log EY log g EY log f f Expected Log-Likelihood Unknown constant E log f (Y ) log f ( y)g( y)ddy Y Log-Likelihood x1 , , xn ~ g (x) 1 n log f ( X i ) EY log f (Y ) n i1 MLE ˆ ˆ () log f (X |) max ( ) (X ) Multi-model Situation: Bias correction AIC Research Organization of Information and Systems Akaike (1973,1974) 19 Bias Correction ˆ ˆ b(G) EX log f (X |(X )) nEY log f (Y |(X )) EX [D] Log-Likelihood Generic Information Criterion logfX ( | ( X )) D1 log f ( X |ˆ( X )) b(G) D D2 1 D b(G) trI(G)J(G) m 3 I(G): Fisher Information EEdxpected J(G): -(Expected Hessian) Log-Likelihood EY log f (Y |(X)) ˆ AIC 2log f (X |(X )) 2m * 2(log L(ˆ)) 2(dim of ) Research Organization of Information and Systems 20 Number of papers citing Akaike’s paper by Thomson (ISI) Year by year cites Cited Research Area 1200 5142 IEEEac(1974) AK(1973) 2907 Akad. Kiado (1973) 1000 2550 Math. Stat. IEEE(1974) 1115 Psychometrika (1987) 800 1987 Medical, Health 818 AISM(1969) 600 431 AISM(1970) 400 359 Math. Science (1978) 1868 Bio logy 200 281 Appl. Stat. (1977) 1833 Engineering 265 Bayesian Stat. (1980) 0 1973 1978 1983 1988 1993 1998 2003 2008 204 System Ident. (1980) 1755 Computer, Info. Sci. 135 Celebration (1986) 12000 Cumulative cites 1376 Psychology 127 AISM(1969) 10000 AK(1973) 123 AISM(1974) IEEE(1974) 1070 Econom ics 8000 13104 total 785 Social Science 6000 773 Environmental Sci. BByy GoogGooglele ScScholarholar 4000 673 Physics, Chemistry 14265 IEEE-ac (1974) 610 Geophysics 2000 8588 Acad.