Fused Adaptive Lasso for Spatial and Temporal Quantile Function Estimation

Total Page:16

File Type:pdf, Size:1020Kb

Fused Adaptive Lasso for Spatial and Temporal Quantile Function Estimation Technometrics ISSN: 0040-1706 (Print) 1537-2723 (Online) Journal homepage: http://www.tandfonline.com/loi/utch20 Fused Adaptive Lasso for Spatial and Temporal Quantile Function Estimation Ying Sun, Huixia J. Wang & Montserrat Fuentes To cite this article: Ying Sun, Huixia J. Wang & Montserrat Fuentes (2016) Fused Adaptive Lasso for Spatial and Temporal Quantile Function Estimation, Technometrics, 58:1, 127-137, DOI: 10.1080/00401706.2015.1017115 To link to this article: http://dx.doi.org/10.1080/00401706.2015.1017115 View supplementary material Accepted author version posted online: 01 Sep 2015. Published online: 22 Feb 2016. Submit your article to this journal Article views: 84 View related articles View Crossmark data Full Terms & Conditions of access and use can be found at http://www.tandfonline.com/action/journalInformation?journalCode=utch20 Download by: [North Carolina State University] Date: 26 April 2016, At: 14:04 Supplementary materials for this article are available online. Please go to http://www.tandfonline.com/r/TECH Fused Adaptive Lasso for Spatial and Temporal Quantile Function Estimation Ying SUN Huixia J. WANG CEMSE Division Department of Statistics King Abdullah University of Science and Technology George Washington University Thuwal 23955-6900, Saudi Arabia Washington, DC 20052 ([email protected]) ([email protected]) Montserrat FUENTES Department of Statistics North Carolina State University Raleigh, NC 27695 ([email protected]) Quantile functions are important in characterizing the entire probability distribution of a random vari- able, especially when the tail of a skewed distribution is of interest. This article introduces new quan- tile function estimators for spatial and temporal data with a fused adaptive Lasso penalty to accom- modate the dependence in space and time. This method penalizes the difference among neighboring quantiles, hence it is desirable for applications with features ordered in time or space without repli- cated observations. The theoretical properties are investigated and the performances of the proposed methods are evaluated by simulations. The proposed method is applied to particulate matter (PM) data from the Community Multiscale Air Quality (CMAQ) model to characterize the upper quantiles, which are crucial for studying spatial association between PM concentrations and adverse human health effects. KEY WORDS: Fused adaptive Lasso; Particulate matter; Quantile estimation; Spatial quantiles; Temporal quantiles. 1. INTRODUCTION while a local smoothing technique accounts for the dependence among qτ (i)’s. Moreover, only one realization of Y is needed to For a continuous univariate random variable Y, the quan- estimate qτ . q = F −1 τ ={y ∈ R F y = τ} tile function is defined as τ ( ) : ( ) , In the presence of right-skewed distributions, the usual Gaus- where F is the strictly increasing cumulative distribution func- sian assumption will underestimate the upper tail probability τ ∈ , tion (cdf) and (0 1) is the quantile level. Parzen (2004, and methods based on the extreme value theory or quantiles 2008) gave comprehensive discussions on quantiles from theory are more appropriate to characterize the tail distribution. Coo- to practice. The quantile function is important in characterizing ley et al. (2012) reviewed current methodology for the analysis the entire probability distribution, for example, the median is of spatial extremes, focusing on methods that account for de- a robust location estimator and upper quantiles are useful in pendence in extremes, such as max-stable processes and copula describing extreme events. Downloaded by [North Carolina State University] at 14:04 26 April 2016 models. Under these models, the statistical inference requires = Y ,...,Y T When data, Y ( 1 n) , are from a stochastic pro- repeated measurements, which are not generally available for cess, multivariate quantile function is not uniquely defined. We stochastic processes. In such a framework, the main goal of consider marginal quantiles and define the quantile function as the article consists of proposing nonparametric methods for = q ,...,q n T q i = F −1 τ F qτ ( τ (1) τ ( )) , where τ ( ) i ( ) with i denot- estimating upper quantile values in time or space without repli- Y ing the marginal cdf of i . In practice, estimating qτ is difficult cated observations. Our work is motivated by the need of bet- Y because i ’s are correlated and repeated measurements of the ter tools to characterize the tail distribution of the particulate random vector Y are usually not available. For example, for matter (PM), a mixture of pollutants. Since most air quality time series or spatial data, it is challenging to estimate the tem- data are right-skewed under stable meteorological conditions, poral quantile curves or spatial quantile surfaces due to the de- pendence among neighboring observations. Ignoring the depen- dence may underestimate the magnitude of the quantile function. © 2016 American Statistical Association and In this article, we propose a method to tackle this problem via the American Society for Quality a smoothing procedure to estimate qτ locally. The advantage TECHNOMETRICS, FEBRUARY 2016, VOL. 58, NO. 1 of this method is that the index-varying qτ (i), i = 1,...,n, DOI: 10.1080/00401706.2015.1017115 describes the changing behavior in the marginal distribution, Color versions of one or more of the figures in the article can be found online at www.tandfonline.com/r/tech. 127 128 YING SUN, HUIXIA J. WANG, AND MONTSERRAT FUENTES and are observed as a time series at a particular location, or 2. METHODOLOGY a spatial surface at a certain time, we need to characterize the upper quantiles using one time series or one set of spatial data 2.1 Background only. To estimate quantile functions, quantile regression, first pro- For stochastic processes, one way to create replication is via a posed by Koenker and Bassett (1978), has become an impor- stationarity assumption. For example, the second-order station- tant and widely used technique. Quantile regression can be arity means that the mean and the covariance of the process do used to estimate quantiles of a response variable conditional not change when shifted in time or space. Specifically, a time on the regressors or predictors, and thus provide a comprehen- {Y t } δ series, ( ) , is second-order stationary if, for all , and for sive picture of the conditional response. Suppose we observe all t, E{Y (t)}=μ, and cov{Y (t),Y(t + δ)}=K(δ). Under the p {(Xi ,Yi ),i = 1,...,n}, where Xi ∈ R is the vector of predic- second-order stationarity assumption, covariance functions play tors and Yi ∈ R is the continuous response. Let qτ (Xi )bethe an important role in interpreting the local behavior, and Gaus- conditional τth quantile of Yi , such that P (Yi ≤ qτ (Xi )|Xi ) = τ, sian process models are widely used. For skewed distributions, for 0 <τ<1. The quantile qτ (Xi ) is estimated by minimizing methodologies based on multivariate extreme value theory are n ρ Y − q , ρ u = u{τ − I u< } i=1 τ ( i τ (Xi )) where τ ( ) ( 0) . developed to model the tail dependence, but it becomes difficult Before introducing the regularization technique in quan- for high or infinite dimensions (Cooley et al. 2012). Another way tile regression model, we briefly review two types of Lasso to model the local behavior is through nonparametric smooth- penalties in the least squares regression model. The Lasso ing. In practice, it is often difficult to tell whether the true un- methodology, as proposed by Tibshirani (1996), is a reg- derlying process is nonstationary or stationary with long range ularization technique for simultaneous estimation and vari- dependence by only looking at the observed data. The local T able selection. In a linear regression model E(Yi |Xi ) = Xi β, features can be either interpreted as a result of the nonstation- β = β ,...,β T the coefficient ( 1 p) is estimated by minimizing arity or the long range dependence in a stationary process. For n T 2 p i= (Yi − Xi β) + λ j= |βj |, where λ is a tuning parame- example, Furrer and Nychka (2007) presented a framework to 1 1 ter. The L1 norm penalty shrinks the coefficients toward 0 as understand the asymptotic properties of kriging and splines and λ increases. The fused Lasso penalty (Tibshirani et al. 2005) proved the equivalent interpretation between kriging estimators enforces sparsity in both the coefficients and their successive and generalized splines based on Nychka (1995). Stein (1991) differences, which is desirable for applications with features or- showed that the predictions that a generalized covariance func- dered in some meaningful way. The penalty term can be written tion yields are identical to an order 2 thin plate smoothing spline. λ p |β |+λ p |β − β |. as 1 j=1 j 2 j=2 j j−1 Draghicescu, Guillas, and Wu (2009) addressed time-varying The Lasso-type penalties in the least squares regression model quantile curve estimation for a class of nonstationary time are widely used in many fields. For example, in comparative ge- series. nomic hybridization (CGH) analysis, Eilers (2003), Huang et al. In this article, we propose nonparametric methods assuming (2005), and Tibshirani and Wang (2008) employed the penal- nonstationarity to capture the local dependence by minimizing ized least squares method by shrinking the distances between a loss function with a penalty term. For example, when observa- signals at adjacent clone locations, taking the spatial depen- tions are temporally inhomogeneous, we assume the marginal dence into account. Nowak et al. (2011) considered multiple Y distribution of i varies over time to characterize the temporal arrays of the CGH data. Moreover, Storlie et al. (2011) studied dependence, and improve the quantile estimation by penalizing the Lasso penalty for surface estimation and variable selec- the roughness. We use the fused Lasso technique (Tibshirani tion in multivariate nonparametric regression. Das et al.
Recommended publications
  • Miguel De Carvalho
    School of Mathematics Miguel de Carvalho Contact M. de Carvalho T: +44 (0) 0131 650 5054 Information The University of Edinburgh B: [email protected] School of Mathematics : mb.carvalho Edinburgh EH9 3FD, UK +: www.maths.ed.ac.uk/ mdecarv Personal Born September 20, 1980 in Montijo, Lisbon. Details Portuguese and EU citizenship. Interests Applied Statistics, Biostatistics, Econometrics, Risk Analysis, Statistics of Extremes. Education Universidade de Lisboa, Portugal Habilitation in Probability and Statistics, 2019 Thesis: Statistical Modeling of Extremes Universidade Nova de Lisboa, Portugal PhD in Mathematics with emphasis on Statistics, 2009 Thesis: Extremum Estimators and Stochastic Optimization Advisors: Manuel Esqu´ıvel and Tiago Mexia Advisors of Advisors: Jean-Pierre Kahane and Tiago de Oliveira. Nova School of Business and Economics (Triple Accreditation), Portugal MSc in Economics, 2009 Thesis: Mean Regression for Censored Length-Biased Data Advisors: Jos´eA. F. Machado and Pedro Portugal Advisors of Advisors: Roger Koenker and John Addison. Universidade Nova de Lisboa, Portugal `Licenciatura'y in Mathematics, 2004 Professional Probation Period: Statistics Portugal (Instituto Nacional de Estat´ıstica). Awards & ISBA (International Society for Bayesian Analysis) Honours Lindley Award, 2019. TWAS (Academy of Sciences for the Developing World) Young Scientist Prize, 2015. International Statistical Institute Elected Member, 2014. American Statistical Association Young Researcher Award, Section on Risk Analysis, 2011. National Institute of Statistical Sciences j American Statistical Association Honorary Mention as a Finalist NISS/ASA Best y-BIS Paper Award, 2010. Portuguese Statistical Society (Sociedade Portuguesa de Estat´ıstica) Young Researcher Award, 2009. International Association for Statistical Computing ERS IASC Young Researcher Award, 2008. 1 of 13 p l e t t si a o e igulctoson Applied Statistical ModelingPublications 1.
    [Show full text]
  • Abbreviations of Names of Serials
    Abbreviations of Names of Serials This list gives the form of references used in Mathematical Reviews (MR). ∗ not previously listed E available electronically The abbreviation is followed by the complete title, the place of publication § journal reviewed cover-to-cover V videocassette series and other pertinent information. † monographic series ¶ bibliographic journal E 4OR 4OR. Quarterly Journal of the Belgian, French and Italian Operations Research ISSN 1211-4774. Societies. Springer, Berlin. ISSN 1619-4500. §Acta Math. Sci. Ser. A Chin. Ed. Acta Mathematica Scientia. Series A. Shuxue Wuli † 19o Col´oq. Bras. Mat. 19o Col´oquio Brasileiro de Matem´atica. [19th Brazilian Xuebao. Chinese Edition. Kexue Chubanshe (Science Press), Beijing. (See also Acta Mathematics Colloquium] Inst. Mat. Pura Apl. (IMPA), Rio de Janeiro. Math.Sci.Ser.BEngl.Ed.) ISSN 1003-3998. † 24o Col´oq. Bras. Mat. 24o Col´oquio Brasileiro de Matem´atica. [24th Brazilian §ActaMath.Sci.Ser.BEngl.Ed. Acta Mathematica Scientia. Series B. English Edition. Mathematics Colloquium] Inst. Mat. Pura Apl. (IMPA), Rio de Janeiro. Science Press, Beijing. (See also Acta Math. Sci. Ser. A Chin. Ed.) ISSN 0252- † 25o Col´oq. Bras. Mat. 25o Col´oquio Brasileiro de Matem´atica. [25th Brazilian 9602. Mathematics Colloquium] Inst. Nac. Mat. Pura Apl. (IMPA), Rio de Janeiro. § E Acta Math. Sin. (Engl. Ser.) Acta Mathematica Sinica (English Series). Springer, † Aastaraam. Eesti Mat. Selts Aastaraamat. Eesti Matemaatika Selts. [Annual. Estonian Heidelberg. ISSN 1439-8516. Mathematical Society] Eesti Mat. Selts, Tartu. ISSN 1406-4316. § E Acta Math. Sinica (Chin. Ser.) Acta Mathematica Sinica. Chinese Series. Chinese Math. Abh. Braunschw. Wiss. Ges. Abhandlungen der Braunschweigischen Wissenschaftlichen Soc., Acta Math.
    [Show full text]
  • Department of Statistics and Data Science Promotion and Tenure Guidelines
    Approved by Department 12/05/2013 Approved by Faculty Relations May 20, 2014 UFF Notified May 21, 2014 Effective Spring 2016 2016-17 Promotion Cycle Department of Statistics and Data Science Promotion and Tenure Guidelines The purpose of these guidelines is to give explicit definitions of what constitutes excellence in teaching, research and service for tenure-earning and tenured faculty. Research: The most common outlet for scholarly research in statistics is in journal articles appearing in refereed publications. Based on the five-year Impact Factor (IF) from the ISI Web of Knowledge Journal Citation Reports, the top 50 journals in Probability and Statistics are: 1. Journal of Statistical Software 26. Journal of Computational Biology 2. Econometrica 27. Annals of Probability 3. Journal of the Royal Statistical Society 28. Statistical Applications in Genetics and Series B – Statistical Methodology Molecular Biology 4. Annals of Statistics 29. Biometrical Journal 5. Statistical Science 30. Journal of Computational and Graphical Statistics 6. Stata Journal 31. Journal of Quality Technology 7. Biostatistics 32. Finance and Stochastics 8. Multivariate Behavioral Research 33. Probability Theory and Related Fields 9. Statistical Methods in Medical Research 34. British Journal of Mathematical & Statistical Psychology 10. Journal of the American Statistical 35. Econometric Theory Association 11. Annals of Applied Statistics 36. Environmental and Ecological Statistics 12. Statistics in Medicine 37. Journal of the Royal Statistical Society Series C – Applied Statistics 13. Statistics and Computing 38. Annals of Applied Probability 14. Biometrika 39. Computational Statistics & Data Analysis 15. Chemometrics and Intelligent Laboratory 40. Probabilistic Engineering Mechanics Systems 16. Journal of Business & Economic Statistics 41.
    [Show full text]
  • Rank Full Journal Title Journal Impact Factor 1 Journal of Statistical
    Journal Data Filtered By: Selected JCR Year: 2019 Selected Editions: SCIE Selected Categories: 'STATISTICS & PROBABILITY' Selected Category Scheme: WoS Rank Full Journal Title Journal Impact Eigenfactor Total Cites Factor Score Journal of Statistical Software 1 25,372 13.642 0.053040 Annual Review of Statistics and Its Application 2 515 5.095 0.004250 ECONOMETRICA 3 35,846 3.992 0.040750 JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION 4 36,843 3.989 0.032370 JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY 5 25,492 3.965 0.018040 STATISTICAL SCIENCE 6 6,545 3.583 0.007500 R Journal 7 1,811 3.312 0.007320 FUZZY SETS AND SYSTEMS 8 17,605 3.305 0.008740 BIOSTATISTICS 9 4,048 3.098 0.006780 STATISTICS AND COMPUTING 10 4,519 3.035 0.011050 IEEE-ACM Transactions on Computational Biology and Bioinformatics 11 3,542 3.015 0.006930 JOURNAL OF BUSINESS & ECONOMIC STATISTICS 12 5,921 2.935 0.008680 CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS 13 9,421 2.895 0.007790 MULTIVARIATE BEHAVIORAL RESEARCH 14 7,112 2.750 0.007880 INTERNATIONAL STATISTICAL REVIEW 15 1,807 2.740 0.002560 Bayesian Analysis 16 2,000 2.696 0.006600 ANNALS OF STATISTICS 17 21,466 2.650 0.027080 PROBABILISTIC ENGINEERING MECHANICS 18 2,689 2.411 0.002430 BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY 19 1,965 2.388 0.003480 ANNALS OF PROBABILITY 20 5,892 2.377 0.017230 STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT 21 4,272 2.351 0.006810 JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS 22 4,369 2.319 0.008900 STATISTICAL METHODS IN
    [Show full text]
  • Abbreviations of Names of Serials
    Abbreviations of Names of Serials This list gives the form of references used in Mathematical Reviews (MR). ∗ not previously listed The abbreviation is followed by the complete title, the place of publication x journal indexed cover-to-cover and other pertinent information. y monographic series Update date: January 30, 2018 4OR 4OR. A Quarterly Journal of Operations Research. Springer, Berlin. ISSN xActa Math. Appl. Sin. Engl. Ser. Acta Mathematicae Applicatae Sinica. English 1619-4500. Series. Springer, Heidelberg. ISSN 0168-9673. y 30o Col´oq.Bras. Mat. 30o Col´oquioBrasileiro de Matem´atica. [30th Brazilian xActa Math. Hungar. Acta Mathematica Hungarica. Akad. Kiad´o,Budapest. Mathematics Colloquium] Inst. Nac. Mat. Pura Apl. (IMPA), Rio de Janeiro. ISSN 0236-5294. y Aastaraam. Eesti Mat. Selts Aastaraamat. Eesti Matemaatika Selts. [Annual. xActa Math. Sci. Ser. A Chin. Ed. Acta Mathematica Scientia. Series A. Shuxue Estonian Mathematical Society] Eesti Mat. Selts, Tartu. ISSN 1406-4316. Wuli Xuebao. Chinese Edition. Kexue Chubanshe (Science Press), Beijing. ISSN y Abel Symp. Abel Symposia. Springer, Heidelberg. ISSN 2193-2808. 1003-3998. y Abh. Akad. Wiss. G¨ottingenNeue Folge Abhandlungen der Akademie der xActa Math. Sci. Ser. B Engl. Ed. Acta Mathematica Scientia. Series B. English Wissenschaften zu G¨ottingen.Neue Folge. [Papers of the Academy of Sciences Edition. Sci. Press Beijing, Beijing. ISSN 0252-9602. in G¨ottingen.New Series] De Gruyter/Akademie Forschung, Berlin. ISSN 0930- xActa Math. Sin. (Engl. Ser.) Acta Mathematica Sinica (English Series). 4304. Springer, Berlin. ISSN 1439-8516. y Abh. Akad. Wiss. Hamburg Abhandlungen der Akademie der Wissenschaften xActa Math. Sinica (Chin. Ser.) Acta Mathematica Sinica.
    [Show full text]
  • CURRICULUM VITAE YAAKOV MALINOVSKY May 3, 2020
    CURRICULUM VITAE YAAKOV MALINOVSKY May 3, 2020 University of Maryland, Baltimore County Email: [email protected] Department of Mathematics and Statistics Homepage: http://www.math.umbc.edu/ yaakovm 1000 Hilltop Circle Cell. No.: 240-535-1622 Baltimore, MD 21250 Education: Ph.D. 2009 The Hebrew University of Jerusalem, Israel Statistics M.A. 2002 The Hebrew University of Jerusalem, Israel Statistics with Operation Research B.A. 1999 The Hebrew University of Jerusalem, Israel Statistics (Summa Cum Laude) Experience in Higher Education: From July 1, 2017 , University of Maryland, Baltimore County, Associate Professor, Mathematics and Statistics. Fall 2011 { 2017, University of Maryland, Baltimore County, Assistant Professor, Mathematics and Statistics. September 2009 { August 2011, NICHD, Visiting Fellow. Spring 2003 { Spring 2007, The Hebrew University in Jerusalem, Instructor, Statistics. 2000{2004, The Open University, Israel, Instructor, Mathematics and Computer Science. Scholarships & Awards: 2014 IMS Meeting of New Researchers Travel award 2010 IMS Meeting of New Researchers Travel award 2003 Hebrew University Yochi Wax prize for PhD student 2001 Hebrew University Rector fellowship 1996 Hebrew University Dean's award 1995 Hebrew University Dean's list Academic visiting: Biostatistics Research Branch NIAID (June-August 2015), La Sapienza Rome Department of Mathematics (January 2018), Alfr´edR´enyi Institute of Mathematics Budapest (June 2018), Biostatistics Branch NCI (August 2017, August 2018, August 2019), Haifa University Department
    [Show full text]
  • Distributions You Can Count on . . . but What's the Point? †
    econometrics Article Distributions You Can Count On . But What’s the Point? † Brendan P. M. McCabe 1 and Christopher L. Skeels 2,* 1 School of Management, University of Liverpoo, Liverpool L69 7ZH, UK; [email protected] 2 Department of Economics, The University of Melbourne, Carlton VIC 3053, Australia * Correspondence: [email protected] † An early version of this paper was originally prepared for the Fest in Celebration of the 65th Birthday of Professor Maxwell King, hosted by Monash University. It was started while Skeels was visiting the Department of Economics at the University of Bristol. He would like to thank them for their hospitality and, in particular, Ken Binmore for a very helpful discussion. We would also like to thank David Dickson and David Harris for useful comments along the way. Received: 2 September 2019; Accepted: 25 February 2020; Published: 4 March 2020 Abstract: The Poisson regression model remains an important tool in the econometric analysis of count data. In a pioneering contribution to the econometric analysis of such models, Lung-Fei Lee presented a specification test for a Poisson model against a broad class of discrete distributions sometimes called the Katz family. Two members of this alternative class are the binomial and negative binomial distributions, which are commonly used with count data to allow for under- and over-dispersion, respectively. In this paper we explore the structure of other distributions within the class and their suitability as alternatives to the Poisson model. Potential difficulties with the Katz likelihood leads us to investigate a class of point optimal tests of the Poisson assumption against the alternative of over-dispersion in both the regression and intercept only cases.
    [Show full text]
  • Daniel S. Cooley Education Academic and Professional Experience
    Daniel S. Cooley Professor and Graduate Director Department of Statistics, Colorado State University, Fort Collins, CO 80523-1877 [email protected] www.stat.colostate.edu/∼cooleyd August 4, 2020 Education Ph.D. Applied Mathematics, University of Colorado at Boulder, 2005 M.S. Applied Mathematics, University of Colorado at Boulder, 2002 B.A. Mathematics, University of Colorado at Boulder, 1994 Academic and Professional Experience 2020-Present: Graduate Director; Department of Statistics, Colorado State University. 2018-Present: Professor; Department of Statistics, Colorado State University. 2017-2020: Associate Chair; Department of Statistics, Colorado State University. 2015-Present: Faculty Member; School of Environmental Sustainability, Colorado State University. 2012-2018: Associate Professor; Department of Statistics, Colorado State University. 2007-2012: Assistant Professor; Department of Statistics, Colorado State University. 2005-2007: Postdoctoral Researcher; Department of Statistics, Colorado State University (joint ap- pointment). 2005-2007: Postdoctoral Researcher; Geophysical Statistics Project, National Center for Atmo- spheric Research, (joint appointment). Visiting Appointments Fall 2014: Visiting Scholar; Department of Statistics, University of Washington, Seattle WA. Fall 2011: Research Fellow, Program on Uncertainty Quantification; SAMSI, Research Triangle Park, NC. June 2008: Invited Visitor; Institut de Recherche Math´ematiqueAvanc´ee,Universit´eLouis Pasteur Strasbourg, France. Research Interests Extreme value theory, modeling multivariate extremes, heavy tailed phenomena, spatial statistics, applications in atmospheric science and environmental science. Honors/Awards College of Natural Sciences, Professor Laureate, 2017-2019. ASA Section on Statistics and the Environment Young Researcher Award, 2012. ASA Section on Statistics and the Environment (ENVR) JSM Presentation Award for the paper \Spatial Prediction of Extreme Value Return Levels", August 2005. 1 Publications Research Articles, Refereed 1.
    [Show full text]
  • Dissecting the Multivariate Extremal Index and Tail Dependence
    REVSTAT – Statistical Journal Volume 18, Number 4, October 2020, 501–520 DISSECTING THE MULTIVARIATE EXTREMAL INDEX AND TAIL DEPENDENCE Authors: Helena Ferreira – Centro de Matem´atica e Aplica¸c˜oes(CMA-UBI), Universidade da Beira Interior, Avenida Marquˆesd’Avila e Bolama, 6200-001 Covilh˜a, Portugal [email protected] Marta Ferreira – Center of Mathematics of Minho University, Center for Computational and Stochastic Mathematics of University of Lisbon, Center of Statistics and Applications of University of Lisbon, Portugal [email protected] Received: April 2017 Revised: February 2018 Accepted: June 2018 Abstract: • A central issue in the theory of extreme values focuses on suitable conditions such that the well- known results for the limiting distributions of the maximum of i.i.d. sequences can be applied to stationary ones. In this context, the extremal index appears as a key parameter to capture the effect of temporal dependence on the limiting distribution of the maxima. The multivariate extremal index corresponds to a generalization of this concept to a multivariate context and affects the tail dependence structure within the marginal sequences and between them. As it is a function, the inference becomes more difficult, and it is therefore important to obtain characterizations, namely bounds based on the marginal dependence that are easier to estimate. In this work we present two decompositions that emphasize different types of information contained in the multivariate extremal index, an upper limit better than those found in the literature and we analyse its role in dependence on the limiting model of the componentwise maxima of a stationary sequence. We will illustrate the results with examples of recognized interest in applications.
    [Show full text]
  • A Study of Statistical Design Publications from 1968 Through 1971*
    A STUDY OF STATISTICAL DESIGN PUBLICATIONS FROM 1968 THROUGH 1971* by W. T. Federer and A. J. Federer BU-443-M Abstract A summarization was made of the number of articles and the number of pages devoted to publication of papers on the statistical design and analysis of experi­ ment designs, treatment designs, sample survey design~ sequential designs, and designs of model-building investigations. These articles were devoted mostly to the theoretical developments rather than to the use of a procedure in a specific investigation. Papers in 18 statistical journals for the four years 1968, 1969, 1970, and 1971 were included in the summarization. Six hundred fifty nine articles involving 7009 pages were published on statistical design and analysis during this four-year period. *Biometrics Unit, Cornell University, Ithaca, N. Y. 14850. 4/73 A STUDY OF STATISTICAL DESIGN PUBLICATIONS FROM 1968 THROUGH 1971* by W. T. Federer and A. J. Federer Some consideration has been given to founding an international journal de­ voted to the theory, application, and exposition of statistical design in surveys, experiments, model building, dynamic programming, systems analysis, and related areas. Reasons advanced for having a separate journal on statistical design and associated analyses were: 1. To focus attention on the broad and diverse aspects of statistical design which is one of the three major parts of the definition of statistics. 2. To bring together literature on statistical design which is presently scat­ tered throughout scientific journals. 3. To relieve some of the pressure on journals such as the Journal of Combina­ torial Theory. 4. To encourage publication of statements of unsolved problems and expository papers.
    [Show full text]
  • Curriculum Vita
    Curriculum vita Baolin Wu June 2014 Contact Information Address Division of Biostatistics School of Public Health, University of Minnesota A460 Mayo Building, MMC 303 420 Delaware St SE Minneapolis, MN 55455 Phone (612) 624-0647 Fax (612) 626-0660 Email [email protected] Education 1999-2004 Ph.D. in Biostatistics, Yale University, New Haven, CT. 1995-1999 B.S. in Probability and Statistics, Peking University, Beijing, P.R.China. Positions 2010-Present Tenured Associate Professor, Division of Biostatistics, School of Public Health, University of Minnesota 2004-2010 Tenure-track Assistant Professor, Division of Biostatistics, School of Public Health, University of Minnesota Student Advising • Wei Zhang (ongoing CS PhD): co-advisor. • Xiting Cao (PhD): graduated November 2011 and now worked for Merck. • Ran Li (PhD): graduated December 2011 and now worked for Abbott. • Sang Mee Lee (PhD): graduated July 2011 and now a Research Assistant Professor of Biostatistics at Department of Health Studies, University of Chicago. Won the student paper award at the 2010 ENAR meeting. • Fang Liu (MS): graduated May 2007 and now worked for UnitedHealth. • Jiaqi Yang (MS): graduated May 2008 and now worked for Schlumberger-Doll Research. 1 Professional Service Referee for journals Annals of Applied Statistics; Applied Statistics; Behavior Genetics; Bioinformatics; Biomet- rical Journal; Biometrics; Biometrika; Biostatistics; BMC Bioinformatics; Communications in Statistics - Simulation and Computation; Computational Statistics and Data Analysis; Genome Biology; IEEE/ACM Transactions on Computational Biology and Bioinformatics; Journal of Agricultural, Biological, and Environmental Statistics; Journal of Applied Statis- tics; Journal of Biological Systems; Journal of Biomedicine and Biotechnology; Journal of the American Statistical Association; Nucleic Acids Research; Physiological Genomics; Sta- tistical Applications in Genetics and Molecular Biology; Statistics in Biopharmaceutical Research; Statistics in Medicine; Statistica Sinica; Technometrics; Test.
    [Show full text]
  • Spatio-Temporal Analysis of Regional Unemployment Rates: a Comparison of Model Based Approaches
    REVSTAT – Statistical Journal Volume 16, Number 4, October 2018, 515–536 SPATIO-TEMPORAL ANALYSIS OF REGIONAL UNEMPLOYMENT RATES: A COMPARISON OF MODEL BASED APPROACHES Authors: Soraia Pereira – Faculdade de Ciˆencias, Universidade de Lisboa, Lisboa, Portugal [email protected] Feridun Turkman – Faculdade de Ciˆencias, Universidade de Lisboa, Lisboa, Portugal [email protected] Lu´ıs Correia – Instituto Nacional de Estat´ıstica, Lisboa, Portugal [email protected] Received: April 2016 Revised: November 2016 Accepted: November 2016 Abstract: • This study aims to analyze the methodologies that can be used to estimate the total number of unemployed, as well as the unemployment rates for 28 regions of Portugal, designated as NUTS III regions, using model based approaches as compared to the direct estimation methods currently employed by INE (National Statistical Institute of Portugal). Model based methods, often known as small area estimation methods (Rao, 2003), “borrow strength” from neighbouring regions and in doing so, aim to compensate for the small sample sizes often observed in these areas. Consequently, it is generally accepted that model based methods tend to produce estimates which have lesser variation. Other benefit in employing model based methods is the possibility of including auxiliary information in the form of variables of interest and latent random structures. This study focuses on the application of Bayesian hierarchical models to the Portuguese Labor Force Survey data from the 1st quarter of 2011 to the 4th quar- ter of 2013. Three different data modeling strategies are considered and compared: Modeling of the total unemployed through Poisson, Binomial and Negative Binomial models; modeling of rates using a Beta model; and modeling of the three states of the labor market (employed, unemployed and inactive) by a Multinomial model.
    [Show full text]