Thesis Reference

Total Page:16

File Type:pdf, Size:1020Kb

Thesis Reference Thesis Contributions to simulation-based estimation methods ORSO, Samuel Abstract The focus of this thesis is twofold. First, it delivers a new look at existing simulation-based methods for statistical inference in parametric problems. Emphasis is placed on finite sample theoretical properties and computational efficiency. In particular, a simple and computationally efficient method for inference is proposed. It is shown that exact inference may be claimed in theory in some situations even though sample size is kept fixed. Numerical examples demonstrate the wide applicability of this method. Second, a general class of flexible models for dependent random phenomena is studied. Emphasis is placed on problems of point estimations due to the presence of outliers or because of the underlying computational burden. To tackle these issues, a new multi-step robust and computationally efficient estimator is proposed. Asymptotic properties are studied along with illustrative examples. Reference ORSO, Samuel. Contributions to simulation-based estimation methods. Thèse de doctorat : Univ. Genève, 2019, no. GSEM 66 DOI : 10.13097/archive-ouverte/unige:121536 URN : urn:nbn:ch:unige-1215361 Available at: http://archive-ouverte.unige.ch/unige:121536 Disclaimer: layout of this document may differ from the published version. 1 / 1 Contributions to simulation-based estimation methods by Samuel Orso A thesis submitted to the Geneva School of Economics and Management, University of Geneva, Switzerland, in fulfillment of the requirements for the degree of PhD in Statistics Members of the thesis committee: Prof. Maria-Pia Victoria-Feser, Co-advisor, University of Geneva Prof. St´ephane Guerrier, Co-advisor, Pennsylvania State University Prof. Stefan Sperlich, Chair, University of Geneva Prof. Yanyuan Ma, Pennsylvania State University Thesis No. 66 January 2019 La Facult´ed’´economie et de management, sur pr´eavis du jury, a autoris´el’impression de la pr´esente th`ese, sans entendre, par-l`a, ´emettre aucune opinion sur les propositions qui s’y trouvent ´enonc´ees et qui n’engagent que la responsabilit´ede leur auteur. Gen`eve, le 29 janvier 2019 Le doyen, Marcelo OLARREAGA Impression d’apr`es le manuscrit de l’auteur. Acknowledgements This thesis has greatly benefitted from the academic and moral support of many, and as hard as it seems, I would like to express my gratitude to all of them. Foremost, I would like to thank my two supervisors Prof. Maria-Pia Victoria-Feser and Prof. St´ephane Guerrier. I would not be writting these lines if it was not for them. They made me discover statistics as an under-graduate student in business administration and have never let me down ever since. I am not sure I would have had their patience with someone who had such a poor background. St´ephane has been very influencial in my decisions to undertake studies in statistics and to pursue a PhD. He has always supported me and is very inspirational. Maria-Pia has never stopped me for trying my own way for research, better, she made it possible regardless of the circumstances. I admire her commitment and all the positive energies she puts into work. I would like to thank Prof. Stefan Sperlich, the chair of my jury, for the interest he took in my work, for his constructive feedbacks and for his professionalism. Likewise, I would like to thank Prof. Yanyuan Ma who honoured me by accepting to be my external expert and whose instructive comments improved the content of this thesis. I would also like to thank my incredible colleagues. I am forever indebted to Marco, a.k.a. Marc-small-oh, and Marc-Olivier, a.k.a. Marc-Big-Oh, for their valuable discus- sions from which this thesis has benefitted. I shared unforgettable moments with the “dˆıner de cons” team: Kustrim, Elise, Rose, Marc-Olivier, Mattia and Dany. I enjoyed sportive time with the “badminton” team: Rami, Walid, Mark, Haotian, Mattia and Dany. Throughout these years, I have always loved sharing ideas with my colleagues at the GSEM. Among them, I would like to acknowledge: Jean-Christophe, Jonathan, Ozgu, Roberto, Sebastian, Justine, Ingrid, Steffen, Mark, Pierre-Yves, Linda, Setareh, Laura, Guillaume, Cesare, Julien, Alban, Jiajun, Benjamin, Kasia, Mucyo, Yuming et al. My office mates Haotian and Ga¨etan were very supportive all along my dissertation and I specially want to thank them. I would like to thank my parents, Charly and Corine, my three sisters and their partners, Jo¨elle, Lucien, Myriam, Estelle and Sergio, for their love and for always being supportive. A special though goes to my two lovely little nieces Anouk and Justine. This thesis would have been impossible without the support of my in-laws Jordan, Margarita and Aleks. On a great deal of occasions, Margarita has volunteered to compensate my lack of availability to my family. This thesis was one of the last thing, if not the last, that I started before my son Isaac was born. He encouraged me a lot for example by saying: “Mon papa fait des statistiques” 1, or with the many drawings that covers half of my wall at the office. My daughter Tania was born later. Her arrival gave me the courage to attack the last marathon that represents the writing of a thesis. Her good sleeps were the encouragement I needed. Many of the ideas that came up to me during the thesis 1French for: “My dad does statistics” ii happened during the bedtime. These little reflections of myself have greatly impacted this thesis. Finally, I would like to thank my soulmate and wife Linda for all these years of sharing and giving. She shared my ups and down. She never stopped believing in me. She gave me all the forces that leads me to say today: “I have Ph.inisheD”. Abstract The focus of this thesis is twofold. First, it delivers a new look at existing simulation- based methods for statistical inference in parametric problems. Emphasis is placed on finite sample theoretical properties and computational efficiency. In particular, a simple and computationally efficient method for inference is proposed. It is shown that exact inference may be claimed in theory in some situations even though sample size is kept fixed. Numerical examples demonstrate the wide applicability of this method. Second, a general class of flexible models for dependent random phenomena is studied. Emphasis is placed on problems of point estimations due to the presence of outliers or because of the underlying computational burden. To tackle these issues, a new multi-step robust and computationally efficient estimator is proposed. Asymptotic properties are studied along with illustrative examples. R´esum´e Cette th`ese comporte deux parties. Premi`erement, elle d´elivre un nouveau regard sur diff´erentes m´ethodes par simulations d´evelopp´ees pour faire de l’inf´erence statistique sur des probl`emes param´etriques. Le focus est port´esur les propri´et´es th´eoriques en ´echantillon fini et les probl`emes computationnelles. En particulier, une m´ethode simple et computationnellement ´efficiente est propos´ee. Il est d´emontr´equ’il est possible d’obtenir une inf´erence exacte dans certaines situations tout en gardant la taille d’´echantillon fixe. Des examples num´eriques illustrent le vaste champ d’application de cette m´ethode. Deuxi`emement, une classe g´en´erale de mod`eles pour des variables al´eatoires d´ependantes est ´etudi´e. Les sujets principaux sont les probl`emes d’estimation ponctuelles dˆus `a la pr´esence de donn´ees aberrantes ou `acause de probl`emes computationnelles. Afin d’adresser ces questions, un nouvel estimateur robuste et computationnellement efficient est propos´e. Ses propri´et´es asymptotiques sont ´etudi´ees. Des examples viennent illustrer ces r´esultats. Contents Acknowledgements i Abstract iii R´esum´e v Introduction 1 List of notation 3 1 SwiZs: Switched Z-estimators 5 1.1 Introduction ................................... 5 1.2 Setup ....................................... 7 1.3 Equivalent methods ............................... 8 1.4 Exact frequentist inference in finite sample .................. 15 1.5 Asymptotic properties ............................. 20 1.6 Examples .................................... 25 1.7 Simulation study ................................ 33 Appendices 67 1.A Technical results ................................ 67 1.B Finite sample .................................. 68 1.C Asymptotics ................................... 72 1.D Additional simulation results .......................... 76 1.D.1 Lomax distribution ........................... 76 1.D.2 Random intercept and random slope linear mixed model ...... 83 1.D.3 M/G/1 queueing model ........................ 88 2 Bounded-Influence Robust Estimation of Copulae 91 2.1 Introduction ................................... 91 2.2 Framework .................................... 93 2.3 Asymptotic results ............................... 95 2.4 Some practical aspects for indirect inference procedure ...........100 2.4.1 Point estimator .............................100 2.4.2 Inference .................................101 2.5 Bounding the Influence Function .......................102 2.6 A weighted maximum likelihood indirect estimator .............104 2.7 Simulation Study ................................104 2.7.1 Bivariate Clayton copula ........................105 viii Contents 2.7.2 Factor Copula ..............................110 2.8 Application to income mobility ........................115 Appendices 131 2.A Intermediate results ...............................131 2.B Main proofs ...................................132 2.C Additional results
Recommended publications
  • Sequential Monte Carlo Methods for Inference and Prediction of Latent Time-Series
    SSStttooonnnyyy BBBrrrooooookkk UUUnnniiivvveeerrrsssiiitttyyy The official electronic file of this thesis or dissertation is maintained by the University Libraries on behalf of The Graduate School at Stony Brook University. ©©© AAAllllll RRRiiiggghhhtttsss RRReeessseeerrrvvveeeddd bbbyyy AAAuuuttthhhooorrr... Sequential Monte Carlo Methods for Inference and Prediction of Latent Time-series A Dissertation presented by Iñigo Urteaga to The Graduate School in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy in Electrical Engineering Stony Brook University August 2016 Stony Brook University The Graduate School Iñigo Urteaga We, the dissertation committee for the above candidate for the Doctor of Philosophy degree, hereby recommend acceptance of this dissertation. Petar M. Djuri´c,Dissertation Advisor Professor, Department of Electrical and Computer Engineering Mónica F. Bugallo, Chairperson of Defense Associate Professor, Department of Electrical and Computer Engineering Yue Zhao Assistant Professor, Department of Electrical and Computer Engineering Svetlozar Rachev Professor, Department of Applied Math and Statistics This dissertation is accepted by the Graduate School. Nancy Goroff Interim Dean of the Graduate School ii Abstract of the Dissertation Sequential Monte Carlo Methods for Inference and Prediction of Latent Time-series by Iñigo Urteaga Doctor of Philosophy in Electrical Engineering Stony Brook University 2016 In the era of information-sensing mobile devices, the Internet- of-Things and Big Data, research on advanced methods for extracting information from data has become extremely critical. One important task in this area of work is the analysis of time-varying phenomena, observed sequentially in time. This endeavor is relevant in many applications, where the goal is to infer the dynamics of events of interest described by the data, as soon as new data-samples are acquired.
    [Show full text]
  • Gareth Roberts
    Publication list: Gareth Roberts [1] Hongsheng Dai, Murray Pollock, and Gareth Roberts. Bayesian fusion: Scalable unification of distributed statistical analyses. arXiv preprint arXiv:2102.02123, 2021. [2] Christian P Robert and Gareth O Roberts. Rao-blackwellization in the mcmc era. arXiv preprint arXiv:2101.01011, 2021. [3] Dootika Vats, Fl´avioGon¸calves, KrzysztofLatuszy´nski,and Gareth O Roberts. Efficient Bernoulli factory MCMC for intractable likelihoods. arXiv preprint arXiv:2004.07471, 2020. [4] Marcin Mider, Paul A Jenkins, Murray Pollock, Gareth O Roberts, and Michael Sørensen. Simulating bridges using confluent diffusions. arXiv preprint arXiv:1903.10184, 2019. [5] Nicholas G Tawn and Gareth O Roberts. Optimal temperature spacing for regionally weight-preserving tempering. arXiv preprint arXiv:1810.05845, 2018. [6] Joris Bierkens, Kengo Kamatani, and Gareth O Roberts. High- dimensional scaling limits of piecewise deterministic sampling algorithms. arXiv preprint arXiv:1807.11358, 2018. [7] Cyril Chimisov, Krzysztof Latuszynski, and Gareth Roberts. Adapting the Gibbs sampler. arXiv preprint arXiv:1801.09299, 2018. [8] Cyril Chimisov, Krzysztof Latuszynski, and Gareth Roberts. Air Markov chain Monte Carlo. arXiv preprint arXiv:1801.09309, 2018. [9] Paul Fearnhead, Krzystof Latuszynski, Gareth O Roberts, and Giorgos Sermaidis. Continuous-time importance sampling: Monte Carlo methods which avoid time-discretisation error. arXiv preprint arXiv:1712.06201, 2017. [10] Fl´avioB Gon¸calves, Krzysztof GLatuszy´nski,and Gareth O Roberts. Exact Monte Carlo likelihood-based inference for jump-diffusion processes. arXiv preprint arXiv:1707.00332, 2017. [11] Andi Q Wang, Murray Pollock, Gareth O Roberts, and David Stein- saltz. Regeneration-enriched Markov processes with application to Monte Carlo. to appear in Annals of Applied Probability, 2020.
    [Show full text]
  • By Alex Reinhart Yosihiko Ogata
    STATISTICAL SCIENCE Volume 33, Number 3 August 2018 A Review of Self-Exciting Spatio-Temporal Point Processes and Their Applications .................................................................................Alex Reinhart 299 Comment on “A Review of Self-Exciting Spatiotemporal Point Process and Their Applications”byAlexReinhart.............................................Yosihiko Ogata 319 Comment on “A Review of Self-Exciting Spatio-Temporal Point Process and Their Applications”byAlexReinhart..........................................Jiancang Zhuang 323 Comment on “A Review of Self-Exciting Spatio-Temporal Point Processes and Their Applications”byAlexReinhart..................................Frederic Paik Schoenberg 325 Self-Exciting Point Processes: Infections and Implementations .............Sebastian Meyer 327 Rejoinder: A Review of Self-Exciting Spatio-Temporal Point Processes and Their Applications..................................................................Alex Reinhart 330 On the Relationship between the Theory of Cointegration and the Theory of Phase Synchronization..................Rainer Dahlhaus, István Z. Kiss and Jan C. Neddermeyer 334 Confidentiality and Differential Privacy in the Dissemination of Frequency Tables ......................Yosef Rinott, Christine M. O’Keefe, Natalie Shlomo and Chris Skinner 358 Piecewise Deterministic Markov Processes for Continuous-Time Monte Carlo .....................Paul Fearnhead, Joris Bierkens, Murray Pollock and Gareth O. Roberts 386 Fractionally Differenced Gegenbauer Processes
    [Show full text]
  • Network Cross-Validation by Edge Sampling
    Biometrika (2020), 107,2,pp. 257–276 doi: 10.1093/biomet/asaa006 Printed in Great Britain Advance Access publication 4 April 2020 Network cross-validation by edge sampling By TIANXI LI Department of Statistics, University of Virginia, B005 Halsey Hall, 148 Amphitheater Way, Charlottesville, Virginia 22904, U.S.A. [email protected] Downloaded from https://academic.oup.com/biomet/article/107/2/257/5816043 by guest on 01 July 2021 ELIZAVETA LEVINA AND JI ZHU Department of Statistics, University of Michigan, 459 West Hall, 1085 South University Avenue, Ann Arbor, Michigan 48105, U.S.A. [email protected] [email protected] Summary While many statistical models and methods are now available for network analysis, resampling of network data remains a challenging problem. Cross-validation is a useful general tool for model selection and parameter tuning, but it is not directly applicable to networks since splitting network nodes into groups requires deleting edges and destroys some of the network structure. In this paper we propose a new network resampling strategy, based on splitting node pairs rather than nodes, that is applicable to cross-validation for a wide range of network model selection tasks. We provide theoretical justification for our method in a general setting and examples of how the method can be used in specific network model selection and parameter tuning tasks. Numerical results on simulated networks and on a statisticians’ citation network show that the proposed cross-validation approach works well for model selection. Some key words: Cross-validation; Model selection; Parameter tuning; Random network. 1. Introduction Statistical methods for analysing networks have received a great deal of attention because of their wide-ranging applications in areas such as sociology, physics, biology and the medical sciences.
    [Show full text]
  • Meetings Inc
    Volume 47 • Issue 8 IMS Bulletin December 2018 International Prize in Statistics The International Prize in Statistics is awarded to Bradley Efron, Professor of Statistics CONTENTS and Biomedical Data Science at Stanford University, in recognition of the “bootstrap,” 1 Efron receives International a method he developed in 1977 for assess- Prize in Statistics ing the uncertainty of scientific results 2–3 Members’ news: Grace Wahba, that has had extraordinary and enormous Michael Jordan, Bin Yu, Steve impact across many scientific fields. Fienberg With the bootstrap, scientists were able to learn from limited data in a 4 President’s column: Xiao-Li Meng simple way that enabled them to assess the uncertainty of their findings. In 6 NSF Seeking Novelty in Data essence, it became possible to simulate Sciences Bradley Efron, a “statistical poet” a potentially infinite number of datasets 9 There’s fun in thinking just from an original dataset, and in looking at the differences, measure the uncertainty of one step more the result from the original data analysis. Made possible by computing, the bootstrap 10 Peter Hall Early Career Prize: powered a revolution that placed statistics at the center of scientific progress. It helped update and nominations to propel statistics beyond techniques that relied on complex mathematical calculations 11 Interview with Dietrich or unreliable approximations, and hence it enabled scientists to assess the uncertainty Stoyan of their results in more realistic and feasible ways. “Because the bootstrap is easy for a computer to calculate and is applicable in an 12 Recent papers: Statistical Science; Bernoulli exceptionally wide range of situations, the method has found use in many fields of sci- ence, technology, medicine, and public affairs,” says Sir David Cox, inaugural winner Radu Craiu’s “sexy statistics” 13 of the International Prize in Statistics.
    [Show full text]
  • Final Report (PDF)
    Bioinformatics, Genetics and Stochastic Computation: Bridging the Gap July 1-6 2007 Closing report The meeting that took place at BIRS, Banff, on July 1-6 2007, gathered 33 people from America (Canada and USA), Europe, Japan and Australia. It was quite successful, thanks to the superb organisation of the Center and the friendly help of the staff, and all invited participants showed up for the meeting. The talks were uniformly well-attended by an overwhelming number of the participants who stayed the whole week, with two exceptions only. There was no mountain accident or bear encounter to deplore, and the organisers were not aware of any complaint from the participants, during or after the meeting, but instead gathered many thanks and requests for a follow-up meeting. Several collaborations were initiated during the meeting as well. OBJECTIVES On the one hand, there is an explosion of complex statistical models appearing in genetics and bioin- formatics. These typically highly structured systems are unfortunately very difficult to fit. On the other hand, there has been recently significant advances on MC methods but most of these methods are unknown to the applied community and/or formulated in ways that are too theoretical for direct application. By providing scientists with improved inferential methods it would allow them to consider richer models, which are more realistic than those dictated by computational constraints. The exchanges between the applied and methodological communities remain surprisingly limited. We believe that this workshop would be an ideal place to (1) Gather people from different research communities and foster links between these communities (applied, methodological and theoretical).
    [Show full text]
  • From the New Editor: Xuming He
    Volume 36 • Issue 1 IMS Bulletin January/February 2007 From the New Editor: Xuming He or almost 20 years, I have counted of 2007 features new CONTENTS on the IMS Bulletin to provide assistant professors at 1 Editor’s Message me with news and information the Statistics depart- Fabout the IMS and about the things that ment, North Carolina 2–3 IMS Members’ News: the IMS members care about. When State University. Jianqing Fan, Eric van Zwet Xuming He, I wanted to find out something about I take special University of Illinois, Nominations for 3 career opportunities or meetings, I turned pride in the growth Urbana–Champaign Spiegelman Award to the Bulletin, either its handy hard copy and renewal of our 4 IMS Membership News or its convenient electronic copy at the profession, and I 6 Apply/nominate: Laha, IMS website. hope that the Bulletin will be part of the Carver Awards, IMS The Bulletin had a face-lift five years success. In the future issues, you will find Fellowship ago, when my predecessor Bernard contributed thoughts and ideas on train- 7 Obituary: H Samuel Wieand Silverman became the Editor. I have ing and mentoring, journal publications, heard of only positive comments about and other issues of interest to many of AOAS; Gift memberships 8 it ever since, so our new editorial team our readers. 10 Department profile: NCSU will set a modest goal to improve, not The success of the Bulletin itself to change, what the Bulletin has been depends on the support of all IMS 12 Department News: UC Irvine, Johns Hopkins doing.
    [Show full text]
  • Oxford University Department of Statistics a Brief History John Gittins
    Oxford University Department of Statistics A Brief History John Gittins 1 Contents Preface 1 Genesis 2 LIDASE and its Successors 3 The Department of Biomathematics 4 Early Days of the Department of Statistics 5 Rapid Growth 6 Mathematical Genetics and Bioinformatics 7 Other Recent Developments 8 Research Assessment Exercises 9 Location 10 Administrative, Financial and Secretarial 11 Computing 12 Advisory Service 13 External Advisory Panel 14 Mathematics and Statistics BA and MMath 15 Actuarial Science and the Institute of Actuaries (IofA) 16 MSc in Applied Statistics 17 Doctoral Training Centres 18 Data Base 19 Past and Present Oxford Statistics Groups 20 Honours 2 Preface It has been a pleasure to compile this brief history. It is a story of great achievement which needed to be told, and current and past members of the department have kindly supplied the anecdotal material which gives it a human face. Much of this achievement has taken place since my retirement in 2005, and I apologise for the cursory treatment of this important period. It would be marvellous if someone with first-hand knowledge was moved to describe it properly. I should like to thank Steffen Lauritzen for entrusting me with this task, and all those whose contributions in various ways have made it possible. These have included Doug Altman, Nancy Amery, Shahzia Anjum, Peter Armitage, Simon Bailey, John Bithell, Jan Boylan, Charlotte Deane, Peter Donnelly, Gerald Draper, David Edwards, David Finney, Paul Griffiths, David Hendry, John Kingman, Alison Macfarlane, Jennie McKenzie, Gilean McVean, Madeline Mitchell, Rafael Perera, Anne Pope, Brian Ripley, Ruth Ripley, Christine Stone, Dominic Welsh, Matthias Winkel and Stephanie Wright.
    [Show full text]
  • Publication List: Gareth Roberts
    Publication list: Gareth Roberts [1] Gareth O Roberts and Jeffrey S Rosenthal. Hitting time and convergence rate bounds for symmetric langevin diffusions. 2016. [2] Alain Durmus, Sylvain Le Corff, Eric Moulines, and Gareth O Roberts. Optimal scaling of the random walk metropolis algorithm under lp mean differentiability. arXiv preprint arXiv:1604.06664, 2016. [3] Joris Bierkens, Paul Fearnhead, and Gareth Roberts. The zig-zag pro- cess and super-efficient sampling for bayesian analysis of big data. arXiv preprint arXiv:1607.03188, 2016. [4] Alexandros Beskos, Gareth Roberts, Alexandre Thiery, and Natesh Pillai. Asymptotic analysis of the random-walk metropolis algorithm on ridged densities. arXiv preprint arXiv:1510.02577, 2015. [5] Alain Durmus, Gareth O Roberts, Gilles Vilmart, and Konstantinos C Zygalakis. Fast Langevin based algorithm for MCMC in high dimensions. arXiv preprint arXiv:1507.02166, 2015. [6] Paul J Birrell, Lorenz Wernisch, Brian D M Tom, Gareth O Roberts, Richard G Pebody, and Daniella De Angelis. Efficient real-time moni- toring of an emerging influenza epidemic: how feasible? arXiv preprint arXiv:1608.05292, 2015. [7] Murray Pollock, Paul Fearnhead, Adam M Johansen, and Gareth O Roberts. The scalable langevin exact algorithm: Bayesian inference for big data. arXiv preprint arXiv:1609.03436, 2016. [8] Sergios Agapiou, Gareth O Roberts, and Sebastian J Vollmer. Unbiased Monte Carlo: posterior estimation for intractable/infinite-dimensional models. Bernoulli, to appear, 2016. arXiv preprint arXiv:1411.7713. [9] Omiros Papaspiliopoulos, Gareth O Roberts, and Kasia B Taylor. Ex- act sampling of diffusions with a discontinuity in the drift. Advances in Applied Probability, 48 Issue A:249{259, 2016.
    [Show full text]
  • IMS Grows Internationally
    Volume 37 • Issue 5 IMS Bulletin June 2008 IMS Grows Internationally From the IMS President: Jianqing Fan CONTENTS My most frequently visited site is the 1 IMS Grows Internationally IMS website, http://imstat.org. It contains useful information about this academic 2 Members’ News: Tom Liggett, Adrian Smith organization, with significant improve- ments still under way. The first sentence Meeting report: Composite 4 on the membership page constantly Likelihood Methods reminds me of its mission: “The IMS is IMS China special articles: an international professional and scholarly 5 Louis Chen: Connecting society devoted to the development, dis- China semination, and application of statistics and probability”. This timeless mission 6 Nanny Wermuth: Contacts The IMS President, Jianqing Fan in China statement reflects the great vision and wisdom of IMS’s founders. Reading on, the second sentence states that, “The Institute Peter Hall: China and 7 currently has about 4,500 members in all parts of the world”. It pronounces proudly Mathematical Statistics that at least 4,500 probabilists and statisticians share our vision. I could not help but 8 Meet the IMS China wonder that there must be a lot more scholars in the world who share this vision, but Committee do not yet have proper means to join the IMS. 12 Zhi-Ming Ma: In Founded in 1935 in the United States, IMS has a strong base of constituents in the Conversation US, consistently delivering a glory history of excellence in scholarship. This can also 13 Rick Durrett: Remembering give the impression that IMS is US-based institution, which is somewhat at odds with its own mission.
    [Show full text]
  • Particle Filters for Mixture Models with an Unknown Number of Components
    Statistics and Computing 14: 11–21, 2004 C 2004 Kluwer Academic Publishers. Manufactured in The Netherlands. Particle filters for mixture models with an unknown number of components PAUL FEARNHEAD Department of Mathematics and Statistics, Lancaster University, Lancaster, LA1 4YF Received April 2002 and accepted July 2003 We consider the analysis of data under mixture models where the number of components in the mixture is unknown. We concentrate on mixture Dirichlet process models, and in particular we consider such models under conjugate priors. This conjugacy enables us to integrate out many of the parameters in the model, and to discretize the posterior distribution. Particle filters are particularly well suited to such discrete problems, and we propose the use of the particle filter of Fearnhead and Clifford for this problem. The performance of this particle filter, when analyzing both simulated and real data from a Gaussian mixture model, is uniformly better than the particle filter algorithm of Chen and Liu. In many situations it outperforms a Gibbs Sampler. We also show how models without the required amount of conjugacy can be efficiently analyzed by the same particle filter algorithm. Keywords: Dirichlet process, Gaussian mixture models, Gibbs sampling, MCMC, particle filters 1. Introduction which includes examples of their use in practice, is given by Doucet, de Freitas and Gordon (2001). Monte Carlo methods are commonly used tools for Bayesian Due to the success of particle filters for dynamic problems, inference. Markov chain Monte Carlo (MCMC; see Gilks et al. there is current interest in applying them to batch problems as 1996, for an introduction) is often the method of choice for batch a competitive alternative to MCMC (e.g.
    [Show full text]
  • Statistical Modelling of Citation Exchange Between Statistics Journals
    52 Discussion on the Paper by Varin, Cattelan and Firth Pengsheng Ji (University of Georgia, Athens), Jiashun Jin (Carnegie Mellon University, Pittsburgh) and Zheng Tracy Ke (University of Chicago) We congratulate Varin, Cattelan and Firth for a very stimulating paper. They use the Stigler model on cross-citation data and provide a mode-based method to rank statistical journals. Their approach allows for evaluation of uncertainty of rankings and sheds light on how to avoid overinterpretation of the insignificant difference between journal rankings. In a related context, we study social networks for authors (instead of journals) with a data set that we collect (based on all papers in the Annals of Statistics, Biometrika, the Journal of the American Statistical Association and the Journal of the Royal Statistical Society, Series B, 2003–2012). The data set will be publicly available soon. The data set provides a fertile ground for studying networks for statisticians. In Ji and Jin (2014) we have presented results including (a) ‘hot’ authors and papers, (b) many meaningful communities and (c) research trends. Here, we report results only on community detection of the citation network (for authors). Intuitively, network communities are groups of nodes that have more edges within than across (Jin, 2015). The goal of community detection is to identify such groups (i.e. clustering). We have analysed the citation network with the method of directed scores (Ji and Jin, 2014; Jin, 2015) and identified three meaningful communities; Fig. 16. The first
    [Show full text]