A Generalized Estimating Equations Approach to Capture-Recapture Closed Population Models: Methods and Applications

Total Page:16

File Type:pdf, Size:1020Kb

A Generalized Estimating Equations Approach to Capture-Recapture Closed Population Models: Methods and Applications A GENERALIZED ESTIMATING EQUATIONS APPROACH TO CAPTURE-RECAPTURE CLOSED POPULATION MODELS: METHODS AND APPLICATIONS Md. Abdus Salam Akanda Thesis presented to the University of Évora to obtain the Degree of Doctor in Mathematics Area of Specialization: Statistics SUPERVISOR: Professor Russell Gerardo Alpizar Jara, PhD ÉVORA, JUNE, 2014 INSTITUTE FOR ADVANCED STUDIES AND RESEARCH Abstract Wildlife population parameters, such as capture or detection probabilities, and density or population size, can be estimated from capture-recapture data. These estimates are of particular interest to ecologists and biologists who rely on ac- curate inferences for management and conservation of the population of interest. However, there are many challenges to researchers for making accurate inferences on population parameters. For instance, capture-recapture data can be considered as binary longitudinal observations since repeated measurements are collected on the same individuals across successive points in times, and these observations are often correlated over time. If these correlations are not taken into account when estimating capture probabilities, then parameter estimates will be biased, possibly producing misleading results. Also, an estimator of population size is generally biased under the presence of heterogeneity in capture probabilities. The use of covariates (or auxiliary variables), when available, has been proposed as an alter- native way to cope with the problem of heterogeneous capture probabilities. In this dissertation, we are interested in tackling these two main problems, (i) when capture probabilities are dependent among capture occasions in closed population capture-recapture models, and (ii) when capture probabilities are heterogeneous among individuals. Hence, the capture-recapture literature can be improved, if we could propose an approach to jointly account for these problems. In summary, this dissertation proposes: (i) a generalized estimating equations (GEE) approach to model possible effects in capture-recapture closed population studies due to cor- relation over time and individual heterogeneity; (ii) the corresponding estimating iv equations for each closed population capture-recapture model; (iii) a comprehen- sive analysis on various real capture-recapture data sets using classical, GEE and generalized linear mixed models (GLMM); (iv) an evaluation of the effect of ac- counting for correlation structures on capture-recapture model selection based on the ‘Quasi-likelihood Information Criterion (QIC)’; (v) a comparison of the per- formance of population size estimators using GEE and GLMM approaches in the analysis of capture-recapture data. The performance of these approaches is eval- uated by Monte Carlo (MC) simulation studies resembling real capture-recapture data. The proposed GEE approach provides a useful inference procedure for esti- mating population parameters, particularly when a large proportion of individuals are captured. For a low capture proportion, it is difficult to obtain reliable esti- mates for all approaches, but the GEE approach outperforms the other methods. Simulation results show that quasi-likelihood GEE provide lower standard error than partial likelihood based on generalized linear modelling (GLM) and GLMM approaches. The estimated population sizes vary on the nature of the existing correlation among capture occasions. Keywords: Capture-recapture experiment; Correlation structure; Generalized estimating equations; Generalized linear mixed models; Heterogeneity; Popula- tion size estimation; Quasi-likelihood information criterion. UMA ABORDAGEM DE EQUAC¸ OES˜ DE ESTIMAC¸ AO˜ GENERALIZADAS PARA MODELOS DE CAPTURA- RECAPTURA EM POPULAC¸ AO˜ FECHADAS: METODOS´ E APLICAC¸ OES˜ Resumo Parˆametros populacionais em esp´ecies de vida selvagens, como probabilidades de captura ou detec¸c˜ao, e abundˆancia ou densidade da popula¸c˜ao, podem ser estima- dos a partir de dados de captura-recaptura. Estas estimativas s˜ao de particular interesse para ecologistas e bi´ologos que dependem de inferˆencias precisas para a gest˜ao e conserva¸c˜ao das popula¸c˜oes. No entanto, h´amuitos desafios para os investigadores fazer inferˆencias precisas de parˆametros populacionais. Por ex- emplo, os dados de captura-recaptura podem ser considerados como observa¸c˜oes longitudinais bin´arias uma vez que s˜ao medi¸c˜oes repetidas coletadas nos mesmos indiv´ıduos em pontos sucessivos no tempo, e muitas vezes correlacionadas. Se essas correla¸c˜oes n˜ao s˜ao levadas em conta ao estimar as probabilidades de cap- tura, as estimativas dos parˆametros ser˜ao tendenciosas e possivelmente produzir˜ao resultados enganosos. Tamb´em, um estimador do tamanho de uma popula¸c˜ao ´e geralmente enviesado na presen¸ca de heterogeneidade das probabilidades de cap- tura. A utiliza¸c˜ao de co-vari´aveis (ou vari´aveis auxiliares), quando dispon´ıveis, tem sido proposta como uma forma de lidar com o problema de probabilidades de captura heterog´eneas. Nesta disserta¸c˜ao, estamos interessados em abordar dois problemas principais em mode1os de captura-recapturar para popula¸c˜ao fechadas, (i) quando as probabilidades de captura s˜ao dependentes entre ocasi˜oes de captura, e (ii) quando as probabilidades de captura s˜ao heterog´eneas entre os indiv´ıduos. Assim, a literatura de captura-recaptura pode ser melhorada, se pud´essemos pro- por uma abordagem conjunta para estes problemas. Em resumo, nesta disserta¸c˜ao vi prop˜oe-se: (i) uma abordagem de estima¸c˜ao de equa¸c˜oes generalizadas (GEE) para modelar poss´ıveis efeitos de correla¸c˜ao temporal e heterogeneidade individual nas probabilidades de captura; (ii) as correspondentes equa¸c˜oes de estima¸c˜ao gen- eralizadas para cada modelo de captura-recaptura em popula¸c˜ao fechadas; (iii) uma an´alise sobre v´arios conjuntos de dodas reais de captura-recaptura usando a abordagem cl´assica, GEE e modelos linear generalizados mistos (GLMM); (iv) uma avalia¸c˜ao do efeito das estruturas de correla¸c˜ao na selec¸c˜ao de modelos de captura-recaptura com base no ‘crit´erio de informa¸c˜ao da Quasi-verossimilhan¸ca (QIC)’; (v) uma compara¸c˜ao da performance das estimativas do tamanho da popula¸c˜ao usando GEE e GLMM. O desempenho destas abordagens ´eavaliado usando simula¸c˜oes Monte Carlo (MC) que se assemelham a dados de captura- recapture reais. A abordagem GEE proposto ´eum procedimento de inferˆencia ´util para estimar parˆametros populacionais, especialmente quando uma grande propor¸c˜ao de indiv´ıduos ´ecapturada. Para uma propor¸c˜ao baixa de capturas, ´e dif´ıcil obter estimativas fi´aveis para todas as abordagens aplicadas, mas GEE su- pera os outros m´etodos. Os resultados das simula¸c˜oes mostram que o m´etodo da quase-verossimilhan¸ca do GEE fornece estimativas do erro padr˜ao menor do que o m´etodo da verossimilhana parcial dos modelos lineares generalizados (GLM) e GLMM. As estimstivas do tamanho da popula¸c˜ao variam de acordo com a na- tureza da correla¸c˜ao existente entre as ocasi˜oes de captura. Palavras-chave: Captura-recaptura experiˆencia; Estrutura de correla¸c˜ao; Equa¸c˜oes de estima¸c˜ao generalizadas; Modelos lineares generalizados mistos; Heterogenei- dade; Estimativa de tamanho da popula¸c˜ao; Crit´erio de informa¸c˜ao quasi- verossim- ilhan¸ca. Dedicated to Shadiq Muntaqim Akanda (Maahin) Maksuda Khanam (Shiuli) Wasif Tahsin Akanda Acknowledgement First and foremost I would like to express my heartfelt gratitude to my supervisor Professor Russell Gerardo Alpizar Jara who gave me the opportunity to work with him. His insights and knowledge have been extremely valuable. He motivated me how to write a paper step by step. His patience, guidance, active support, con- tinuous encouragement, amiability and generosity turned a difficult task into an immersing and enjoyable experience. During these years I have learned so much academic and non-academic things in a hospitable style. His door was always open for me right from the beginning. He is not only my supervisor, but also one of my closest best friends. It was my privilege to work with him. Thanks a lot for all the discussions and for the exhaustive text revisions indeed. I am profoundly grateful to him. I express my sincere gratitude to the European Union for awarding me the Eras- mus Mundus Mobility with Asia-West (EMMA-WEST) scholarship. I am also very grateful to my employer University of Dhaka, Bangladesh who facilitated my enrolment in this program. I feel that I have greatly enhanced my skills and broadened my outlook. The investment was worthwhile and I eagerly look to apply back my skills and contribute to my country. I am thankful to the Re- search Centre of Mathematics and Application (CIMA-UE) at the University of Evora´ and the Funda¸c˜ao Nacional para a Ciˆencia e Tecnologia (FCT), Portugal for providing some financial support to participate in conferences under the project PEst-OE/MAT/UI0117/2011. I am also greatfully acknowledge the comments x and suggestions made by jury members. I am utmost grateful to EMMA-WEST coordinators Professor Md. Shafiqul Islam of University of Dhaka, Professor Imme van den Berg and Professor Jos´eCarlos Tiago de Oliveira of University of Evora´ for their cordial support during my study period. Much appreciation is also due to the Rector at the University of Evora´ for nurturing a wonderful environment that made life here such a stimulating and enjoyable experience. My thanks goes to everybody in the Mobility and Interna- tional Relations Office, Department of Mathematics, Academic Services and Social Services at the University of Evora´ for their active support. Much of the credit of the dissertation goes to my beloved sons Shadiq Muntaqim Akanda (Maahin) & Wasif Tahsin Akanda and my heartiest wife Maksuda Khanam (Shiuli). They supported me in innumerable ways and continues to inspire and encourage me along in life. I would not be here without their love and support. My faith makes all things possible, but their love and support make all things easy. The list of sacrifices that they has made during this research would not fit on the pages of this dissertation.
Recommended publications
  • Mark and Recapture for Event Data When Reporting Probabilities Are Less Than One
    No News is Good News?: Mark and Recapture for Event Data When Reporting Probabilities are Less than One. Cullen S. Hendrix Idean Salehyan* Korbel School of International Studies Department of Political Science University of Denver University of North Texas [email protected] [email protected] Abstract. We discuss a common, but often ignored, problem in event data: underreporting bias. When collecting data, it is often not the case that source materials capture all events of interest, leading to an undercount of the true number of events. To address this issue, we propose a common method first used to estimate the size of animal populations when a complete census is not feasible: mark and recapture. By taking multiple sources into consideration, one can estimate the rate of missing data across sources and come up with an estimate of the true number of events. To demonstrate the utility of the approach, we compare Associated Press and Agence France Press reports on conflict events, as contained in the Social Conflict in Africa Database. We show that these sources capture approximately 76% of all events in Africa, but that the non- detection rate declines dramatically when considering more significant events. We also show through regression analysis that deadly events, events of a larger magnitude, and events with government repression, among others, are significant predictors of overlapping reporting. Ultimately, the approach can be used to correct for undercounting in event data and to assess the quality of sources used. * We would like to thank Christian Davenport, James Nichols, and James Hines for comments on and assistance with earlier drafts.
    [Show full text]
  • Salmon Escapement
    THE DESIGN AND ANALYSIS OF SALMONID TAGGING STUDIES IN THE COLUMBIA BASIN VOLUME XXIV A Statistical Critique of Estimating Salmon Escapement in the Pacific Northwest Prepared by: Amber L. Parsons and John R. Skalski School of Aquatic and Fishery Sciences University of Washington 1325 Fourth Avenue, Suite 1820 Seattle, WA 98101-2509 Prepared for: U.S. Department of Energy Bonneville Power Administration Division of Fish and Wildlife P.O. Box 3621 Portland, OR 97208-3621 Project No. 1989-107-00 Contract No. 00039987 June 2009 The Design and Analysis of Salmonid Tagging Studies in the Columbia Basin Other Publications in this Series Volume I: Skalski, J. R., J. A. Perez-Comas, R. L. Townsend, and J. Lady. 1998. Assessment of temporal trends in daily survival estimates of spring chinook, 1994-1996. Technical report submitted to BPA, Project 89-107-00, Contract DE-BI79-90BP02341. 24 pp. plus appendix. Volume II: Newman, K. 1998. Estimating salmonid survival with combined PIT- CWT tagging. Technical report (DOE/BP-35885-11) to BPA, Project 91-051-00, Contract 87- BI-35885. Volume III: Newman, K. 1998. Experiment designs and statistical models to estimate the effect of transportation on survival of Columbia River system salmonids. Technical report (DOE/BP-35885-11a) to BPA, Project 91-051-00, Contract 87-BI-35885. Volume IV: Perez-Comas, J. A., and J. R. Skalski. Submitted. Preliminary assessment of the effects of pulsed flows on smolt migratory behavior. Technical report to BPA, Project 89-107-00, Contract DE-BI79-90BP02341. Volume V: Perez-Comas, J. A., and J. R.
    [Show full text]
  • Snakes on a Lane
    Trinity River Restoration Program Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II Final Technical Memorandum December 21, 2009 Prepared for: Trinity River Restoration Program 1313 S. Main Street P.O. Box 1300 Weaverville, California 96093 Prepared by: ESSA Technologies, Ltd. 1765 W 8th Avenue Vancouver, British Columbia V6J 5C6 Simon Fraser University Department of Statistics and Actuarial Science 8888 University Drive Burnaby, British Columbia V5A 1S6 North State Resources, Inc. 5000 Bechelli Lane, Suite 203 Redding, California 96002 (530) 222-5347 USBR TASK ORDER 06A0204097G NSR# 10116-02 © 2009 Trinity River Restoration Program Citation: Schwarz, C.J., D. Pickard, K. Marine and S.J. Bonner. 2009. Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II– December 2009. Final Technical Memorandum for the Trinity River Restoration Program, Weaverville, CA. 155 pp. + appendices. Executive Summary The Trinity River Restoration Program (hereafter called the Program) was initiated in 1984 to restore and maintain the fish and wildlife stocks of the Trinity River Basin to levels that existed just prior to construction of the CVP Trinity River Division. Using an adaptive management framework the Program hopes to determine the most effective restoration strategies to restore the Trinity River’s natural riverine processes and enhance fish and wildlife populations. The Program’s Integrated Assessment Plan (IAP) identifies key assessments that can be used to evaluate long-term progress toward achieving Program goals and objectives and/or provide short-term feedback to improve Program management actions by testing key hypotheses and reducing management uncertainties. This report addresses juvenile salmonid outmigrant monitoring (a key component of the IAP), evaluating tradeoffs between alternative monitoring methods for assessing smolt abundance, run timing, and condition.
    [Show full text]
  • Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II
    Trinity River Restoration Program Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II Final Technical Memorandum December 21, 2009 Prepared for: Trinity River Restoration Program 1313 S. Main Street P.O. Box 1300 Weaverville, California 96093 Prepared by: ESSA Technologies, Ltd. 1765 W 8th Avenue Vancouver, British Columbia V6J 5C6 Simon Fraser University Department of Statistics and Actuarial Science 8888 University Drive Burnaby, British Columbia V5A 1S6 North State Resources, Inc. 5000 Bechelli Lane, Suite 203 Redding, California 96002 (530) 222-5347 USBR TASK ORDER 06A0204097G NSR# 10116-02 © 2009 Trinity River Restoration Program Citation: Schwarz, C.J., D. Pickard, K. Marine and S.J. Bonner. 2009. Juvenile Salmonid Outmigrant Monitoring Evaluation, Phase II– December 2009. Final Technical Memorandum for the Trinity River Restoration Program, Weaverville, CA. 155 pp. + appendices. Executive Summary The Trinity River Restoration Program (hereafter called the Program) was initiated in 1984 to restore and maintain the fish and wildlife stocks of the Trinity River Basin to levels that existed just prior to construction of the CVP Trinity River Division. Using an adaptive management framework the Program hopes to determine the most effective restoration strategies to restore the Trinity River’s natural riverine processes and enhance fish and wildlife populations. The Program’s Integrated Assessment Plan (IAP) identifies key assessments that can be used to evaluate long-term progress toward achieving Program goals and objectives and/or provide short-term feedback to improve Program management actions by testing key hypotheses and reducing management uncertainties. This report addresses juvenile salmonid outmigrant monitoring (a key component of the IAP), evaluating tradeoffs between alternative monitoring methods for assessing smolt abundance, run timing, and condition.
    [Show full text]
  • Mark-Recapture Estimates of Stickleback Population Sizes in Paxton and Priest Lakes in 2016
    Mark-recapture estimates of stickleback population sizes in Paxton and Priest Lakes in 2016 Dolph Schluter, Marius Roesti, and Thor Veen Biodiversity Research Centre and Zoology Department University of British Columbia Introduction We conducted a mark-recapture survey of population (census) sizes of the stickleback species pairs in Priest and Paxton lakes in spring 2016. Trapping success over the years by the Schluter lab and people in other labs had suggested that total numbers were high. Nevertheless, current population sizes were unknown. The present project aimed to remedy this situation. Previous knowledge of population sizes of fish in these lakes comes from a mark-recapture study carried out in Paxton Lake in May, 2005 (M. Nomura and D. Schluter, unpublished). The results of this effort are shown in Table S1 of the Supplement. These estimates are now stale, being more than 10 years out of date. The estimates were based on relatively small samples, leading to highly uncertain estimates (wide confidence intervals). Population sizes of the Priest Lake (Vananda Creek) species pair have never been estimated directly (previous numbers are based on extrapolating from Paxton Lake estimates according to area differences between Priest Lake and Paxton Lake). Methods Study populations Population size was estimated in four populations from Texada Island, BC. These included the Paxton Lake Benthic Threespine (Gasterosteus aculeatus), the Paxton Lake Limnetic Threespine (Gasterosteus aculeatus), the Vananda Creek Benthic Threespine (Gasterosteus aculeatus) from Priest Lake, and the Vananda Creek Limnetic Threespine (Gasterosteus aculeatus) from Priest Lake. All four populations are federally listed as Endangered under the Species at Risk Act (SARA), Access difficulties prevented us from conducting similar studies in Spectacle (Balkwill) Lake and Emily Lake (Vananda Creek), which also contain stickleback species pairs.
    [Show full text]
  • Jaguar Survey and Monitoring Techniques and Methodologies: a Review
    Jaguar Survey and Monitoring Techniques and Methodologies: A Review A Submission to the U.S. Fish and Wildlife Service in Partial Fulfillment of Contract F13PX01563 © Daniel Alarcon John Polisar, Tim G. O’Brien, Sean M. Matthews, Jon P. Beckmann, Eric W. Sanderson, Octavio Cesar Rosas-Rosas, and Carlos A. López-González 27 March 2014 Table of Contents Executive Summary ...................................................................................................................... 1 Background ................................................................................................................................... 2 Survey Designs and Statistical Analyses Applied to Data Collected During Jaguar Surveys, Monitoring, and Research .................................................................................................... 5 Abundance and Occurrence ........................................................................................................ 6 Species Distribution Mapping: Modeling presence-only data .................................................... 7 Logistic Regression: Modeling presence-absence data ............................................................. 10 Occupancy: Modeling detection-non-detection data................................................................. 11 Kinds of Occupancy Models ................................................................................................. 14 Capture-Recapture Models for Density Estimation Using Replicated Count Data .................. 17 Poisson
    [Show full text]
  • Olvido, AE, and LS Blumer. 2005. Introduction to Mark-Recapture
    This article reprinted from: Olvido, A. E., and L. S. Blumer. 2005. Introduction to mark-recapture census methods using the seed beetle, Callosobruchus maculatus. Pages 197-211, in Tested Studies for Laboratory Teaching, Volume 26 (M.A. O'Donnell, Editor). Proceedings of the 26th Workshop/Conference of the Association for Biology Laboratory Education (ABLE), 452 pages. Compilation copyright © 2005 by the Association for Biology Laboratory Education (ABLE) ISBN 1-890444-08-1 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the copyright owner. Use solely at one’s own institution with no intent for profit is excluded from the preceding copyright restriction, unless otherwise noted on the copyright notice of the individual chapter in this volume. Proper credit to this publication must be included in your laboratory outline for each use; a sample citation is given above. Upon obtaining permission or with the “sole use at one’s own institution” exclusion, ABLE strongly encourages individuals to use the exercises in this proceedings volume in their teaching program. Although the laboratory exercises in this proceedings volume have been tested and due consideration has been given to safety, individuals performing these exercises must assume all responsibilities for risk. The Association for Biology Laboratory Education (ABLE) disclaims any liability with regards to safety in connection with the use of the exercises in this volume. The focus of ABLE is to improve the undergraduate biology laboratory experience by promoting the development and dissemination of interesting, innovative, and reliable laboratory exercises.
    [Show full text]