Confidence Distribution

Confidence Distribution

Confidence Distribution Xie and Singh (2013): Confidence distribution, the frequentist distribution estimator of a parameter: A Review Céline Cunen, 15/09/2014 Outline of Article ● Introduction ● The concept of Confidence Distribution (CD) ● A classical Definition and the History of the CD Concept ● A modern definition and interpretation ● Illustrative examples ● Basic parametric examples ● Significant (p-value) functions ● Bootstrap distributions ● Likelihood functions ● Asymptotically third-order accurate confidence distributions ● CD, Bootstrap, Fiducial and Bayesian approaches ● CD-random variable, Bootstrap estimator and fiducial-less interpretation ● CD, fiducial distribution and Belief function ● CD and Bayesian inference ● Inferences using a CD ● Confidence Interval ● Point estimation ● Hypothesis testing ● Optimality (comparison) of CDs ● Combining CDs from independent sources ● Combination of CDs and a unified framework for Meta-Analysis ● Incorporation of Expert opinions in clinical trials ● CD-based new methodologies, examples and applications ● CD-based likelihood caluculations ● Confidence curve ● CD-based simulation methods ● Additional examples and applications of CD-developments ● Summary CD: a sample-dependent distribution that can represent confidence intervals of all levels for a parameter of interest ● CD: a broad concept = covers all approaches that can build confidence intervals at all levels Interpretation ● A distribution on the parameter space ● A Distribution estimator = contains information for many types of inference ● An estimator FOR the parameter of interest, instead of an inherent distribution OF the parameter ● Purely frequentist: – The parameter is a fixed, non-random quantity – 95% CI: the true parameter value will be covered by the CIs 95% of the time Fiducial Inference ● “Fisher's biggest blunder”? ● Fiducial – Faith – A fiducial distribution: describes the level of “faith” attached to different values of the unknown parameter ● Sometimes normalized likelihood function are interpreted as distributions of the parameter – ~ the parameters are both fixed and random quantities ● Interpretation of 95% CI: there is a 95% probability that the parameter lies inside the CI. 2.2 Modern definition ● A function Hn() is called a confidence distribution for a parameter θ if: – R1: Hn() is a cummulative distribution function on the parameter space – R2: at the true parameter value θ = θ0 Hn(θ0) as a function of the sample x follows the uniform distribution U[0,1] ● R2 is important! 2.3.1: Basic parametric examples ● A sample: – With σ² known ● The function satisfies the requirements in the CD-definition – R1: it clearly is a cdf – R2: 2.3.2 p-value functions ● One-sided test H0: θ ≤ b vs H1: θ > b ● P-value function: ● Usually CDs or (aCDs): – Because cdf – And because when b=θ0, H0 is true and p-values are uniformly distributed when H0 is true. 2.3.3 Bootstrap distributions ● True (unknown) parameter ● Original sample, with estimator ● Bootstrap sample: a sample of equal size as the original sample, sampled with replacement from the original sample ● Compute the estimator on each bootstrap samples (get many ) --> the Bootstrap Distribution ● The Bootstrap Distribution is an aCD! 2.3.4 Likelihood functions ● Under some mild conditions: normalized likelihood functions are density functions of asymptotic normal CDs – Method for obtaining CDs from likelihoods ● CD-based inference ≈ likelihood inference 3.1 CD-random Variable, ... ● The CD-random variable: – The CD is not a distribution of θ! ● CD-random variable = Bootstrap estimator, and this is useful because it: – Help understanding CD-inference and develop new methods – Clarifies the interpretation of CDs: CDs are not distributions of θ, so therefore it is not a problem that a transformation g(θ) of θ does not generally lead to a CD for g(θ) 3.2 CD, Fiducial Distribution... ● Methods from fiducial reasoning (and from CDs) are supposed to have good statistical performance in the frequentist sense – CIs should have the exact coverage property – For tests: the actual rate of type I error is equal to the specified level of the test ● Many fiducial distributions are CDs ● Fiducial reasoning: provides a procedure for finding CDs 3.3 CD and Bayesian Inference ● Bayesian credible intervals to not possess the exact coverage property – But asymptotically they can obtain it – Then: posterior distributions are aCDs – Bayes methods can produce CDs! ● Benefits with CBs compared to bayesian methods: – Nuisance parameters 4 Inferences using a CD ● CDs contain information for any type of frequentist inference 4.1 Confidence Intervals ● CDs allows us to construct CIs for all levels of α ● CI constructed from CDs have the exact coverage property: 4.2 Point Estimation ● Median: ● Mean: ● Mode: ● Under some conditions, these are consistent estimators (= converging in probability to the true value) 4.3 Hypothesis Testing ● One-sided test: – The support on C: – Reject H0 if the support on C is less than α – The rejection region corresponds to a level α test – Support = p-value (often) ● Two-sided test – Same story, but with a more complicated rejection region 5 Optimality of CDs ● Can have multiple CDs for the same parameter ● A better CD = a CD more concentrated around the true parameter value 6.1 Combination of CDs ... ● k independent studies, estimate the same parameter of interest θ ● Study i with sample xi and CD Hi(.) ● Propose a general recipe for combining k independent CDs: Combining k p-values - “Fisher's method” ● k p-values from k independent studies ● Under H0: ● Remember that ● So that the teststatistic ● Will be distributed under H0 Example ● 2 studies aim to estimate the μ parameter from a normal model with known sigmas ● Study 1: n1=30, ● Study 2: n2=40, ● With CDs: ● We choose ● And then 6.2 Incorporation of Expert Opinions ● How can one incorporate existing knowledge in an analysis? – Bayesian approaches: prior = existing knowledge – In a frequentist setting: CD! ● CD-approach: – CDe: summarizes the existing information/opinions – CDd: from the data – Combines these two by the methods in the last section ● Advantages: – Easy to implement/ computationally cheap – No need for priors on nuisance parameter! – Avoids “the discrepant posterior phenomenon” 7 New methodologies, Examples... ● Different CD-related methods: – Obtaining likelihood functions from CDs – Presenting CDs: the confidence curve – CD-based simulation methods 8 Summary ● CDs is a broad concept which “contains” many well-known notions and results ● Most types of frequentist inference can be derived from CDs ● Advantages of the CD-approach: – Handles nuisance parameters well – Easy to combine information ● Problems (need of further study): – Multivariate CDs – Cases where it difficult to obtain CDs – Model uncertainty/ diagnosis/ selection .

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    26 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us