MATERIALES Y MÉTODOS - Impactación canina maxilar en pacientes clase III esqueletal a causa del

This section introduces the Bayesian inference approach to estimate a copula parameter. For didactic reasons, we concentrate first on the bivariate case with known margins, i.e. we introduce a method for estimation of the copula parameter of a random vector (X1,X2), where the marginsF1andF2ofX1andX2are

continuous and known. The multidimensional case with unknown margins will be described in Section 6.

For the remainder of the paper, we denote withρ(_·,_·) a fixed dependence measure, i.e.ρmaps a pair of random variables to a value inR(in most practical cases the interval of possible values is [₋1, 1] or [0, 1]), which is then called their “degree of dependence”. We assume that ρ is independent of the margins, i.e.ρ(X1,X2)=ρ(t1(X1),t2(X2)) for all random vectors (X1,X2) and strictly

increasing transformationst1andt2. Commonly used dependence measures

satisfying this condition are for example Kendall’s tau or asymptotic tail dependence. Other dependence measures can be found in McNeil et al. (2005).

Our method allows statistical inference in the following situation.

Situation 1. The following three sources of information are given.

1. A setO _{of N independent observations}₍_X_1,_n_,_X_2,_n₎_{, n}₌_{1, . . . ,}_{N , of}₍_X₁_,_X₂₎_. 2. From K experts, a setE _{of point estimates}_ϕ_k_of_ρ₍_X₁_,_X₂₎_{, k}₌_{1, . . . ,}_{K .} 3. An additional prior source of information (e.g. regulatory guidelines) which

provides an estimate ofρ(X1,X2).

Letθ₌ρ(X1,X2) be the unknown value of the dependence measure applied

to (X1,X2). The direct elicitation of the value of the canonical copula parameter

is not feasible as this quantity is not familiar to the expert in terms of the way he collects and evokes his knowledge. However, a question that asks for the value of a dependence measure can be formulated in a way such that substantial answers can be given even if experts are unfamiliar with probability theory. For that reason, we will parameterize the copula through the dependence measure and ask experts to estimate this dependence measure.

130 Paper C

Definition 2. LetC₂₌_{_C_θ_:_θ_∈Θ_}_{denote a family of absolutely continuous bi-} variate copulas Cθ: [0, 1]2→[0, 1], parameterized through the dependence mea-

sureρ. I.e.ρ(U1,U2)=θfor vectors(U1,U2)withP[U1≤u1,U2≤u2]=Cθ(u1,u2). We denote with c(_·|θ)the density and withΘ_⊂_R_{the set of admissible parame-} ters.

Most combinations of commonly used dependence measures (Kendall’s tau, Spearman’s rho, asymptotic tail dependence) and bivariate copula classes that are indexed by a real-valued parameter (Clayton, Gumbel, Frank, Gaussian, t with fixed degrees of freedom) satisfy the requirements of Definition 2. Note that the parametrization in terms of the dependence measure also guarantees that the copulas are identifiable.

The following proposition allows statistical inference onθ, where the Bayes- ian approach combines all available information given in Situation 1. We will assume that the copula of (X1,X2) is contained inC2and the experts give esti-

mates ofρ(X1,X2). As experts often base their judgement on common knowl-

edge, we allow experts to be dependent, too, and model their dependence struc- ture through a copula. We will fully describe the modeling of expert assessments in Section 5.

For a detailed introduction to Bayesian inference we refer the reader to Bernardo and Smith (1994). For ease of notation, we denote withp(_·) all (un)conditional densities of those random variables with no specifically designated density.

Proposition 3. Suppose Situation 1 holds along with the following assumptions. A1 Conditionally on the degree of dependence θ, (X1,X2) has a distribution given by the copula C ₌Cθ∈C2and the fixed, known margins F1and F2. I.e.,

P[X1≤x1,X2≤x2|θ]=Cθ(F1(x1),F2(x2)) .

A2 Conditionally on θ, experts point estimates ϕk have a joint distribution

given by

P[ϕ1≤x1, . . . ,ϕK ≤xK|θ]=CE(G1(x1|θ), . . . ,GK(xK|θ)) ,

where Gk(·|θ)is the conditional distribution of the k-th expert and CE de-

scribes the dependence between experts. Assume both CE and Gk have a

density, denoted by cE and gk(·|θ), respectively.

A3 Conditionally onθ,O _andE _{are independent.}

A4 The prior source of information can be translated into a prior density p(θ) :

Arbenz, Canestraro (2012) 131

Then the posterior density p(θ_|O_,E₎_of_θ_givenO_andE _satisfies

p(θ_|O_,E₎_∝_p₍_θ₎ N Y n₌1 c¡F1(X1,n),F2(X2,n) ¯ ¯θ¢ ·cE¡G1(ϕ1|θ), . . . ,GK(ϕK|θ) ¢_YK k=1 gk(ϕk|θ), (1)

where the symbol_∝denotes proportionality with respect toθ.

Proof. Bayes’ Theorem leads top(θ_|O_,E₎_p₍O_,E₎₌_p₍O_,E_|_θ₎_p₍_θ_{), which we}

can write asp(θ_|O_,E₎_∝_p₍O_,E_|_θ₎_p₍_θ_{). Due to A3, we get}_p₍O_,E_|_θ₎₌_p₍O_|_θ₎_p₍E_|_θ_).

As a consequence of A1 and A2 the quantitiesp(O_|_θ_{) and}_p₍E_|_θ_{), i.e. the likeli-}

hoods ofO _andE_{, given}_θ_{, can be written as}

p(O_|_θ₎₌ N Y n=1 c¡F1(X1,n),F2(X2,n) ¯ ¯θ¢, p(E_|_θ₎₌_cE¡_G₁₍_ϕ₁_|_θ_{), . . . ,}_G_K₍_ϕ_K_|_θ₎¢ K Y k=1 gk(ϕk|θ).

We make some remarks on Proposition 3 and its assumptions.

Any Bayesian point estimator (such as mean or median) can now be used to calculate a point estimate θb of the copula parameter from p(θ_|O_,E_{). We}

propose to use the posterior meanE[θ_|O_,E_{] in order to be consistent with the}

modeling of the expert assessments, which will be based on matching conditional moments. The uncertainty ofθbcan then be assessed through the variance var(θ_|O_,E_).

More details on the practical transformation of the prior information into the prior density are given in Section 3. If such a prior source of information is unavailable, an uninformative prior can be used. The modeling of the experts’ marginal conditional distribution (theGk(·|θ)) and the dependence be-

tween the experts (through the copulaCE) will be addressed in Section 4. If experts are conditionally independent, then thecE term in (1) drops out. Depen- dence between experts and observations is difficult to avoid, but it is unclear how this influences the expert assessments. An expert could tend to underes- timate due to inexistent joint observations or to overestimate by overcorrecting through unrealistically extreme scenarios.

For sensitivity analyses or in case no expert opinion or observations are available, we can calculate p(θ_|O_{) and}_p₍_θ_|E_{), the posterior of} _θ _{given either} O _orE_{, in which case}_p₍O_|_θ_{) or}_p₍E_|_θ_{) drops out of (1).}

In other applications of Bayesian inference, conjugate priors are often used, in which case both p(θ) and p(θ_|O_,E_{) belong to the same parametric class of}

distributions, see Bernardo and Smith (1994). We are not aware of any conjugate priors for copulas.

132 Paper C

The normalization factorp(O_,E_{) is not explicitly known in most cases. It}

is however possible to sample fromθ_|O_,E _{through Markov Chain Monte Carlo}

methods (MCMC), see Robert and Casella (2005). IfΘ_{is low dimensional (as it}

is the case in the present bivariate setting), direct grid discretizations may also be feasible.

As our method is intended to be used mainly with smallNandK, we refrain from proving the following asymptotic results on the convergence of θ_|O_,E_.

Proofs and more details concerning the following statements can be found in Section 10 of Van der Vaart (1998). ForN_{→ ∞}the information contained inO

increases, the influence ofp(θ)p(E_|_θ_{) is diminished, and the posterior is driven}

by the observations. The Bernstein-von Mises Theorem states asymptotic nor- mality ofθ_|O_,E _for_N_{→ ∞}_if_p₍_θ₎_p₍E_|_θ_{) is smooth and positive in a neighbor-}

hood of the true parameter. Therefore,θ_|O_,E _{and thus also}_θb_{converge in prob-}

ability with rate 1/pN. Furthermore, Bayesian point estimators are asymptotically efficient and asymptotically equivalent to maximum likelihood estimators. Modelling the prior with a shifted Beta distribution yields a smooth and positve prior p(θ) on Θ_{. Parametrizing the Beta distribution in terms of its}

mean to get the conditional expert’s densitygk(ϕk|θ) also satisfies the smooth-

ness conditions. We will later chooseCE as a Frank copula, which implies that

p(θ)p(E_|_θ_{) satisfies the conditions of the Bernstein-von Mises Theorem.}

In document Impactación canina maxilar en pacientes clase III esqueletal a causa del déficit de desarrollo maxilar (página 32-54)