2.2. Fundamentación teórica
2.2.3. Que es un manejo de objeciones
To construct our estimator, πopt, we make three standard causal inference as-
sumptions (Robins et al. 2000, Zhang et al. 2012b): (C1) consistency,
(Y, Z) = {Y∗(A), Z∗(A)}; (C2) positivity, for each a there exists > 0 such that
P(A =a∣X,W) ≥; (C3) ignorability, [{Y∗(a), Z∗(a)} ∶ a∈ {−1,1}] ⊥ A∣X,W. The
consistency assumption implies that the individual’s observed outcome is the potential outcome associated with the observed exposure; the positivity assumption implies there is a positive probability of receiving all possible treatments; the ignorability assump- tion implies that patients receiving all possible treatments have equal distributions of experiencing the potential outcomes. We further assume (C4) (A, Y, Z) ⊥ E∣X,W,
which states that the treatment assignment and responses are independent of a pa- tient’s preference. This important assumption is reasonable in the single stage scenario because the treatments are assigned randomly and if outcomes were affected by the preferences, treatments would be obsolete. Under these stated assumptions, it can be shown (Schulte et al. 2014) that the optimal individualized treatment can then be defined in terms of the predefined utility function:
πopt(x,w) =arg max
a∈{−1,1}E [U(Y, Z;E)∣X =x,W =w, A=a] =arg max a∈{−1,1}E [h(G)Y + {1−h(G)}Z∣X =x,W =w, A=a] =arg max a∈{−1,1} [E{h(G)∣X =x,W =w}E(Y∣X =x,W =w, A=a) + [1−E{h(G)∣X=x,W =w}]E(Z∣X=x,W =w, A=a) ] =arg max a∈{−1,1} [µ(x,w)QY(x,w,a) + [1−µ(x,w)]QZ(x,w, a)], where QY(x,w, a) = E(Y∣X = x,W = w, A = a), QZ(x,w, a) = E(Z∣X = x,W =
w, A=a)andµ(x,w) =E{h(G) ∣X =x,W =w}. Now letQ̂Y,n(x,w, a),Q̂Z,n(x,w, a)
and̂µn(x,w)be respective estimators ofQY(x,w,a),QZ(x,w, a)andµ(x,w). Then,
̂
πn(x,w) =arg max
a∈{−1,1}[̂µn
(x,w) ̂QY,n(x,w, a) + {1− ̂µn(x,w)} ̂QZ,n(x,w, a)]
This implies that estimationπopt(x,w)can be broken down into three estimators: ̂
QY,n(x,w, a),Q̂Z,n(x,w, a) and ̂µn(x,w)
To define Q̂
means that the itemized response information contained inW as it relates to the out- comes is sufficiently captured in the covariates X and need not be included in our estimator. Then, we construct linear working models as QY(x,w, a;γY) = x⊺Y,0γY,0 +
ax⊺
Y,1γY,1 and QZ(x,w, a;γZ) =x ⊺
Z,0γZ,0+ax ⊺
Z,1γZ,1, where x`,j for`=Y, Z and j =0,1
are known feature vectors constructed from x and γY, γZ are unknown parameter vectors. Note that we assume a linear working model, but this form is note re- quired and was just assumed for simplicity. Let ̂γY,n and ̂γZ, n be the maximum
likelihood estimates such that ̂γY,n = arg minγY E{Y −QY(X,W, A;γY)}2 and ̂γZ,n = arg minγZE{Z−QZ(X,W, A;γZ)}
2. Then, utilizing these estimates, we find that ̂
QY,n(x,w, a;̂γY)andQ̂Z,n(x,w, a;̂γZ)are the estimators ofQY(x,w, a)andQZ(x,w, a).
Constructing and estimator ̂µn(x,w) is sufficiently more difficult because E is
latent and therefore unobservable. The first step in defining this estimator is finding an estimate for Φ(E). To develop a latent preference model, we assume that E ⊥X∣W.
As we will see, this assumption simplifies the construction of the estimator and is reasonable ifX does not contain any additional information about patient preferences beyond that contained in W. Since Wj is a binary response variable, we can assume that, conditional on the unobservede,Wj are independent Bernoulli random variables. We define the generating model for W as P(Wj∣E = e) = 1+expexp{fj(e)}
{fj(e)}, where fj is
constrained to be a monotone increasing function for j = 1, . . . , p. To solve this we
employ an EM algorithm. In the estimation step we create starting values forE using the standard Rasch model from traditional item response theory (Rasch 1961; 1980), which we discussed at length in Section 3.2.2. We note that in the EM algorithm these starting values will be replaced with estimates of ên as the algorithm progresses
iteratively.
With these starting values, e0, we start the maximization step by finding f̂
j(e0)
{t1,e, . . . , tkE,e} are quantiles of e0. For computational and asymptotic simplicity, we
impose piecewise linear splines and let πj(e0) = {πj,1(e0), πj,2(e0), . . . , πj,K
E(e
0)}T be the set of splines of order 2. We then estimate fj by f̂j,n(e0) = πj(e0)T. Define the
linear component asπj,k =aj,k+b∗j,ke0 for k=1, . . . , KE. The linear parametersaj,k, bj,k
for a fixedj and allk=1, . . . , KE are simultaneously estimated by solving the following
equation using nonlinear programming (Ghalanos and Theussl 2015, Ye 1987):
πj,k(e0) =aj,k+b∗j,ke0 ∀ k∈ {1, . . . , KG} where logit{P(Wj =1∣E=e)} =πj(e(0))
s.t.
aj,k>0, bj,k >0 ∀ k∈ {1, . . . , KE}
aj,k+bj,ktk,E=aj,k+1+bj,k+1tk,E ∀ k∈ {1, . . . , KE}.
Because the Wj are independent for j ∈1, . . . , p, this process is repeated for allj.
Let βE = {a1,1, b1,1, . . . , a1,KE, b1,KE, . . . , ap,1, bp,1, . . . , ap,KE, bp,KE}. Once we have
estimates β̂
E for βE, we can create an estimator of e, ̂en. Given β̂E and a marginal distribution for the latent patient preferences, the conditional distribution of E given W =w is proportional to p(w∣h)ph(h). This can be approximated using a Metropolis
Hastings algorithm. However, because we assume thatµ(x,w)does not depend on x,
it is less computationally intensive to apply a method of moments estimator. We let
̂en denote the solution to ∑pj=1̂bj,lexpit(̂aj,l+ ̂bj,le) = ∑pj=1̂bj,lwj ∀ l =1, . . . , KE, where expit(u) = exp(u)/ {1+exp(u)}. This estimator for e, ̂en, provides similar estimates
burdensome.
Finally, recall that we only collect the contentment information, B, to develop our rule. We need to incorporate that information into the estimate of h( ̂G), which
serves as linear trade off weight. To do this, we assume the distribution of B is de- pendent on the composite outcome, which is a function combination of the outcomes and the preference information as U(Y, Z;̂en) = h( ̂G)Y + {1−h( ̂G)}Z. Hence, we
are interested in is an estimator for h( ̂G) that optimizes that patients contentment.
We assume that logit{P(B = 1)} =r{U(Y, Z;̂en)}, where r is a monotone increasing
function. Let Ĝ = Φ(̂e
n), then, similar to what we have previously done, we fit this
model using monotone spline smoothing except this time we simultaneously fit h( ̂G)
andr{U(Y, Z;̂en)}as piecewise linear splines. LetKGbe the number of knots such that
the knots {t1,G, . . . , tK
G,G} are quantiles of
̂
G. Under these specifications, we impose splines πh( ̂G) and πr{U(Y, Z;̂en)} where πh( ̂G) = {πh,1( ̂G), πh,2( ̂G), . . . , πh,KG( ̂G)}
and πr{U(Y, Z;̂en)} = [πr,1{U(Y, Z;̂en)}, πr,2{U(Y, Z;̂en)}, . . . , πr,KG{U(Y, Z;̂en)}] for
πr,k{U(Y, Z;̂en)} =δ1,k+δ2,kU(Y, Z;̂en) ∀ k∈ {1, . . . , KG} where logit{P (B=1)} =πr{U(Y, Z;̂en)} and U(Y, Z;̂en) =h( ̂G)Y + {1−h( ̂G)}Z, πh,k( ̂G) =α1,k+α2,kĜ ∀ k∈ {1, . . . , KG} where h( ̂G) =πh( ̂G) s.t. δ2,k >0, α2,k>0 ∀ k∈ {1, . . . , KG} α1,1=0 α1,KG+α2,KG =1 α1,k+α2,ktk,G =α1,k+1+α2,k+1tk,G ∀ k∈ {1, . . . , KG} δ1,k+δ2,ktk,G =δ1,k+1+δ2,k+1tk,G ∀ k∈ {1, . . . , KG}
Then, the estimator of µ(x,w) is ̂µn(x,w) = ̂h( ̂G).
5.3 Simulation Study
The results of the simulation study are included for 3 knots, which has been shown to be sufficient by He and Shi (1998). For completeness, similar results for 5 knots are included in the Appendix B.1. We chose 3 knots because additional knots do not provide enough additional information to justify a more complex model with a burdensome computation.