Conclusion - Essays on macroprudential and fiscal policy in a currency union

To present this algorithm we consider the generic equality constrained optimization problem,

minx F (x) (5.24)

s.t. H(x) = 0.

The AAO optimization problem, (5.23), can easily be written in this standard form, with x now representing the composite vector of design variables, targets, state variables, and slack variables, and F (x) and H(x) being defined appropriately.

The AAO trust-region algorithm decouples the constrained optimization into two parts, first a normal step, s^⊥_k, that decreases the constraint violation, and second, a tangential step, s^k_k, that decreases the objective function value while maintaining some of the constraint violation reduction from the normal step.³ The trust-region update step is accordingly the sum of the normal and tangential steps, s_k = s^⊥_k + s^k_k. The trust region step is accepted if the pair (F (x_k+ s_k), kH(x_k+ s_k)k) is Pareto dominant compared with all previous iterates x_k, and rejected otherwise. We will refer to the fully linear surrogate model at an iteration k for F (x) as ¯F_k(x), and the fully linear surrogates for H(x) as ¯H_k(x).

The first step of the algorithm is to find a normal step that reduces the constraint violation. To do this we compute a step minimizing the 2-norm of the constraint violation

3The terms normal and tangential do not imply that the steps are orthogonal [31, Page 658].

using the surrogate models,

We require that the normal step satisfies a condition slightly stricter than the fraction of Cauchy decrease condition, therefore the solution to Eq. 5.25 must satisfy,

k ¯H_k(x_k)k²₂− k ¯H_k(x_k+ s^⊥_k)k²₂ ≥ κ_amm−Hk ¯H_k(x_k)k²₂− k ¯H_k(x_k+ s^⊥∗_k )k²₂

(5.26)

for a constant κ_amm−H ∈ (0, 1], where s^⊥∗_k is the exact solution to Eq. 5.25. An immediate consequence of Eq. 5.26 is the fraction of Cauchy decrease condition,

k ¯Hk(xk)k²₂− k ¯Hk(xk+ s^⊥_k)k²₂ ≥ (5.27)

After computing the normal step to reduce the constraint violation we seek a tangential step that reduces the objective function value without giving up all of the progress achieved towards feasibility. To find the tangential step, s^k_k, we solve a trust-region problem that is based on the Celis-Dennis-Tapia trust-region algorithm [21]. We require at the complete step, normal plus tangential, that the fraction of Cauchy decrease condition for the constraint violation is still satisfied. Therefore, we define an additional constant, η ∈ (0, 1), in order to require k ¯H_k(x_k+ s^⊥_k + s^k_k)k²₂ ≤ k ¯H_k(x_k)k²₂− η k ¯H_k(x_k)k²₂− k ¯H_k(x_k+ s^⊥_k)k²₂. The constant, η, can be considered the weighting placed on whether the optimizer focuses on maintaining the constraint violation reduction, η ∼ 1, or reducing the objective function, η ∼ 0; however, regardless of the value chosen, the algorithm will converge to a feasible point that minimizes

the objective function. This tangential step trust-region subproblem is now,

An alternative tangential step trust region subproblem, which is easier to compute, is based on the Gauss-Newton approximation. The Gauss-Newton normal step using the sur-rogate models of the constraints, ˜s^⊥_k, is computed as

min

The value of the Gauss-Newton step is that the fraction of Cauchy decrease is the same for it as for (5.25) when the trust region is small. The fraction of Cauchy decrease requirement for (5.29) is

This slightly weaker fraction of Cauchy decrease condition is significantly less computation-ally expensive than the fraction of Cauchy decrease requirement in Eq. 5.27. The alternative

tangential step calculation is,

min

s^k_k

F¯_k(x_k+ s^⊥_k + s^k_k) (5.32)

s.t. b_k(s^⊥_k + s^k_k) ≤ b_k(0) − η[b_k(0) − b_k(s^⊥_k)]

ks^⊥_k + s^k_kk ≤ ∆_k.

For whichever tangential step trust region problem is used, we require that the tangential step optimization be solved such that,

F¯_k(x_k+ s^⊥_k) − ¯F_k(x_k+ s^⊥_k + s^k_k) ≥ κ_amm−F h ¯F_k(x_k+ s^⊥_k) − ¯F_k(x_k+ s^⊥_k + s^k∗_k )i

, (5.33)

where s^k∗_k is the exact solution to either Eq. 5.32 or Eq. 5.28, and κ_amm−F is a constant in (2/(2 +p1 − η²), 1).

The step acceptance and trust-region update criterion for this algorithm are a filter based on [40], and enhancements from [39, 41]. The filter uses ideas from multiobjective optimization and considers reducing the objective F (x) and constraint violation kH(x)k as two goals of the optimization. The filter stores the Pareto front of pairs (F (x_k), kH(x_k)k), and a trust-region step x_k+ s_k is acceptable to the filter if it is not dominated by a previous iterate, i.e., it will appear on the filter Pareto front. The filter algorithm also requires a sufficient decrease condition so filter points cannot be arbitrarily close. Therefore a point x_k+ s_k is non-dominated and acceptable for the filter if for all x_k visited (x_ˆ_k, ˆk = 0, . . . , k),

F (xk+ sk) ≤ F (xˆk) − β ˆ∆²_k or kH(xk+ sk)k ≤ kH(xˆk)k − β ˆ∆²_kkH(xˆk)k, (5.34)

∀ˆk ≤ k,

for a constant β ∈ (0,¹₂) and where ˆ∆_k = min{∆_k, 1}. We typically initiate the filter with the point (−∞, kH(x0)k) in lieu of (F (x0), kH(x0)k) to ensure the constraint violation remains bounded. The points acceptable to the filter are shown graphically in Figure 5-3. If a point is acceptable to the filter, it is added to the filter and any previous points in the filter that

are then dominated are removed.

Figure 5-3: Graphic of points acceptable to the filter.

The trust region updates are all based on the whether or not a trial point is acceptable to the filter. If the trial point x_k+s_kis acceptable to the filter, then the design vector is updated, x_k+1 = x_k+ s_k, but if it is not acceptable the step, s_k, is rejected. Similarly, if the step is accepted the size of the trust region is increased so that ∆_k+1 = min{γ₁∆_k, ∆_max} and if the step is rejected the size of the trust region is decreased so that ∆_k+1 = γ₀∆_k with γ₁ > 1 and γ₀ ∈ (0, 1). ¯F (x) and ¯H(x) are then updated to be fully linear on kx − x_k+1k ≤ ∆_k+1. We require that the surrogate models have the correct value at x_k+1, i.e. H(x_k+1) = ¯H(x_k+1) and F (x_k+1) = ¯F_k+1(x_k+1). The surrogate models for each constraint are all created concurrently.

Moreover, if no designs have been evaluated in the vicinity of the current trust region, then construction of a fully linear model requires n + 1 designs to be evaluated. All of these designs may also be evaluated concurrently.

To establish optimality of x_k and terminate the algorithm, it is necessary to verify that both the constraint violation and the projection of the objective function gradient onto the nullspace of the constraint Jacobian are sufficiently small. In the general gradient-free case, the second condition cannot be verified directly. Therefore, we require three criteria to terminate the algorithm, (1) kH(x_k)k ≤ , (2) k ¯P_k∇ ¯F_k(x_k)k ≤ , and (3) ∆_k ≤ , where

P¯_k = I − ∇ ¯H_k(x_k)^> ∇ ¯H_k(x_k)∇ ¯H_k(x_k)^>−1

∇ ¯H_k(x_k). (5.35)

Using fully linear surrogate models, conditions (2) and (3) are equivalent to the projection of

the objective function gradient onto the nullspace of the constraint Jacobian being sufficiently small [38]. The complete trust-region algorithm is summarized as Algorithm 5.2.

In document Essays on macroprudential and fiscal policy in a currency union (página 183-200)