Nonlinear Solution Methods

First-order linear solutions impose certainty equivalence –- agents behave as if shocks have zero variance. This rules out risk premia, precautionary savings, welfare costs of uncertainty, and asymmetric dynamics. Nonlinear methods capture all of these by retaining higher-order terms in the Taylor expansion of the policy function or by solving the functional equation globally. MacroEconometricModels.jl provides four families: higher-order perturbation (local, Schmitt-Grohe & Uribe 2004; Andreasen, Fernandez-Villaverde & Rubio-Ramirez 2018), Chebyshev projection (global polynomial, Judd 1992, 1998), policy function iteration (global iterative, Coleman 1990), and value function iteration (global Bellman, Stokey, Lucas & Prescott 1989). All three global solvers support Anderson acceleration (Walker & Ni 2011) and multi-threading. For model specification and linearization, see DSGE Models. For first-order solvers, see Linear Solvers.

using MacroEconometricModels, Random
Random.seed!(42)

<< @setup-block not executed in draft mode >>

Quick Start

Recipe 1: Second-order perturbation with pruned simulation

spec = @dsge begin
    parameters: β = 0.99, α = 0.36, δ = 0.025, ρ = 0.9, σ = 0.01
    endogenous: Y, C, K, A
    exogenous: ε_A
    Y[t] = A[t] * K[t-1]^α
    C[t] + K[t] = Y[t] + (1 - δ) * K[t-1]
    1 = β * (C[t] / C[t+1]) * (α * A[t+1] * K[t]^(α - 1) + 1 - δ)
    A[t] = ρ * A[t-1] + σ * ε_A[t]
    steady_state = begin
        A_ss = 1.0
        K_ss = (α * β / (1 - β * (1 - δ)))^(1 / (1 - α))
        Y_ss = K_ss^α
        C_ss = Y_ss - δ * K_ss
        [Y_ss, C_ss, K_ss, A_ss]
    end
end
spec = compute_steady_state(spec)

psol = perturbation_solver(spec; order=2)
Y_sim = simulate(psol, 1000)  # pruned simulation (Kim et al. 2008)
nothing # hide

<< @example-block not executed in draft mode >>

Recipe 2: Third-order perturbation with GIRFs

psol3 = perturbation_solver(spec; order=3)
Y_sim3 = simulate(psol3, 1000)
girf3 = irf(psol3, 40; irf_type=:girf, n_draws=100)
nothing # hide

<< @example-block not executed in draft mode >>

plot_result(girf3)

Recipe 3: Chebyshev projection and Euler errors

proj = collocation_solver(spec; degree=5, grid=:tensor, max_iter=200)
y = evaluate_policy(proj, proj.steady_state[proj.state_indices])
err = max_euler_error(proj)

<< @example-block not executed in draft mode >>

Recipe 4: PFI with damped updates

pfi = pfi_solver(spec; degree=5, damping=0.5, max_iter=200)
report(pfi)

<< @example-block not executed in draft mode >>

Recipe 5: VFI with Howard steps and Anderson acceleration

vfi = vfi_solver(spec; degree=5, howard_steps=5, anderson_m=3, max_iter=500)
report(vfi)

<< @example-block not executed in draft mode >>

Second-Order Perturbation

The first-order linear solution imposes certainty equivalence: agents behave identically regardless of shock variance. This produces four specific deficiencies: (1) zero risk premia on asset returns, (2) no precautionary savings motive, (3) zero welfare cost of business cycles, and (4) perfectly symmetric impulse responses to positive and negative shocks. The second-order perturbation (Schmitt-Grohe & Uribe 2004) resolves all four by retaining quadratic terms in the Taylor expansion of the policy function.

The second-order decision rule takes the form:

\[z_t = \bar{z} + f_v \, v_t + \tfrac{1}{2} f_{vv} (v_t \otimes v_t) + \tfrac{1}{2} f_{\sigma\sigma} \sigma^2\]

where:

$z_t$ is the $n \times 1$ vector of all endogenous variables (deviations from steady state)
$v_t = [x_{t-1}; \varepsilon_t]$ is the $n_v \times 1$ innovations vector (lagged states + current shocks)
$f_v$ is the $n \times n_v$ first-order coefficient matrix
$f_{vv}$ is the $n \times n_v^2$ second-order coefficient tensor (flattened Kronecker)
$f_{\sigma\sigma}$ is the $n \times 1$ variance correction (shifts the stochastic steady state)
$\sigma$ is the perturbation scaling parameter

psol = perturbation_solver(spec; order=2)
nothing # hide

<< @example-block not executed in draft mode >>

Keyword	Type	Default	Description
`order`	`Int`	`2`	Perturbation order (1, 2, or 3)
`method`	`Symbol`	`:gensys`	First-order solver (`:gensys` or `:blanchard_kahn`)

Algorithm

The computation follows Schmitt-Grohe & Uribe (2004) in six steps:

Solve the first-order system via Gensys or Blanchard-Kahn to obtain $G_1$ and the impact matrix
Partition variables into states ($x$) and controls ($y$) via the $\Gamma_1$ structure
Build the innovations vector $v = [x_{t-1}; \varepsilon_t]$ and the mapping matrices that relate each argument slot (current, lag, lead, shock) to $v$-space
Compute all 10 unique Hessian tensors via central finite differences across the four argument slots
Assemble the $n \times n_v^2$ right-hand side from contracting each Hessian with its slot-to-$v$ mapping matrices, then solve the Kronecker system: $(I_{n_v^2} \otimes f_c + \text{kron}(M,M)' \otimes f_f) \, \text{vec}(f_{vv}) = -\text{vec}(\text{RHS})$
Solve the $\sigma^2$ correction: $(f_c + f_f) \, f_{\sigma\sigma} = -f_f \, f_{vv} \, \text{vec}(\eta \eta')$

Technical Note

The stochastic steady state $\bar{z} + \tfrac{1}{2} f_{\sigma\sigma} \sigma^2$ differs from the deterministic steady state $\bar{z}$. This shift captures precautionary behavior: risk-averse agents accumulate more capital when facing uncertainty. The magnitude depends on the curvature of the utility function and the variance of shocks.

Third-Order Perturbation

The decision rule at third order adds cubic and variance-interaction terms (Andreasen, Fernandez-Villaverde & Rubio-Ramirez 2018):

\[z_t = \bar{z} + f_v \, v_t + \tfrac{1}{2} f_{vv} (v_t \otimes v_t) + \tfrac{1}{2} f_{\sigma\sigma} \sigma^2 + \tfrac{1}{6} f_{vvv} (v_t \otimes v_t \otimes v_t) + \tfrac{1}{2} f_{\sigma\sigma v} \, \sigma^2 \, v_t + \tfrac{1}{6} f_{\sigma\sigma\sigma} \, \sigma^3\]

where:

$f_{vvv}$ is the $n \times n_v^3$ third-order coefficient tensor (flattened triple Kronecker)
$f_{\sigma\sigma v}$ is the $n \times n_v$ interaction between uncertainty and the state
$f_{\sigma\sigma\sigma}$ is the $n \times 1$ cubic variance correction (zero for Gaussian shocks)

psol3 = perturbation_solver(spec; order=3)
nothing # hide

<< @example-block not executed in draft mode >>

Algorithm

The third-order computation extends the second-order procedure with six additional steps:

Solve first-order and second-order as prerequisites (Steps 1–6 above)
Compute all 20 unique third-derivative tensors via central finite differences across the four argument slots (current, lag, lead, shock), storing only canonical orderings
Accumulate the $n \times n_v^3$ right-hand side from two sources: (A) pure third derivatives contracted with mapping matrices across all slot triples, and (B) mixed Hessian-times-second-order interaction terms (3 cyclic permutations per Hessian block)
Solve the third-order Kronecker system: $(I_{n_v^3} \otimes f_c + \text{kron}(M,M,M)' \otimes f_f) \, \text{vec}(f_{vvv}) = -\text{vec}(\text{RHS}_3)$
Compute $f_{\sigma\sigma v}$ correction via contraction of $f_{vvv}$ with shock covariance $\eta \eta'$ plus Hessian-times-$f_{\sigma\sigma}$ interaction across all slot pairs
Set $f_{\sigma\sigma\sigma} = 0$ (zero for Gaussian shocks)

Technical Note

Third-order perturbation captures skewness in the ergodic distribution. The $f_{\sigma\sigma\sigma}$ correction is zero for Gaussian shocks but would be non-zero for non-Gaussian innovations. The $f_{\sigma\sigma v}$ term captures how the variance correction depends on the state –- agents' precautionary behavior varies with economic conditions.

Return Value (`PerturbationSolution{T}`)

Field	Type	Description
`order`	`Int`	Perturbation order (1, 2, or 3)
`gx`	`Matrix{T}`	$n_y \times n_v$ first-order controls
`hx`	`Matrix{T}`	$n_x \times n_v$ first-order states
`gxx`	`Union{Nothing, Matrix{T}}`	$n_y \times n_v^2$ second-order controls (order $\geq 2$)
`hxx`	`Union{Nothing, Matrix{T}}`	$n_x \times n_v^2$ second-order states (order $\geq 2$)
`g\sigma\sigma`	`Union{Nothing, Vector{T}}`	$n_y$ control variance correction (order $\geq 2$)
`h\sigma\sigma`	`Union{Nothing, Vector{T}}`	$n_x$ state variance correction (order $\geq 2$)
`gxxx`	`Union{Nothing, Matrix{T}}`	$n_y \times n_v^3$ third-order controls (order = 3)
`hxxx`	`Union{Nothing, Matrix{T}}`	$n_x \times n_v^3$ third-order states (order = 3)
`g\sigma\sigma x`	`Union{Nothing, Matrix{T}}`	$n_y \times n_v$ variance-state interaction (order = 3)
`h\sigma\sigma x`	`Union{Nothing, Matrix{T}}`	$n_x \times n_v$ variance-state interaction (order = 3)
`g\sigma\sigma\sigma`	`Union{Nothing, Vector{T}}`	$n_y$ cubic variance correction (order = 3)
`h\sigma\sigma\sigma`	`Union{Nothing, Vector{T}}`	$n_x$ cubic variance correction (order = 3)
`eta`	`Matrix{T}`	$n_v \times n_\varepsilon$ shock loading $[0; I]$ block
`steady_state`	`Vector{T}`	Deterministic steady state
`state_indices`	`Vector{Int}`	Indices of state variables
`control_indices`	`Vector{Int}`	Indices of control variables
`eu`	`Vector{Int}`	Existence/uniqueness from first-order
`spec`	`DSGESpec{T}`	Back-reference to specification
`linear`	`LinearDSGE{T}`	Linearized system

Pruning

Naive simulation of higher-order decision rules produces explosive sample paths because the Kronecker products $(v_t \otimes v_t)$ compound deviations multiplicatively –- a moderate deviation at time $t$ is squared, generating a larger deviation at $t+1$, which is squared again. Pruning (Kim, Kim, Schaumburg & Sims 2008) prevents this by tracking state components separately and using only first-order states in the Kronecker products.

Second-Order Pruning (Kim et al. 2008)

The pruned simulation decomposes the state into two components:

First-order state: $x_t^{(1)} = h_{x,\text{state}} \cdot x_{t-1}^{(1)} + \eta_x \cdot \varepsilon_t$
Second-order correction: $x_t^{(2)} = h_{x,\text{state}} \cdot x_{t-1}^{(2)} + \tfrac{1}{2} h_{xx} (v_t^{(1)} \otimes v_t^{(1)}) + \tfrac{1}{2} h_{\sigma\sigma}$

Total state: $x_t = x_t^{(1)} + x_t^{(2)}$

where $v_t^{(1)} = [x_{t-1}^{(1)}; \varepsilon_t]$ contains only first-order states. The key insight is that the Kronecker product $v_t^{(1)} \otimes v_t^{(1)}$ uses only the stable first-order component, preventing compounding.

Third-Order Pruning (Andreasen et al. 2018)

Third-order pruning adds a third component:

Third-order correction: $x_t^{(3)} = h_{x,\text{state}} \cdot x_{t-1}^{(3)} + h_{xx} (v_t^{(1)} \otimes v_t^{(2)}) + \tfrac{1}{6} h_{xxx} (v_t^{(1)} \otimes v_t^{(1)} \otimes v_t^{(1)}) + \tfrac{1}{2} h_{\sigma\sigma x} \, v_t^{(1)} + \tfrac{1}{6} h_{\sigma\sigma\sigma}$

where $v_t^{(2)} = [x_{t-1}^{(2)}; 0]$ contains the second-order state correction with zero shocks.

Total: $x_t = x_t^{(1)} + x_t^{(2)} + x_t^{(3)}$

psol3 = perturbation_solver(spec; order=3)
Y_sim = simulate(psol3, 1000)  # 3-component pruned simulation
nothing # hide

<< @example-block not executed in draft mode >>

Keyword	Type	Default	Description
`shock_draws`	`Union{Nothing, Matrix}`	`nothing`	Pre-drawn shocks (T x n_shocks); draws from N(0,1) if `nothing`
`rng`	`AbstractRNG`	`default_rng()`	Random number generator
`antithetic`	`Bool`	`false`	Antithetic variates for variance reduction

Explosive paths without pruning

Never simulate a second- or third-order perturbation solution without pruning. The naive simulation $z_t = \bar{z} + f_v v_t + \frac{1}{2} f_{vv} (v_t \otimes v_t) + \ldots$ using the total state in the Kronecker product diverges within 50–100 periods for most calibrations. All simulate(::PerturbationSolution, ...) calls use pruning automatically.

Generalized Impulse Responses

Analytical IRFs from the first-order solution miss second- and third-order effects. Generalized impulse response functions (GIRFs, Koop, Pesaran & Potter 1996) compute the expected difference between a shocked and baseline path via Monte Carlo simulation:

\[\text{GIRF}(h, \delta_j) = E\big[y_{t+h} \mid \varepsilon_{j,t} = \delta_j, \Omega_{t-1}\big] - E\big[y_{t+h} \mid \Omega_{t-1}\big]\]

where:

$\delta_j$ is the shock impulse to shock $j$
$\Omega_{t-1}$ is the information set at $t-1$
The expectation is computed by averaging over $n_{\text{draws}}$ simulated paths using pruned simulation

The GIRF captures nonlinear dynamics that the analytical first-order IRF misses: asymmetric responses to positive vs. negative shocks, state-dependent propagation, and variance-correction effects. For a first-order solution, the GIRF converges to the analytical IRF as $n_{\text{draws}} \to \infty$.

# Analytical IRFs (first-order only, fast)
irf_analytical = irf(psol3, 40)

# GIRFs (captures nonlinear dynamics, Monte Carlo)
girf = irf(psol3, 40; irf_type=:girf, n_draws=100)
nothing # hide

<< @example-block not executed in draft mode >>

plot_result(girf)

Keyword	Type	Default	Description
`irf_type`	`Symbol`	`:analytical`	`:analytical` for first-order, `:girf` for simulation-based
`n_draws`	`Int`	`500`	Number of Monte Carlo draws for GIRF
`shock_size`	`Real`	`1.0`	Impulse size in standard deviations

The FEVD is also available via fevd(psol, H), which uses the first-order decomposition from the underlying linear solution.

Chebyshev Projection

Chebyshev collocation (Judd 1992, 1998) approximates the policy function globally using Chebyshev polynomials on a tensor or Smolyak grid. Unlike perturbation, the approximation is accurate far from the steady state –- essential for models with large shocks, regime switches, or occasionally binding constraints. The solver finds coefficients such that the equilibrium conditions hold exactly at each collocation node.

The policy function approximation takes the form:

\[y_i(x) \approx \sum_{j=1}^{n_b} c_{i,j} \, T_j(x)\]

where:

$c_{i,j}$ are the Chebyshev coefficients ($n_{\text{vars}} \times n_b$ matrix)
$T_j(x)$ are the multivariate Chebyshev basis functions (products of univariate $T_k(x_d)$)
$n_b$ is the number of basis functions

Algorithm

The collocation solver proceeds in five steps:

Linearize to get the state/control partition and compute state bounds as $\bar{x}_i \pm \text{scale} \cdot \sigma_i$ from the first-order Lyapunov solution
Build grid: tensor product of Chebyshev extrema (Gauss-Lobatto) nodes on $[-1, 1]^{n_x}$; for high-dimensional models, use Smolyak sparse grid instead
Construct basis matrix by evaluating all multivariate Chebyshev polynomials at each collocation node
Initialize coefficients from the first-order perturbation solution via least-squares projection onto the Chebyshev basis
Newton iteration: solve $R(c) = 0$ where $R$ is the vector of equilibrium residuals evaluated at all nodes, with Gauss-Hermite or monomial quadrature for expectations of next-period variables. Uses Gauss-Newton with backtracking line search.

proj = collocation_solver(spec; degree=5, grid=:tensor, max_iter=200)
report(proj)

<< @example-block not executed in draft mode >>

Keyword	Type	Default	Description
`degree`	`Int`	`5`	Chebyshev polynomial degree
`grid`	`Symbol`	`:auto`	`:tensor`, `:smolyak`, or `:auto` (tensor if $n_x \leq 4$)
`smolyak_mu`	`Int`	`3`	Smolyak approximation level
`quadrature`	`Symbol`	`:auto`	`:gauss_hermite` or `:monomial` (auto based on $n_\varepsilon$)
`n_quad`	`Int`	`5`	Quadrature nodes per dimension
`scale`	`Real`	`3.0`	State bounds as multiples of unconditional std
`tol`	`Real`	$10^{-8}$	Newton convergence tolerance
`max_iter`	`Int`	`100`	Maximum Newton iterations
`initial_coeffs`	`Union{Nothing, Matrix}`	`nothing`	Warm-start coefficients from previous solve

Smolyak Sparse Grids

For models with $n_x > 4$ states, tensor grids become computationally infeasible. A tensor grid with degree 5 and 5 states requires $6^5 = 7{,}776$ nodes; with 8 states, $6^8 \approx 1.7 \times 10^6$. Smolyak sparse grids (Smolyak 1963; Judd, Maliar, Maliar & Valero 2014) select a subset of grid points that preserve polynomial exactness at a fraction of the cost. The grid=:smolyak option uses nested Clenshaw-Curtis points with the Smolyak selection rule $|\alpha|_1 \leq \mu + n_x$ for multi-index $\alpha$.

Technical Note

The grid=:auto option selects tensor grids for $n_x \leq 4$ and Smolyak grids for $n_x > 4$. Similarly, quadrature=:auto selects Gauss-Hermite for $n_\varepsilon \leq 2$ and monomial rules for $n_\varepsilon > 2$. Monomial rules scale linearly with $n_\varepsilon$ while Gauss-Hermite scales exponentially.

Evaluating the Policy Function

The evaluate_policy function maps state vectors to the full vector of endogenous variables using the stored Chebyshev coefficients:

y = evaluate_policy(proj, proj.steady_state[proj.state_indices])  # single point (vector)

<< @example-block not executed in draft mode >>

Euler Equation Errors

The Euler equation error measures the accuracy of the global approximation by evaluating residuals at random test points drawn uniformly within the state bounds:

err = max_euler_error(proj; n_test=1000)

<< @example-block not executed in draft mode >>

The error is reported in levels. Convert to $\log_{10}$ for the standard accuracy metric:

$\log_{10}$ error	Quality
$< -6$	Excellent
$-6$ to $-4$	Good
$-4$ to $-2$	Acceptable
$> -2$	Poor –- increase degree or tighten tolerance

Policy Function Iteration

Policy function iteration (PFI, Coleman 1990) solves for the equilibrium policy function by iterating on the Euler equation. At each step, the current policy guess determines expected future values via quadrature, and the equilibrium conditions at each grid point produce an updated policy via Newton's method. PFI tends to be more robust than collocation for models with kinks or near-kinks in the policy function, while collocation converges faster for smooth problems.

The algorithm iterates three sub-steps at each grid point $j$:

Expectation: compute $E[y_{t+1}]$ using the current policy coefficients and quadrature
Euler solve: given $y_{\text{lag}} = x_j$ (the grid point as lagged state) and $E[y_{t+1}]$, solve $F(y_t, y_{\text{lag}}, E[y_{t+1}], 0, \theta) = 0$ for $y_t$ via Newton iteration
Refit: project the updated policy values onto the Chebyshev basis via least squares

pfi = pfi_solver(spec; degree=5, damping=0.5, max_iter=200)
report(pfi)

<< @example-block not executed in draft mode >>

Keyword	Type	Default	Description
`degree`	`Int`	`5`	Chebyshev polynomial degree
`grid`	`Symbol`	`:auto`	`:tensor`, `:smolyak`, or `:auto`
`damping`	`Real`	`1.0`	Damping factor (0.5 for slow convergence, 1.0 for no damping)
`anderson_m`	`Int`	`0`	Anderson acceleration depth (0 = disabled; see Anderson Acceleration)
`threaded`	`Bool`	`false`	Multi-threaded grid-point Euler evaluation
`tol`	`Real`	$10^{-8}$	Sup-norm convergence tolerance
`max_iter`	`Int`	`500`	Maximum iterations
`initial_coeffs`	`Union{Nothing, Matrix}`	`nothing`	Warm-start from previous solve

Technical Note

PFI, Chebyshev collocation, and VFI all return the same ProjectionSolution{T} type. All three support evaluate_policy, simulate, irf, and max_euler_error. The method field distinguishes them: :projection for collocation, :pfi for policy function iteration, :vfi for value function iteration.

Return Value (`ProjectionSolution{T}`)

Field	Type	Description
`coefficients`	`Matrix{T}`	$n_{\text{vars}} \times n_b$ Chebyshev coefficients
`state_bounds`	`Matrix{T}`	$n_x \times 2$ state domain bounds
`grid_type`	`Symbol`	`:tensor` or `:smolyak`
`degree`	`Int`	Polynomial degree (tensor) or Smolyak level $\mu$
`collocation_nodes`	`Matrix{T}`	$n_{\text{nodes}} \times n_x$ grid points in $[-1, 1]$
`residual_norm`	`T`	Final $\|R\|$ residual (collocation) or sup-norm (PFI/VFI)
`n_basis`	`Int`	Number of basis functions
`multi_indices`	`Matrix{Int}`	$n_b \times n_x$ multi-index matrix
`quadrature`	`Symbol`	`:gauss_hermite` or `:monomial`
`converged`	`Bool`	Convergence flag
`iterations`	`Int`	Iterations until convergence
`state_indices`	`Vector{Int}`	State variable indices
`control_indices`	`Vector{Int}`	Control variable indices
`method`	`Symbol`	`:projection`, `:pfi`, or `:vfi`

Value Function Iteration

Value function iteration (VFI, Stokey, Lucas & Prescott 1989) solves the Bellman equation by iterating on the value function. At each iteration, the solver evaluates the Euler equation residuals at all grid points, updates the policy coefficients, and checks sup-norm convergence. VFI is the most general global method –- it converges under weaker regularity conditions than PFI or collocation –- but is typically slower per iteration. Two acceleration techniques reduce the iteration count: Howard improvement steps (Howard 1960) and Anderson acceleration (Walker & Ni 2011).

The algorithm proceeds in four steps:

Setup: Linearize the model, compute state bounds, build the Chebyshev grid and basis matrix (identical to PFI/collocation)
Euler evaluation: At each grid point $x_j$, compute expectations via quadrature and solve $F(y_t, x_j, E[y_{t+1}], 0, \theta) = 0$ for $y_t$ via Newton's method
Update: Project updated policy values onto the Chebyshev basis, apply damping and optional Howard/Anderson steps
Convergence: Check sup-norm of policy change; iterate until $\|y_{\text{new}} - y_{\text{old}}\|_\infty < \text{tol}$

vfi = vfi_solver(spec; degree=5, max_iter=500)
report(vfi)

<< @example-block not executed in draft mode >>

Keyword	Type	Default	Description
`degree`	`Int`	`5`	Chebyshev polynomial degree
`grid`	`Symbol`	`:auto`	`:tensor`, `:smolyak`, or `:auto`
`smolyak_mu`	`Int`	`3`	Smolyak approximation level
`quadrature`	`Symbol`	`:auto`	`:gauss_hermite` or `:monomial`
`n_quad`	`Int`	`5`	Quadrature nodes per dimension
`scale`	`Real`	`3.0`	State bounds as multiples of unconditional std
`damping`	`Real`	`1.0`	Coefficient mixing factor (1.0 = no damping)
`howard_steps`	`Int`	`0`	Howard improvement steps per iteration (0 = pure VFI)
`anderson_m`	`Int`	`0`	Anderson acceleration depth (0 = disabled; see Anderson Acceleration)
`threaded`	`Bool`	`false`	Multi-threaded grid-point evaluation
`tol`	`Real`	$10^{-8}$	Sup-norm convergence tolerance
`max_iter`	`Int`	`1000`	Maximum VFI iterations
`initial_coeffs`	`Union{Nothing, Matrix}`	`nothing`	Warm-start coefficients from previous solve

Howard Improvement Steps

Pure VFI updates the policy at every iteration, but each policy update requires solving a nonlinear system at every grid point. Howard improvement steps (Howard 1960; Santos & Rust 2003) amortize the cost: after each policy update, hold the policy fixed and re-evaluate the Euler equation $howard_steps$ times, updating only the value (Chebyshev coefficients). Because value evaluation is cheaper than policy optimization, Howard steps reduce the total iteration count at modest per-iteration cost.

vfi_howard = vfi_solver(spec; degree=5, howard_steps=5, max_iter=500)
report(vfi_howard)

<< @example-block not executed in draft mode >>

VFI vs PFI vs Collocation

All three global solvers return ProjectionSolution{T} and share the same post-solution API (evaluate_policy, simulate, irf, max_euler_error). The methods differ in convergence properties:

Collocation (Gauss-Newton on residuals): fastest for smooth problems, but can stall at local minima
PFI (fixed-point on the Euler equation): more robust for models with kinks, but requires good initialization
VFI (fixed-point on the Bellman operator): most general convergence guarantees (contraction under Blackwell conditions), but slowest without acceleration

sol_vfi = vfi_solver(spec; degree=5, max_iter=500)
sol_pfi = pfi_solver(spec; degree=5, damping=0.5, max_iter=200)
sol_proj = collocation_solver(spec; degree=5, max_iter=200)

# All three agree at the steady state
y_vfi = evaluate_policy(sol_vfi, sol_vfi.steady_state[sol_vfi.state_indices])
y_pfi = evaluate_policy(sol_pfi, sol_pfi.steady_state[sol_pfi.state_indices])
y_proj = evaluate_policy(sol_proj, sol_proj.steady_state[sol_proj.state_indices])
nothing # hide

<< @example-block not executed in draft mode >>

Anderson Acceleration

Anderson acceleration (Walker & Ni 2011) speeds convergence of fixed-point iterations by mixing the last $m$ iterates. Given iterates $x_k$ and residuals $r_k = g(x_k) - x_k$, the method solves:

\[\min_{\alpha} \left\| \sum_{i=1}^{m} \alpha_i \, r_i \right\|^2 \quad \text{s.t.} \quad \sum_{i=1}^{m} \alpha_i = 1\]

and returns the mixed iterate $x_{\text{new}} = \sum_{i} \alpha_i (x_i + r_i)$. The depth parameter $m$ controls how many previous iterates are used. Larger $m$ captures more history but increases the linear algebra cost. In practice, $m = 3$–$5$ works well.

Anderson acceleration is available for both PFI (anderson_m kwarg) and VFI (anderson_m kwarg). It operates on the vectorized Chebyshev coefficient matrix, treating the coefficient update as a fixed-point iteration.

# PFI with Anderson acceleration
pfi_anderson = pfi_solver(spec; degree=5, damping=0.5, anderson_m=3, max_iter=200)

# VFI with Anderson acceleration
vfi_anderson = vfi_solver(spec; degree=5, anderson_m=3, max_iter=500)
nothing # hide

<< @example-block not executed in draft mode >>

Multi-Threading

The three global solvers (collocation, PFI, VFI) support opt-in multi-threading via the threaded=true keyword. When enabled:

VFI / PFI: Grid-point Euler equation evaluations run in parallel via Threads.@threads
Collocation: Jacobian column computation runs in parallel

Threading requires Julia to be started with multiple threads (e.g., julia -t 4). On single-threaded Julia, threaded=true has no effect. The solutions are numerically identical regardless of the threaded setting.

# Sequential (default)
sol_seq = vfi_solver(spec; degree=5, threaded=false, max_iter=500)

# Threaded (requires julia -t N)
sol_par = vfi_solver(spec; degree=5, threaded=true, max_iter=500)
nothing # hide

<< @example-block not executed in draft mode >>

Analytical Moments

For first-order solutions, analytical_moments computes unconditional moments in closed form via the discrete Lyapunov equation:

\[\Sigma = G_1 \, \Sigma \, G_1' + \text{impact} \cdot \text{impact}'\]

where:

$\Sigma$ is the $n \times n$ unconditional covariance matrix
$G_1$ is the state transition matrix from the first-order solution
$\text{impact}$ is the $n \times n_\varepsilon$ shock impact matrix

The solve_lyapunov function solves this equation via Kronecker vectorization: $\text{vec}(\Sigma) = (I_{n^2} - G_1 \otimes G_1)^{-1} \, \text{vec}(\text{impact} \cdot \text{impact}')$. Autocovariances at lag $h$ follow from $\Gamma_h = G_1^h \, \Sigma$.

sol = solve(spec)

# Unconditional covariance matrix
Sigma = solve_lyapunov(sol.G1, sol.impact)

# Moment vector matching autocovariance_moments format
m = analytical_moments(sol; lags=2)

<< @example-block not executed in draft mode >>

The moment vector contains two blocks: (1) the upper triangle of the variance-covariance matrix ($k(k+1)/2$ elements) and (2) diagonal autocovariances at each lag ($k$ elements per lag). This format matches autocovariance_moments(data, lags) for direct comparison in Estimation.

Complete Example

This example solves the RBC model at all three perturbation orders, compares analytical and generalized IRFs, and validates a global projection solution with Euler equation errors.

# First-order: certainty equivalence
sol1 = perturbation_solver(spec; order=1)

# Second-order: risk correction
sol2 = perturbation_solver(spec; order=2)

# Third-order: skewness and state-dependent risk
sol3 = perturbation_solver(spec; order=3)

# Compare simulated ergodic means (stochastic SS shift)
Y1 = simulate(sol1, 10000)
Y2 = simulate(sol2, 10000)
Y3 = simulate(sol3, 10000)

# Generalized IRFs at third order
girf = irf(sol3, 40; irf_type=:girf, n_draws=100)

# Global solution via Chebyshev collocation
proj = collocation_solver(spec; degree=5, grid=:tensor, max_iter=200)
report(proj)

<< @example-block not executed in draft mode >>

# Euler equation accuracy
err = max_euler_error(proj; n_test=1000)

# PFI for comparison
pfi = pfi_solver(spec; degree=5, damping=0.5, max_iter=200)
err_pfi = max_euler_error(pfi; n_test=1000)

# VFI with Howard steps
vfi = vfi_solver(spec; degree=5, howard_steps=5, max_iter=500)
err_vfi = max_euler_error(vfi; n_test=1000)

# Analytical moments for estimation targets
m = analytical_moments(sol1; lags=2)

<< @example-block not executed in draft mode >>

plot_result(girf)

The stochastic steady state shifts from second-order perturbation reflect precautionary savings: the ergodic mean of capital exceeds the deterministic steady state because risk-averse agents over-accumulate capital as a buffer against productivity shocks. Third-order perturbation adds state-dependent risk premia –- the precautionary effect is stronger in recessions than expansions.

Common Pitfalls

Explosive simulation without pruning: Never extract the second-order coefficients and simulate manually. Always use simulate(psol, T), which applies Kim et al. (2008) pruning automatically.
Tensor grids in high dimensions: A tensor grid with degree 5 and $n_x = 6$ states requires $6^6 = 46{,}656$ nodes. Use grid=:smolyak for $n_x > 4$.
Poor Euler errors: If max_euler_error returns values above $10^{-2}$, increase the polynomial degree, widen the state bounds via the scale parameter, or switch from collocation to PFI.
Non-convergence of collocation: The Gauss-Newton solver uses backtracking line search but can stall at local minima. Warm-start with initial_coeffs from a lower-degree solution or from PFI.
Lyapunov equation instability: solve_lyapunov throws an error if the first-order solution has eigenvalues on or outside the unit circle. Check determinacy with is_determined(sol) before computing moments.
VFI convergence speed: Pure VFI is slow –- use howard_steps=5 or howard_steps=10 to reduce iteration count by 3–5x. Combine with anderson_m=3 for further acceleration.

References

Andreasen, M. M., Fernandez-Villaverde, J., & Rubio-Ramirez, J. F. (2018). The Pruned State-Space System for Non-Linear DSGE Models: Theory and Empirical Applications. Review of Economic Studies, 85(1), 1–49. DOI
Coleman, W. J. (1990). Solving the Stochastic Growth Model by Policy-Function Iteration. Journal of Business & Economic Statistics, 8(1), 27–29. DOI
Judd, K. L. (1992). Projection Methods for Solving Aggregate Growth Models. Journal of Economic Theory, 58(2), 410–452. DOI
Judd, K. L. (1998). Numerical Methods in Economics. MIT Press. ISBN: 978-0-262-10071-7.
Judd, K. L., Maliar, L., Maliar, S., & Valero, R. (2014). Smolyak Method for Solving Dynamic Economic Models: Lagrange Interpolation, Anisotropic Grids and Adaptive Domain. Journal of Economic Dynamics and Control, 44, 92–123. DOI
Kim, J., Kim, S., Schaumburg, E., & Sims, C. A. (2008). Calculating and Using Second-Order Accurate Solutions of Discrete Time Dynamic Equilibrium Models. Journal of Economic Dynamics and Control, 32(11), 3397–3414. DOI
Koop, G., Pesaran, M. H., & Potter, S. M. (1996). Impulse Response Analysis in Nonlinear Multivariate Models. Journal of Econometrics, 74(1), 119–147. DOI
Santos, M. S., & Rust, J. (2003). Convergence Properties of Policy Iteration. SIAM Journal on Control and Optimization, 42(6), 2094–2115. DOI
Schmitt-Grohe, S., & Uribe, M. (2004). Solving Dynamic General Equilibrium Models Using a Second-Order Approximation to the Policy Function. Journal of Economic Dynamics and Control, 28(4), 755–775. DOI
Stokey, N. L., Lucas, R. E., & Prescott, E. C. (1989). Recursive Methods in Economic Dynamics. Harvard University Press. ISBN: 978-0-674-75096-8.
Walker, H. F., & Ni, P. (2011). Anderson Acceleration for Fixed-Point Iterations. SIAM Journal on Numerical Analysis, 49(4), 1715–1735. DOI