Riemann manifold Langevin and Hamiltonian Monte Carlo methods

1. Summary

This paper extend two MCMC algorithms: Metropolis adjusted Langevin and Hamiltonian Monte Carlo from Euclidean space to Riemann manifold to resolve the shortcomings of existing Monte Carlo algorithms when sampling from target densities that may be high dimensional and exhibit strong correlations. This is done by utilizing the geometry information associated with probability density function.

2. Enlightenment from Metropolis Adjusted Langevin and HMC

2.1. Metropolis Adjusted Langevin

Based on a Langevin diffusion, the SDE with stationary distribution is

where and denotes a D-dimensional Brownian motion. A first-order Euler discretization of the SDE gives the proposal mechanism

where and is the integration step size. Notice that the error this discretization introduces can be corrected by employing a Metropolis acceptance probability after each integration step.

Although the drift term in the proposal mechanism for the Metropolis Adjusted Langevin defines the direction for the proposal based on the gradient information, it is clear that the isotropic diffusion will be inefficient for strongly correlated variables with widely differing variances forcing the step size to accommodate the variate with smallest variance. This issue can be circumvented by employing a preconditioning matrix such that

2.2. HMC

With a auxiliary variable , we can define the negative joint log-probability as

where the kinetic energy term where the covariance matrix denotes a mass matrix.

It can be shown that the discrete version of HMC (leapfrog method) is equivalent to a Metropolis Adjusted Langevin with precondition matrix .

3. Geometry Information in Probability

A distance metric based on a first order expression for the symmetric Kullback-Liebler divergence between two densities and is defined as

Therefore, up to a small term, we have

where is the Fisher Information matrix.

The parameter space of a statistical model is a Riemannian manifold. Therefore the natural geometric structure of the density model is defined by the Riemannian manifold and associated metric tensor. Given this geometric structure of the parameter space of statistical models the appropriate adoption of the position specific metric, , within an MCMC scheme should yield more effective transitions in the overall algorithm.

4. Manifold Metropolis adjusted Langevin algorithm

Given the geometric structure for probability models a Langevin diffusion with invariant measure can be defined directly upon the Riemannian manifold with metric tensor :

where

and

5. Riemann manifold Hamiltonian Monte Carlo methods

Similarly, we can replace the covariance matrix with :

and the marginal density of does not depend on .

results matching ""

    No results matching ""