Hamiltonian Monte Carlo with Blackjax

Hamiltonian Monte Carlo with Blackjax#

Let’s use the Hamiltonian Monte Carlo (HMC) algorithm to sample from a “banana”-shaped distribution (defined in section 5.1.3 of Wang et. al.).

Here is the probability density of the banana distribution with high curvature:

banana_logdensity_high_curv = partial(banana_logdensity, a=1.15, b=1.0, rho=0.9)
plot_2d_function(banana_logdensity_high_curv);

../../_images/5b1a29b5c261be1c51c3bcb555f8472b7e4aefd7c53063cafc6baf764a1eaa2e.svg

Hamiltonian Monte Carlo recap#

Intuitively, HMC works by simulating the dynamics of a particle moving in a potential energy field.

As an analogy, think of the probability density plot above as showing the elevation of a landscape, where the bright region is a valley. Imagine placing a ball somewhere on this landscape and kicking in a random direction with a random amount of force. Let the ball roll for a while, then stop it and record its position. Repeat this process many times, and the positions you record will be samples from the target distribution. This is HMC in a nutshell.

Let’s do it in BlackJax:

# HMC hyperparameters
step_size = 0.1
inverse_mass_matrix = jnp.eye(2)
num_integration_steps = 20

# Create the HMC kernel
hmc = blackjax.hmc(
    banana_logdensity_high_curv, step_size, inverse_mass_matrix, num_integration_steps
)

def step(state, _):
    """A single step of the Hamiltonian Monte Carlo sampler. Used with `lax.scan`."""
    key, kernel_state = state
    key, subkey = jrandom.split(key)
    kernel_state, info = hmc.step(subkey, kernel_state)
    return (key, kernel_state), (kernel_state.position, info)

def run_mcmc_chain(key, init_state, num_samples):
    """Run a chain of MCMC."""
    _, (samples, info) = lax.scan(step, (key, init_state), None, length=num_samples)
    return samples, info

num_chains = 5
num_samples_per_chain = 200

key, key_run, key_init = jrandom.split(key, 3)
keys = jrandom.split(key_run, num_chains)
init_state_spread = 2.0
init_state = vmap(hmc.init)(init_state_spread*jrandom.normal(key_init, (num_chains, 2)))
samples, info = vmap(run_mcmc_chain, in_axes=(0, 0, None))(keys, init_state, num_samples_per_chain)

Let’s make the trace plot (with the help of the arviz library):

import arviz as az
az.plot_trace(np.array(samples[:, :]), compact=False, backend_kwargs=dict(figsize=(8,4), tight_layout=True));

../../_images/09fc12c41c83e2deea4a985c00dc35191dc115dc0df7d4cbf49301e5b396aae9.svg

Let’s look at \(\hat{R}\) to assess convergence (see the Metropolis-Hastings hands-on activity for more details):

compute_diagnostics_every = 10
rhats = []
for i in range(2, num_samples_per_chain, compute_diagnostics_every):
    rhat = blackjax.diagnostics.potential_scale_reduction(samples[:, :i])
    rhats.append(rhat)
rhats = jnp.array(rhats)

../../_images/a2d2ed1a6f03115b75071a321f50defd510e645d778aba6972f84878a98f7a70.svg

It looks like the chains converge fairly quickly.

Let’s look at the effective sample size (ESS) to see how many independent samples we have (again, see the Metropolis-Hastings hands-on activity for more details):

n_effs = []
for i in range(2, num_samples_per_chain, compute_diagnostics_every):
    n_eff = blackjax.diagnostics.effective_sample_size(samples[:, :i])
    n_effs.append(n_eff)
n_effs = jnp.array(n_effs)

../../_images/5265f5901af2785c86f10c023bf9dd499a11885a7d4fddd402217055321a5b49.svg

The ESS is very high. This is good—it means that our samples are not correlated, and we don’t have to thin them out.

Finally, let’s plot the samples:

# The original shape of the `samples` array is (n_chains, n_samples, n_dim)
burn_in = 50  # Remove first N samples
thin = 1  # Only keep every M samples
true_samples = samples[:, burn_in::thin]

# Concatenate the chains. Final shape is (n_chains * n_true_samples_per_chain, n_dim)
true_samples = true_samples.reshape(-1, 2)

../../_images/14b7b2d961207d4acb4f8327ca2ec2bba90d9111b540ad348b4c186595cd25d0.svg

They look good! Note that even though the distribution has high curvature, HMC is still able to efficiently sample from it.

Hamiltonian Monte Carlo with Blackjax

Contents

Hamiltonian Monte Carlo with Blackjax#

Hamiltonian Monte Carlo recap#