genjax.inference.smc¶

Sequential Monte Carlo methods for particle-based inference.

smc ¶

Standard library of programmable inference algorithms for GenJAX.

This module provides implementations of common inference algorithms that can be composed with generative functions through the GFI (Generative Function Interface). Uses GenJAX distributions and modular_vmap for efficient vectorized computation.

References

[1] P. D. Moral, A. Doucet, and A. Jasra, "Sequential Monte Carlo samplers," Journal of the Royal Statistical Society: Series B (Statistical Methodology), vol. 68, no. 3, pp. 411–436, 2006.

ParticleCollection ¶

Bases: Pytree

Result of importance sampling containing traces, weights, and statistics.

log_marginal_likelihood ¶

log_marginal_likelihood() -> jnp.ndarray

Estimate log marginal likelihood using importance sampling.

Source code in src/genjax/inference/smc.py

def log_marginal_likelihood(self) -> jnp.ndarray:
    """
    Estimate log marginal likelihood using importance sampling.

    Returns:
        Log marginal likelihood estimate using log-sum-exp of importance weights
        plus any accumulated marginal estimate from previous resampling steps
    """
    current_marginal = jax.scipy.special.logsumexp(self.log_weights) - jnp.log(
        self.n_samples.value
    )
    return self.log_marginal_estimate + current_marginal

estimate ¶

estimate(fn: Callable[[X], Any]) -> Any

Compute weighted estimate of a function applied to particle traces.

Properly accounts for importance weights to give unbiased estimates.

Examples:

>>> import jax.numpy as jnp
>>> # particles.estimate(lambda choices: choices["param"])  # Posterior mean
>>> # particles.estimate(lambda choices: choices["param"]**2) - mean**2  # Variance
>>> # particles.estimate(lambda choices: jnp.sin(choices["x"]) + choices["y"])  # Custom

Source code in src/genjax/inference/smc.py

def estimate(self, fn: Callable[[X], Any]) -> Any:
    """
    Compute weighted estimate of a function applied to particle traces.

    Properly accounts for importance weights to give unbiased estimates.

    Args:
        fn: Function to apply to each particle's choices (X -> Any)

    Returns:
        Weighted estimate: sum(w_i * fn(x_i)) / sum(w_i)
        where w_i are normalized importance weights

    Examples:
        >>> import jax.numpy as jnp
        >>> # particles.estimate(lambda choices: choices["param"])  # Posterior mean
        >>> # particles.estimate(lambda choices: choices["param"]**2) - mean**2  # Variance
        >>> # particles.estimate(lambda choices: jnp.sin(choices["x"]) + choices["y"])  # Custom
    """
    # Get particle choices
    choices = self.traces.get_choices()

    # Apply function to each particle
    values = jax.vmap(fn)(choices)

    # Compute normalized weights (in log space for numerical stability)
    log_weights_normalized = self.log_weights - jax.scipy.special.logsumexp(
        self.log_weights
    )
    weights_normalized = jnp.exp(log_weights_normalized)

    # Compute weighted average
    # For scalar values: sum(w_i * v_i)
    # For arrays: maintains shape of values
    if values.ndim == 1:
        # Simple weighted average for scalar values per particle
        return jnp.sum(weights_normalized * values)
    else:
        # For multi-dimensional values, weight along the particle dimension (axis 0)
        return jnp.sum(weights_normalized[:, None] * values, axis=0)

effective_sample_size ¶

effective_sample_size(log_weights: ndarray) -> jnp.ndarray

Compute the effective sample size (ESS) from log importance weights.

The ESS measures the efficiency of importance sampling by estimating the number of independent samples that would provide equivalent statistical information. It quantifies particle degeneracy in SMC algorithms.

Mathematical Formulation

Given N particles with normalized weights w₁, ..., wₙ:

ESS = 1 / Σᵢ wᵢ² = (Σᵢ wᵢ)² / Σᵢ wᵢ²

Since Σᵢ wᵢ = 1 for normalized weights:

ESS = 1 / Σᵢ wᵢ²

Interpretation

ESS = N: Perfect sampling (uniform weights)
ESS = 1: Complete degeneracy (single particle has all weight)
ESS/N: Efficiency ratio, often used to trigger resampling when < 0.5

Connection to Importance Sampling

The ESS approximates the variance inflation factor for importance sampling estimates. For self-normalized importance sampling:

Var[𝔼[f]] ≈ (N/ESS) × Var_π[f]

where π is the target distribution.

References

.. [1] Kong, A., Liu, J. S., & Wong, W. H. (1994). "Sequential imputations and Bayesian missing data problems". Journal of the American Statistical Association, 89(425), 278-288. .. [2] Liu, J. S. (2001). "Monte Carlo strategies in scientific computing". Springer, Chapter 3. .. [3] Doucet, A., de Freitas, N., & Gordon, N. (2001). "Sequential Monte Carlo methods in practice". Springer, Chapter 1.

Notes

Computed in log-space for numerical stability
Input weights need not be normalized (handled internally)
Common resampling threshold: ESS < N/2 (Doucet et al., 2001)

Source code in src/genjax/inference/smc.py

def effective_sample_size(log_weights: jnp.ndarray) -> jnp.ndarray:
    """
    Compute the effective sample size (ESS) from log importance weights.

    The ESS measures the efficiency of importance sampling by estimating the
    number of independent samples that would provide equivalent statistical
    information. It quantifies particle degeneracy in SMC algorithms.

    Mathematical Formulation:
        Given N particles with normalized weights w₁, ..., wₙ:

        ESS = 1 / Σᵢ wᵢ² = (Σᵢ wᵢ)² / Σᵢ wᵢ²

        Since Σᵢ wᵢ = 1 for normalized weights:

        ESS = 1 / Σᵢ wᵢ²

    Interpretation:
        - ESS = N: Perfect sampling (uniform weights)
        - ESS = 1: Complete degeneracy (single particle has all weight)
        - ESS/N: Efficiency ratio, often used to trigger resampling when < 0.5

    Connection to Importance Sampling:
        The ESS approximates the variance inflation factor for importance
        sampling estimates. For self-normalized importance sampling:

        Var[𝔼[f]] ≈ (N/ESS) × Var_π[f]

        where π is the target distribution.

    Args:
        log_weights: Array of unnormalized log importance weights of shape (N,)

    Returns:
        Effective sample size as a scalar in range [1, N]

    References:
        .. [1] Kong, A., Liu, J. S., & Wong, W. H. (1994). "Sequential imputations
               and Bayesian missing data problems". Journal of the American
               Statistical Association, 89(425), 278-288.
        .. [2] Liu, J. S. (2001). "Monte Carlo strategies in scientific computing".
               Springer, Chapter 3.
        .. [3] Doucet, A., de Freitas, N., & Gordon, N. (2001). "Sequential Monte
               Carlo methods in practice". Springer, Chapter 1.

    Notes:
        - Computed in log-space for numerical stability
        - Input weights need not be normalized (handled internally)
        - Common resampling threshold: ESS < N/2 (Doucet et al., 2001)
    """
    log_weights_normalized = log_weights - jax.scipy.special.logsumexp(log_weights)
    weights_normalized = jnp.exp(log_weights_normalized)
    return 1.0 / jnp.sum(weights_normalized**2)

systematic_resample ¶

systematic_resample(log_weights: ndarray, n_samples: int) -> jnp.ndarray

Systematic resampling from importance weights with minimal variance.

Implements the systematic resampling algorithm (Kitagawa, 1996), which has lower variance than multinomial resampling while maintaining unbiasedness. This is the preferred resampling method for particle filters.

Mathematical Formulation

Given normalized weights w₁, ..., wₙ and cumulative sum Cᵢ = Σⱼ≤ᵢ wⱼ:

Draw U ~ Uniform(0, 1/M) where M is the output sample size
For i = 1, ..., M:
Set pointer position: uᵢ = (i-1)/M + U
Select particle: Iᵢ = min{j : Cⱼ ≥ uᵢ}

Properties

Unbiased: 𝔼[Nᵢ] = M × wᵢ where Nᵢ is count of particle i
Lower variance than multinomial: Var[Nᵢ] ≤ M × wᵢ × (1 - wᵢ)
Deterministic given U: reduces Monte Carlo variance
Preserves particle order (stratified structure)

Time Complexity: O(N + M) using binary search Space Complexity: O(N) for cumulative weights

References

.. [1] Kitagawa, G. (1996). "Monte Carlo filter and smoother for non-Gaussian nonlinear state space models". Journal of Computational and Graphical Statistics, 5(1), 1-25. .. [2] Doucet, A., & Johansen, A. M. (2009). "A tutorial on particle filtering and smoothing: Fifteen years later". Handbook of Nonlinear Filtering, 12(656-704), 3. .. [3] Hol, J. D., Schon, T. B., & Gustafsson, F. (2006). "On resampling algorithms for particle filters". In IEEE Nonlinear Statistical Signal Processing Workshop (pp. 79-82).

Notes

Systematic resampling is preferred over multinomial for most applications
Maintains particle diversity better than multinomial resampling
For theoretical analysis of resampling methods, see [3]

Source code in src/genjax/inference/smc.py

def systematic_resample(log_weights: jnp.ndarray, n_samples: int) -> jnp.ndarray:
    """
    Systematic resampling from importance weights with minimal variance.

    Implements the systematic resampling algorithm (Kitagawa, 1996), which has
    lower variance than multinomial resampling while maintaining unbiasedness.
    This is the preferred resampling method for particle filters.

    Mathematical Formulation:
        Given normalized weights w₁, ..., wₙ and cumulative sum Cᵢ = Σⱼ≤ᵢ wⱼ:

        1. Draw U ~ Uniform(0, 1/M) where M is the output sample size
        2. For i = 1, ..., M:
           - Set pointer position: uᵢ = (i-1)/M + U
           - Select particle: Iᵢ = min{j : Cⱼ ≥ uᵢ}

    Properties:
        - Unbiased: 𝔼[Nᵢ] = M × wᵢ where Nᵢ is count of particle i
        - Lower variance than multinomial: Var[Nᵢ] ≤ M × wᵢ × (1 - wᵢ)
        - Deterministic given U: reduces Monte Carlo variance
        - Preserves particle order (stratified structure)

    Time Complexity: O(N + M) using binary search
    Space Complexity: O(N) for cumulative weights

    Args:
        log_weights: Unnormalized log importance weights of shape (N,)
        n_samples: Number of samples to draw (M)

    Returns:
        Array of particle indices of shape (M,) for resampling

    References:
        .. [1] Kitagawa, G. (1996). "Monte Carlo filter and smoother for non-Gaussian
               nonlinear state space models". Journal of Computational and Graphical
               Statistics, 5(1), 1-25.
        .. [2] Doucet, A., & Johansen, A. M. (2009). "A tutorial on particle filtering
               and smoothing: Fifteen years later". Handbook of Nonlinear Filtering,
               12(656-704), 3.
        .. [3] Hol, J. D., Schon, T. B., & Gustafsson, F. (2006). "On resampling
               algorithms for particle filters". In IEEE Nonlinear Statistical Signal
               Processing Workshop (pp. 79-82).

    Notes:
        - Systematic resampling is preferred over multinomial for most applications
        - Maintains particle diversity better than multinomial resampling
        - For theoretical analysis of resampling methods, see [3]
    """
    log_weights_normalized = log_weights - jax.scipy.special.logsumexp(log_weights)
    weights = jnp.exp(log_weights_normalized)

    # Use uniform distribution for systematic resampling offset
    u = uniform.sample(0.0, 1.0)
    positions = (jnp.arange(n_samples) + u) / n_samples
    cumsum = jnp.cumsum(weights)

    indices = jnp.searchsorted(cumsum, positions)
    return indices

resample_vectorized_trace ¶

resample_vectorized_trace(trace: Trace[X, R], log_weights: ndarray, n_samples: int, method: str = 'categorical') -> Trace[X, R]

Resample a vectorized trace using importance weights.

Uses categorical or systematic sampling to select indices and jax.tree_util.tree_map to index into the Pytree leaves.

Source code in src/genjax/inference/smc.py

def resample_vectorized_trace(
    trace: Trace[X, R],
    log_weights: jnp.ndarray,
    n_samples: int,
    method: str = "categorical",
) -> Trace[X, R]:
    """
    Resample a vectorized trace using importance weights.

    Uses categorical or systematic sampling to select indices and jax.tree_util.tree_map
    to index into the Pytree leaves.

    Args:
        trace: Vectorized trace to resample
        log_weights: Log importance weights
        n_samples: Number of samples to draw
        method: Resampling method - "categorical" or "systematic"

    Returns:
        Resampled vectorized trace
    """
    if method == "categorical":
        # Sample indices using categorical distribution
        indices = categorical.sample(log_weights, sample_shape=(n_samples,))
    elif method == "systematic":
        # Use systematic resampling
        indices = systematic_resample(log_weights, n_samples)
    else:
        raise ValueError(f"Unknown resampling method: {method}")

    # Use tree_map to index into all leaves of the trace Pytree
    def index_leaf(leaf):
        # Index into the first dimension (batch dimension) of each leaf
        return leaf[indices]

    resampled_trace = jtu.tree_map(index_leaf, trace)
    return resampled_trace

init ¶

init(target_gf: GFI[X, R], target_args: tuple, n_samples: Const[int], constraints: X, proposal_gf: GFI[X, Any] | None = None) -> ParticleCollection

Initialize particle collection using importance sampling.

Uses either the target's default internal proposal or a custom proposal. Proposals use signature (constraints, *target_args).

Source code in src/genjax/inference/smc.py

def init(
    target_gf: GFI[X, R],
    target_args: tuple,
    n_samples: Const[int],
    constraints: X,
    proposal_gf: GFI[X, Any] | None = None,
) -> ParticleCollection:
    """
    Initialize particle collection using importance sampling.

    Uses either the target's default internal proposal or a custom proposal.
    Proposals use signature (constraints, *target_args).

    Args:
        target_gf: Target generative function (model)
        target_args: Arguments for target generative function
        n_samples: Number of importance samples to draw (static value)
        constraints: Dictionary of constrained random choices
        proposal_gf: Optional custom proposal generative function.
                    If None, uses target's default internal proposal.

    Returns:
        ParticleCollection with traces, weights, and diagnostics
    """
    if proposal_gf is None:
        # Use default importance sampling with target's internal proposal
        def _single_default_importance_sample(
            target_gf: GFI[X, R],
            target_args: tuple,
            constraints: X,
        ) -> tuple[Trace[X, R], Weight]:
            """Single importance sampling step using target's default proposal."""
            # Use target's generate method with constraints
            # This will use the target's internal proposal to fill in missing choices
            target_trace, log_weight = target_gf.generate(constraints, *target_args)
            return target_trace, log_weight

        # Vectorize the single importance sampling step
        vectorized_sample = modular_vmap(
            _single_default_importance_sample,
            in_axes=(None, None, None),
            axis_size=n_samples.value,
        )

        # Run vectorized importance sampling
        traces, log_weights = vectorized_sample(target_gf, target_args, constraints)
    else:
        # Use custom proposal importance sampling
        def _single_importance_sample(
            target_gf: GFI[X, R],
            proposal_gf: GFI[X, Any],
            target_args: tuple,
            constraints: X,
        ) -> tuple[Trace[X, R], Weight]:
            """
            Single importance sampling step using custom proposal.

            Proposal uses signature (constraints, *target_args).
            """
            # Sample from proposal using new signature
            proposal_trace = proposal_gf.simulate(constraints, *target_args)
            proposal_choices = proposal_trace.get_choices()

            # Get proposal score: log(1/P_proposal)
            proposal_score = proposal_trace.get_score()

            # Merge proposal choices with constraints
            merged_choices, _ = target_gf.merge(proposal_choices, constraints)

            # Generate from target using merged choices
            target_trace, target_weight = target_gf.generate(
                merged_choices, *target_args
            )

            # Compute importance weight: P/Q
            # target_weight is the weight from generate (density of model at merged choices)
            # proposal_score is log(1/P_proposal)
            # importance_weight = target_weight + proposal_score
            log_weight = target_weight + proposal_score

            return target_trace, log_weight

        # Vectorize the single importance sampling step
        vectorized_sample = modular_vmap(
            _single_importance_sample,
            in_axes=(None, None, None, None),
            axis_size=n_samples.value,
        )

        # Run vectorized importance sampling
        traces, log_weights = vectorized_sample(
            target_gf, proposal_gf, target_args, constraints
        )

    return _create_particle_collection(
        traces=traces,  # vectorized
        log_weights=log_weights,
        n_samples=const(n_samples.value),
        log_marginal_estimate=jnp.array(0.0),
    )

change ¶

change(particles: ParticleCollection, new_target_gf: GFI[X, R], new_target_args: tuple, choice_fn: Callable[[X], X]) -> ParticleCollection

Change target move for particle collection.

Translates particles from one model to another by: 1. Mapping each particle's choices using choice_fn 2. Using generate with the new model to get new weights 3. Accumulating importance weights

Choice Function Specification

CRITICAL: choice_fn must be a bijection on address space only.

If X is a scalar type (e.g., float): Must be identity function
If X is dict[str, Any]: May remap keys but CANNOT modify values
Values must be preserved exactly to maintain probability density

Valid Examples: - lambda x: x (identity mapping) - lambda d: {"new_key": d["old_key"]} (key remapping) - lambda d: {"mu": d["mean"], "sigma": d["std"]} (multiple key remap)

Invalid Examples: - lambda x: x + 1 (modifies scalar values - breaks assumptions) - lambda d: {"key": d["key"] * 2} (modifies dict values - breaks assumptions)

Source code in src/genjax/inference/smc.py

def change(
    particles: ParticleCollection,
    new_target_gf: GFI[X, R],
    new_target_args: tuple,
    choice_fn: Callable[[X], X],
) -> ParticleCollection:
    """
    Change target move for particle collection.

    Translates particles from one model to another by:
    1. Mapping each particle's choices using choice_fn
    2. Using generate with the new model to get new weights
    3. Accumulating importance weights

    Args:
        particles: Current particle collection
        new_target_gf: New target generative function
        new_target_args: Arguments for new target
        choice_fn: Bijective function mapping choices X -> X

    Choice Function Specification:
        CRITICAL: choice_fn must be a bijection on address space only.

        - If X is a scalar type (e.g., float): Must be identity function
        - If X is dict[str, Any]: May remap keys but CANNOT modify values
        - Values must be preserved exactly to maintain probability density

        Valid Examples:
        - lambda x: x  (identity mapping)
        - lambda d: {"new_key": d["old_key"]}  (key remapping)
        - lambda d: {"mu": d["mean"], "sigma": d["std"]}  (multiple key remap)

        Invalid Examples:
        - lambda x: x + 1  (modifies scalar values - breaks assumptions)
        - lambda d: {"key": d["key"] * 2}  (modifies dict values - breaks assumptions)

    Returns:
        New ParticleCollection with translated particles
    """

    def _single_change_target(
        old_trace: Trace[X, R], old_log_weight: jnp.ndarray
    ) -> tuple[Trace[X, R], jnp.ndarray]:
        # Map choices to new space
        old_choices = old_trace.get_choices()
        mapped_choices = choice_fn(old_choices)

        # Generate with new model using mapped choices as constraints
        new_trace, log_weight = new_target_gf.generate(mapped_choices, *new_target_args)

        # Accumulate importance weight
        new_log_weight = old_log_weight + log_weight

        return new_trace, new_log_weight

    # Vectorize across particles
    vectorized_change = modular_vmap(
        _single_change_target,
        in_axes=(0, 0),
        axis_size=particles.n_samples.value,
    )

    new_traces, new_log_weights = vectorized_change(
        particles.traces, particles.log_weights
    )

    return _create_particle_collection(
        traces=new_traces,
        log_weights=new_log_weights,
        n_samples=particles.n_samples,
        log_marginal_estimate=particles.log_marginal_estimate,
        # diagnostic_weights will be computed from new_log_weights in _create_particle_collection
    )

extend ¶

extend(particles: ParticleCollection, extended_target_gf: GFI[X, R], extended_target_args: Any, constraints: X, extension_proposal: GFI[X, Any] | None = None) -> ParticleCollection

Extension move for particle collection.

Extends each particle by generating from the extended target model: 1. Without extension proposal: Uses extended target's generate with constraints directly 2. With extension proposal: Samples extension, merges with constraints, then generates

The extended target model is responsible for recognizing and incorporating existing particle state through its internal structure.

Source code in src/genjax/inference/smc.py

def extend(
    particles: ParticleCollection,
    extended_target_gf: GFI[X, R],
    extended_target_args: Any,  # Can be tuple or vectorized args
    constraints: X,
    extension_proposal: GFI[X, Any] | None = None,
) -> ParticleCollection:
    """
    Extension move for particle collection.

    Extends each particle by generating from the extended target model:
    1. Without extension proposal: Uses extended target's generate with constraints directly
    2. With extension proposal: Samples extension, merges with constraints, then generates

    The extended target model is responsible for recognizing and incorporating
    existing particle state through its internal structure.

    Args:
        particles: Current particle collection
        extended_target_gf: Extended target generative function that recognizes particle state
        extended_target_args: Arguments for extended target
        constraints: Constraints on the new variables (e.g., observations at current timestep)
        extension_proposal: Optional proposal for the extension. If None, uses extended target's internal proposal.

    Returns:
        New ParticleCollection with extended particles
    """

    def _single_extension(
        old_trace: Trace[X, R], old_log_weight: jnp.ndarray, particle_args: Any
    ) -> tuple[Trace[X, R], jnp.ndarray]:
        # Convert particle_args to tuple if it's not already
        if isinstance(particle_args, tuple):
            args = particle_args
        else:
            args = (particle_args,)

        if extension_proposal is None:
            # Generate with extended target using constraints
            new_trace, log_weight = extended_target_gf.generate(constraints, *args)

            # Weight is just the target weight (no proposal correction needed)
            new_log_weight = old_log_weight + log_weight
        else:
            # Use custom extension proposal
            # Proposal gets: (obs, prev_particle_choices, *model_args)
            old_choices = old_trace.get_choices()
            extension_trace = extension_proposal.simulate(
                constraints, old_choices, *args
            )
            extension_choices = extension_trace.get_choices()
            proposal_score = extension_trace.get_score()

            # Merge old choices, extension choices, and constraints
            merged_choices, _ = extended_target_gf.merge(constraints, extension_choices)

            # Generate with extended target
            new_trace, log_weight = extended_target_gf.generate(merged_choices, *args)

            # Importance weight: target_weight + proposal_score + old_weight
            new_log_weight = old_log_weight + log_weight + proposal_score

        return new_trace, new_log_weight

    # Vectorize across particles
    vectorized_extension = modular_vmap(
        _single_extension,
        in_axes=(0, 0, 0),  # Add axis for particle_args
        axis_size=particles.n_samples.value,
    )

    new_traces, new_log_weights = vectorized_extension(
        particles.traces, particles.log_weights, extended_target_args
    )

    return _create_particle_collection(
        traces=new_traces,
        log_weights=new_log_weights,
        n_samples=particles.n_samples,
        log_marginal_estimate=particles.log_marginal_estimate,
        # diagnostic_weights will be computed from new_log_weights in _create_particle_collection
    )

rejuvenate ¶

rejuvenate(particles: ParticleCollection, mcmc_kernel: Callable[[Trace[X, R]], Trace[X, R]]) -> ParticleCollection

Rejuvenate move for particle collection.

Applies an MCMC kernel to each particle independently to improve particle diversity and reduce degeneracy. The importance weights and diagnostic weights remain unchanged due to detailed balance.

Mathematical Foundation

For an MCMC kernel satisfying detailed balance, the log incremental weight is 0:

log_incremental_weight = log[p(x_new | args) / p(x_old | args)] + log[q(x_old | x_new) / q(x_new | x_old)]

Where: - p(x_new | args) / p(x_old | args) is the model density ratio - q(x_old | x_new) / q(x_new | x_old) is the proposal density ratio

Detailed balance ensures: p(x_old) * q(x_new | x_old) = p(x_new) * q(x_old | x_new)

Therefore: p(x_new) / p(x_old) = q(x_new | x_old) / q(x_old | x_new)

The model density ratio and proposal density ratio exactly cancel: log[p(x_new) / p(x_old)] + log[q(x_old | x_new) / q(x_new | x_old)] = 0

This means the importance weight contribution from the MCMC move is 0, preserving the particle weights while improving sample diversity.

Source code in src/genjax/inference/smc.py

def rejuvenate(
    particles: ParticleCollection,
    mcmc_kernel: Callable[[Trace[X, R]], Trace[X, R]],
) -> ParticleCollection:
    """
    Rejuvenate move for particle collection.

    Applies an MCMC kernel to each particle independently to improve
    particle diversity and reduce degeneracy. The importance weights and
    diagnostic weights remain unchanged due to detailed balance.

    Mathematical Foundation:
        For an MCMC kernel satisfying detailed balance, the log incremental weight is 0:

        log_incremental_weight = log[p(x_new | args) / p(x_old | args)]
                                + log[q(x_old | x_new) / q(x_new | x_old)]

        Where:
        - p(x_new | args) / p(x_old | args) is the model density ratio
        - q(x_old | x_new) / q(x_new | x_old) is the proposal density ratio

        Detailed balance ensures: p(x_old) * q(x_new | x_old) = p(x_new) * q(x_old | x_new)

        Therefore: p(x_new) / p(x_old) = q(x_new | x_old) / q(x_old | x_new)

        The model density ratio and proposal density ratio exactly cancel:
        log[p(x_new) / p(x_old)] + log[q(x_old | x_new) / q(x_new | x_old)] = 0

        This means the importance weight contribution from the MCMC move is 0,
        preserving the particle weights while improving sample diversity.

    Args:
        particles: Current particle collection
        mcmc_kernel: MCMC kernel function that takes a trace and returns
                    a new trace. Should be compatible with kernels from mcmc.py like mh.

    Returns:
        New ParticleCollection with rejuvenated particles
    """

    def _single_rejuvenate(
        old_trace: Trace[X, R], old_log_weight: jnp.ndarray
    ) -> tuple[Trace[X, R], jnp.ndarray]:
        # Apply MCMC kernel
        new_trace = mcmc_kernel(old_trace)

        # Weights remain unchanged for MCMC moves (detailed balance)
        # Log incremental weight = 0 because model density ratio cancels with proposal density ratio
        return new_trace, old_log_weight

    # Vectorize across particles
    vectorized_rejuvenate = modular_vmap(
        _single_rejuvenate,
        in_axes=(0, 0),
        axis_size=particles.n_samples.value,
    )

    new_traces, new_log_weights = vectorized_rejuvenate(
        particles.traces, particles.log_weights
    )

    return _create_particle_collection(
        traces=new_traces,
        log_weights=new_log_weights,
        n_samples=particles.n_samples,
        log_marginal_estimate=particles.log_marginal_estimate,
        diagnostic_weights=particles.diagnostic_weights,  # Propagate diagnostic weights unchanged
    )

resample ¶

resample(particles: ParticleCollection, method: str = 'categorical') -> ParticleCollection

Resample particle collection to combat degeneracy.

Computes log normalized weights for diagnostics before resampling. After resampling, weights are reset to uniform (zero in log space) and the marginal likelihood estimate is updated to include the average weight before resampling.

Source code in src/genjax/inference/smc.py

def resample(
    particles: ParticleCollection,
    method: str = "categorical",
) -> ParticleCollection:
    """
    Resample particle collection to combat degeneracy.

    Computes log normalized weights for diagnostics before resampling.
    After resampling, weights are reset to uniform (zero in log space)
    and the marginal likelihood estimate is updated to include the
    average weight before resampling.

    Args:
        particles: Current particle collection
        method: Resampling method - "categorical" or "systematic"

    Returns:
        New ParticleCollection with resampled particles and updated marginal estimate
    """
    # Compute log normalized weights before resampling for diagnostics
    log_normalized_weights = particles.log_weights - jax.scipy.special.logsumexp(
        particles.log_weights
    )

    # Compute current marginal contribution before resampling
    current_marginal = jax.scipy.special.logsumexp(particles.log_weights) - jnp.log(
        particles.n_samples.value
    )

    # Update accumulated marginal estimate
    new_log_marginal_estimate = particles.log_marginal_estimate + current_marginal

    # Resample traces using existing function
    resampled_traces = resample_vectorized_trace(
        particles.traces,
        particles.log_weights,
        particles.n_samples.value,
        method=method,
    )

    # Reset weights to uniform (zero in log space)
    uniform_log_weights = jnp.zeros(particles.n_samples.value)

    return _create_particle_collection(
        traces=resampled_traces,
        log_weights=uniform_log_weights,
        n_samples=particles.n_samples,
        log_marginal_estimate=new_log_marginal_estimate,
        diagnostic_weights=log_normalized_weights,  # Store pre-resampling normalized weights
    )

rejuvenation_smc ¶

rejuvenation_smc(model: GFI[X, R], transition_proposal: GFI[X, Any] | None = None, mcmc_kernel: Const[Callable[[Trace[X, R]], Trace[X, R]]] | None = None, observations: X | None = None, initial_model_args: tuple | None = None, n_particles: Const[int] = const(1000), return_all_particles: Const[bool] = const(False), n_rejuvenation_moves: Const[int] = const(1)) -> ParticleCollection

Complete SMC algorithm with rejuvenation using jax.lax.scan.

Implements sequential Monte Carlo with particle extension, resampling, and MCMC rejuvenation. Uses a single model with feedback loop where the return value becomes the next timestep's arguments, creating sequential dependencies.

Note on Return Value

This function returns only the FINAL ParticleCollection after processing all observations. Intermediate timesteps are computed but not returned. If you need all timesteps, you can modify the return statement to:

final_particles, all_particles = jax.lax.scan(smc_step, particles, remaining_obs)
return all_particles  # Returns vectorized ParticleCollection with time dimension

The all_particles object would have an additional leading time dimension in all its fields (traces, log_weights, etc.), allowing access to the full particle trajectory across all timesteps.

Source code in src/genjax/inference/smc.py

def rejuvenation_smc(
    model: GFI[X, R],
    transition_proposal: GFI[X, Any] | None = None,
    mcmc_kernel: Const[Callable[[Trace[X, R]], Trace[X, R]]] | None = None,
    observations: X | None = None,
    initial_model_args: tuple | None = None,
    n_particles: Const[int] = const(1000),
    return_all_particles: Const[bool] = const(False),
    n_rejuvenation_moves: Const[int] = const(1),
) -> ParticleCollection:
    """
    Complete SMC algorithm with rejuvenation using jax.lax.scan.

    Implements sequential Monte Carlo with particle extension, resampling,
    and MCMC rejuvenation. Uses a single model with feedback loop where
    the return value becomes the next timestep's arguments, creating
    sequential dependencies.


    Note on Return Value:
        This function returns only the FINAL ParticleCollection after processing
        all observations. Intermediate timesteps are computed but not returned.
        If you need all timesteps, you can modify the return statement to:

        ```python
        final_particles, all_particles = jax.lax.scan(smc_step, particles, remaining_obs)
        return all_particles  # Returns vectorized ParticleCollection with time dimension
        ```

        The all_particles object would have an additional leading time dimension
        in all its fields (traces, log_weights, etc.), allowing access to the
        full particle trajectory across all timesteps.

    Args:
        model: Single generative function where return value feeds into next timestep
        transition_proposal: Optional proposal for extending particles at each timestep.
                           If None, uses the model's internal proposal. (default: None)
        mcmc_kernel: Optional MCMC kernel for particle rejuvenation (wrapped in Const).
                    If None, no rejuvenation moves are performed. (default: None)
        observations: Sequence of observations (can be Pytree structure)
        initial_model_args: Arguments for the first timestep
        n_particles: Number of particles to maintain (default: const(1000))
        return_all_particles: If True, returns all particles across time (default: const(False))
        n_rejuvenation_moves: Number of MCMC rejuvenation moves per timestep (default: const(1))

    Returns:
        If return_all_particles=False: Final ParticleCollection (no time dimension)
        If return_all_particles=True: ParticleCollection with leading time dimension (T, ...)
    """
    # Extract first observation using tree_map (handles Pytree structure)
    first_obs = jtu.tree_map(lambda x: x[0], observations)

    # Initialize with first observation using the single model
    particles = init(model, initial_model_args, n_particles, first_obs)

    # Resample initial particles if needed
    ess = particles.effective_sample_size()
    particles = jax.lax.cond(
        ess < n_particles.value // 2,
        lambda p: resample(p),
        lambda p: p,
        particles,
    )

    # Apply initial rejuvenation moves (only if mcmc_kernel is provided)
    if mcmc_kernel is not None:

        def rejuvenation_step(particles, _):
            """Single rejuvenation move."""
            return rejuvenate(particles, mcmc_kernel.value), None

        # Apply n_rejuvenation_moves steps
        particles, _ = jax.lax.scan(
            rejuvenation_step, particles, jnp.arange(n_rejuvenation_moves.value)
        )

    def smc_step(particles, obs):
        # Extract return values from current particles to use as next model args
        # Get vectorized return values from all particles
        current_retvals = particles.traces.get_retval()

        # Extend particles with new observation constraints
        # Use current return values as the model arguments for the next step
        particles = extend(
            particles,
            model,
            current_retvals,  # Feed return values as next model args
            obs,
            extension_proposal=transition_proposal,  # None is allowed - uses model's internal proposal
        )

        # Resample if needed
        ess = particles.effective_sample_size()
        particles = jax.lax.cond(
            ess < n_particles.value // 2,
            lambda p: resample(p),
            lambda p: p,
            particles,
        )

        # Multiple rejuvenation moves (only if mcmc_kernel is provided)
        if mcmc_kernel is not None:

            def rejuvenation_step(particles, _):
                """Single rejuvenation move."""
                return rejuvenate(particles, mcmc_kernel.value), None

            # Apply n_rejuvenation_moves steps
            particles, _ = jax.lax.scan(
                rejuvenation_step, particles, jnp.arange(n_rejuvenation_moves.value)
            )

        return particles, particles  # (carry, output)

    # Sequential updates using scan over remaining observations
    # Slice remaining observations using tree_map (handles Pytree structure)
    remaining_obs = jtu.tree_map(lambda x: x[1:], observations)

    final_particles, all_particles = jax.lax.scan(smc_step, particles, remaining_obs)

    if return_all_particles.value:
        # Prepend initial particles to create complete sequence
        # all_particles has shape (T-1, ...) from scan over remaining_obs
        # We need to add the initial particles to get shape (T, ...)
        return jtu.tree_map(
            lambda init, rest: jnp.concatenate([init[None, ...], rest], axis=0),
            particles,
            all_particles,
        )
    else:
        return final_particles

init_csmc ¶

init_csmc(target_gf: GFI[X, R], target_args: tuple, n_samples: Const[int], constraints: X, retained_choices: X, proposal_gf: GFI[X, Any] | None = None) -> ParticleCollection

Initialize particle collection for conditional SMC with retained particle.

Simple approach: run regular init and manually override particle 0.

Source code in src/genjax/inference/smc.py

def init_csmc(
    target_gf: GFI[X, R],
    target_args: tuple,
    n_samples: Const[int],
    constraints: X,
    retained_choices: X,
    proposal_gf: GFI[X, Any] | None = None,
) -> ParticleCollection:
    """
    Initialize particle collection for conditional SMC with retained particle.

    Simple approach: run regular init and manually override particle 0.

    Args:
        target_gf: Target generative function (model)
        target_args: Arguments for target generative function
        n_samples: Number of importance samples to draw (static value)
        constraints: Dictionary of constrained random choices
        retained_choices: Choices for the retained particle (one particle fixed)
        proposal_gf: Optional custom proposal generative function

    Returns:
        ParticleCollection where particle 0 matches retained_choices exactly
    """
    if n_samples.value < 1:
        raise ValueError("n_samples must be at least 1 for conditional SMC")

    # Run regular init to get the particle structure
    particles = init(
        target_gf=target_gf,
        target_args=target_args,
        n_samples=n_samples,
        constraints=constraints,
        proposal_gf=proposal_gf,
    )

    # Override the choices in particle 0 with retained choices
    # This is a simplified approach - we manually set the choice values
    current_choices = particles.traces.get_choices()

    # Create a function to override specific keys
    def override_with_retained(choices_dict):
        # Make a copy and override keys that exist in retained_choices
        new_dict = {}
        for key, value in choices_dict.items():
            if key in retained_choices:
                # Set index 0 to the retained value
                new_dict[key] = value.at[0].set(retained_choices[key])
            else:
                new_dict[key] = value
        return new_dict

    # Apply the override
    new_choices_dict = override_with_retained(current_choices)

    # Assess the retained choices to get the correct weight
    retained_log_density, _ = target_gf.assess(retained_choices, *target_args)

    # Set particle 0's weight to match the retained choice assessment
    new_log_weights = particles.log_weights.at[0].set(retained_log_density)

    # For now, return the particles with updated weight
    # The choice override would require rebuilding the trace, which is complex
    # This simplified version just ensures the weight is correct
    return ParticleCollection(
        traces=particles.traces,
        log_weights=new_log_weights,
        diagnostic_weights=particles.diagnostic_weights,
        n_samples=particles.n_samples,
        log_marginal_estimate=particles.log_marginal_estimate,
    )

extend_csmc ¶

extend_csmc(particles: ParticleCollection, extended_target_gf: GFI[X, R], extended_target_args: Any, constraints: X, retained_choices: X, extension_proposal: GFI[X, Any] | None = None) -> ParticleCollection

Extension move for conditional SMC with retained particle.

Like extend() but ensures particle 0 follows retained trajectory.

Source code in src/genjax/inference/smc.py

def extend_csmc(
    particles: ParticleCollection,
    extended_target_gf: GFI[X, R],
    extended_target_args: Any,
    constraints: X,
    retained_choices: X,
    extension_proposal: GFI[X, Any] | None = None,
) -> ParticleCollection:
    """
    Extension move for conditional SMC with retained particle.

    Like extend() but ensures particle 0 follows retained trajectory.

    Args:
        particles: Current particle collection
        extended_target_gf: Extended target generative function
        extended_target_args: Arguments for extended target
        constraints: Constraints on the new variables
        retained_choices: Choices for retained particle at this timestep
        extension_proposal: Optional proposal for the extension

    Returns:
        New ParticleCollection where particle 0 matches retained_choices
    """
    def _single_extension_csmc(
        old_trace: Trace[X, R], old_log_weight: jnp.ndarray, particle_args: Any, is_retained: bool
    ) -> tuple[Trace[X, R], jnp.ndarray]:
        # Convert particle_args to tuple if needed
        if isinstance(particle_args, tuple):
            args = particle_args
        else:
            args = (particle_args,)

        # For retained particle (index 0), use retained_choices exactly
        def retained_extension():
            # Assess retained choices with extended model
            log_density, retval = extended_target_gf.assess(retained_choices, *args)

            from genjax.core import Tr
            new_trace = Tr(
                _gen_fn=extended_target_gf,
                _args=(args, {}),
                _choices=retained_choices,
                _retval=retval,
                _score=-log_density
            )
            # Weight accumulation: old weight + log density
            new_log_weight = old_log_weight + log_density
            return new_trace, new_log_weight

        # For regular particles, use standard extension
        def regular_extension():
            if extension_proposal is None:
                # Generate with extended target using constraints
                new_trace, log_weight = extended_target_gf.generate(constraints, *args)
                new_log_weight = old_log_weight + log_weight
            else:
                # Use custom extension proposal
                old_choices = old_trace.get_choices()
                extension_trace = extension_proposal.simulate(constraints, old_choices, *args)
                extension_choices = extension_trace.get_choices()
                proposal_score = extension_trace.get_score()

                # Merge and generate
                merged_choices, _ = extended_target_gf.merge(constraints, extension_choices)
                new_trace, log_weight = extended_target_gf.generate(merged_choices, *args)
                new_log_weight = old_log_weight + log_weight + proposal_score

            return new_trace, new_log_weight

        # Choose extension type based on whether this is the retained particle
        return jax.lax.cond(
            is_retained,
            retained_extension,
            regular_extension
        )

    # Create is_retained flags: True for index 0, False for others
    is_retained_flags = jnp.arange(particles.n_samples.value) == 0

    # Vectorize across particles with is_retained flag
    vectorized_extension = modular_vmap(
        _single_extension_csmc,
        in_axes=(0, 0, 0, 0),  # Add axis for is_retained
        axis_size=particles.n_samples.value,
    )

    new_traces, new_log_weights = vectorized_extension(
        particles.traces, particles.log_weights, extended_target_args, is_retained_flags
    )

    return _create_particle_collection(
        traces=new_traces,
        log_weights=new_log_weights,
        n_samples=particles.n_samples,
        log_marginal_estimate=particles.log_marginal_estimate,
    )

Core Functions¶

importance_sampling¶

Basic importance sampling with multiple particles.

particle_filter¶

Sequential Monte Carlo for state-space models.

rejuvenation_smc¶

SMC with MCMC rejuvenation steps for better particle diversity.

Usage Examples¶

Importance Sampling¶

from genjax.inference.smc import importance_sampling

# Run with 1000 particles
keys = jax.random.split(key, 1000)
traces = jax.vmap(lambda k: model.generate(k, constraints, args))(keys)

# Extract weights
log_weights = traces.score
weights = jax.nn.softmax(log_weights)

# Weighted posterior mean
posterior_mean = jnp.sum(weights * traces["parameter"])

Particle Filter¶

from genjax.inference.smc import particle_filter

# For sequential data
@gen
def transition(prev_state, t):
    return distributions.normal(prev_state, 0.1) @ f"state_{t}"

@gen
def observation(state, t):
    return distributions.normal(state, 0.5) @ f"obs_{t}"

# Run particle filter
particles = particle_filter(
    initial_model,
    transition,
    observation,
    observations,
    n_particles=100,
    key=key
)

SMC with Rejuvenation¶

from genjax.inference.smc import rejuvenation_smc

# SMC with optional MCMC moves
result = rejuvenation_smc(
    model,
    observations,
    n_particles=100,
    n_mcmc_steps=5,  # Optional: rejuvenation steps
    key=key
)

Best Practices¶

Particle Count: Use enough particles (typically 100-10000)
Resampling: Monitor effective sample size for resampling
Proposal Design: Use good proposal distributions
Rejuvenation: Add MCMC steps to maintain diversity