API Reference

Complete listing of all exported types and functions.

Abstract Types

DistanceDependentCRP.LikelihoodModel — Type

LikelihoodModel

Abstract supertype for all likelihood models in the DDCRP framework. Each model variant defines how cluster data contributes to the likelihood.

source

DistanceDependentCRP.PoissonModel — Type

PoissonModel <: LikelihoodModel

Abstract type for Poisson likelihood models. Concrete subtypes: PoissonClusterRates, PoissonClusterRatesMarg, PoissonPopulationRates

source

DistanceDependentCRP.BinomialModel — Type

BinomialModel <: LikelihoodModel

Abstract type for Binomial likelihood models. Concrete subtypes: BinomialClusterProb, BinomialClusterProbMarg

source

DistanceDependentCRP.GammaModel — Type

GammaModel <: LikelihoodModel

Abstract type for Gamma likelihood models. Concrete subtypes: GammaClusterShapeMarg

source

DistanceDependentCRP.AbstractMCMCState — Type

AbstractMCMCState{T<:Real}

Abstract supertype for MCMC state containers. Each model variant has its own state type.

source

DistanceDependentCRP.AbstractPriors — Type

AbstractPriors

Abstract supertype for prior specifications. Allows type-dispatched prior handling and validation.

source

DistanceDependentCRP.AbstractMCMCSamples — Type

AbstractMCMCSamples

Abstract supertype for MCMC output containers.

source

Data Containers

DistanceDependentCRP.AbstractObservedData — Type

AbstractObservedData

Abstract supertype for observed data containers in the DDCRP framework. Encapsulates response data (y), distance matrix (D), and optional trials data.

source

DistanceDependentCRP.CountData — Type

CountData{Ty, Td} <: AbstractObservedData

Observed count data for Poisson and Binomial models.

Fields

y::Ty: Observed counts (AbstractVector)
D::Td: Distance matrix (AbstractMatrix)

source

DistanceDependentCRP.CountDataWithTrials — Type

CountDataWithTrials{Ty, Tn, Td} <: AbstractObservedData

Observed count data with number of trials for Binomial models.

Fields

y::Ty: Observed successes (AbstractVector)
N::Tn: Number of trials (scalar Int or AbstractVector{Int})
D::Td: Distance matrix (AbstractMatrix)

source

DistanceDependentCRP.CountDataWithPopulation — Type

CountDataWithPopulation{Ty, Tp, Td} <: AbstractObservedData

Observed count data with population/exposure offsets for Poisson/NB population models.

Fields

y::Ty: Observed counts (AbstractVector)
P::Tp: Population or exposure (scalar or AbstractVector{<:Real})
D::Td: Distance matrix (AbstractMatrix)
missing_mask::BitVector: true for indices with missing observations (default: all false)

source

DistanceDependentCRP.ContinuousData — Type

ContinuousData{Ty, Td} <: AbstractObservedData

Observed continuous data for continuous-valued models (e.g. Gamma).

Fields

y::Ty: Observed values (AbstractVector{<:Real})
D::Td: Distance matrix (AbstractMatrix)

source

DistanceDependentCRP.observations — Function

Return the observations vector.

source

DistanceDependentCRP.distance_matrix — Function

Return the distance matrix.

source

DistanceDependentCRP.trials — Function

Return the number of trials (only for CountDataWithTrials).

source

DistanceDependentCRP.has_trials — Function

Check if data has trials information.

source

DistanceDependentCRP.population — Function

Return the population/exposure vector (only for CountDataWithPopulation).

source

DistanceDependentCRP.has_population — Function

Check if data has population/exposure information.

source

DistanceDependentCRP.get_missing_mask — Function

Return the missing data mask (BitVector of length n); nothing for other data types.

source

DistanceDependentCRP.has_missing — Function

Return true if any observations are missing.

source

DistanceDependentCRP.nobs — Function

Number of observations.

source

DistanceDependentCRP.requires_trials — Function

requires_trials(model::LikelihoodModel) -> Bool

Returns true if the model requires data with trials/exposure (N or P).

source

DistanceDependentCRP.requires_population — Function

requires_population(model::LikelihoodModel) -> Bool

Returns true if the model requires data with population/exposure offsets (P).

source

DDCRP Parameters and Options

DistanceDependentCRP.DDCRPParams — Type

DDCRPParams{T<:Real}

DDCRP hyperparameters (shared across models).

Fields

α::T: Concentration parameter (self-link probability)
scale::T: Distance decay scale parameter
decay_fn::Function: Decay function (default: exponential)
α_a::Union{T,Nothing}: Gamma shape prior for α (nothing = don't infer)
α_b::Union{T,Nothing}: Gamma rate prior for α
s_a::Union{T,Nothing}: Gamma shape prior for scale s (nothing = don't infer)
s_b::Union{T,Nothing}: Gamma rate prior for scale s

source

DistanceDependentCRP.MCMCOptions — Type

MCMCOptions

Configuration for MCMC sampling. Birth proposals and fixed-dimension proposals are passed directly to mcmc as arguments, not through options.

Fields

n_samples::Int: Number of MCMC iterations (default: 10000)
verbose::Bool: Print progress (default: false)
infer_params::Dict{Symbol, Bool}: Parameters to explicitly disable inference for (default: empty — all parameters inferred)
prop_sds::Dict{Symbol, Float64}: Proposal standard deviations for MH parameter updates
track_diagnostics::Bool: Track acceptance rates (default: true)
track_pairwise::Bool: Track pairwise proposals (default: false)

source

DistanceDependentCRP.should_infer — Function

should_infer(opts::MCMCOptions, param::Symbol) -> Bool

Check if a parameter should be inferred. Defaults to true if not specified.

source

DistanceDependentCRP.get_prop_sd — Function

get_prop_sd(opts::MCMCOptions, param::Symbol; default=0.5) -> Float64

Get the proposal standard deviation for a parameter.

source

Birth Proposals

DistanceDependentCRP.BirthProposal — Type

BirthProposal

Abstract supertype for RJMCMC birth proposal distributions. Controls how new cluster parameters are proposed when clusters split. Proposal objects are passed directly to mcmc and carry their own configuration.

source

DistanceDependentCRP.PriorProposal — Type

PriorProposal <: BirthProposal

Sample new cluster parameters from the prior distribution.

source

DistanceDependentCRP.ConjugateProposal — Type

ConjugateProposal <: BirthProposal

Marker type indicating the model has conjugate cluster parameters. When used, update_c! dispatches to Gibbs sampling for assignments instead of RJMCMC, and cluster parameters are resampled from their conjugate posteriors after assignment updates.

source

DistanceDependentCRP.MomentMatchedProposal — Type

MomentMatchedProposal <: BirthProposal

Abstract supertype for data-informed birth proposals that use empirical moments of the moving set to construct the proposal distribution.

source

DistanceDependentCRP.NormalMomentMatch — Type

NormalMomentMatch <: MomentMatchedProposal

Sample new cluster parameters from truncated Normal centered at empirical mean.

Fields

σ::Vector{Float64}: One proposal std per cluster parameter

source

DistanceDependentCRP.InverseGammaMomentMatch — Type

InverseGammaMomentMatch <: MomentMatchedProposal

Fit InverseGamma to data in moving set via method of moments. Falls back to prior if moment matching fails.

Fields

min_size::Int: Minimum cluster size to attempt moment matching

source

DistanceDependentCRP.LogNormalMomentMatch — Type

LogNormalMomentMatch <: MomentMatchedProposal

Sample on log-scale using moment-matched LogNormal proposal. For each parameter, proposes log(θ) ~ Normal(log(θest), σ) where θest is a moment-based estimate.

Fields

σ::Vector{Float64}: One proposal std per cluster parameter (on log-scale)
min_size::Int: Minimum cluster size for moment estimation

source

DistanceDependentCRP.FixedDistributionProposal — Type

FixedDistributionProposal <: BirthProposal

Sample new cluster parameters from user-specified fixed distributions.

Fields

dists::Vector{UnivariateDistribution}: One distribution per cluster parameter

source

DistanceDependentCRP.MixedProposal — Type

MixedProposal{T<:NamedTuple} <: BirthProposal

Compose per-parameter birth proposals. Each cluster parameter can use a different proposal strategy. The proposals field is a NamedTuple mapping parameter names (e.g. :λ, :α) to individual BirthProposal instances.

Dispatches to sample_birth_param and birth_param_logpdf for each parameter, which are implemented per (model, parameter, proposal) combination in each model file.

Example

MixedProposal(
    λ = LogNormalMomentMatch(0.5),
    α = NormalMomentMatch(0.5)
)

source

Fixed-Dimension Proposals

DistanceDependentCRP.FixedDimensionProposal — Type

FixedDimensionProposal

Abstract supertype for RJMCMC fixed-dimension proposal distributions. Controls how cluster parameters are updated when the moving set S_i transfers between existing clusters without changing the total number of clusters K.

source

DistanceDependentCRP.NoUpdate — Type

NoUpdate <: FixedDimensionProposal

Keep existing cluster parameters unchanged during fixed-dimension moves. The acceptance probability depends solely on the posterior ratio.

source

DistanceDependentCRP.WeightedMean — Type

WeightedMean <: FixedDimensionProposal

Deterministically update parameters as weighted averages of cluster contents. For a parameter ρ, the augmented cluster gets a weighted mean incorporating the moving set, and the depleted cluster is adjusted accordingly. The update is deterministic (lpr = 0) so the Jacobian is unity.

source

DistanceDependentCRP.Resample — Type

Resample{P<:BirthProposal} <: FixedDimensionProposal

Stochastically resample cluster parameters for the modified clusters using an inner BirthProposal. Reuses sample_birth_param/birth_param_logpdf applied to the new cluster memberships (remaining depleted, augmented). The Hastings ratio accounts for the forward and reverse proposal densities.

Fields

proposal::P: The birth proposal to use for resampling

Example

Resample(NormalMomentMatch(0.5, 0.3, 0.5))  # moment-matched resampling
Resample()                                    # prior-based resampling

source

DistanceDependentCRP.MixedFixedDim — Type

MixedFixedDim{T<:NamedTuple} <: FixedDimensionProposal

Compose per-parameter fixed-dimension proposals. Each cluster parameter can use a different update strategy. The proposals field is a NamedTuple mapping parameter names to individual FixedDimensionProposal instances. Unspecified parameters default to NoUpdate.

Example

MixedFixedDim(ξ = WeightedMean(), ω = NoUpdate(), α = NoUpdate())

source

Poisson Models

DistanceDependentCRP.PoissonClusterRates — Type

PoissonClusterRates <: PoissonModel

Poisson model with explicit cluster-specific rates. Rates λ_k are maintained and updated via conjugate Gibbs sampling.

Parameters:

c: Customer assignments
λ_k: Cluster rates (cluster-level)

source

DistanceDependentCRP.PoissonClusterRatesState — Type

PoissonClusterRatesState{T<:Real} <: AbstractMCMCState{T}

State for PoissonClusterRates model.

Fields

c::Vector{Int}: Customer assignments (link representation)
λ_dict::Dict{Vector{Int}, T}: Table -> cluster rate mapping

source

DistanceDependentCRP.PoissonClusterRatesPriors — Type

PoissonClusterRatesPriors{T<:Real} <: AbstractPriors

Prior specification for PoissonClusterRates model.

Fields

λ_a::T: Gamma shape parameter for rate λ
λ_b::T: Gamma rate parameter for rate λ

source

DistanceDependentCRP.PoissonClusterRatesSamples — Type

PoissonClusterRatesSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for PoissonClusterRates model.

Fields

c::Matrix{Int}: Customer assignments (nsamples x nobs)
λ::Matrix{T}: Cluster rates per observation (nsamples x nobs)
logpost::Vector{T}: Log-posterior values (n_samples)

source

DistanceDependentCRP.PoissonClusterRatesMarg — Type

PoissonClusterRatesMarg <: PoissonModel

Poisson model with cluster rates marginalised out. Uses Gamma-Poisson conjugacy for closed-form marginal likelihood.

Parameters:

c: Customer assignments only

source

DistanceDependentCRP.PoissonClusterRatesMargState — Type

PoissonClusterRatesMargState <: AbstractMCMCState{Float64}

State for PoissonClusterRatesMarg model.

Fields

c::Vector{Int}: Customer assignments (link representation)

source

DistanceDependentCRP.PoissonClusterRatesMargPriors — Type

PoissonClusterRatesMargPriors{T<:Real} <: AbstractPriors

Prior specification for PoissonClusterRatesMarg model.

Fields

λ_a::T: Gamma shape parameter for rate λ
λ_b::T: Gamma rate parameter for rate λ

source

DistanceDependentCRP.PoissonClusterRatesMargSamples — Type

PoissonClusterRatesMargSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for PoissonClusterRatesMarg model.

Fields

c::Matrix{Int}: Customer assignments (nsamples x nobs)
logpost::Vector{T}: Log-posterior values (n_samples)

source

DistanceDependentCRP.PoissonPopulationRates — Type

PoissonPopulationRates <: PoissonModel

Poisson model with population/exposure adjustment. Rate for observation i in cluster k is λi = Pi * ρ_k.

Parameters:

c: Customer assignments
ρ_k: Cluster rate multipliers (cluster-level)

Requires exposure data P_i for each observation.

source

DistanceDependentCRP.PoissonPopulationRatesState — Type

PoissonPopulationRatesState{T<:Real} <: AbstractMCMCState{T}

State for PoissonPopulationRates model.

Fields

c::Vector{Int}: Customer assignments (link representation)
ρ_dict::Dict{Vector{Int}, T}: Table -> cluster rate multiplier mapping

source

DistanceDependentCRP.PoissonPopulationRatesPriors — Type

PoissonPopulationRatesPriors{T<:Real} <: AbstractPriors

Prior specification for PoissonPopulationRates model.

Fields

ρ_a::T: Gamma shape parameter for rate multiplier ρ
ρ_b::T: Gamma rate parameter for rate multiplier ρ

source

DistanceDependentCRP.PoissonPopulationRatesSamples — Type

PoissonPopulationRatesSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for PoissonPopulationRates model.

Fields

c::Matrix{Int}: Customer assignments (nsamples x nobs)
ρ::Matrix{T}: Cluster rate multipliers per observation (nsamples x nobs)
logpost::Vector{T}: Log-posterior values (n_samples)

source

DistanceDependentCRP.PoissonPopulationRatesMarg — Type

PoissonPopulationRatesMarg <: PoissonModel

Poisson model with population/exposure offsets and cluster rates marginalised out. Uses Gamma-Poisson conjugacy for a closed-form marginal likelihood.

Missing observations are excluded from the likelihood; their cluster assignments are updated using only the ddCRP prior.

Parameters:

c: Customer assignments only

Requires population data P_i for each observation via CountDataWithPopulation.

source

DistanceDependentCRP.PoissonPopulationRatesMargState — Type

PoissonPopulationRatesMargState <: AbstractMCMCState{Float64}

State for PoissonPopulationRatesMarg model.

Fields

c::Vector{Int}: Customer assignments (link representation)

source

DistanceDependentCRP.PoissonPopulationRatesMargPriors — Type

PoissonPopulationRatesMargPriors{T<:Real} <: AbstractPriors

Prior specification for PoissonPopulationRatesMarg model.

Fields

ρ_a::T: Gamma shape parameter for cluster rate multiplier ρ
ρ_b::T: Gamma rate parameter for cluster rate multiplier ρ

source

DistanceDependentCRP.PoissonPopulationRatesMargSamples — Type

PoissonPopulationRatesMargSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for PoissonPopulationRatesMarg model.

Fields

c::Matrix{Int}: Customer assignments (nsamples × nobs)
logpost::Vector{T}: Log-posterior values (n_samples)
α_ddcrp::Vector{T}: DDCRP concentration samples (n_samples)
s_ddcrp::Vector{T}: DDCRP decay scale samples (n_samples)

source

Binomial Models

DistanceDependentCRP.BinomialClusterProb — Type

BinomialClusterProb <: BinomialModel

Binomial model with explicit cluster-specific success probabilities. Probabilities p_k are maintained and updated via conjugate Gibbs sampling.

Parameters:

c: Customer assignments
p_k: Cluster probabilities (cluster-level)

source

DistanceDependentCRP.BinomialClusterProbState — Type

BinomialClusterProbState{T<:Real} <: AbstractMCMCState{T}

State for BinomialClusterProb model.

Fields

c::Vector{Int}: Customer assignments (link representation)
p_dict::Dict{Vector{Int}, T}: Table -> cluster probability mapping

source

DistanceDependentCRP.BinomialClusterProbPriors — Type

BinomialClusterProbPriors{T<:Real} <: AbstractPriors

Prior specification for BinomialClusterProb model.

Fields

p_a::T: Beta α parameter for probability p
p_b::T: Beta β parameter for probability p

source

DistanceDependentCRP.BinomialClusterProbSamples — Type

BinomialClusterProbSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for BinomialClusterProb model.

Fields

c::Matrix{Int}: Customer assignments (nsamples x nobs)
p::Matrix{T}: Cluster probabilities per observation (nsamples x nobs)
logpost::Vector{T}: Log-posterior values (n_samples)

source

DistanceDependentCRP.BinomialClusterProbMarg — Type

BinomialClusterProbMarg <: BinomialModel

Binomial model with cluster probabilities marginalised out. Uses Beta-Binomial conjugacy for closed-form marginal likelihood.

Parameters:

c: Customer assignments only

source

DistanceDependentCRP.BinomialClusterProbMargState — Type

BinomialClusterProbMargState <: AbstractMCMCState{Float64}

State for BinomialClusterProbMarg model.

Fields

c::Vector{Int}: Customer assignments (link representation)

source

DistanceDependentCRP.BinomialClusterProbMargPriors — Type

BinomialClusterProbMargPriors{T<:Real} <: AbstractPriors

Prior specification for BinomialClusterProbMarg model.

Fields

p_a::T: Beta α parameter for probability p
p_b::T: Beta β parameter for probability p

source

DistanceDependentCRP.BinomialClusterProbMargSamples — Type

BinomialClusterProbMargSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for BinomialClusterProbMarg model.

Fields

c::Matrix{Int}: Customer assignments (nsamples x nobs)
logpost::Vector{T}: Log-posterior values (n_samples)

source

Gamma Models

DistanceDependentCRP.GammaClusterShapeMarg — Type

GammaClusterShapeMarg <: GammaModel

Gamma model with cluster-specific shape parameters (αk). Rate parameters (βk) are marginalised out using Gamma-Gamma conjugacy.

Parameters:

α_k: Cluster shape parameters (cluster-level, explicit)
c: Customer assignments

Marginalised: β_k (cluster rate parameters integrated out analytically)

source

DistanceDependentCRP.GammaClusterShapeMargState — Type

GammaClusterShapeMargState{T<:Real} <: AbstractMCMCState{T}

State for GammaClusterShapeMarg model.

Fields

c::Vector{Int}: Customer assignments (link representation)
α_dict::Dict{Vector{Int}, T}: Table -> cluster shape parameter mapping

source

DistanceDependentCRP.GammaClusterShapeMargPriors — Type

GammaClusterShapeMargPriors{T<:Real} <: AbstractPriors

Prior specification for GammaClusterShapeMarg model.

Fields

α_a::T: Gamma shape parameter for α prior (shape of shape)
α_b::T: Gamma rate parameter for α prior
β_a::T: Gamma shape parameter for β prior (used in marginal likelihood)
β_b::T: Gamma rate parameter for β prior (used in marginal likelihood)

source

DistanceDependentCRP.GammaClusterShapeMargSamples — Type

GammaClusterShapeMargSamples{T<:Real} <: AbstractMCMCSamples

MCMC samples container for GammaClusterShapeMarg model.

Fields

c::Matrix{Int}: Customer assignments (nsamples x nobs)
α::Matrix{T}: Shape per observation (nsamples x nobs) - stores cluster α
logpost::Vector{T}: Log-posterior values (n_samples)

source

Main MCMC Entry Point

DistanceDependentCRP.mcmc — Function

mcmc(model, data, ddcrp_params, priors, proposal; fixed_dim_proposal, opts)

Main MCMC entry point. Dispatches based on model type.

Arguments

model::LikelihoodModel: The likelihood model (determines parameter structure)
data::AbstractObservedData: Observed data container
ddcrp_params::DDCRPParams: DDCRP hyperparameters
priors::AbstractPriors: Prior specification
proposal::BirthProposal: Birth proposal for RJMCMC (or ConjugateProposal for Gibbs)

Keyword Arguments

fixed_dim_proposal::FixedDimensionProposal: Fixed-dimension proposal (default: NoUpdate())
opts::MCMCOptions: MCMC configuration

Returns

Model-specific *Samples struct (subtype of AbstractMCMCSamples)
MCMCDiagnostics (optional): If opts.track_diagnostics is true

source

Convenience: CountData models (Poisson, NegBin) with separate y, D.

source

Convenience: CountDataWithTrials models (Binomial) or CountDataWithPopulation models with separate y, N/P, D.

source

Convenience: ContinuousData models (Gamma) with separate y, D.

source

Model Interface Methods

DistanceDependentCRP.initialise_state — Function

initialise_state(model::PoissonClusterRates, data, ddcrp_params, priors)

Create initial MCMC state for the model.

source

initialise_state(model::PoissonClusterRatesMarg, data, ddcrp_params, priors)

Create initial MCMC state for the model.

source

initialise_state(model::PoissonPopulationRates, data, ddcrp_params, priors)

Create initial MCMC state for the model.

source

initialise_state(model::PoissonPopulationRatesMarg, data, ddcrp_params, priors)

Create initial MCMC state. Assignments are drawn from the ddCRP prior.

source

initialise_state(model::BinomialClusterProb, data, ddcrp_params, priors)

Create initial MCMC state for the model.

source

initialise_state(model::BinomialClusterProbMarg, data, ddcrp_params, priors)

Create initial MCMC state for the model.

source

initialise_state(model::GammaClusterShapeMarg, data, ddcrp_params, priors)

Create initial MCMC state for the model. Initialises shape parameters using method of moments.

source

DistanceDependentCRP.allocate_samples — Function

allocate_samples(model::PoissonClusterRates, n_samples, n)

Allocate storage for MCMC samples.

source

allocate_samples(model::PoissonClusterRatesMarg, n_samples, n)

Allocate storage for MCMC samples.

source

allocate_samples(model::PoissonPopulationRates, n_samples, n)

Allocate storage for MCMC samples.

source

allocate_samples(model::PoissonPopulationRatesMarg, n_samples, n)

Allocate storage for MCMC samples.

source

allocate_samples(model::BinomialClusterProb, n_samples, n)

Allocate storage for MCMC samples.

source

allocate_samples(model::BinomialClusterProbMarg, n_samples, n)

Allocate storage for MCMC samples.

source

allocate_samples(model::GammaClusterShapeMarg, n_samples, n)

Allocate storage for MCMC samples.

source

DistanceDependentCRP.extract_samples! — Function

extract_samples!(model::PoissonClusterRates, state, samples, iter)