`parameters` macro for specifying trace manually (aka Turing's `VarInfo`) #2492

yebai · 2025-02-24T12:45:09Z

It has been known for a while that one can write Turing models as callable objects/functors, in the form of

struct MyModel
    a::Int
end
@model function (f::MyModel)(x)
    m ~ Normal(f.a, 1)
    return x ~ Normal(m, 1)
end

Suppose one further assumes that all model parameters (i.e. LHS of tilde) have to be a field of MyModel. In that case, the type definition of MyModel manually specifies the tracing data structure for Turing models.

However, the above example doesn't yet use MyModel as a substitute for TypedVarInfo. For that to happen, one can introduce a new @parameters macro to manually specify TypedVarInfo for models. Here is an example:

@parameters struct Demo{T}
   m::T
   s²::T
   x::Vector{T}
end

@model function (gdemo::Demo)()
   (;m, s², x) = gdemo
   m ~ Normal(0, sqrt(s²))
   s² ~ InverseGamma(2, 3)
   x .~ Normal(m, sqrt(s²))
end

Here, all the model parameters are passed to models transparently via the (;m, s², x) = gdemo. This avoids the use of VarInfo ultimately (we will need VarInfo for other purposes in a lightweight manner, e.g. for accumulating logp . If a sampler opts to leave a parameter unspecified (i.e. sample_from_prior / sample_from_proposal), the model evaluator can still sample these parameters from their conditional prior. Otherwise, samplers must specify each parameter with a concrete value or mark it as missing or predict so the model evaluator handles them accordingly. In addition, users could manually specify parameter constraints and bijector transforms on struct Demo (note this happens without running the model once). All these properties allow more clarity, reduce the complexity of DynamicPPL, and improve robustness by avoiding edge cases of VarInfo.

This approach offers many advantages, e.g.

simplifying the complex code in DynamicPPL to obtain a TypedVarInfo (see Simplify the context mechanism in DynamicPPL #2424 and Significantly simplify the tilde-pipeline in DynamicPPL #2422), which requires running the model once
interact more nicely with operations like fix/condition/biject_transform/missing/predict/sample_from_prior: we could mark fields of MyModel as one of these operations without requiring DynamicPPL's model evaluation context (see, e.g. Convenience macros to use within @model DynamicPPL.jl#714).

A note on the practicals for DynamicPPL: as I mentioned in a previous meeting, one could refactor the DynamicPPL functionality for building TypedVarInfo and provide an API for users to define @parameters struct Demo{T} automatically. This would involve running the model once, collecting all the model parameters and their support constraints, and then automatically generating a Julia struct definition.

This new design (passing tracing data structure explicitly and assume it is immutable) will resolve the following:

The text was updated successfully, but these errors were encountered:

mhauru · 2025-02-24T15:55:41Z

As discussed previously, I like this proposal as long as we can find a way to make most @models generate their @parameters automatically, so that a) simple models will continue to work as they do now, b) we don't need to maintain the new struct-based VarInfo in parallel with the current one.

I wonder what the best syntax is for connecting a particular parameter struct with the variables of a model. With

@model function (gdemo::Demo)()
   (;m, s², x) = gdemo
   m ~ Normal(0, sqrt(s²))
   s² ~ InverseGamma(2, 3)
   x .~ Normal(m, sqrt(s²))
end

it's not obvious to me how we would match e.g. the m in m ~ Normal(0, sqrt(s²)) to the field gdemo.m. We could make gdemo.m return some type of our own for this purpose, but then we would lose the nice property that in Turing models during execution the random variables are just regular Julia variables with regular values like Float64. Relying on matching names of fields and variables feels a bit hacky and magical too.

@model function (gdemo::Demo)()
   gdemo.m ~ Normal(0, sqrt(s²))
   gdemo.s² ~ InverseGamma(2, 3)
   gdemo.x .~ Normal(m, sqrt(s²))
end

would work but gets a bit verbose.

yebai · 2025-02-25T11:46:07Z

We could make gdemo.m return some type of our own for this purpose

A quick note for future discussion: we could use a type similar to ForwardDiff's Dual to carry extra sampler/inference information. This can be made very generic and robust. For probabilistic programming, Dual's implementation is often more straightforward than differentiable programming.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`parameters` macro for specifying trace manually (aka Turing's `VarInfo`) #2492

`parameters` macro for specifying trace manually (aka Turing's `VarInfo`) #2492

yebai commented Feb 24, 2025 •

edited

Loading

mhauru commented Feb 24, 2025

yebai commented Feb 25, 2025

parameters macro for specifying trace manually (aka Turing's VarInfo) #2492

parameters macro for specifying trace manually (aka Turing's VarInfo) #2492

Comments

yebai commented Feb 24, 2025 • edited Loading

mhauru commented Feb 24, 2025

yebai commented Feb 25, 2025

`parameters` macro for specifying trace manually (aka Turing's `VarInfo`) #2492

`parameters` macro for specifying trace manually (aka Turing's `VarInfo`) #2492

yebai commented Feb 24, 2025 •

edited

Loading