9.3.4. Replicating graphs¶

Now let us learn the second capability of the bundles: graph generation and replication. We will use the example of the energy resolution transformation. In short, the energy resolution transformation smears the input histogram bin by bin. Each bin is smeared with a Gaussian. A width is defined based on the bin center by a formula, that depends on three parameters: \(a\), \(b\) and \(c\).

The energy resolution object contains 2+ transformations: matrix transformation computes the smearing matrix based on the values of the parameters, at least one smear transformation smears the input histogram with smearing matrix.

We now will define a bundle which:

Defines the parameters for \(a\), \(b\) and \(c\).
- If major index is specified, different parameters are defined for each iteration.
- Minor indices are ignored.
Defines the energy resolution object for each major iteration.
- The matrix transformation depends on the current parameters \(a\), \(b\) and \(c\). The bundle provides an input of the matrix transformation for each major iteration. The bin edges output should be connected to it.
- New smear transformation is added for each minor iteration.
- The bundle provides an input/output pair on each minor+major iteration.
Optional features:
- Label formats.
- Merge transformations. Do not create a new transformation for each minor index. Use the same transformation to process all the inputs. The procedure is explained here.

9.3.4.1. Energy resolution bundle¶

At first let us create a draft bundle detector_eres.

detector_eres_ex01.py¶

from gna.bundle import *

class detector_eres_ex01(TransformationBundle):
    def __init__(self, *args, **kwargs):
        TransformationBundle.__init__(self, *args, **kwargs)

    def build(self):
        self.objects = []
        for it_major in self.nidx_major:
            vals = it_major.current_values(name=self.cfg.parameter)
            names = [ '.'.join(vals+(name,)) for name in self.names ]

            eres = C.EnergyResolution(names, ns=self.namespace)
            self.objects.append(eres)

            eres.print()

            self.set_input('eres_matrix', it_major, eres.matrix.Edges, argument_number=0)

            trans = eres.smear
            for i, it_minor in enumerate(self.nidx_minor):
                it = it_major + it_minor
                eres.add_input()

                self.set_input('eres', it, trans.inputs.back(), argument_number=0)
                self.set_output('eres', it, trans.outputs.back())

    def define_variables(self):
        parname = self.cfg.parameter
        parscfg = self.cfg.pars
        self.names = None

        for it_major in self.nidx_major:
            major_values = it_major.current_values()
            pars = parscfg[major_values]

            if self.names is None:
                self.names = tuple(pars.keys())
            else:
                assert self.names == tuple(pars.keys())

            for i, (name, unc) in enumerate(pars.items()):
                it=it_major

                par = self.reqparameter(parname, it, cfg=unc, extra=name)

The method define_variables() is called to define th variables similarly to the tutorial on a bundle for parameters. The only difference is that for each major iteration there are three parameters to be defined. An extra check is done that the actual parameter names for each major iteration is the same.

Let us now look at the second method build() that creates a part of the computational graph.

        self.objects = []
        for it_major in self.nidx_major:
            vals = it_major.current_values(name=self.cfg.parameter)
            names = [ '.'.join(vals+(name,)) for name in self.names ]

            eres = C.EnergyResolution(names, ns=self.namespace)
            self.objects.append(eres)

            eres.print()

            self.set_input('eres_matrix', it_major, eres.matrix.Edges, argument_number=0)

            trans = eres.smear
            for i, it_minor in enumerate(self.nidx_minor):
                it = it_major + it_minor
                eres.add_input()

                self.set_input('eres', it, trans.inputs.back(), argument_number=0)
                self.set_output('eres', it, trans.outputs.back())

We start from iterating over major indices combinations:

        self.objects = []
        for it_major in self.nidx_major:
            vals = it_major.current_values(name=self.cfg.parameter)
            names = [ '.'.join(vals+(name,)) for name in self.names ]

On each iteration we make a list of parameter names to be passed to the energy resolution constructor

            self.objects.append(eres)

The namespace is passed to ensure that energy resolution refers to the correct parameters. Here is the contents of the EnergyResolution object:

[obj] EnergyResolution: 2 transformation(s), 3 variables
[trans] matrix: 1 input(s), 1 output(s)
[in]  Edges <- ...
[out] FakeMatrix: invalid
[trans] smear: 2 input(s), 1 output(s)
[in]  FakeMatrix <- [out] FakeMatrix: invalid
[in]  Ntrue <- ...
[out] Nrec: invalid

As one can see and as described in EnergyResolution we need to bind a histogram that defines the bin edges to matrix.Edges and a histogram that will be smeared to the smear.Ntrue. Since the aim of the current bundle is not to bind the inputs and outputs, but rather to provide them to later use we declare the output with the following command:

The signature is set_input(‘name’, nidx, input, argument_number). The output will be located in the bundle.context.inputs.eres_matrix. The full path after bundle.context.inputs will be defined by the major index current format. Argument number will be added in the end of the path. It is used in case several inputs are used for the same name.

We then iterate over each minor index,

            for i, it_minor in enumerate(self.nidx_minor):
                it = it_major + it_minor
                eres.add_input()

                self.set_input('eres', it, trans.inputs.back(), argument_number=0)
                self.set_output('eres', it, trans.outputs.back())

add a new input/output pair and declare the input/output pair with methods set_input() and set_output().

Now let us use the bundle in the following script:

05_bundle_eres.py¶

import load
from gna.bundle import execute_bundle
from gna.configurator import NestedDict, uncertaindict, uncertain
from gna.env import env
from gna.bindings import common
from gna import constructors as C
import numpy as np
from matplotlib import pyplot as plt

cfg = NestedDict(
    bundle = dict(
        name='detector_eres',
        version='ex01',
        ),
    parameter = 'eres',
    pars = uncertaindict(
        [
         ('a', 0.01),
         ('b', 0.09),
         ('c', 0.03),
         ],
        mode='percent',
        uncertainty = 30.0
        ),
)
b = execute_bundle(cfg)
env.globalns.printparameters(labels=True); print()

#
# Prepare inputs
#
emin, emax = 0.0, 12.0
nbins = 240
edges = np.linspace(emin, emax, nbins+1, dtype='d')
data  = np.zeros(nbins, dtype='d')
data[20]=1.0  # 1 MeV
data[120]=1.0 # 6 MeV
data[200]=1.0 # 10 MeV
hist = C.Histogram(edges, data)
hist.hist.setLabel('Input histogram')

# Bind outputs
#
hist >> b.context.inputs.eres_matrix.values()
hist >> b.context.inputs.eres.values()
print( b.context )

#
# Plot
#
savegraph(hist, tutorial_image_name('png', suffix='graph'), rankdir='TB')

fig = plt.figure()
ax = plt.subplot(111, xlabel='E, MeV', ylabel='', title='Energy smearing')
ax.minorticks_on()
ax.grid()

hist.hist.hist.plot_hist(label='Original histogram')
b.context.outputs.eres.plot_hist(label='Smeared histogram')

ax.legend(loc='upper right')

savefig(tutorial_image_name('png'))
plt.show()

The configuration and execution should be familiar to the use after tutorial on a bundle for parameters. The loaded parameters are the following:

Variables in namespace 'eres':
  a                    =       0.01 │        0.01±       0.003 [         30%] │
  b                    =       0.09 │        0.09±       0.027 [         30%] │
  c                    =       0.03 │        0.03±       0.009 [         30%] │

After executing the bundle let us make an input:

emin, emax = 0.0, 12.0
nbins = 240
edges = np.linspace(emin, emax, nbins+1, dtype='d')
data  = np.zeros(nbins, dtype='d')
data[20]=1.0  # 1 MeV
data[120]=1.0 # 6 MeV
data[200]=1.0 # 10 MeV
hist = C.Histogram(edges, data)
hist.hist.setLabel('Input histogram')

Here we defined a histogram for energy between 0 and 12 MeV with three peaks: at 1 MeV, 6 MeV and 10 MeV. The histogram output is then bind to the inputs as follows:

hist >> b.context.inputs.eres_matrix.values()
hist >> b.context.inputs.eres.values()
print( b.context )

The last line prints the contents of the context:

${
  inputs : ${
    eres_matrix : ${
      00 : [in]  Edges <- [out] hist: hist, 240 bins, edges 0.0->12.0, width 0.05,
    },
    eres : ${
      00 : [in]  Ntrue <- [out] hist: hist, 240 bins, edges 0.0->12.0, width 0.05,
    },
  },
  objects : ${},
  outputs : ${
    eres_matrix : [out] FakeMatrix: array 2d, shape 240x240, size 57600,
    eres : [out] Nrec: hist, 240 bins, edges 0.0->12.0, width 0.05,
  },
}

Context is a nested dictionary with declared inputs eres_matrix.00 and eres.00. The outputs contain outputs eres_matrix and eres. Just as it was declared in the bundle for the case with empty major iterator. When binding we have used .values() method that returns an iterator on all the values to avoid typing 00.

When empty multi-index is provided the resulting graph looks as follows:

../_images/05_bundle_eres_graph.png — The resulting graph of the energy resolution bundle for the case of empty index.¶

It contains the matrix transformation, which is defined by the histogram binning. The matrix is then used to smear the histogram via smear transformation. As it is noted in the EnergyResolution, the input of the matrix is used only to define the matrix shape: it does not read the histogram and does not propagate the taint flag.

The example produces the following plot:

../_images/05_bundle_eres.png — A histogram with bins at 1 MeV, 6 MeV and 10 MeV smeared with EnergyResolution bundle.¶

9.3.4.2. Energy resolution replicated¶

Before trying more complex example, we add some new features to the detector_eres bundle. We change the version mark to ex02 to keep both versions available. The updates include:

A method to label transformations and parameters based on the configuration.
Option split_transformations (true by default) which changes the graph topology.

The newer version is available in detector_eres_ex02.py.

Here is a script:

05_bundle_eres_upd.py¶

import load
from gna.bundle import execute_bundle
from gna.configurator import NestedDict, uncertaindict, uncertain
from gna import constructors as C
from gna.env import env
import numpy as np
from matplotlib import pyplot as plt
from gna.bindings import common

cfg = NestedDict(
    bundle = dict(
        name='detector_eres',
        version='ex02',
        nidx=[ ('d', 'detector', ['D1', 'D2', 'D3']),
               ('z', 'zone', ['z1', 'z2'])],
        major=['z'],
        names=dict(
            eres_matrix='smearing_matrix',
            )
        ),
    parameter = 'eres',
    pars = uncertaindict(
        [
         ('z1.a', (0.0, 'fixed')),
         ('z1.b', (0.05, 30, 'percent')),
         ('z1.c', (0.0, 'fixed')),
         ('z2.a', (0.0, 'fixed')),
         ('z2.b', (0.10, 30, 'percent')),
         ('z2.c', (0.0, 'fixed')),
         ('z3.a', (0.0, 'fixed')),
         ('z3.b', (0.15, 30, 'percent')) ,
         ('z3.c', (0.0, 'fixed')),
         ]
        ),
    labels = dict(
        matrix    = 'Smearing\nmatrix\n{autoindex}',
        smear     = 'Energy\nresolution\n{autoindex}',
        parameter = '{description} (zone {autoindex})'
        ),
    split_transformations = True
)
b = execute_bundle(cfg)
env.globalns.printparameters(labels=True); print()

#
# Prepare inputs
#
emin, emax = 0.0, 12.0
nbins = 240
edges = np.linspace(emin, emax, nbins+1, dtype='d')
data1  = np.zeros(nbins, dtype='d')
data1[20]=1.0  # 1 MeV
data1[120]=1.0 # 6 MeV
data1[200]=1.0 # 10 MeV

data2  = np.zeros(nbins, dtype='d')
data2[40]=1.0  # 2 MeV
data2[140]=1.0 # 7 MeV
data2[220]=1.0 # 11 MeV

data3  = np.zeros(nbins, dtype='d')
data3[60]=1.0  # 3 MeV
data3[160]=1.0 # 8 MeV
data3[239]=1.0 # 12 MeV

hist1 = C.Histogram(edges, data1)
hist1.hist.setLabel('Input histogram 1')

hist2 = C.Histogram(edges, data2)
hist2.hist.setLabel('Input histogram 2')

hist3 = C.Histogram(edges, data3)
hist3.hist.setLabel('Input histogram 3')

#
# Bind outputs
#
suffix = '' if cfg.split_transformations else 'merged_'
savegraph(b.context.outputs.smearing_matrix.values(), tutorial_image_name('png', suffix=suffix+'graph0'), rankdir='TB')

hist1   >> b.context.inputs.smearing_matrix.values(nested=True)
hist1   >> b.context.inputs.eres.D1.values(nested=True)
hist2   >> b.context.inputs.eres.D2.values(nested=True)
hist3   >> b.context.inputs.eres.D3.values(nested=True)
print( b.context )

savegraph(hist1, tutorial_image_name('png', suffix=suffix+'graph1'), rankdir='TB')

#
# Plot
#
fig = plt.figure(figsize=(12,12))

hists = [hist1, hist2, hist3]
for i, det in enumerate(['D1', 'D2', 'D3']):
    ax = plt.subplot(221+i, xlabel='E, MeV', ylabel='', title='Energy smearing in '+det)
    ax.minorticks_on()
    ax.grid()

    hists[i].hist.hist.plot_hist(label='Original histogram')
    for i, out in enumerate(b.context.outputs.eres[det].values(nested=True)):
        out.plot_hist(label='Smeared histogram (%i)'%i)

    ax.legend(loc='upper right')

savefig(tutorial_image_name('png'))
plt.show()

Let us look at the configuration in more details. First of all we defined indices.

        nidx=[ ('d', 'detector', ['D1', 'D2', 'D3']),
               ('z', 'zone', ['z1', 'z2'])],
        major=['z'],

We assume that there are three detectors D1, D2 and D3 with same energy resolution parameters. In the same time, each of the detectors has two zones z1 and z2 with own parameters. The zone index z is thus major while the detector index d is minor. The parameters configuration contains a value and uncertainty for each of the parameters \(a\), \(b\) and \(c\) for each of the zones.

    pars = uncertaindict(
        [
         ('z1.a', (0.0, 'fixed')),
         ('z1.b', (0.05, 30, 'percent')),
         ('z1.c', (0.0, 'fixed')),
         ('z2.a', (0.0, 'fixed')),
         ('z2.b', (0.10, 30, 'percent')),
         ('z2.c', (0.0, 'fixed')),
         ('z3.a', (0.0, 'fixed')),
         ('z3.b', (0.15, 30, 'percent')) ,
         ('z3.c', (0.0, 'fixed')),
         ]
        ),

One may see that there are parameters for more zones in the configuration, but only ones, defined by indices, will be read.

Also, the bundle normally defines input/output pairs for eres and eres_matrix. These names may be overridden via dictionary names in the bundle configuration:

        names=dict(
            eres_matrix='smearing_matrix',
            )

Here we have defined a new name smearing_matrix for the eres_matrix. Name substitutions are realized by the GNA.

By using labels we now define the labels for matrix and smear transformations and for the parameters. The format strings include the field {autoindex} that will be substituted by the current iteration of the multi-index. There is also {description} field, that will be substituted by the parameter meaning. The labels are set by the detector_eres bundle of version ex02.

Executing the bundle we produce the following graph:

../_images/05_bundle_eres_upd_graph0.png — The resulting graph of the energy resolution bundle (version ex02). Inputs are open.¶

As one may see, there are two smearing matrices, one for each zone. The matrices are similar between the detectors.

The parameters are defined for each of the zones:

Variables in namespace 'eres.z1':
  a                    =          0 │                 [fixed]                 │ spatial/temporal resolution (zone z1)
  b                    =       0.05 │        0.05±       0.015 [         30%] │ photon statistics (zone z1)
  c                    =          0 │                 [fixed]                 │ dark noise (zone z1)
Variables in namespace 'eres.z2':
  a                    =          0 │                 [fixed]                 │ spatial/temporal resolution (zone z2)
  b                    =        0.1 │         0.1±        0.03 [         30%] │ photon statistics (zone z2)
  c                    =          0 │                 [fixed]                 │ dark noise (zone z2)

We now define input histograms in the same way as we done in the previous example with the only difference. There is now a separate histogram for each detector with the peaks in the different positions.

emin, emax = 0.0, 12.0
nbins = 240
edges = np.linspace(emin, emax, nbins+1, dtype='d')
data1  = np.zeros(nbins, dtype='d')
data1[20]=1.0  # 1 MeV
data1[120]=1.0 # 6 MeV
data1[200]=1.0 # 10 MeV

data2  = np.zeros(nbins, dtype='d')
data2[40]=1.0  # 2 MeV
data2[140]=1.0 # 7 MeV
data2[220]=1.0 # 11 MeV

data3  = np.zeros(nbins, dtype='d')
data3[60]=1.0  # 3 MeV
data3[160]=1.0 # 8 MeV
data3[239]=1.0 # 12 MeV

hist1 = C.Histogram(edges, data1)
hist1.hist.setLabel('Input histogram 1')

hist2 = C.Histogram(edges, data2)
hist2.hist.setLabel('Input histogram 2')

hist3 = C.Histogram(edges, data3)
hist3.hist.setLabel('Input histogram 3')

We bind the first histogram to the matrix input. Each histogram hist1, hist2 and hist3 is the binded to all the inputs of detectors D1, D2 and D3 respectively.

hist1   >> b.context.inputs.smearing_matrix.values(nested=True)
hist1   >> b.context.inputs.eres.D1.values(nested=True)
hist2   >> b.context.inputs.eres.D2.values(nested=True)
hist3   >> b.context.inputs.eres.D3.values(nested=True)

We have used nestsed=True key that returns all the values in all the nested dictionaries regardless of the structure. The full graph now looks as follows:

../_images/05_bundle_eres_upd_graph1.png — The resulting graph of the energy resolution bundle (version ex02). Inputs are bound.¶

And the result of the smearing plotted:

../_images/05_bundle_eres_upd.png — Three histograms (D1, D2 and D3) smeared with two smear matrices each (z1 and z2).¶

9.3.4.3. Energy resolution replicated (and merged)¶

The computational chain topology is discussed in tutorial Lazy evaluation and graph structure. The detector_eres bundle has parameter split_transformations, which is by default True. This means that each minor iteration produces an input in a new transformation: each input taintflag is propagated in separate. If split_transformations is set to False, all the inputs for the same major iteration are handled by the same transformation as shown in the following graph.

../_images/05_bundle_eres_upd_merged_graph0.png — The resulting graph of the energy resolution bundle (version ex02) with split_transformations=False. Inputs are open.¶

When the histograms are bound to the inputs, the graph looks as follows:

../_images/05_bundle_eres_upd_merged_graph1.png — The resulting graph of the energy resolution bundle (version ex02) with split_transformations=False. Inputs are bound.¶