Specific IMNN classes¶

The available modules are:

SimulatorIMNN
- SimulatorIMNN
- AggregatedSimulatorIMNN
GradientIMNN
- GradientIMNN
- AggregatedGradientIMNN
NumericalGradientIMNN
- NumericalGradientIMNN
- AggregatedNumericalGradientIMNN

SimulatorIMNN ¶

class imnn.SimulatorIMNN(n_s, n_d, n_params, n_summaries, input_shape, θ_fid, model, optimiser, key_or_state, simulator)¶

Information maximising neural network fit with simulations on-the-fly

Defines the function to get simulations and compress them using an XLA compilable simulator.

The outline of the fitting procedure is that a set of \(i\in[1, n_s]\) random number generators are generated and used to generate a set of \(n_s\) simulations, \({\bf d}^i={\rm simulator}({\rm seed}^i, \theta^\rm{fid})\) at the fiducial model parameters, \(\theta^\rm{fid}\), and these are passed direrectly through a network \(f_{{\bf w}}({\bf d})\) with network parameters \({\bf w}\) to obtain network outputs \({\bf x}^i\) and autodifferentiation is used to get the derivative of \(n_d\) of these outputs with respect to the physical model parameters, \(\partial{{\bf x}^i}/\partial\theta_\alpha\), where \(\alpha\) labels the physical parameter. With \({\bf x}^i\) and \(\partial{{\bf x}^i}/\partial\theta_\alpha\) the covariance

\[C_{ab} = \frac{1}{n_s-1}\sum_{i=1}^{n_s}(x^i_a-\mu^i_a) (x^i_b-\mu^i_b)\]

and the derivative of the mean of the network outputs with respect to the model parameters

\[\frac{\partial\mu_a}{\partial\theta_\alpha} = \frac{1}{n_d} \sum_{i=1}^{n_d}\frac{\partial{x^i_a}}{\partial\theta_\alpha}\]

can be calculated and used form the Fisher information matrix

\[F_{\alpha\beta} = \frac{\partial\mu_a}{\partial\theta_\alpha} C^{-1}_{ab}\frac{\partial\mu_b}{\partial\theta_\beta}.\]

The loss function is then defined as

\[\Lambda = -\log|{\bf F}| + r(\Lambda_2) \Lambda_2\]

Since any linear rescaling of a sufficient statistic is also a sufficient statistic the negative logarithm of the determinant of the Fisher information matrix needs to be regularised to fix the scale of the network outputs. We choose to fix this scale by constraining the covariance of network outputs as

\[\Lambda_2 = ||{\bf C}-{\bf I}|| + ||{\bf C}^{-1}-{\bf I}||\]

Choosing this constraint is that it forces the covariance to be approximately parameter independent which justifies choosing the covariance independent Gaussian Fisher information as above. To avoid having a dual optimisation objective, we use a smooth and dynamic regularisation strength which turns off the regularisation to focus on maximising the Fisher information when the covariance has set the scale

\[r(\Lambda_2) = \frac{\lambda\Lambda_2}{\Lambda_2-\exp (-\alpha\Lambda_2)}.\]

Once the loss function is calculated the automatic gradient is then calculated and used to update the network parameters via the optimiser function.

simulator:: A function for generating a simulation on-the-fly (XLA compilable)

Public Methods:

`__init__`(n_s, n_d, n_params, n_summaries, …)	Constructor method
`get_summaries`(w, key[, validate])	Gets all network outputs and derivatives wrt model parameters

Inherited from _IMNN

`__init__`(n_s, n_d, n_params, n_summaries, …)	Constructor method
`fit`(λ, ε[, rng, patience, min_iterations, …])	Fitting routine for the IMNN
`get_α`(λ, ε)	Calculate rate parameter for regularisation from closeness criterion
`set_F_statistics`([w, key, validate])	Set necessary attributes for calculating score compressed summaries
`get_summaries`(w, key[, validate])	Gets all network outputs and derivatives wrt model parameters
`get_estimate`(d)	Calculate score compressed parameter estimates from network outputs
`plot`([ax, expected_detF, colour, figsize, …])	Plot fitting history

Private Methods:

_get_fitting_keys(rng)

Generates random numbers for simulation

Inherited from _IMNN

`_initialise_parameters`(n_s, n_d, n_params, …)	Performs type checking and initialisation of class attributes
`_initialise_model`(model, optimiser, key_or_state)	Initialises neural network parameters or loads optimiser state
`_initialise_history`()	Initialises history dictionary attribute
`_set_history`(results)	Places results from fitting into the history dictionary
`_set_inputs`(rng, max_iterations)	Builds list of inputs for the XLA compilable fitting routine
`_get_fitting_keys`(rng)	Generates random numbers for simulation
`_fit`(inputs, λ=None, α=None[, min_iterations])	Single iteration fitting algorithm
`_fit_cond`(inputs, patience, max_iterations)	Stopping condition for the fitting loop
`_update_loop_vars`(inputs)	Updates input parameters if `max_detF` is increased
`_check_loop_vars`(inputs, min_iterations)	Updates `patience_counter` if `max_detF` not increased
`_update_history`(inputs, history, counter, ind)	Puts current fitting statistics into history arrays
`_slogdet`(matrix)	Combined summed logarithmic determinant
`_construct_derivatives`(derivatives)	Builds derivatives of the network outputs wrt model parameters
`_get_F_statistics`([w, key, validate])	Calculates the Fisher information and returns all statistics used
`_calculate_F_statistics`(summaries, derivatives)	Calculates the Fisher information matrix from network outputs
`_get_regularisation_strength`(Λ2, λ, α)	Coupling strength of the regularisation (amplified sigmoid)
`_get_regularisation`(C, invC)	Difference of the covariance (and its inverse) from identity
`_get_loss`(w, λ, α[, key])	Calculates the loss function and returns auxillary variables
`_calculate_loss`(summaries, derivatives, λ, α)	Calculates the loss function from network summaries and derivatives
`_setup_plot`([ax, expected_detF, figsize])	Builds axes for history plot

_get_fitting_keys(rng)¶

Generates random numbers for simulation

Parameters: rng (int(2,)) – A random number generator
Returns: A new random number generator and random number generators for fitting (and validation)
Return type: int(2,), int(2,), int(2,)

get_summaries(w, key, validate=False)¶

Gets all network outputs and derivatives wrt model parameters

A random seed for each simulation is obtained and n_d of them are used to calculate the network outputs of each of these simulations as well as the derivative of these network outputs with respect to the model parameters as calculated using jax autodifferentiation. The remaining n_s - n_d network outputs are then calculated and concatenated to those already calculated.

Parameters

w (list or None, default=None) – The network parameters if wanting to calculate the Fisher information with a specific set of network parameters
key (int(2,) or None, default=None) – A random number generator for generating simulations on-the-fly
validate (bool, default=False) – Whether to get summaries of the validation set

Returns

float(n_s, n_summaries) – The set of all network outputs used to calculate the covariance
float(n_d, n_summaries, n_params) – The set of all network output derivatives wrt model parameters

get_summary:: Return a single network output

get_derivatives:: Return the Jacobian of the network outputs wrt model parameters

AggregatedSimulatorIMNN ¶

class imnn.AggregatedSimulatorIMNN(n_s, n_d, n_params, n_summaries, input_shape, θ_fid, model, optimiser, key_or_state, simulator, host, devices, n_per_device)¶