Bayesian Model Inference ¶

This chapter describes the use of SPM’s Bayesian Model Inference capabilities. For a fuller background on this topic see (Penny et al. 2010). We illustrate the methods using a DCM for fMRI study of the language system.

Background¶

The neuroimaging data derive from an fMRI study on the cortical dynamics of intelligible speech (Leff et al. 2008). This study applied dynamic causal modelling of fMRI responses to investigate activity among three key multimodal regions: the left posterior and anterior superior temporal sulcus (subsequently referred to as regions P and A respectively) and pars orbitalis of the inferior frontal gyrus (region F). The aim of the study was to see how connections among regions depended on whether the auditory input was intelligible speech or time-reversed speech.

The basic DCM, from which all other models were derived, is shown in figure 1.1. Auditory input enters region P and the three areas have full intrinsic connectivity. The modulatory input, encoding whether or not the auditory stimulus was speech or reversed speech, was then allowed to modulate a subset of connections in the model. These are the forward and backward connections between P and F, and the forward and backward connections between P and A. As these are either present or absent this results in $2^4=16$ different DCMs.

All DCMs were fully connected ie. there were endogenous connections between all three regions (dotted lines) (i) left posterior temporal sulcus (region P), (ii) left anterior superior temporal sulcus (region A) and (iii) pars orbitalis of the inferior frontal gyrus (region F). Auditory input enters region P. The sixteen models differ in their modulatory connectivity (solid lines)

Data¶

An archive containing 16 DCMs for each of 12 subjects can be downloaded from the SPM web page¹. This archive is called dcm_bms.zip. When you extract the data onto your computer a number of subdirectories will be created - one for each of the 12 subjects. The 16 DCMs for each subject are then available in these subject-specific directories. You can load one of these into SPM and examine the information contained therein.

These DCM files contain the usual information eg. the original time series from each region of interest are available in DCM.xY(1) for region 1, wherein DCM.xY(1).name='PSTS_6' indicates this is the posterior temporal region. The estimated parameter values are available in DCM.Ep. You should note that these DCMs were specified and estimated using SPM revision 3894 (from May 2010) and that these DCM structures differ from earlier SPM releases.

Also in the Zip archive is a file called model_space.mat. If you load model_space, you will see that it contains a data structure called subj with subfields ‘sess’ and then ‘model’. If you type eg. subj(1).sess(1).model(1) you will see four further subfields containing information about the first DCM for subject 1. This comprises the filename (fname), the free energy approximation to the model evidence (F), posterior mean parameters (Ep), and the posterior covariance of parameters (Cp).

The use of a ‘model space’ file makes use of SPMs Bayesian model comparison (BMC) routines much simpler. If this file is not specified it will be automatically generated (from the DCM files) the first time you use the BMC routines (see below). Alternatively, you can easily create your own model space file. To get the current file to work on your system you will need to change all of the filenames (fname) so that they correspond to the positions of the DCM files in your filespace. You can do this with the model_space_filenames function (also provided in the Zip archive).

Analysis¶

After unzipping the archive, correct the model space filenames using the commandsubj=model_space_filenames(subj,new_base_dir) where new_base_dir is the name of the directory where you have unzipped the archive. This should be something like 'C:\blah\blah\blah\dcm-base-files'. Then save subj back in the model space file using save model_space subj.

Single Family¶

Now open SPM and in the Menu window go to Batch, SPM, DCM, Bayesian Model Selection, Model Inference. This will open SPM’s batch editor. Select an appropriate directory (eg. where you unzipped the archive), highlight Load model space and select the model_space.mat file. For inference method select ‘FFX’. Save the batch job as ffx_all_models.mat, then press the green play button to run the job. This will produce the figure 1.2, showing that model 6 is the best model.

We can now go back and load the ffx_all_models.mat job in the batch editor (press the Batch button) and change the inference methods to RFX. This will produce something like the results in figure 1.3 (note that the RFX approach uses a sampling procedure with a different random initial seed on each run, so the results can vary slightly from run to run). Again, model 6 is the best model, but not by much. These RFX results will be stored in the same BMS.mat file as the FFX results.

Bayesian Model Averaging¶

Now go back into the batch editor and reload the ffx_all_models.mat job. Highlight BMA, and select Choose family (instead of ‘Do not compute’). Accept the ‘Winning Family’ option. The BMA results will be saved in the same BMS.mat file as the previous analyses. Now go ahead and press the green play button. SPM will do the FFX model inference (again), but will also implement a weighted average of the model parameters where the weights are given by the evidence for each model, as described in (Penny et al. 2010). After the averaging is complete, SPM will report the number of models in Occam’s window. This should be 10 models (models 5,6,7,8,11,12,13,14,15,16).

To look at the BMA results, go to the Menu window and press the Dynamic Causal Modelling button. Then select Average, select BMA, and then the BMS.mat file just created. If you then highlight the tab (top left) to select the modulatory variables you should get the plot shown in figure 1.4.

Bayesian model averaging over all 16 models

Family level inference¶

The results so far have made no use of SPM’s family inference procedure. Or rather, they have, but have assumed that all models belong to the same family.

Open the ffx_all_models.mat batch file again, highlight Family inference and select Load family. Highlight Load family and select the pf_family.mat file contained in the Zip archive. This comprises two families (i) those with a forward connection from P to F (‘PF’), and (ii) those without it (‘No PF’). Set the BMA option to Do not Compute. Select a new directory you have created for this analysis (eg pf-family) and run the job. SPM will create the family level inference plot shown in figure 1.5. This gives a 90% posterior probability to models with the P to F connection.

We will now repeat the analysis but with RFX inference. You should see a result similar to that shown in figure 1.6.

Summary Statistics and Group Analyses¶

The group mean DCM parameters can be easily obtained from the MATLAB command window by loading the BMS.mat file and then typing: BMS.DCM.ffx.bma.Ep.

The subject specific mean DCM parameters can be obtained as follows: BMS.DCM.ffx.bma.Eps(n), where $n$ is the subject number. For random-effects change ffx to rfx.

If we are interested in the modulatory connection from region 1 to region 3 (that is modulated by the second input), then the mean value of this for Subject 10 is given byBMS.DCM.ffx.bma.Eps(10).B(3,1,2) (which should be 0.7475). The mean connection values for all subjects (12) can be gotten with the MATLAB syntax for i=1:12, b(i) = BMS.DCM.ffx.bma.Eps(i).B(3,1,2); end.

These subject specific mean parameters can then act as summary statistics for a standard group analysis. For example to look for significant differences between eg. a control group and a patient group in a modulatory parameter one would implement a two-sample t-test on data from the appropriate entries in the mean_bs matrices. Similarly, if one has 3 groups one would use a 3-level ANOVA.

BMS.mat file¶

The BMS structure saved in BMS.mat file contains the following variables²:

BMS.DCM.ffx/rfx¶

(fixed-effects (FFX) / random-effects (RFX) analysis)


.data	path to model_space.mat file (see below).
.F_fname	path to file containing the log evidence matrix, F, (if this option is specified).
.F	matrix of log model evidences for all subjects and models, [nsub $\times$ nm].
.SF	vector of summed log evidences over subjects [1 $\times$ nm].
.model	results from model level inference (see below).
.family	results from family level inference (see below).
.bma	results from Bayesian model averaging (see below).

Model level results¶

Fixed-effects:


model
	.prior	model priors, $p(m)$, [1 $\times$ nm].
	.subj_lme	log model evidence matrix, [nsub $\times$ nm].
	.like	model likelihoods, $p(Y
	.posts	model posterior probabilities, $p(m

Random-effects (different from fixed-effects):


model
	.alpha0	initial Dirichlet parameters (prior counts), $\alpha_0$, [1 $\times$ nm].
	.exp_r	model posterior means, $<r
	.xp	model exceedance probabilities, $\psi_m$ [1 $\times$ nm].
	.r_samp	samples from the model posterior density, $p(r
	.g_post	posterior model probabilities for subject n and model m, $p(m_n

Family level results¶

Fixed-effects:


family
	.names	family names, ex: $\{$‘F1’, ‘F2’, ‘F3’$\}$.
	.partition	partition vector assigning each model to a family [1 $\times$ nm].
	.infer	inference method (‘ffx’ or ‘rfx’).
	.prior	family priors, $p(f_k)$, [1 $\times$ nfam].
	.post	family posterior probabilities, $p(f_k
	.like	family likelihoods, $p(Y

Random-effects (different from fixed-effects):


family
	.Nsamp	number of samples used in Gibbs sampling (default = 20000).
	.prior	family type of priors (‘F-unity’, $\alpha_0=1$, for each family, is the default;
		other option, ‘M-unity’, $\alpha_0=1$, for each model) .
	.alpha0	initial values of the Dirichlet parameters (prior counts), $\alpha_{prior}(m)$, [1 $\times$ nfam].
	.s_samp	samples from family posterior density, $p(s
	.exp_r	family posterior means, $<s_k
	.xp	family exceedance probabilities, $\psi_k$, [1 $\times$ nfam].

Bayesian model averaging (BMA)¶

Fixed-effects:


bma
	.nsamp	number of samples used to average parameters (default = 10000).
	.oddsr	posterior odds ratio, $\pi_{OCC}$, (number of models in Occam’s window,
		default = 0).
	.Nocc	number of models in Occam’s window.
	.Mocc	index of models in Occam’s window, [1 $\times$ nm].
	.indx	index of models in Occam’s window (different for each subject in RFX),
		[1 $\times$ nm].
	.a	samples from posterior density over DCM.a parameters [dima $\times$ nsamp].
	.b	samples from posterior density over DCM.b parameters [dimb $\times$ nsamp].
	.c	samples from posterior density over DCM.c parameters [dimc $\times$ nsamp].
	.d	samples from posterior density over DCM.d parameters [dimd $\times$ nsamp].
	.mEp	mean DCM parameters [1 $\times$ 1 struct].
	.sEp	standard deviation of DCM parameters [1 $\times$ 1 struct].
	.mEps	mean DCM parameters per subject [1 $\times$ nsub struct].
	.sEps	standard deviation DCM parameters per subject [1 $\times$ nsub struct].

Random-effects - same variables as in fixed-effects.

model_space.mat file¶

This structure is created automatically if it doesn’t exist in the chosen directory and can be loaded for subsequent analyses as a faster option to reading the DCM.mat files. The model_space.mat file contains the following structure:


subj(nsub).sess(nsess).model(nm)
	.fname	path to DCM.mat file.
	.F	log-evidence (free energy).
	.Ep	parameter estimates: conditional expectation,
		[np $\times$ 1].
	.Cp	parameter estimates: conditional covariance,
		[np $\times$ np].

For a detailed description of all the variables and methods please see (Penny et al. 2010) and (Stephan et al. 2009).

Leff, A. P., T. M. Schofield, K. E. Stephan, J. T. Crinion, K. J. Friston, and C. J. Price. 2008. "The Cortical Dynamics of Intelligible Speech." *J. Neurosci.* 28: 13209--15.

Penny, W. D., K. E. Stephan, J. Daunizeau, M. J. Rosa, K. J. Friston, T. M.Schofield, and A. P. Leff. 2010. "Comparing Families of Dynamic Causal Models." *PLoS Comput Biol* 6 (3): e1000709. .

Stephan, K. E., W. D. Penny, J. Daunizeau, R. Moran, and K. J. Friston. 2009. "Bayesian Model Selection for Group Studies." *NeuroImage* 46 (3): 1004--10174. .

Bayesian comparison of Dynamic Causal Models: http://www.fil.ion.ucl.ac.uk/spm/data/dcm_bms/ ↩
nm = number of models; nfam = number of families; nsub = number of subjects; nsamp = number of samples; dima/b/c/d = dimensions of a/b/c/d DCM parameters; np = number of model parameters; nsess = number of sessions. ↩


model
	.prior	model priors, \(p(m)\), [1 \(\times\) nm].
	.subj_lme	log model evidence matrix, [nsub \(\times\) nm].
	.like	model likelihoods, $p(Y
	.posts	model posterior probabilities, $p(m


family
	.names	family names, ex: \(\{\)‘F1’, ‘F2’, ‘F3’\(\}\).
	.partition	partition vector assigning each model to a family [1 \(\times\) nm].
	.infer	inference method (‘ffx’ or ‘rfx’).
	.prior	family priors, \(p(f_k)\), [1 \(\times\) nfam].
	.post	family posterior probabilities, $p(f_k
	.like	family likelihoods, $p(Y