Elsevier

NeuroImage

Volume 56, Issue 2, 15 May 2011, Pages 616-626
NeuroImage

Decoding brain states from fMRI connectivity graphs

https://doi.org/10.1016/j.neuroimage.2010.05.081Get rights and content

Abstract

Functional connectivity analysis of fMRI data can reveal synchronised activity between anatomically distinct brain regions. Here, we extract the characteristic connectivity signatures of different brain states to perform classification, allowing us to decode the different states based on the functional connectivity patterns. Our approach is based on polythetic decision trees, which combine powerful discriminative ability with interpretability of results. We also propose to use ensemble of classifiers within specific frequency subbands, and show that they bring systematic improvement in classification accuracy. Exploiting multi-band classification of connectivity graphs is also proposed, and we explain theoretical reasons why the technique could bring further improvement in classification performance. The choice of decision trees as classifier is shown to provide a practical way to identify a subset of connections that distinguishes best between the conditions, permitting the extraction of very compact representations for differences between brain states, which we call discriminative graphs. Our experimental results based on strict train/test separation at all stages of processing show that the method is applicable to inter-subject brain decoding with relatively low error rates for the task considered.

Research highlights

► Whole-brain functional connectivity can be used for inter-subject decoding. ► Embedding connectivity graphs in vector space is effective for graph matching. ► Ensembling techniques help discrimination, both within and between frequency subbands.

Introduction

Traditional fMRI analysis consists of univariate statistical hypothesis testing to assess changes in the activity of each brain voxel induced by the stimulation paradigm (Frackowiak et al., 1997). More recently, approaches derived from supervised machine learning – commonly termed “brain decoding” in the field of neuroimaging – have shown that it is possible to exploit more subtle relationships in voxels' intensity patterns (Haxby et al., 2001, Haynes & Rees, 2006, Norman et al., 2006). These methods rely on a classifier to predict the subject's brain state from the BOLD responses in a set of selected voxels, such as visual (Cox & Savoy, 2003, Haynes & Rees, 2005, Kamitani & Tong, 2005, Thirion et al., 2006, Kay et al., 2008, Miyawaki et al., 2008) or auditory cortices (Ethofer et al., 2009). Results from brain decoding are often remarkable since they clearly reach beyond the possibilities of univariate techniques, but also because they are able to uncover information from fine-grained cortical activity despite the relatively low spatial resolution of fMRI. Furthermore, instead of a fixed spatial window, one can also apply classification to a so-called “searchlight” that slides over the whole-brain data, such that the classification success for each position of the spotlight can then be mapped to show brain regions that carry discriminative information between different conditions (Kriegeskorte et al., 2006). Another interesting approach is to use spatiotemporal observations as an input to the classifier (Mitchell et al., 2004, Mourao-Miranda et al., 2007).

The study of functional connectivity is concerned with the temporal coherence between neurophysiological events observed in spatially remote brain regions. In early work, correlation with a seed voxel was investigated and revealed bilateral coactivation between sensory cortices (Biswal et al., 1995, Lowe et al., 1998). Further advances have been driven by unsupervised methods such as source separation – mainly principal components (Friston et al., 1993) and independent components analysis (McKeown et al., 1998, Calhoun et al., 2002, Beckmann & Smith, 2004) which allow to identify large-scale cortical networks – and by other methods such as dynamic causal modelling, which tries to establish effective connectivity and requires prior information about the neurological network to investigate Friston et al. 2003. A recent method related to our work proposed to use resting-state correlations between regions of interest as features for an SVM classifier (Craddock et al., 2009). Another attractive methodology to investigate functional networks is to rely on mathematical graph theory; i.e., constructing the (undirected) graph from temporal correlation matrices and computing related measures; e.g., node degree, hubbiness, and so on (Sporns et al., 2000, Salvador et al., 2005). This methodology has brought new insights in functional connectivity at resting state, such as the small-world organisation of cortical networks at low temporal frequencies (Achard et al., 2006).

Here, we bring together brain decoding and graph representations based on functional connectivity measures. These measures, such as temporal correlation, are performed over a given period of time and reflect the timecourse resemblances between different regions. Moreover, we estimate connectivity at different temporal scales using the wavelet transform as a preprocessing step. Then, we build a classifier trained on functional connectivity graphs of a group of subjects to distinguish between different brain states of an unseen subject. The aim of our approach is to identify the connections that are most discriminative between brain states, and to obtain relevant visual representation of the data for neuroscience studies.

Previous brain decoding techniques primarily rely on linear support vector machines (SVMs), which use a soft-margin hyperplane to separate classes. In the present work, we propose polythetic decision trees that fit a hyperplane using the most discriminative features (i.e., connections) at each level, such that potentially complex and non-linear class boundaries can be obtained by multilevel trees. Efficient and effective learning of this type of decision trees relies on recent advances in pattern recognition (Friedman et al., 2000, Gama, 2004, Landwehr et al., 2005) and provides embedded feature selection which yields a compact discriminant function whose parameters are amenable to interpretation. Their variance properties make them good candidates for ensembling, steering the classification strategy towards slightly weaker but simpler (lower capacity) classifiers, which is a desirable behaviour in high-dimensional learning, a classical situation in fMRI where the number of dimensions is much higher than the number of training examples.

Our paper is organised as follows. In the Methods section, we describe our data processing pipeline together with the details of the proposed methodology. Next, we illustrate the feasibility of our approach by a proof-of-concept; i.e., an fMRI experiment with block-based stimulation paradigm (watching short movies) with long resting periods. We extract the set of connections that is the most discriminative between rest and stimulation at different temporal scales. The method is able to correctly classify the conditions in a leave-one-subject-out cross-validation setting. Interestingly, we find that the low-frequency correlations of the BOLD signal (below 0.11 Hz) are the most informative. The discriminative network also confirms a differential modulation of sensory areas, in particular within the visual system, and midline brain areas during movie and rest conditions.

Section snippets

Preprocessing and data representation

The preprocessing steps are illustrated schematically in Fig. 1, and explained in detail below.

After realignment of the functional volumes using SPM5,1 we use the IBASPM toolbox (Tzourio-Mazoyer et al., 2002, Alemán-Gómez et al., 2006) to build an individual brain atlas based on the structural MRI, containing M = 90 anatomical regions. While this is a relatively coarse atlas, it is an essential step to allow for inter-subject variability and enable

Subjects and data acquisition

The N = 15 subjects (4 males, 11 females) were aged between 18 and 36 years old, without history of neurological disorders. They had given written informed consent to participate in the study, which was performed in accordance with the local Ethics Committee of the University of Geneva. Scanning was performed on a Siemens 3 T Tim Trio. Functional imaging data were acquired in two sessions using gradient-echo echo-planar imaging (TR/TE/FA = 1.1 s/27 ms/90°, matrix = 64 × 64, voxel size = 3.75 × 3.75 × 4.2 mm3, 21

Classification

Classifier training and testing are performed using the leave-one-subject-out procedure outlined in Classification section of Methods. Stratified classification results are given in Table 1 (column L = 1, α = 100%): in each of the 15 cross-validation folds, 2 tests are performed, one for the resting and one for the movies condition of a single subject. Thus, the granularity of results is about 3%. One striking result is that lower-frequency subbands (3 and 4) have much more discriminative power

Classification

The relatively high number of connections that are retained by statistical feature selection (Graph embedding and feature selection) in the low-frequency subbands, with respect to the number retained in higher-frequency subbands, hints at the presence of resting-state networks that are consistent across subjects (Damoiseaux et al., 2006, Mantini et al., 2007), which in turn yields relatively low inter-subject standard deviation on the weights of graph edges that these networks comprise. Not

Conclusion

In summary, we have proposed a classification approach to infer brain states from functional connectivity graphs, instead of the commonly used brain voxel activation values. We have shown that the approach is applicable to inter-subject brain decoding with good results, and that interpretable output can be generated. We have demonstrated the feasibility using a cognitive task and compared the discriminative connectivity graph with SPM-style activation patterns. The potential of the proposed

Acknowledgments

The authors wish to thank the anonymous reviewers for their valuable input. This work was supported in part by the Swiss National Science Foundation (grant PP00P2-123438), in part by the Société Académique de Genève and the FOREMANE foundation, and in part by the Center for Biomedical Imaging (CIBM) of the Geneva and Lausanne Universities, EPFL, and the Leenaards and Louis-Jeantet foundations.

References (73)

  • A. Zalesky et al.

    Whole-brain anatomical networks: does the choice of nodes matter?

    Neuroimage

    (2010)
  • S. Achard et al.

    A resilient, low-frequency, small-world human brain functional network with highly connected association cortical hubs

    The Journal of Neuroscience

    (2006)
  • Y. Alemán-Gómez et al.

    IBASPM: toolbox for automatic parcellation of brain structures

  • G. Battle

    A block spin construction of ondelettes. Part I: Lemarié functions

    Communications in Mathematical Physics

    (1987)
  • C. Beckmann et al.

    Probabilistic independent component analysis for functional magnetic resonance imaging

    IEEE Transactions on Medical Imaging

    (2004)
  • Benjamini Y., Hochberg Y., 1995. Controlling the false discovery rate: a practical and powerful approach to multiple...
  • B. Biswal et al.

    Functional connectivity in the motor cortex of resting human brain using echo-planar MRI

    Magnetic Resonance in Medicine

    (1995)
  • Bluhm R.L., Miller J., Lanius R.A., Osuch E.A., Boksman, K., Neufeld, R.W.J., Théberge, J., Schaefer, B., Williamson,...
  • P.J. Boland

    Majority system and the Condorcet jury theorem

    Statistician

    (1989)
  • Bray S., Chang C., Hoeft F., 2009. Applications of multivariate pattern classification analyses in developmental...
  • Breiman L., Aug. 1996. Bagging predictors. Machine Learning 24 (2), 123–140. URL...
  • L. Breiman et al.

    Classification and Regression Trees

    (1984)
  • Buckner, R., Andrews-Hanna, J., Schacter, D., 2008. The brain's default network: anatomy, function, and relevance to...
  • Bunke H., Shearer K., Mar. 1998. A graph distance metric based on the maximal common subgraph. Pattern Recognition...
  • V.D. Calhoun et al.

    Independent component analysis of fMRI data in the complex domain

    Magnetic Resonance in Medicine

    (2002)
  • D. Conte et al.

    Thirty years of graph matching in pattern recognition

    International Journal of Pattern Recognition and Artificial Intelligence

    (2004)
  • Craddock R.C., Holtzheimer P.E., Hu, X.P., Mayberg H.S., Dec 2009. Disease state prediction from resting state...
  • Dale A.M., Fischl B., Sereno M.I., Feb 1999. Cortical surface-based analysis. I. segmentation and surface...
  • J.S. Damoiseaux et al.

    Consistent resting-state networks across healthy subjects

    Proceedings of the National Academy of Science

    (2006)
  • R.O. Duda et al.

    Pattern Classification

    (2001)
  • Fair D.A., Schlaggar B.L., Cohen A.L., Miezin F.M., Dosenbach N.U., Wenger K.K., Fox M.D., Snyder A.Z., Raichle M.E.,...
  • Fischl B., Sereno M.I., Dale A.M., Feb 1999. Cortical surface-based analysis. II: Inflation, flattening, and a...
  • R. Frackowiak et al.

    Human Brain Function

    (1997)
  • Friedman J., Hastie T., Tibshirani R., 2000. Additive logistic regression: a statistical view of boosting. Annals of...
  • Friston K., Frith C., Liddle P., Frackowiak R., 1993. Functional connectivity: the principal-component analysis of...
  • Gama J., Jun. 2004. Functional trees. Machine Learning 55 (3), 219–250. URL...
  • Cited by (251)

    View all citing articles on Scopus
    View full text