Multivariate techniques including principal component analysis (PCA) have been used to identify signature patterns of regional change in functional brain images. We have developed an algorithm to identify reproducible network biomarkers for the diagnosis of neurodegenerative disorders, assessment of disease progression, and objective evaluation of treatment effects in patient populations.
The scaled subprofile model (SSM)1-4 is a multivariate PCA-based algorithm that identifies major sources of variation in patient and control group brain image data while rejecting lesser components (Figure 1). Applied directly to voxel-by-voxel covariance data of steady-state multimodality images, an entire group image set can be reduced to a few significant linearly independent covariance patterns and corresponding subject scores. Each pattern, termed a group invariant subprofile (GIS), is an orthogonal principal component that represents a spatially distributed network of functionally interrelated brain regions. Large global mean scalar effects that can obscure smaller network-specific contributions are removed by the inherent logarithmic conversion and mean centering of the data2,5,6. Subjects express each of these patterns to a variable degree represented by a simple scalar score that can correlate with independent clinical or psychometric descriptors7,8. Using logistic regression analysis of subject scores (i.e. pattern expression values), linear coefficients can be derived to combine multiple principal components into single disease-related spatial covariance patterns, i.e. composite networks with improved discrimination of patients from healthy control subjects5,6. Cross-validation within the derivation set can be performed using bootstrap resampling techniques9. Forward validation is easily confirmed by direct score evaluation of the derived patterns in prospective datasets10. Once validated, disease-related patterns can be used to score individual patients with respect to a fixed reference sample, often the set of healthy subjects that was used (with the disease group) in the original pattern derivation11. These standardized values can in turn be used to assist in differential diagnosis12,13 and to assess disease progression and treatment effects at the network level7,14-16. We present an example of the application of this methodology to FDG PET data of Parkinson’s Disease patients and normal controls using our in-house software to derive a characteristic covariance pattern biomarker of disease.
Neurodegenerative disorders have been extensively studied using techniques that localize and quantify abnormalities of brain metabolism as well as non-inferential methods that study regional interactions17. Data-driven multivariate analytical strategies such as principal component analysis (PCA)1,2,4,18 and independent component analysis (ICA)19,20, as well as supervised techniques such as partial least squares (PLS)21 and ordinal trends canonical variates analysis (OrT/CVA)22 can reveal characteristic patterns or “networks” of interrelated activity. The basics of multivariate procedures, particularly the scaled subprofile model (SSM)1,2,4-6,18 have been previously described in JoVE3. This PCA-based approach was originally developed to examine abnormal functional covariance relationships between brain regions in steady-state single volume images of cerebral blood flow and metabolism acquired in the resting state of modalities such as PET and SPECT that exhibit high signal-to-noise characteristics. Disease-specific SSM patterns are imaging biomarkers that reflect overall differences in the regional topography in patients compared to normal subjects7,16 and may reflect a single network process or the assimilation of several complex abnormal functions23. Metabolic covariance pattern brain networks are associated with expression values (subject scores) that can distinguish between normal and disease groups and provide network-based measures that correlate with clinical ratings of disease severity. Typically, subject scores for such patterns increase with disease progression and may even be expressed before symptom onset14,24. Indeed, disease-related network biomarkers have been characterized for neurodegenerative disorders such as Parkinson’s disease10 (PD), Huntington’s disease25(HD), and Alzheimer’s disease8 (AD). Importantly, disease-related metabolic topographies have also been identified for atypical parkinsonian movement disorders such as multiple system atrophy (MSA) and progressive supranuclear palsy (PSP). These patterns have been used in concert for the differential diagnosis of individuals with clinically similar “look-alike” syndromes12,13,26.
In contrast, typical fMRI voxel-based univariate methods assess the significance of differences between patients and controls in isolated brain clusters. More recently, methods have been developed to measure functional connectivity between variously defined brain regions27-29. This definition of functional connectivity is restricted to subject and region specific interactions and deviates from the original SSM/PCA concept that refers to the cross-sectional interconnectivity of intrinsic spatially distributed brain network regions1,2,23,30. To their advantage, MRI platforms are easily installed, widely available, non-invasive and typically require shorter scanning time than traditional radiotracer imaging modalities such as PET or SPECT resulting in an upsurge of potential methodologies described in recent literature. However, the resulting time-dependent fMRI signals provide indirect measures of local neural activity31,32. The generally complex analytical algorithms employed have been limited by the large size of datasets, the physiological noise inherent in fMRI signals, as well as the high variability in brain activity that exists between subjects and regions19,23. Although interesting information regarding brain organization can be inferred from the properties of fMRI “networks”, they have not been sufficiently stable to be used as reliable disease biomarkers. Furthermore, the resulting network topographies are not necessarily equivalent to those identified using established functional imaging methodologies such as SSM/PCA. For the most part, rigorous cross-validation of the resulting fMRI topographies has been lacking with few examples of successful forward application of derived patterns in prospective scan data from single cases.
An advantage of PCA covariance analysis lies in its capacity to identify the most significant sources of data variation in the first few principal components but it is ineffective if the prominent eigenvectors represent random noise factors rather than actual intrinsic network response. By selecting only the first few eigenvectors and limiting to those that show significant differences in patient versus normal control scores, we greatly reduce the influence of noise elements. However, for the basic approach described here, these measures may not be adequate to generate robust estimators in a typical fMRI dataset with the exception of the modalities described below.
Thus, because of the stable direct relationship of regional glucose metabolism and synaptic activity33, this methodology has been applied primarily to the analysis of rest state FDG PET data. However, given that cerebral blood flow (CBF) is closely coupled to metabolic activity in the resting state10,11,34, SPECT35,36 and more recently arterial spin labeling (ASL) MRI perfusion imaging methods37,38, have been used to assess abnormal metabolic activity in individual cases. That said, the derivation of reliable spatial covariance patterns with resting state fMRI (rsfMRI) is as previously noted not straightforward 31,32. Even so, preliminary SSM/PCA analysis of rsfMRI data from PD patients and control subjects has revealed some topographical homologies between disease-related patterns identified using the two modalities, PET and amplitude of low-frequency fluctuations (ALFF) of BOLD fMRI39,40. Lastly, we also note that this approach has been applied successfully in voxel based morphometry (VBM) structural MRI data41,42, revealing distinctive spatial covariance patterns associated with age-related volume loss and in further comparisons of VBM and ASL patterns in the same subjects43. The relationship between SSM/PCA spatial covariance topographies and analogous brain networks identified using different analytical approaches and imaging platforms is a topic of ongoing investigation.
1. Data Collection and Preprocessing
2. Perform Multivariate SSM/PCA
Operations for multivariate SSM/PCA (Figure 4) can be performed by external software. The steps itemized below reflect the procedures performed for the most part automatically by our in-house routines (Figure 5) (scanvp/ssmpca, www.feinsteinneuroscience.org also available as an SPM toolbox ssm_pca).
3. Pattern Biomarker Derivation
4. Single-subject Prospective Score Evaluation using a Predetermined Biomarker
A simple application of multivariate SSM/PCA analysis to derive a neuroimaging biomarker pattern for PD is illustrated below. PET FDG images of ten clinically diagnosed PD patients (6M/4F, 59y ± 7y sd) of variable diseased duration (9y ± 5y sd) and ten age and gender matched normal controls (6M/4F, 58y ±7y sd) were analyzed using our ssmpca routine. The twenty corresponding spatially pre-normalized images were initially selected under the categories disease subjects or controls along with the default mask (Figure 3). The program output the first 16 principal component image files and a file listing of their associated scores along with a scree plot of sequential eigenvalues (Figure 6). Sequential PC images and associated bar and scatter graphs were displayed on the screen for initial review as depicted in Figures 7a and 7b. Note that the first PC component accounted for the largest vaf (24%) and its associated subject scores significantly discriminate patients from controls (p=0.0002) based on two-sample t-test comparison and ROC characteristics (AUC=0.99, sensitivity=0.93 for specificity=0.95). For these reasons it could be considered a biomarker by itself and is in fact highly spatially correlated with a previously derived PD biomarker PDRP46,47 (R2=84%, p < 0.001) validated in numerous prospective datasets5. The second pattern (p=0.81, AUC=0.54) and third pattern (p=0.38, AUC=0.63) did not discriminate significantly and may originate from normal processes that are not highly affected by disease. The fourth pattern (vaf=7 %) discriminates to a far lesser degree than the first PC (p=0.13, AUC=0.71) and may be associated with less prominent disease factors. Further PCs were ignored because of decreasing vaf values (< 7%) combined with non-significant discrimination of normal and patient scores (p > 0.4). The degeneration of eigenvalue differences is illustrated in Figure 6.
The first and fourth patterns were vector normalized and linearly combined using our software to determine associated coefficients (.78, .63) for optimum patient/control score discrimination using a logistic discrimination model. The composite GIS results in improved discrimination as indicated by the p-value (p=10-5, Figure 7b) and a perfect AUC value of 1.0 for the derivation group. The composite vaf (17.3%) associated with this pattern is evaluated as the sum of the individual vafs (24%, 7%) modified by the squares of the linear coefficients. However, note that this model achieved higher discrimination by attributing a disproportionately high coefficient (in comparison to vaf values) to the less significant PC4. Even higher discrimination within the derivation group may be achieved using additional components with the same reservation. In our case there was only slight improvement by adding PC3 (p=3×10-6) (Figure 7c) and no improvement by including all four PCs (p=10-5) because of the non-discriminant capacity of PC2. All of the combined patterns had perfect ROC characteristics (AUC=1 and sensitivity=1, specificity=1 for a low Z-score threshold). However, these values are specific to the derivation group. The validity of the final biomarkers has to be verified in prospective evaluation as performed here for an independent group of 22 patients (15M/7F, 57y ± 9y sd; disease duration 10y± 4y sd) and 22 controls (4M/18F, 55y ± 15y sd) (Figure 7d). Although significantly higher differentiation was achieved in the derivation group by combining more than one PC, the same relative advantage was not maintained in the test group although all four combinations performed well. The first PC demonstrated higher mean differentiation in the prospective group data (p=5×10-8, AUC=0.95) than the combined patterns PC1_3_4(p=2×10-7, AUC=0.95) and PC1_2_3_4 (p=6×10-7, AUC=0.92) attributed to its innate validity. A minor advantage was achieved by the additive PC1_4 (p=3.2×10-8, AUC=0.96). Although the prospective AUC value was slightly higher for PC1_4, the sensitivity for a specificity of 0.95 appears to decrease with additive PCs from a value of approximately 0.8 for PC1 to 0.5 for PC1_2_3_4. Clearly more extensive sampling would be necessary to more accurately predict these values judging from the increasing irregularity of the ROC plot. In addition, higher accuracy could have been achieved by using a larger derivation group as demonstrated in previous publications. However, it is apparent from these displays that the same discriminant accuracy attained in the derivation group does not always generalize to independent group evaluation, necessitating prospective validation of the derivation patterns.
Thus, our original dataset of 20 subject images was reduced to one essential PC image pattern PC1 that performed well as a diagnostic biomarker of disease in 44 prospective subjects.
Figure 1. Schematic of SSM/PCA Modeling Strategy. Normal and patient scanner data is processed by the SSM/PCA algorithms to derive a neuroimaging biomarker and associated subject scores. The mean difference between patient and normal subject scores is significant. Click here to view larger figure.
Figure 2. Preprocessing and Masking of Data. Raw scanner images are spatially normalized to map voxels to a common stereotaxic space for all subjects and smoothed using a Guasian kernel. The masking operation removes ventricular space and noise.
Figure 3. This default 0/1 gray matter mask was created from the processing of 72 normal subjects. Individual spatially normalized data was thresholded at 38% of their maximum value to create individual masks that were multiplied together. The multiplicative mask was flipped left to right through the origin and multiplied with the original to create the final symmetrized mask.
Figure 4. Flow Path of SSM/PCA Processing. a) Subject vector formulation of data matrix. Log transformation. Row centering. Evaluation of the GMP vector. b) Column centering. Derivation of SRP. Construction of the covariance matrix C. c) Principal Component Analysis. Derivation of PC score vectors and voxel patterns. d) Prospective single subject score calculation. Click here to view larger figure.
Figure 5. Software. To perform SSM/PCA the user can select the voxel based SSM/PCA option from the ScAnVP menu, select patient image files and normal controls, a preferred mask and then process. Log transformation is performed by default. After reviewing output and screen displayed plots and images, the user can choose to linearly combine selected PCs using logistic regression or other models to create a biomarker with higher patient/normal score discrimination. Click here to view larger figure.
Figure 6. Scree Plot. This Scree plot of sequential eigenvalues of the non-normalized subject by subject covariance matrix resulted from the SSM/PCA analysis of 10 PD patients and 10 normal controls. Notice the sharp drop in the eigenvalues after the first PC. Differences between eigenvalues are small after the 4th PC.
Figure 7. SSM/PCA program output. a) Top: Axial display (Z=0) of the first four principal component images (PC1 to 4). Bottom: Orthogonal views through origin (X,Y,Z=0,0,0) of PC1 and combined PCs 1 and 4 (PC1_4) displayed over a structural MRI image. PC voxel values represent positive and negative Z-score deviations relative to the mean voxel loading. Hot colors indicate relative increases in metabolic activity within the PC’s contribution to the overall SRP whereas cold colors indicate relative metabolic decreases. For PC’s that discriminate patients from controls these deviations can be attributed to disease. b) bar graphs and scatterplots of the derivation subject Z-scores of PC1 and PC4. Only the first principal component significantly discriminates patients from controls (p=0.0002) while the fourth demonstrates a trend (p=0.13). The linear logistic combination of PC1 and PC4 (coefficients .78, .63) improves discrimination (p=0.00001). The ROC AUC value and the sensitivity for a specificity of 0.95 are also displayed for each pattern. The combined pattern PC1_4 demonstrates perfect separation at a Z-score threshold of 0.9 (AUC=1). c) Orthogonal views of combined PC1_3_4 and PC1_2_3_4 and corresponding subject bar graphs for derivation subject Z-scores. Discrimination increased (p=3×10-6 and 10-5, respectively) compared to that of PC1 for both of these different combinations. AUC characteristics indicated perfect separation in both cases (AUC=1, sensitivity=1, specificity=1). See note in legend 7a regarding the color scale. d) Prospective score evaluation of patterns PC1, PC1_4, PC1_3_4 and PC1_2_3_4 on 22 normal controls and 22 PD patients. Bar and scatter-plot graphs as well as the ROC plot for subject Z-scores are displayed and AUC values as well as the sensitivity at a specificity of 0.95 is indicated. The AUC values and sensitivity appear to decrease for more than two combined principal components while p-values tend to become less robust. An insignificant improvement was noted for PC1_4 in AUC and p values invalidating the significant difference predicted in the derivation sample. Click here to view larger figure.
The SSM/PCA model originally presented by Moeller et al.4 has evolved1-3 into a straightforward and robust technique for the analysis of neuroimaging data. However, there have been ambiguities in the application of this methodology that we have attempted to clarify here and in previous publications5-7,10. Some of these issues have been addressed in the text but are reemphasized here because of their importance. As detailed in the Introduction, SSM/PCA is primarily effective in resting state FDG PET data, but has also been successful using other single volume imaging techniques and scanning platforms including H215O PET and 99mTc-ECD SPECT, as well as perfusion ASL and volumetric VBM MRI. Its successful application to time-series rsfMRI necessitates the future development of greatly enhanced techniques48.
The main strength of SSM lies in its ability to identify sources of variation in the data and to separate them into spatially orthogonal components. In most studies, only a few significant components are attributable to disease effects, thus dramatically reducing the relevant data set. To identify disease-specific components, we ideally include equal numbers of disease subjects with gender and age-matched healthy control subjects in a combined group analysis2,5. In this situation, deviations from normal values due to disease are generally small in absolute terms and obscured by larger global factors that do not discriminate patients from controls2,5,6. PCA is an effective analytical tool when the distribution of data is near-Guassian49. The SSM procedure performs log transformation which basically converts the data into a log normal distribution50 and separates multiplicative factors into additive terms. Subsequent subject mean centering (row mean subtraction) is a necessary step that removes any subject global scaling factors from the log data. This implies that preliminary subject global mean normalization has no effect in SSM even though it is generally a necessary preprocessing step in analytical neuroimaging procedures. Similarly, pre-normalization by any regional mean has no effect.
The mean centered log images are averaged over subjects to generate the GMP image that can be considered an invariant characteristic image of similar cohorts. The subsequent column centering over subjects removes regional means (GMP) to create subject residual SRP images. The SRP vectors are entered into the PCA analysis to generate a set of orthogonal PC patterns and reference group scores for each PC. In prospective single subject score evaluation, group analysis is not necessary; the subject’s SRP is evaluated by the subtraction of the subject mean and reference group GMP and the subject’s score is then determined as the scalar inner product of the SRP vector and the prederived PC pattern vector. This score usually falls within the range of the reference scores provided that the new data is obtained in a similar manner as in the original derivation. In a different setting, the PC vector may still be a valid pattern even for a different modality imager but to obtain an interpretable score, a recalibration procedure must be performed to evaluate new reference scores. The recalibration procedure does not require PCA analysis but is a simple determination of single subject scores for a complete new cohort of patients and controls using the predetermined pattern and a newly determined GMP.
It should be noted that even though a specific modality may reflect certain physiological parameters, derived SSM/PCA patterns cannot be interpreted as absolute measures of such quantities. Firstly, they are derived from the interregional covariance of voxel differences from mean values. These differences may result from various network functions and may represent composite effects. Hence, the PCA pattern is a mathematical construct that may reflect covariance within a single network or more complex processes. By performing correlation analysis of pattern scores with clinical measures of disease severity or normal psychosomatic measures, and by anatomically identifying highly weighted pattern areas, we can indirectly interpret the significance of these patterns. Furthermore, composite PC patterns are constructed as linear combinations of PCs with coefficients derived using logistic regression, with Akaike or other information theoretic criteria imposed on the corresponding score values. Although these may result in better discrimination, they may incorporate less significant components with disproportionately higher coefficients. Thus, covariance topographies designed for accurate differential diagnosis of clinically similar “look-a-like” conditions (“diagnostic biomarkers”) may be composites of complex physio-chemical processes, not necessarily corresponding to easily interpretable physical networks13. By contrast, other patterns may be optimal for capturing the topographical hallmarks of specific disease manifestations such as parkinsonian tremor or cognitive dysfunction15,51,52, or for monitoring disease progression7,16. The specific objective of the biomarker’s function should be considered in making a final selection.
It is essential to validate a potential biomarker by evaluating prospective subject scores in independent cohorts10. Preferably, cross-validation should be initially performed in the derivation sample to determine the reliability of the pattern topography on a voxel-by-voxel basis and to exclude potential outlier effects. To this end, bootstrapping procedures8,9 are routinely implemented. This approach requires iterative pattern derivation in resampled derivation data to identify the most robust voxels, i.e. those with the lowest standard deviation in voxel weight. By optionally masking the original data to remove less reliable voxels, higher discrimination can be achieved for a rederived pattern that must then be validated in prospective data sets.
As previously noted, once an SSM/PCA biomarker has been established it can be applied to prospective individual or group data. Derived scores can be used to monitor a patient’s status by gauging network progression in longitudinal imaging data, or to evaluate treatment effects. In complex cases of undetermined diagnosis, sets of subject scores can be entered in logistic regression analysis (or in other discriminant function models) to distinguish between different alternatives12,13. This approach can be vitally important given the high rate of clinical misdiagnosis in individuals with early disease, and the varying prognosis and treatment outcome associated with the different underlying pathologies. As a final point, SSM pattern-based group score evaluation can be a valuable tool in clinical trials of experimental therapies for brain disorders15,53,54.
The authors have nothing to disclose.
This work was supported by Grant No. P50NS071675 (Morris K. Udall Center of Excellence in Parkinson’s Disease Research at The Feinstein Institute for Medical Research) to D.E. from the National Institute of Neurological Disorders and Stroke. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of Neurological Disorders and Stroke or the National Institutes of Health. The sponsor did not play a role in study design, collection, analysis and interpretation of data, writing of the report or in the decision to submit the paper for publication.
Name of Equipment | Company | Catalog Number | Comments |
Image Acquisition | |||
PET Scanner | GE Medical Systems | GE Advance | Any PET, PET/CT and PET/MRI Scanners from GE, Siemens and Philips |
PC Workstations | Lenovo | Any | http://www.lenovo.com/us/en/ |
Radiopharmaceuticals | |||
[18F]-fluorodeoxyglucose | Feinstein Institute for Medical Research | Routine Production | Also distributed by Cardinal Health http://www.cardinal.com/ |
Software | |||
ScanVP | Feinstein Institute for Medical Research | Version 5.9.1, Version 6.2, To be released | www.feinsteinneuroscience.org |
SPM | The UCL Institute of Neurology | spm99-spm8 | http://www.fil.ion.ucl.ac.uk/spm |
Windows | Microsoft | Any | |
Matlab | Mathworks | Matlab Version 7.0, 7.3 | http://www.mathworks.com/ |
JMP | SAS | Version 5 | http://www.jmp.com/ |