Skip to main content

Structural connectome alterations between individuals with autism and neurotypical controls using feature representation learning

Abstract

Autism spectrum disorder is one of the most common neurodevelopmental conditions associated with sensory and social communication impairments. Previous neuroimaging studies reported that atypical nodal- or network-level functional brain organization in individuals with autism was associated with autistic behaviors. Although dimensionality reduction techniques have the potential to uncover new biomarkers, the analysis of whole-brain structural connectome abnormalities in a low-dimensional latent space is underinvestigated. In this study, we utilized autoencoder-based feature representation learning for diffusion magnetic resonance imaging-based structural connectivity in 80 individuals with autism and 61 neurotypical controls that passed strict quality controls. We generated low-dimensional latent features using the autoencoder model for each group and adopted an integrated gradient approach to assess the contribution of the input data for predicting latent features during the encoding process. Subsequently, we compared the integrated gradient values between individuals with autism and neurotypical controls and observed differences within the transmodal regions and between the sensory and limbic systems. Finally, we identified significant associations between integrated gradient values and communication abilities in individuals with autism. Our findings provide insights into the whole-brain structural connectome in autism and may help identify potential biomarkers for autistic connectopathy.

Introduction

Autism spectrum disorder is a pervasive condition that occurs during development. Individuals with autism show deficits in sensory processing and social communication skills [1, 2]. To identify the pathological and behavioral associations of autism, previous neuroscience studies have investigated alterations in large-scale functional brain networks [3,4,5] and abnormalities in microcircuit functions, such as excitation/inhibition imbalances [6,7,8,9,10]. Studies have suggested that autism is associated with altered macroscale functional brain organization, as well as atypical neural circuits and cognitive functions.

Recent neuroimaging studies based on magnetic resonance imaging (MRI) have adopted dimensionality reduction techniques to study high-dimensional connectome data with multiple low-dimensional eigenvectors [11, 12]. The core of these techniques is the generation of low-dimensional features that sufficiently represent whole-brain connectome data by estimating the principal axes of the brain. Functional and microstructural MRI studies have applied dimensionality reduction techniques to connectivity data and observed cortical hierarchical patterns expanding from low-level sensory to higher-order default mode networks [12,13,14]. Moreover, these techniques have been adopted to investigate structural and functional connectome disorganization in individuals with autism, which consistently suggests altered connectivity in the sensory and default mode regions [3, 5]. In addition to these connectome manifold approaches, feature representation learning based on deep learning techniques is a notable method for generating representative latent features from the original data. In particular, an autoencoder reduces high-dimensional data through multiple nonlinear operations and generates latent vectors in a hidden bottleneck layer. These low-dimensional features are then used to reconstruct the original data [15, 16]. The generated latent features can be used to develop disease diagnosis models; for example, to classify healthy controls and patients with Alzheimer’s disease [17, 18] or schizophrenia [19, 20]. However, one challenge of the autoencoder is the interpretation of latent vectors. To address this issue, a recent study introduced an integrated gradient technique that computes the contribution of the input data to predict features in the hidden layer [21]. Thus, the extent to which the structural connectivity contributes to predicting the low-dimensional latent vectors can be calculated. Here, we hypothesized that the low-dimensional latent features derived from the autoencoder model might contribute differently to the reconstruction of the original connectome data between typically and atypically developing brains.

In this study, we investigated the structural network disorganization in individuals with autism using autoencoder-based feature representation learning. First, we constructed autoencoders to reconstruct the structural connectivity of neurotypical controls and individuals with autism, and generated low-dimensional latent features for each group. Next, the integrated gradient approach was used to assess the contribution of the features in reconstructing the original data, and the integrated gradient values of the control and autism groups were compared. Finally, we evaluated the associations between the symptom severity of autism, measured using the Autism Diagnostic Observation Schedule (ADOS), and the integrated gradient values.

Method

Study participants

We obtained T1-weighted and diffusion MRI data from three independent sites (New York University Langone Medical Center [NYU], Trinity College Dublin [TCD], and San Diego State University [SDSU]) of the Autism Brain Imaging Data Exchange-II (ABIDE-II) database [22]. We included the sites that (i) included children and adults with autism and neurotypical controls, with ≥ 10 individuals per group, (ii) who had T1-weighted and diffusion MRI available, (iii) sufficient MRI data quality (i.e., scanned with 3 T scanner). Of the 178 participants, 141 participants, including 61 neurotypical controls (mean ± standard deviation [SD] age = 13.2 ± 4.0 years) and 80 individuals with autism (12.1 ± 4.9 years), were included in the study (Table 1). Individuals with autism were diagnosed with ADOS [23] and/or Autism Diagnostic Interview-Revised [24], and the neurotypical controls did not have any history of mental illness. ABIDE data collection was performed in accordance with the local Institutional Review Board guidelines. In accordance with the Health Insurance Portability and Accountability Act (HIPAA) guidelines and 1000 Functional Connectomes Project/INDI protocols, all ABIDE datasets were fully anonymized, with no protected health information included.

Table 1 Demographic information of study participants

MRI data acquisition

T1-weighted and diffusion MRI from three independent sites, NYU, TCD, and SDSU, were scanned with 3 T Siemens Allegra, 3 T Philips Achieva, and 3 T GE MR7550 scanners, respectively. At the NYU site, T1-weighted images were acquired using a 3D magnetization-prepared rapid acquisition gradient echo (MPRAGE) sequence (repetition time [TR] = 2530 ms; echo time [TE] = 3.25 ms; inversion time [TI] = 1,100 ms; flip angle [FA] = 7˚; matrix size = 256 × 192; and voxel size = 1.3 × 1.0 × 1.3 mm3). Diffusion MRI data were obtained using a 2D spin-echo echo-planar imaging (SE-EPI) sequence (TR = 5200 ms; TE = 78 ms; matrix size = 64 × 64; voxel size = 3 mm3 isotropic; 64 directions; b-value = 1000 s/mm2; and 1 b0 image). At the TCD site, T1-weighted data were obtained using a 3D MPRAGE sequence (TR = 8,400 ms; TE = 3.90 ms; TI = 1,150 ms; FA = 8˚; matrix = 256 × 256; and voxel size = 0.9 mm3 isotropic). The diffusion MRI data were acquired using a 2D SE-EPI (TR = 20,244 ms; TE = 7.9 ms; matrix size = 124 × 124; voxel size = 1.94 × 1.94 × 2 mm3; 61 directions; b-value = 1,500 s/mm2; and 1 b0 image). At the SDSU site, the T1-weighted images were acquired using a 3D standard fast spoiled gradient echo (SPGR) sequence (TR = 8.136 ms; TE = 3.172 ms; TI = 600 ms; FA = 8˚; matrix size = 256 × 192; and voxel size = 1 mm3 isotropic). Diffusion MRI data were obtained using a 2D SE-EPI sequence (TR = 8,500 ms; TE = 84.9 ms; matrix size = 128 × 128; voxel size = 1.875 × 1.875 × 2 mm3; 61 directions; b-value = 1000 s/mm2; and 1 b0 image).

MRI data preprocessing and structural connectivity construction

T1-weighted MRI data were preprocessed using a conventional recon-all process in FreeSurfer [25]. The process included gradient non-uniformity correction, non-brain tissue removal, intensity normalization, tissue segmentation, and surface reconstruction. The cortical surfaces were then topology corrected and inflated. Subsequently, a spherical registration to the fsaverage template space was performed. Diffusion MRI data were preprocessed using MRtrix3 [26], which corrected for susceptibility distortions, head motion, and eddy currents. Based on probabilistic tractography, we constructed structural connectomes from the preprocessed diffusion MRI data. Different tissue types, including cortical and subcortical gray matter, white matter, and cerebrospinal fluid, were defined from the T1-weighted data using anatomically-constrained tractography [27] and registered onto the native diffusion MRI space with boundary-based registration. Multishell and multitissue response functions were estimated [28], and constrained spherical deconvolution and intensity normalization were performed [29]. A tractogram was generated with 40 million streamlines, with a maximum tract length of 250, and a fractional anisotropy cutoff of 0.06. Then, spherical-deconvolution informed filtering of tractograms (SIFT2) was applied to reconstruct whole-brain streamlines weighted by cross-section multipliers [30]. The streamlines were mapped onto the Schaefer atlas with 200 parcels [31] and log-transformed to generate a structural connectivity matrix.

Feature representation learning based on autoencoder

We selected an autoencoder model to generate low-dimensional latent features from the input structural connectivity matrix [15, 16]. We controlled for age, sex, and site from the structural connectivity matrix using a linear regression model and entered the controlled data into the autoencoder by considering the left and right hemispheres separately. The autoencoder reconstructs the original data via encoding and decoding processes as follows:

$$z=E\left(x\right)={\text{tanh}}\left(Wx+b\right)$$
(1)
$$y=D\left(z\right)={\text{tanh}}\left(Wz+b\right)=D\left(E\left(x\right)\right)$$
(2)
$$L\left(x,y\right)=\sum \frac{{\left|x-y\right|}^{2}}{n}=\sum \frac{{\left|x-D\left(E\left(x\right)\right)\right|}^{2}}{n}$$
(3)

where E is the encoder that generates the latent feature z from the input data \(x\), with weight \(W\) and bias \(b\). Then, the latent feature \(z\) is entered into the decoder \(D\) to reconstruct the input data and generate \(y\). \(L\left(x,y\right)\) is the loss function defined by the sum of the mean square errors between the input \(x\) and output \(y\), and the model is trained to minimize the \(L\left(x,y\right)\). Our autoencoder model consists of four encoder layers, one bottleneck layer, and four decoder layers (Fig. 1A). The layers of the encoder and decoder had 7700; 5500; 2930; and 900 units, respectively, and the bottleneck layer contained 200 units. A dropout rate of 0.3 was applied to the input layer, and the hypertangent activation function was used for all layers. An average stochastic gradient descent optimizer [32] with a learning rate of 0.00008 was used. We constructed an autoencoder model for autism and neurotypical control groups, respectively. The data were randomly divided into the training (autism/control = 45 /33), validation (n = 20/16), and test (n = 15/12) datasets. Among the 500 epochs, we selected the weights that exhibited the minimum loss in the training and validation datasets and applied them to the test dataset. The performance of the model was assessed using the test dataset by calculating Pearson’s correlation between the original and reconstructed structural connectivity matrices. We repeated this process 100 times with different training, validation, and test datasets to minimize the subject selection bias. As the sensitivity analysis, we assessed the reconstruction performance of the autoencoder by changing the hyperparameters of the network architecture. First, we changed the number of units to (i) 7000; 5000; 2930; and 900, and (ii) 8000; 5000; 1000; and 500. Second, we changed the dropout rate to 0.1 and 0.5. Third, we constructed the architecture using five layers by including one additional layer with 4,400 units between layers 2 and 3. In addition, we constructed the architecture with three layers by removing layer 4. Lastly, we varied the learning rate to 0.0001 and 0.001.

Fig. 1
figure 1

The autoencoder model of neurotypical controls and individuals with autism. A A schematic of the autoencoder model consisting of four encoder, one bottleneck, and four decoder layers. The diffusion tractography-based structural connectivity (SC) is entered into the autoencoder. B The upper panels show the loss graphs of training and validation datasets according to epochs for neurotypical controls (left) and individuals with autism (right). The scatter plots on the bottom represent correlations between the actual and reconstructed structural connectivity of the test data

Contribution of latent features and between-group differences

We hypothesized that individuals with autism and neurotypical controls might show different reconstruction performances and that these differences may be associated with altered connectivity among different brain regions. We quantitatively assessed the extent of the contribution of structural connectivity while reconstructing the original data using the integrated gradient technique [21]. Briefly, it computes the attribution of each element of the structural connectivity matrix to predict the latent features of the bottleneck layer by progressively increasing the intensity of the input values from a zero-information baseline to a particular intact input level and averaging the attributions. The integrated gradient from \({i}^{{\text{th}}}\) neuron is defined as follows:

$$I{ntegrated \, Gradient }_{i}\left(x\right)=\left({x}_{i}-{m}_{i}\right)\times \underset{\alpha =0}{\overset{1}{\int }}\frac{\delta f\left(m+\alpha \left({x}-{m}\right)\right)}{\delta {x}_{i}}d\alpha,$$
(4)

where \(x\) is the input, \(m\) is the baseline, and \(\alpha\) the interpolation constant. This can be represented by a summation using the Riemann approximation of the integral as follows:

$$I{ntegrated \, Gradient }_{i}\left(x\right)=\left({x}_{i}-{m}_{i}\right)\times \frac{1}{M}{\sum }_{k=1}^{M}\frac{\delta f\left(m+\frac{k}{M}\left({x}-m\right)\right)}{\delta {x}_{i}},$$
(5)

where m and M are the number of steps in the scaled feature perturbation constant and the approximation of the integral, respectively. Thus, it provides information on how each element of the connectivity matrix contributes significantly to the encoding processes. We compared the integrated gradient values of each individual between individuals with autism and neurotypical controls after the z-normalization of these values. Between-group differences were assessed using two-sample t-tests with 1000 permutation tests by randomly assigning group indices. A null distribution was constructed, and if the real t-statistic did not belong to the 5% of the null distribution, it was considered significant. The p-values were corrected using a false discovery rate (FDR) < 0.05 [33]. To assess network-level differences, we summarized the t-statistic values based on seven intrinsic functional communities as follows: visual, somatomotor, dorsal attention, ventral attention, limbic, frontoparietal, and default mode networks [34]. We additionally assessed the effects of between-group differences using the structural connectivity instead of the integrated gradient values between neurotypical controls and individuals with autism.

Symptom severity associations

To determine whether the integrated gradient values were associated with the symptom severity of autism measured by the ADOS, which included the social cognition, communication, and repeated behavior/interest sub-scores, as well as the total score [3], we adopted canonical correlation analysis (CCA) [35]. CCA finds a canonical coordinate space that maximizes the correlations between the projections of different datasets onto the space [36]. Here, we projected the integrated gradient values (X) and ADOS scores (Y) onto each dimension of the canonical space and obtained the canonical components u and v such that these components had maximum correlations. The optimal number of canonical components was determined using a five-fold cross-validation. For each cross-validation, we selected components that showed significant (FDR < 0.05) correlations between u and v. The explained variance assesses how much of the variance of the data is explained by each component and is defined as follows:

$$Explained \, Variance \left(x\right)= \frac{Var\left\{x\right\}-Var\{x- \widehat{x}\}}{Var\{x\}},$$

where x is the original data, and \(\widehat{x}\) is the predicted data.

Additionally, we estimated the explained variance of each component and summarized the explained variance of the integrated gradients within and between the networks using seven functional communities [34]. Furthermore, we assessed the multivariate associations between the ADOS scores and structural connectivity data or the latent vectors extracted from the bottleneck layer of the autoencoder to evaluate which feature is more useful for explaining the symptoms of autism. The symptom severity association analysis was performed using the data from NYU and TCD sites because the SDSU site did not provide the ADOS score.

Result

The autoencoder model and reconstruction performance

The structural connectivity matrix was entered into an autoencoder and trained to reconstruct the original data (Fig. 1A). We observed decreasing loss values in both the training and validation datasets across epochs (Fig. 1B). The weights of the latest epoch that exhibited the best performance were applied to the test dataset. The reconstruction performance based on the correlations between the original and reconstructed connectivity matrix was significant for control (mean ± SD across individuals and 100 bootstraps, r = 0.427 ± 0.165, p < 0.001) and autism groups (autism: r = 0.271 ± 0.133, p = 0.023; Fig. 1B). When we tested the reconstruction performance with different settings of hyperparameters, the performances were lower than our model (Additional file 1: Table S1).

Between-group differences in the integrated gradients

The integrated gradient method was applied to assess the contribution of the structural connectivity to predict latent features of the hidden layer (Fig. 2A). We found that within the visual network and between the limbic and frontoparietal networks showed particularly large attributions in neurotypical controls, while within the limbic network and between the limbic and frontoparietal networks showed large effects in individuals with autism. We assessed between-group differences in the integrated gradient values and summarized the t-statistic values of the connections that showed significant between-group differences according to the seven functional communities [34]. We observed high effects within the default mode and frontoparietal networks, and between the visual and frontoparietal/ventral attention networks as well as between the somatomotor and limbic networks (Fig. 2B). The small effects were observed within the visual network and between the somatomotor and dorsal attention networks. Furthermore, we examined between-group differences in the structural connectivity between neurotypical controls and individuals with autism, and only two elements (visual-default mode and frontoparietal-default mode networks) showed significant effects, suggesting lower sensitivity of the structural connectivity in assessing connectome distortions in individuals with autism than integrated gradient values.

Fig. 2
figure 2

Between-group differences in the integrated gradient values. A Shown is the schema of the integrated gradient technique (left). We summarized the integrated gradient values of neurotypical controls and individuals with autism based on seven functional communities (right). B We compared the integrated gradient values between individual with autism and neurotypical controls after the z-normalization. We assessed two-sample t-tests with 1000 permutation tests by shuffling group indices. We then applied a false discovery rate (FDR) < 0.05. The t-statistics of the between-group differences are shown (left). Within- and between-network effects are plotted based on the t-statistics of the between-group differences (right). The highest effects of between-network were shown in red lines. The high effects were observed within the default mode and frontoparietal networks, and between the visual and frontoparietal/ventral attention networks as well as between the somatomotor and limbic networks

Associations between symptom severity and the integrated gradients

We assessed the associations between the integrated gradient values and symptom severity of autism measured by the ADOS [23] using CCA [35]. Five-fold cross-validation showed that the optimal number of canonical components was three. Each canonical component showed significant correlations (1st: r = 0.726, p < 0.001; 2nd: r = 0.732, p < 0.001; 3rd: r = 0.647, p < 0.001; Fig. 3). When we estimated the explained variance, the ADOS communication sub-score was associated with the integrated gradient values within the default mode network and between the somatomotor-visual/limbic/frontoparietal networks. These findings suggest that sensory and transmodal (i.e., default mode and frontoparietal) regions may be critically associated with communication skills in autism. When we conducted CCA using structural connectivity, the correlations between the canonical components and ADOS scores showed lower performance than when we used the integrated gradient values. In addition, the correlations based on the latent vectors did not show significant associations (Additional file 1: Table S2). Together, the findings indicate that the CCA analysis based on the integrated gradient values is more beneficial than using structural connectivity or latent vectors.

Fig. 3
figure 3

Canonical correlation analysis between symptom severity of autism and the integrated gradient values. Correlations between the canonical components u and v (left) are shown. Circle plots represent the explained variance of ADOS total, social cognition, communication, and repeated behavior/interest scores (middle). The size and color of the circles indicate the magnitude of the explained variance of each ADOS score. The explained variance of the integrated gradients of within- and between-networks is shown with spider plots (right). The ADOS communication sub-score, within the default mode network, and between the somatomotor-visual/limbic/frontoparietal networks showed relatively high explained variance. Abbreviations: ADOS, Autism Diagnostic Observation Schedule; T, total; S, social cognition; C, communication; R, repeated behavior/interest

Discussion

Understanding whole-brain structural connectome disorganization in individuals with autism may complement previous functional findings; however, studies identifying principal markers related to autism pathophysiology are relatively underinvestigated owing to the lack of interpretability of deep learning techniques. In this study, we systematically investigated the structural connectome abnormalities in individuals with autism using feature representation learning combined with an integrated gradient approach. We observed that the low-dimensional features of structural connectivity within the transmodal regions, including the default mode and frontoparietal networks, and between the sensory and limbic systems, were altered in individuals with autism. Additionally, we found that these features were associated with the communication abilities of individuals with autism, suggesting their clinical implications. Our findings provide an understanding of the atypical structural connectivity in individuals with autism and its association with their clinical phenotypes.

The core of our study is feature representation learning, which generates low-dimensional latent features [15, 16]. In neuroscience, the gradient approach, which estimates low-dimensional eigenvectors from connectome data, is widely adopted to investigate the whole-brain connectome organization of brain structure and function [12, 37,38,39,40,41,42,43] and their relations [44,45,46]. However, these studies were primarily based on nodal-level analysis and did not consider the interconnected links between nodes. To fill this gap, we generated latent vectors from the connectivity matrix using an autoencoder model to assess edge-level effects. However, a crucial point to consider when using deep learning-based models is the uncertainty of interpretation. Here, we opted for the integrated gradient method, which calculates the contribution of the input features in predicting the latent vectors [21]. We found that feature contributions differed between individuals with autism and neurotypical controls, particularly in connectivity within the transmodal regions and between the sensory and limbic networks. Sensorimotor and default mode networks show connectome idiosyncrasies and a decrease in the number of neurotransmitter receptors in individuals with autism [3, 4, 47]. In addition, abnormal structural connectomes in these systems are associated with excitation/inhibition imbalances in autism [5]. Our work expands upon prior studies by providing insights into the understanding of low-dimensional representations of structural connectivity in autism.

To assess the behavioral associations of the low-dimensional features of whole-brain structural connectivity in individuals with autism, we used a multivariate association technique called CCA. Unlike conventional association analyses based on linear correlations or regression analyses, CCA determines the canonical coordinate space that maximizes the correlation between independent and dependent variables. We found that the integrated gradient values within the default mode region and between the sensory and transmodal regions were highly associated with communication skills in individuals with autism. The sensory and transmodal regions are involved in the perception and processing of language and nonverbal information [48, 49]. Individuals with autism show decreased activation in the inferior frontal cortex [50], anterior insula, and premotor cortex [51], and these patterns are associated with alterations in social information integration [52]. Behavioral studies have also found that individuals with autism show lower verbal and nonverbal abilities than neurotypical controls [53, 54]. Together, these studies suggest that altered connectome organization in sensory and transmodal areas may be related to communication and social impairments in individuals with autism [55], and that the integrated gradient values derived from the autoencoder model could serve as an indicator for describing the symptom severity of autism.

In this study, we identified structural connectivity differences in individuals with autism using feature representation learning combined with integrated gradient techniques, which may provide potential biomarkers for the diagnosis of autism. However, this study has several limitations. First, our study had a small sample size owing to the nature of the ABIDE database and strict quality control procedures. In the future, additional data must be collected from independent databases to improve the reliability of our findings. Second, we used the integrated gradient method to interpret the internal operations of the autoencoder. In future studies, we need to link integrated gradient data to biologically plausible mechanisms for better neuroscientific insights. Finally, the estimated integrated gradient values may be variable according to the baseline. Although the zero-informed baseline is commonly used for the image data, it should be noted that the different baselines may yield varying results and thus need careful interpretation.

Availability of data and materials

Imaging and phenotypic data were provided, in part, by the Autism Brain Imaging Data Exchange initiative (ABIDE-II; https://fcon_1000.projects.nitrc.org/indi/abide). Codes for the autoencoder and integrated gradients are available at https://github.com/CAMIN-neuro/caminopen/tree/master/Feature_representation_learning_autism.

References

  1. Christensen DL, Braun KVN, Baio J, Bilder D, Charles J, Constantino JN, et al. Prevalence and characteristics of autism spectrum disorder among children aged 8 years—autism and developmental disabilities monitoring network, 11 sites, United States, 2012. MMWR Surveill Summ. 2018;65:1–23.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Mottron L, Dawson M, Soulières I, Hubert B, Burack J. Enhanced perceptual functioning in autism: an update, and eight principles of autistic perception. J Autism Dev Disord. 2006;36:27–43.

    Article  PubMed  Google Scholar 

  3. Hong S-J, Vos de Wael R, Bethlehem RAI, Lariviere S, Paquola C, Valk SL, et al. Atypical functional connectome hierarchy in autism. Nat Commun. 2019;10:1022.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Nunes AS, Peatfield N, Vakorin V, Doesburg SM. Idiosyncratic organization of cortical networks in autism spectrum disorder. Neuroimage. 2019;190:182–90.

    Article  PubMed  Google Scholar 

  5. Park B, Hong S-J, Valk SL, Paquola C, Benkarim O, Bethlehem RAI, et al. Differences in subcortico-cortical interactions identified from connectome and microcircuit models in autism. Nat Commun. 2021. https://doi.org/10.1038/s41467-021-21732-0.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Jou RJ, Jackowski AP, Papademetris X, Rajeevan N, Staib LH, Volkmar FR. Diffusion tensor imaging in autism spectrum disorders: preliminary evidence of abnormal neural connectivity. Aust N Z J Psychiatry. 2011;45:153–62.

    Article  PubMed  Google Scholar 

  7. Sohal VS, Rubenstein JLR. Excitation-inhibition balance as a framework for investigating mechanisms in neuropsychiatric disorders. Mol Psychiatry. 2019;24:1248–57.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Lee E, Lee J, Kim E. Excitation/inhibition imbalance in animal models of autism spectrum disorders. Biol Psychiatry. 2017;81:838–47.

    Article  PubMed  Google Scholar 

  9. Nair A, Treiber JM, Shukla DK, Shih P, Müller R-A. Impaired thalamocortical connectivity in autism spectrum disorder: a study of functional and anatomical connectivity. Brain. 2013;136:1942–55.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Nelson SB, Valakh V. Excitatory/inhibitory balance and circuit homeostasis in autism spectrum disorders. Neuron. 2015;87:684–98.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  11. Haak KV, Marquand AF, Beckmann CF. Connectopic mapping with resting-state fMRI. Neuroimage. 2018;170:83–94.

    Article  PubMed  Google Scholar 

  12. Margulies DS, Ghosh SS, Goulas A, Falkiewicz M, Huntenburg JM, Langs G, et al. Situating the default-mode network along a principal gradient of macroscale cortical organization. Proc Natl Acad Sci. 2016;113:12574–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Park B, Park H, Morys F, Kim M, Byeon K, Lee H, et al. Inter-individual body mass variations relate to fractionated functional brain hierarchies. Commun Biol. 2021;4:735.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Vos de Wael R, Benkarim O, Paquola C, Lariviere S, Royer J, Tavakol S, et al. BrainSpace: a toolbox for the analysis of macroscale gradients in neuroimaging and connectomics datasets. Commun Biol. 2020. https://doi.org/10.1038/s42003-020-0794-7.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Hinton GE, Salakhutdinov RR. Reducing the dimensionality of data with neural networks. Science. 1979;2006(313):504–7.

    Google Scholar 

  16. Vincent P, Larochelle H, Bengio Y, Manzagol P-A. Extracting and composing robust features with denoising autoencoders. Proceedings of the 25th international conference on Machine learning - ICML ’08. New York, New York, USA: ACM Press; 2008. p. 1096–103.

  17. Suk H-I, Lee S-W, Shen D. Latent feature representation with stacked auto-encoder for AD/MCI diagnosis. Brain Struct Funct. 2015;220:841–59.

    Article  PubMed  Google Scholar 

  18. Suk H-I, Wee C-Y, Lee S-W, Shen D. State-space model with deep learning for functional dynamics estimation in resting-state fMRI. Neuroimage. 2016;129:292–307.

    Article  PubMed  Google Scholar 

  19. Zeng L-L, Wang H, Hu P, Yang B, Pu W, Shen H, et al. Multi-site diagnostic classification of schizophrenia using discriminant deep learning with functional connectivity MRI. EBioMedicine. 2018;30:74–85.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Kim J, Calhoun VD, Shim E, Lee J-H. Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. Neuroimage. 2016;124:127–46.

    Article  PubMed  Google Scholar 

  21. Sundararajan M, Taly A, Yan Q. Axiomatic attribution for deep networks. arXiv; 2017. arXiv:1703.01365.

  22. Di Martino A, O’Connor D, Chen B, Alaerts K, Anderson JS, Assaf M, et al. Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci Data. 2017. https://doi.org/10.1038/sdata.2017.10.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Lord C, Risi S, Lambrecht L, Cook EH Jr, Leventhal BL, DiLavore PC, et al. The autism diagnostic observation schedule—generic: a standard measure of social and communication deficits associated with the spectrum of autism. J Autism Dev Disord. 2000;30:205–23.

    Article  CAS  PubMed  Google Scholar 

  24. Lord C, Rutter M, Le Couteur A. Autism diagnostic interview-revised: a revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. J Autism Dev Disord. 1994;24:659–85.

    Article  CAS  PubMed  Google Scholar 

  25. Fischl B. FreeSurfer. Neuroimage. 2012;62:774–81.

    Article  PubMed  Google Scholar 

  26. Tournier J-D, Smith R, Raffelt D, Tabbara R, Dhollander T, Pietsch M, et al. MRtrix3: a fast, flexible and open software framework for medical image processing and visualisation. Neuroimage. 2019;202: 116137.

    Article  PubMed  Google Scholar 

  27. Smith RE, Tournier J-D, Calamante F, Connelly A. Anatomically-constrained tractography: improved diffusion MRI streamlines tractography through effective use of anatomical information. Neuroimage. 2012;62:1924–38.

    Article  PubMed  Google Scholar 

  28. Christiaens D, Reisert M, Dhollander T, Sunaert S, Suetens P, Maes F. Global tractography of multi-shell diffusion-weighted imaging data using a multi-tissue model. Neuroimage. 2015;123:89–101.

    Article  PubMed  Google Scholar 

  29. Jeurissen B, Tournier J-D, Dhollander T, Connelly A, Sijbers J. Multi-tissue constrained spherical deconvolution for improved analysis of multi-shell diffusion MRI data. Neuroimage. 2014;103:411–26.

    Article  PubMed  Google Scholar 

  30. Smith RE, Tournier J-D, Calamante F, Connelly A. SIFT2: enabling dense quantitative assessment of brain white matter connectivity using streamlines tractography. Neuroimage. 2015;119:338–51.

    Article  PubMed  Google Scholar 

  31. Schaefer A, Kong R, Gordon EM, Laumann TO, Zuo X-N, Holmes AJ, et al. Local-global parcellation of the human cerebral cortex from intrinsic functional connectivity MRI. Cerebral Cortex. 2018;28:3095–114.

    Article  PubMed  Google Scholar 

  32. Stich SU, Cordonnier J-B, Jaggi M. Sparsified SGD with memory. In: 32nd conference on neural information processing systems. 2018. p. 1–12.

  33. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B (Methodol). 1995;57:289–300.

    Google Scholar 

  34. Thomas Yeo BT, Krienen FM, Sepulcre J, Sabuncu MR, Lashkari D, Hollinshead M, et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. J Neurophysiol. 2011;106:1125–65.

    Article  PubMed Central  Google Scholar 

  35. Hardoon DR, Szedmak S, Shawe-Taylor J. Canonical correlation analysis: an overview with application to learning methods. Neural Comput. 2004;16:2639–64.

    Article  PubMed  Google Scholar 

  36. Bilenko NY, Gallant JL. Pyrcca: regularized kernel canonical correlation analysis in python and its applications to neuroimaging. Front Neuroinform. 2016;10:49.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Bethlehem RAI, Paquola C, Seidlitz J, Ronan L, Bernhardt B, Consortium C-C, et al. Dispersion of functional gradients across the adult lifespan. Neuroimage. 2020;222: 117299.

    Article  PubMed  Google Scholar 

  38. Mckeown B, Strawson WH, Wang H-T, Karapanagiotidis T, Vosde Wael R, Benkarim O, et al. The relationship between individual variation in macroscale functional gradients and distinct aspects of ongoing thought. Neuroimage. 2020;220: 117072.

    Article  PubMed  Google Scholar 

  39. Murphy C, Wang H-T, Konu D, Lowndes R, Margulies DS, Jefferies E, et al. Modes of operation: a topographic neural gradient supporting stimulus dependent and independent cognition. Neuroimage. 2019;186:487–96.

    Article  PubMed  Google Scholar 

  40. Kharabian Masouleh S, Plachti A, Hoffstaedter F, Eickhoff S, Genon S. Characterizing the gradients of structural covariance in the human hippocampus. Neuroimage. 2020;218: 116972.

    Article  PubMed  Google Scholar 

  41. Paquola C, Vos De Wael R, Wagstyl K, Bethlehem RAI, Hong S-J, Seidlitz J, et al. Microstructural and functional gradients are increasingly dissociated in transmodal cortices. PLoS Biol. 2019;17: e3000284.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. Park B, Hong S-J, Valk SL, Paquola C, Benkarim O, Bethlehem RAI, et al. Differences in subcortico-cortical interactions identified from connectome and microcircuit models in autism. Nat Commun. 2021;12:2225.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Park B, Bethlehem RA, Paquola C, Larivière S, Rodríguez-Cruces R, Vos de Wael R, et al. An expanding manifold in transmodal regions characterizes adolescent reconfiguration of structural connectome organization. Elife. 2021;10:e64694.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Larivière S, Vos de Wael R, Hong S-J, Paquola C, Tavakol S, Lowe AJ, et al. Multiscale structure-function gradients in the neonatal connectome. Cereb Cortex. 2020;30:47–58.

    Article  PubMed  Google Scholar 

  45. Park B, Vosde Wael R, Paquola C, Larivière S, Benkarim O, Royer J, et al. Signal diffusion along connectome gradients and inter-hub routing differentially contribute to dynamic human brain function. Neuroimage. 2021;224: 117429.

    Article  PubMed  Google Scholar 

  46. Vázquez-Rodríguez B, Suárez LE, Markello RD, Shafiei G, Paquola C, Hagmann P, et al. Gradients of structure–function tethering across neocortex. Proc Natl Acad Sci. 2019;116:21219–27.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Oblak AL, Gibbs TT, Blatt GJ. Reduced GABAA receptors and benzodiazepine binding sites in the posterior cingulate cortex and fusiform gyrus in autism. Brain Res. 2011;1380:218–28.

    Article  CAS  PubMed  Google Scholar 

  48. Hiser J, Koenigs M. The multifaceted role of the ventromedial prefrontal cortex in emotion, decision making, social cognition, and psychopathology. Biol Psychiatry. 2018;83:638–47.

    Article  PubMed  Google Scholar 

  49. Hocking J, Price CJ. The role of the posterior superior temporal sulcus in audiovisual processing. Cereb Cortex. 2008;18:2439–49.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Hadjikhani N, Joseph RM, Manoach DS, Naik P, Snyder J, Dominick K, et al. Body expressions of emotion do not trigger fear contagion in autism spectrum disorder. Soc Cogn Affect Neurosci. 2009;4:70–8.

    Article  PubMed  PubMed Central  Google Scholar 

  51. Grèzes J, Wicker B, Berthoz S, de Gelder B. A failure to grasp the affective meaning of actions in autism spectrum disorder subjects. Neuropsychologia. 2009;47:1816–25.

    Article  PubMed  Google Scholar 

  52. Groen WB, Tesink C, Petersson KM, van Berkum J, van der Gaag RJ, Hagoort P, et al. Semantic, factual, and social language comprehension in adolescents with autism: an FMRI study. Cereb Cortex. 2010;20:1937–45.

    Article  CAS  PubMed  Google Scholar 

  53. Freitag CM, Kleser C, von Gontardf A. Imitation and language abilities in adolescents with autism spectrum disorder without language delay. Eur Child Adolesc Psychiatry. 2006;15:282–91.

    Article  PubMed  Google Scholar 

  54. Lam YG, Yeung SSS. Towards a convergent account of pragmatic language deficits in children with high-functioning autism: depicting the phenotype using the pragmatic rating scale. Res Autism Spectr Disord. 2012;6:792–7.

    Article  Google Scholar 

  55. Klin A, Saulnier CA, Sparrow SS, Cicchetti DV, Volkmar FR, Lord C. Social and communication abilities and disabilities in higher functioning individuals with autism spectrum disorders: the Vineland and the ADOS. J Autism Dev Disord. 2007;37:748–59.

    Article  PubMed  Google Scholar 

Download references

Funding

Dr. Bo-yong Park was supported by the National Research Foundation of Korea (NRF-2021R1F1A1052303; NRF-2022R1A5A7033499) and the Institute for Information and Communications Technology Planning and Evaluation (IITP), funded by the Korean Government (MSIT) (No. 2022-0-00448, Deep Total Recall: Continual Learning for Human-Like Recall of Artificial Neural Networks; No. RS-2022-00155915, Artificial Intelligence Convergence Innovation Human Resources Development (Inha University)). Drs. Hyunjin Park and Bo-yong Park were jointly supported by the IITP funded by the Korean Government (MSIT) (No. 2021-0-02068, Artificial Intelligence Innovation Hub) and the Institute for Basic Science (IBS-R015-D1).

Author information

Authors and Affiliations

Authors

Contributions

Y.J. and B.P. designed the study, analyzed data, and wrote the manuscript. H.C. and S.Y. aided in performing the experiments. H.P. reviewed the manuscript. B.P. is the corresponding author of this study and is responsible for the integrity of data analysis.

Corresponding author

Correspondence to Bo-yong Park.

Ethics declarations

Ethics approval and consent to participate

ABIDE data collection was performed in accordance with the local Institutional Review Board guidelines. In accordance with the Health Insurance Portability and Accountability Act (HIPAA) guidelines and 1000 Functional Connectomes Project/INDI protocols, all ABIDE datasets were fully anonymized, with no protected health information included.

Competing interests

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Reconstruction performance of the autoencoder with different hyperparameter settings. The performance was assessed by calculating Pearson’s correlation between the actual and reconstructed structural connectivity of the test data. Table S2. Correlations between the canonical components of the ADOS scores and various features.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jang, Y., Choi, H., Yoo, S. et al. Structural connectome alterations between individuals with autism and neurotypical controls using feature representation learning. Behav Brain Funct 20, 2 (2024). https://doi.org/10.1186/s12993-024-00228-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12993-024-00228-z

Keywords