Large-Scale Network Analysis of the Cerebrospinal Fluid Proteome Reveals Molecular Signatures of Frontotemporal Lobar Degeneration

Analysis of Large-Scale Network Study of the CSF Proteome in FTLD — Unlocking Molecular Signatures of Neurodegenerative Diseases

I. Academic Background and Motivation

Frontotemporal Lobar Degeneration (FTLD) is one of the most common causes of early-onset dementia (under age 65), triggering a range of progressive behavioral, language, and even motor impairments, and significantly threatening patients’ quality of life while imposing huge societal and economic burdens. Although the molecular mechanisms of FTLD are being gradually unveiled, current understanding of the intrinsic driving forces behind its pathological progression and the in vivo detectable biomarkers remains very limited. Commonly used molecular biomarkers in the clinic, such as Neurofilament Light Chain (NFL) or Alzheimer’s disease markers, primarily serve as nonspecific indicators of neurodegeneration and fail to fully reflect the complex molecular pathology of FTLD.

The pathogenic mechanisms of FTLD are highly heterogeneous, dominated primarily by two pathological subtypes: the aggregation of abnormally phosphorylated tau protein (FTLD-Tau subtype) and the accumulation of Transactive Response DNA-binding Protein 43 kDa (TDP-43, FTLD-TDP subtype). In familial FTLD cases, the most common causative genes are C9orf72, GRN (Progranulin), and MAPT (microtubule-associated protein tau). Direct relatives carrying susceptible mutations provide a unique opportunity for tracking the disease’s molecular mechanism and developing biomarkers.

In the field of neurodegenerative diseases, the large-scale, high-throughput development of proteomics has greatly promoted the discovery of novel molecular biomarkers and underlying disease mechanism networks. However, unbiased, large-scale proteomic analysis of the cerebrospinal fluid (CSF) from living FTLD patients has thus far progressed slowly.

The core motivation of this study is to leverage large-scale proteomic analysis techniques to reveal the CSF protein expression networks associated with FTLD progression, mining for “molecular signature clusters” (molecular signatures) that can serve as early diagnostic markers and therapeutic targets. The study also conducts multi-cohort, multi-platform, and multi-disease cross-validation of these discoveries to empower research on neurodegenerative disease biomarkers and pathological mechanisms as a whole.

II. Source of the Paper and Author Information

This paper, titled “Large-scale network analysis of the cerebrospinal fluid proteome identifies molecular signatures of frontotemporal lobar degeneration,” was written by Rowan Saloner (corresponding author, email: rowan.saloner@ucsf.edu) and his team. The authors are from several neurological research centers, including the University of California, San Francisco (UCSF). The paper was published in June 2025 in the prestigious journal Nature Aging.

III. Research Design and Complete Procedure

1. Overall Research Strategy

This study adopts an innovative, large-scale, network-based analysis of the CSF proteome to systematically map the molecular changes in familial FTLD, and utilizes a series of independent cohorts and disease models for cross-validation to ensure the universality and biological reproducibility of the discovered molecular mechanism networks. The main steps are as follows:

  • Sample Recruitment and Grouping: A total of 116 familial FTLD pathogenic mutation carriers (C9orf72 n=47, GRN n=32, MAPT n=37) and 39 family non-carrier controls were enrolled.
  • Proteomics Profiling: Using the aptamer-based SOMAscan platform, each CSF sample was subject to high-throughput quantitative analysis of 4,138 proteins.
  • Network Construction: Weighted Gene Correlation Network Analysis (WGCNA) was employed to separate the high-dimensional protein data into 31 functional modules, which were annotated for biological function by GO enrichment and cell type enrichment.
  • Association Analysis and Stratified Validation: Module expression was correlated with multiple indicators, such as clinical severity (CDR+NACC-FTLD score), imaging-based frontotemporal volume, NFL levels, and cognitive decline rate, to characterize disease progression at the molecular level.
  • Cross-Cohort/Platform/Disease Validation: Results were replicated in independent cohorts—PSP (progressive supranuclear palsy-Richardson syndrome) and BioFINDER-2 (containing FTLD, AD, and controls)—using both SOMAscan and Olink platforms, and compared with published AD and PD proteome network data.

2. Study Subjects, Samples, and Laboratory Procedures

(1) Sample Recruitment and Clinical Data

  • Familial FTLD Cohort: 116 pathogenic mutation carriers and 39 family controls. Genotypes included C9orf72, GRN, and MAPT mutations. Clinical assessment showed that 47% were asymptomatic carriers. Baseline data such as age, gender, and education years were adjusted to ensure comparability among groups.
  • Independent Replication Cohorts: PSP-RS (n=39, 4RTNI cohort); BioFINDER 2 (FTLD n=29, AD n=87, control n=248).

(2) CSF Proteomics Profiling

  • SOMAscan Platform: Utilizing Slow Off-rate Modified Aptamer (SOMAmer) technology, 4,138 CSF proteins were quantified with high sensitivity. Each experimental step incorporated randomization and blinding.
  • Olink Platform: Used in cross-validation cohorts for proteomic profiling, ensuring traceability and cross-platform reproducibility.

(3) Network Analysis and Algorithmic Innovation

  • WGCNA Algorithm: After data preprocessing, elimination of outliers, and multivariate regression adjustment, WGCNA clustering was used to integrate protein expression into 31 functional modules, each annotated by GO and cell-type-specific markers.
  • Synthetic Eigenprotein: For cross-cohort/platform validation, synthetic eigenproteins were calculated for different cohorts based on the original module’s membership, indirectly evaluating the network’s transferability.
  • Differential Expression and ROC Analysis: Differentially expressed proteins within and across modules were analyzed, and ROC curves were used to assess their discriminative power for FTLD subtypes.
  • Modeling the Relationship Between Cognitive Trajectories and Protein Modules: Annual cognitive change rates were measured by linear mixed-effects models and related to the expression strength of each module and its hub proteins.

(4) Cross-Validation and Functional Overlap Analysis

  • Module preservation analysis: The PSP-RS cohort used original SOMAscan data to reconstruct the WGCNA network and assess the structural conservation of CSF modules between familial FTLD and PSP.
  • Functional module overlap analysis: Cross-comparison with AD, PD, and other neurodegenerative disease proteome networks, using overrepresentation analysis (ORA) and Fisher’s exact test to quantify module overlap.

IV. Detailed Explanation of Main Research Results

1. Construction of Protein Co-expression Networks and Phenotypic Associations

  • A total of 31 protein co-expression modules were identified (each with 48–360 members); 28 modules had principal function annotation (e.g., RNA splicing, synapse, ECM, protein degradation, ion transport, etc.).
  • Disease progression-linked modules mainly included RNA splicing (m26 spliceosome), presynapse (m2 presynapse), synapse assembly/axon (m28 synapse assembly/axon), and autophagy (m22 autophagy).

2. Molecular Signature Clusters Correlate with Clinical Severity, Structural Loss, and NFL

  • m26 spliceosome (RNA splicing): Significantly increased in the CSF of symptomatic carriers and positively correlated with functional impairment (ρ=0.41); it was also negatively correlated with NFL levels and frontotemporal volume, suggesting this molecular pathway has a key role in FTLD progression.
  • m2, m28, m3 (synapse-related modules): All decreased in symptomatic carriers, and changed in concert with cognitive preservation, structural integrity, and NFL levels, indicating that reduction of synaptic proteins is a molecular signature of cognitive decline.
  • m9 ion transport: Early decrease was observed in some asymptomatic carriers (C9orf72 and MAPT mutations), suggesting that changes in this molecular pathway may reflect neurodevelopmental imbalance in a predictive manner.

3. Stratified Analysis and Key Molecular Features of Each Genetic Subtype

  • C9orf72 and GRN mutation carriers exhibited the most prominent changes in m26 (RNA splicing) and m22 (autophagy).
  • MAPT mutation carriers had elevated m29 (ECM, rich in microglial markers) and m4 (complement/coagulation), reflecting the unique metabolic impact of Tau protein.
  • Key module proteins such as NPTX2, CNTNAP2, NLNG1/2, and TMEM106B serve as molecular hub proteins for cognitive and structural damage.

4. Large-Scale Differential Protein Expression and Discriminative Performance

  • In the m26 module, members such as TRA2B, TMPO, and HNRNPAB were significantly differentially expressed (especially in C9orf72 and GRN).
  • ROC analysis showed that multi-marker protein panels dramatically improved discrimination of FTLD subtypes (AUC 0.88–0.97), e.g., GRN group — GRN protein itself, MAPT group — NPTX2, C9orf72 group — TMPO, etc.

5. Relationship Between Cognitive Decline Trajectories and Protein Modules/Hub Proteins

  • m29 (ECM) and m26 (RNA splicing) enriched modules negatively correlated with cognitive decline rates, whereas changes in synaptic modules (m2, m3, m28) related closely to cognitive preservation.
  • Hub proteins such as NPTX2 (m2), HNRNPA1 (m26), and TMEM106B (m3) occupy central positions in the modular network and hold potential value for early warning and intervention.

6. Generalization and Specificity Validation Across Cohorts, Platforms, and Diseases

  • The PSP-RS cohort’s WGCNA network and the familial FTLD network’s 31 modules were highly consistent, all with Zsummary >10 and key modules showing the same directional changes. This suggests that tau-mediated familial and sporadic FTLD share similar molecular signature clusters.
  • In the BioFINDER 2 cohort, using the Olink platform to reconstruct protein modules, FTLD vs. AD and control separation was markedly more effective than that between AD and controls, and certain modules (e.g., m26, m2, m28, m22) remained available across platforms.
  • Correlation analysis with AD and PD datasets revealed that the overall protein changes in FTLD and PD (r=0.46–0.56) were higher than with AD (r=0.14–0.21), both mainly showing decreases in neuron injury/autophagy modules. m26 was found to be the most FTLD-specific module.

7. Structural Overlap of Protein Networks Across Diseases and Platforms

  • FTLD modules and the AD proteome network were highly overlapping, mainly in neuron, oligodendrocyte, ECM, and immune modules.
  • m26 spliceosome only partially overlapped with the translation-related modules in the AD network, suggesting that its specific expression is more marked in FTLD.

V. Conclusions and Significance of the Study

This study systematically unraveled the molecular networks of the FTLD-related CSF proteome, clarifying multiple molecular pathways—RNA splicing disorders, synaptic protein reduction, ECM remodeling, and autophagy dysfunction—and their links to FTLD progression and cognitive impairment. It was the first to demonstrate that RNA splicing proteins can be dynamically detected in FTLD body fluids and may predict cognitive decline and structural loss in advance, with particular early-warning value in GRN mutation carriers. The study also shows that sporadic tauopathy (PSP) and familial MAPT mutant FTLD share highly homologous molecular signatures, and that cross-platform and cross-disease network structures are broadly conserved in pathways such as immunity and the synapse.

Scientific and applied value includes: - Providing a more systematic set of fluid molecular biomarkers for early diagnosis of FTLD, enriching the toolkit for molecular classification and disease monitoring. - Revealing both shared and unique molecular pathways across diseases (FTLD/AD/PD), paving the way for precision diagnostics and differential interventions. - Integrating network science with proteomics offers a new paradigm for understanding the complex biological processes of proteinopathies. - Core proteins such as NPTX2, TMEM106B, etc., provide a basis for future drug discovery and targeted therapies.

VI. Highlights and Innovations of the Study

  • Profiling the largest-scale FTLD CSF proteome to date (>4,000 proteins), comprehensively characterizing molecular signature clusters.
  • Multi-cohort, multi-center, multi-platform, and multi-disease model cross-validation ensures the robustness and broad applicability of the protein network.
  • Innovatively reconstructing the fluid biomarker system using protein co-expression network/module analysis, significantly surpassing traditional single-molecule analyses.
  • First verification that RNA splicing abnormalities are detectable at the fluid (CSF) level, providing a new tool for FTLD molecular subtyping and early detection.
  • The deep integration of molecular networks with multidimensional indicators such as cognitive decline and structural damage enhances clinical translational value.

Through this study, the research framework for FTLD and other neurodegenerative diseases shifts from a “single-point probe” to a “network interpretation,” greatly expanding the potential for understanding disease molecular underpinnings and translational diagnostics and therapeutics. This achievement suggests that CSF proteomic networks will become a key foundation for future precision neurological disease diagnosis and treatment.