Using machine intelligence to uncover Alzheimers disease progression heterogeneity

Bessi Qorri; Mike Tsay; Abhishek Agrawal; Rhoda Au; Joseph Geraci

doi:10.37349/emed.2020.00026

Open Access

Original Article

Using machine intelligence to uncover Alzheimer’s disease progression heterogeneity

Affiliation:

¹Department of Biomedical and Molecular Sciences, Queen’s University, Kingston, ON K7L 3N6, Canada

^†These authors contributed equally to this work.

ORCID: https://orcid.org/0000-0003-4984-7299

Bessi Qorri ^1†

Affiliation:

²NetraMark Corp, Toronto, ON M4E 1G8, Canada

Mike Tsay ²,

Affiliation:

³GSK, Philadelphia, PA 19112, USA

Abhishek Agrawal ³,

Affiliation:

⁴Department of Anatomy & Neurobiology, Neurology and Epidemiology, Boston University Schools of Medicine and Public Health, Boston, MA 02218, USA

ORCID: https://orcid.org/0000-0001-7742-4491

Rhoda Au ⁴

Affiliation:

²NetraMark Corp, Toronto, ON M4E 1G8, Canada

⁵Department of Pathology and Molecular Medicine, Queen’s University, Kingston, ON K7L 3N6, Canada

^†These authors contributed equally to this work.

Email: geracij@queensu.ca

ORCID: https://orcid.org/0000-0003-0967-2164

Joseph Geraci ^2,5†*

Explor Med. 2020;1:377–395 DOI: https://doi.org/10.37349/emed.2020.00026

Received: August 02, 2020 Accepted: November 12, 2020 Published: December 31, 2020

Academic Editor: Derek M. Dykxhoorn, University of Miami Miller School of Medicine, USA

The article belongs to the special issue Digital Biomarkers: The New Frontier for Medicine and Research

Abstract

Aim: Research suggests that Alzheimer’s disease (AD) is heterogeneous with numerous subtypes. Through a proprietary interactive ML system, several underlying biological mechanisms associated with AD pathology were uncovered. This paper is an introduction to emerging analytic efforts that can more precisely elucidate the heterogeneity of AD.

Methods: A public AD data set (GSE84422) consisting of transcriptomic data of postmortem brain samples from healthy controls (n = 121) and AD (n = 380) subjects was analyzed. Data were processed by an artificial intelligence platform designed to discover potential drug repurposing candidates, followed by an interactive augmented intelligence program.

Results: Using perspective analytics, six perspective classes were identified: Class I is defined by TUBB1, ASB4, and PDE5A; Class II by NRG2 and ZNF3; Class III by IGF1, ASB4, and GTSE1; Class IV is defined by cDNA FLJ39269, ITGA1, and CPM; Class V is defined by PDE5A, PSEN1, and NDUFS8; and Class VI is defined by DCAF17, cDNA FLJ75819, and SLC33A1. It is hypothesized that these classes represent biological mechanisms that may act alone or in any combination to manifest an Alzheimer’s pathology.

Conclusions: Using a limited transcriptomic public database, six different classes that drive AD were uncovered, supporting the premise that AD is a heterogeneously complex disorder. The perspective classes highlighted genetic pathways associated with vasculogenesis, cellular signaling and differentiation, metabolic function, mitochondrial function, nitric oxide, and metal ion metabolism. The interplay among these genetic factors reveals a more profound underlying complexity of AD that may be responsible for the confluence of several biological factors. These results are not exhaustive; instead, they demonstrate that even within a relatively small study sample, next-generation machine intelligence can uncover multiple genetically driven subtypes. The models and the underlying hypotheses generated using novel analytic methods may translate into potential treatment pathways.

Keywords

Machine learning, genetic subtypes, disease heterogeneity, drug repurposing, augmented intelligence, machine intelligence, artificial intelligence, target discovery

Introduction

Alzheimer’s disease (AD) is the most common form of dementia, contributing to 60–70% of dementia cases [1]. This neurodegenerative disease is characterized by neuronal cell damage and concomitant cognitive and functional decline, predominantly affecting older individuals, with two-thirds being women, and prevalence is expected to continue to rise as the population ages [2–4]. There is currently no definitive cure to prevent or attenuate the progression of this debilitating disease. Research efforts aimed at disease modification have focused on the amyloid and tau pathways as significant contributors of AD pathology due to excessive deposition of β-amyloid (Aβ) peptides and hyperphosphorylated tau proteins contributing to DNA and RNA damage [5–7]. However, none of the currently clinically approved AD drugs are disease-modifying therapies (DMTs) and instead broadly target AD symptoms [8]. Despite over 100 agents in the current AD treatment pipeline, the last AD drug approved by the U.S. Food and Drug Administration (FDA) was memantine, an N-methyl-D-aspartate (NMDA) receptor AD antagonist, in 2003 [9, 10]. While the Chinese FDA recently approved the clinical use of oligomannate (GV-971), international drug trials are underway to confirm results and validate use outside of China (NCT03715114, NCT02986529, NCT02293915) [11]. Due to gaps in our understanding of AD etiology and the complex interactions between genomic and environmental factors that lead to disease heterogeneity, a multimodal approach towards precision medicine is necessary.

There are currently very few consistently reported susceptible risk loci associated with AD. Early-onset Alzheimer’s disease (EOAD), which follows a Mendelian inheritance pattern, is primarily associated with mutations in one of three genes–amyloid precursor protein (APP), presenilin-1 (PSEN1), and presenilin-2 (PSEN2) [12]. However, late-onset Alzheimer’s disease (LOAD), which accounts for over 95% of AD cases, is associated with a more complex genomic makeup. To date, apolipoprotein E (APOE), a lipid carrier involved in cholesterol metabolism, is the strongest genetic risk factor for LOAD. Specifically, the APOE ε4 allele has been reported to have a lower affinity for lipoproteins and poorly binds Aβ [13]. Genome-wide association studies (GWAS) have identified several other susceptibility loci that confer AD risk to varying degrees that can be broadly categorized into those involved in immunity, lipid homeostasis, cytoskeletal interactions, endocytosis, and apoptosis [8, 14–16].

Machine learning (ML) efforts allow for a more systems-level approach that considers complex genetic interactions to reveal critical insights into disease etiology and identifying new drug targets [17]. While there has been extensive research using ML models to classify AD risk, discriminate between AD and mild cognitive impairment (MCI), and predict MCI-to-AD conversion based on structural and functional magnetic resonance imaging (MRI), positron emission tomography (PET) scans, and cerebrospinal fluid (CSF), there is less known about genetic subtypes within the AD patient population [18–21]. A recent study revealed sex- and age-based AD subpopulations. There was only a moderate genetic correlation between younger (60–79 years old) and older (> 80 years old) age-at-onset AD subjects, suggesting that the polygenic architecture of AD is heterogeneous across age. However, stratified GWAS and polygenic variation analyses highlighted BIN1, OR2S2, and PICALM as having significant effects at a younger age [22]. Relative expression ordering (REO)-based gene expression profiling analyses revealed two distinct subtypes within AD patients–one in which differentially expressed genes overlapped with age-related genes and one related to neuroinflammation [23]. Since AD primarily affects older individuals, it is not surprising that memory-spared individuals were often younger and APOE ε4 negative compared to memory-impaired individuals [24]. Furthermore, in-depth latent class analysis (LCA) of subjects with AD dementia revealed eight cognitive subtypes associated with distinct demographical and neurobiological characteristics. For example, the memory spared moderate-visuospatial cluster was associated with younger age, APOE ε4 negative genotype, and prominent atrophy of the posterior cortex [25].

APOE ε4 allele frequency is consistently associated with more extensive AD-associated neuropathology and cognitive deficits [26]. It is evident that specific genetic variants, such as APOE ε4, significantly contribute to disease heterogeneity compared to other genetic variants. The polygenic risk score (PRS) determines the cumulative genetic risk for an individual. Adopting a single nucleotide polymorphism (SNP) and transcriptomic approach when considering the PRS more accurately captures the contribution of individual SNPs and differential gene expression [12, 27]. Incorporating these strategies will contribute to the shift towards accurate patient stratification and classification, bringing precision medicine closer to reality. Rather than developing therapies for population averages of a biologically heterogeneous disease such as AD, artificial intelligence (AI)-based algorithms can be utilized for more individually-tailored therapies [28].

Here, we utilized a suite of ML tools designed to learn from subject datasets to analyze gene expression data derived from postmortem controls vs. AD subjects. Importantly, these next-generation methods can learn from smaller datasets than is typically assumed as necessary with many ML approaches and can explain the driving variables, as will be explained below. The novelty of this work lies in the machine’s ability to discover unknown subpopulations that are defined by several genes at a time. These genes may be related to each other and the dependent variable, e.g., AD status, in non-linear ways. The ability of some of these methods to extract non-linear relationships from small data is an exceptional trait, which in combination with explainability and the ability to learn from small datasets, uncovers a new avenue of exploring patient populations. Collectively, these properties will equip researchers to redefine our understanding of disease heterogeneity and significantly move the needle forward on the precision treatment of disease.

Materials and methods

Dataset assembly and analysis

A public AD dataset (GSE84422) consisting of transcriptomic data of postmortem brain samples from 121 healthy controls and 380 AD subjects was assembled. The analysis carried out in this paper was based on only AD subjects. This was done intentionally to extract a refined vantage into AD beyond APOE findings, which are well established in the field.

A unique suite of ML methods was assembled due to their ability to extract subpopulations from high-dimensional data and their ability to provide explanations for the driving mechanisms behind the subpopulations [29, 30]. These methods include statistical measures of feature importance, ensemble methods, neural networks, and a novel system designed to work with patient population data [31]. We also describe in detail methods that were used, which are freely available to researchers.

Perspective analytics

A significant feature of these machine intelligence methods is their ability to see a patient population in numerous ways. To be more precise, there are various ways to model a group of AD subjects vs. control subjects. Different collections of genes will reveal different relationships amongst the samples and different subtypes of subjects. These different models are called ‘perspectives,’ and this approach is referred to as perspective analytics. Each perspective is learned by the machine and consists of a unique set of variables, with each variable having a different contribution. Different collections of variables are arrived at through a feature selection methodology that consists of univariate statistics and Random Forest cross-validation verifications [32, 33]. If an independent dataset is available, the perspective analytics algorithm uses it; otherwise, it must rely on leave out and cross-validation protocols to establish reliability and avoid overfitting. The machine is rewarded for finding groups of samples within the same perspective class with several variables simultaneously in common, making it semi-supervised [29]. The results provided in this paper were derived from models that must have at least a 75% cross-validation score. This machine intelligence utilizes geometric representation methods coupled with a fast learner. These methods were created specifically for use with smaller datasets; therefore, they are inherently designed to find statistically significant pure subpopulations of a given label rather than trying to find perfect models.

Validation

Since the perspective analytics methods [29] used cannot be revealed due to intellectual property concerns, we utilized the following methodology using techniques that are available to the public to help validate our results [34]. Our analysis only utilized gene expression data, i.e., counts, and passed the data through the following process:

1) Each sample had gene expression levels associated with them derived by Affymetrix technology (GPL96: Affymetrix Human Genome U133A Array, GPL97: Affymetrix Human Genome U133B Array, GPL570: Affymetrix Human Genome U133 Plus 2.0 Array).

2) Different normalizations were performed so that each gene expression was ranked, broken into quantiles, or relative log expression was used.

3) Using Random Forest and univariate variable reduction methods [32, 33] left us with 9, 060 variables. The dependent variable for cross-validation was based on a binary variable we created, which distinguished low dementia vs. high dementia (See the supplemental materials; 0 = low and 1= high).

4) We then conducted a principal components analysis as a linear unsupervised clustering method to reveal different subclasses.

5) The loadings from the principal components were utilized to reduce the variables of focus to 16 variables.

6) Using the t-SNE and UMAP algorithms, we were able to extract subpopulations.

7) We then collected the sample IDs from the clusters formed from these two clustering models, systematically compared each group with the other, and looked for statistically significant genes. Several statistics were explored, but in order to deal with non-normality, the Wilcoxon signed-rank test was used [35].

8) The resulting statistically significant genes revealed by this process became associated with the sample clusters, and we called these the cluster-associated genes.

9) A study of the protein interaction networks formed from the cluster-associated genes helped us interpret their physiological role in AD. These are what is later referred to as the six progression mechanisms.

For transparency, this process was used to verify the more efficient method that we also used and previously outlined [29]. The perspective analytics platform known as NetraAI utilizes a similar process to extract subpopulations; however, it allows for human interaction via a user interface, and the subpopulation discovery is based on mathematics that allows for a very refined set of sub-populations to be discovered and explored.

Results

We were able to derive six progression mechanisms (i.e., perspective classes) that represent the various ways that an individual may manifest an AD pathology (Figure 1). An individual can progress via a single class or any combination of the six classes, highlighting the complexity of the AD population and resulting in 63 possible combinations. However, there are likely even more mechanisms at play, including immune system function, which plays an important orchestration role, further contributing to the complex AD etiology.

Display full size

Figure 1. Perspective analytics for AD. Perspective analytics discovered a unique set of variables for each of the six different perspectives learned from an Alzheimer’s dataset. Within each set, there is a subgroup of subjects that are driven by the corresponding etiology

While it is evident that small datasets are not representative of the overall disease state, the significant occurrence of variables binding together subjects of the same Class can provide valuable insights with respect to precision medicine. The characteristics of each perspective class, each of which represents a novel avenue of the complex etiologies that drive neurodegeneration and cognitive aging, are highlighted in Table 1. Subjects that belong to more than one perspective class may be due to the overlapping components across some of the pathways implicated in each perspective class.

Table 1. Perspective classes, characteristic genes, and defining traits of AD patients

Perspective class	Number of subjects	Gene name	Gene symbol	Defining trait
I	156	Tubulin beta 1 class VI	TUBB1	Vasculogenesis
		Ankyrin repeat and SOCS box-containing protein 4	ASB4
		Phosphodiesterase 5A	PDE5A
II	164	Neuregulin-2	NRG2	Cell signaling and differentiation
II	164	Zinc-finger 3	ZNF3	Cell signaling and differentiation
III	84	Insulin-like growth factor 1	IGF1	Metabolism
		Ankyrin repeat and SOCS box-containing protein 4	ASB4
		G2 and S-phase expressed protein 1	GTSE1
IV	134		cDNA FLJ39269	Nitric oxide
		Integrin subunit alpha	ITGA1
		Carboxypeptidase M	CPM
V	76	Phosphodiesterase 5A	PDE5A	Mitochondrial
		Presenilin 1	PSEN1
		NADH-coenzyme Q reductase	NDUFS8
VI	228	DDB1 and CUL4 associated factor 17	DCAF17	Metal ion transport
			cDNA FLJ75819
		Solute carrier family 33 member 1	SLC33AI

Display full size

SOCS: suppressor of cytokine signaling; NADH: nicotinamide adenine dinucleotide; DDB1: DNA damage-binding protein 1; CUL4: cullin-4

Due to reports of AD having a greater prevalence and severity in women, we investigated whether certain classes were more prevalent in females than males (Table 2). Interestingly the metal ion transport class was the most common, with 65% of the female AD subjects and 61% of the male AD subjects progressing via this Class. Of the 266 female AD subjects, 49.6% progressed via the cell signaling class, compared to only 35.5% of the male subjects. Of the 90 male AD subjects, 58.9% fell under the vasculogenesis perspective class, compared to only 38.7% of the female subjects.

Table 2. Sex-based differences in perspective classes of AD progression

Sex	Perspective class
Sex	Vasculogenesis	Cell signaling	Metabolism	Nitric oxide	Mitochondrial	Metal ion transport
Female	103	132	60	104	44	173
Male	53	32	24	30	32	55
Total	156	164	84	134	76	228

Display full size

In the remainder of this study, we utilized protein interaction/expression and gene pathway networks to assist in the interpretation of the physiological components behind our findings. We input the resulting statistically significant genes for each subclass we discovered into GeneMania [36], which allowed us to extract what we refer to as the perspective classes summarized in Table 2.

Class I is identified by TUBB1, ASB4, and PDE5A. As the primary identifier of Class I, TUBB1 mutations are associated with enlarged rounded platelets and result in thrombocytopenia [37]. The strong link of AD to vascular diseases such as stroke and atherosclerosis suggests a crucial role for vascularization in this subpopulation. Interestingly, there appears to be a network between TUBB1, ASB4, and PDE5A, suggesting Class I represents a subpopulation defined by the regulation of vasculogenesis [38–40] (Figure 2).

Display full size

Figure 2. Gene interactions for Class I identifiers. Red represents physical interactions, purple represents co-expression, and green represents genetic interactions. Created using GeneMania [36].

NRG2 and ZNF3 define Class II. Neuregulins (NRGs) stimulate ErbB-receptor tyrosine phosphorylation that elicits different downstream signaling pathways such as MAPK, PI3, PKC, and the Janus kinase signal transducer and activator of transcription (JAK-STAT) pathways and are associated with synaptic plasticity [41, 42]. ZNF3 is a zinc-finger protein that is differentially expressed in AD and is involved in cell differentiation and proliferation [43]. Thus, we define Class II by cell signaling and differentiation.

Class III is defined by IGF1, ASB4, and GTSE1. Impairments in insulin/IGF1 signaling have been associated with AD [44, 45]. IGF1 is connected to ASB4 by IRS4 (insulin receptor substrate 4) and SOCS2 (Figure 3). In contrast, GTSE1 is a microtubule-associated protein that regulates G1/S cycle transition and microtubule stability [46]. Given the role of GTSE1 and TUBB1 on microtubule stability and formation and the shared presence of ASB4 as a descriptor in both Class I and Class III, these two classes may represent one larger overarching AD subpopulation that can be further stratified into microtubule formation and IGF1 pathway signaling. This is supported by the fact that the subjects the machine clustered together in the metabolism class have lower expression levels of IGF1 than those we classified as being members of the vasculogenesis class.

Display full size

Figure 3. Gene interactions for Class III identifiers. Red represents physical interactions, purple represents co-expression, orange represents predicted interactions, blue represents co-localization, aqua represents a shared pathway, green represents genetic interactions, and yellow represents shared protein domains. Created using GeneMania

Class IV is defined by cDNA FLJ39269, ITGA1, and CPM. cDNA FLJ39269 is most closely associated with GUCY1A3, which is dysregulated in AD [47, 48]. CPM is known to enhance nitric oxide (NO) output, playing a role in NO signaling under inflammatory conditions [49]. Due to the recurrent role of NO, we define Class IV with NO, despite ITGA1 not being involved in this signaling.

Class V is defined by PDE5A, PSEN1, and NDUFS8. Despite PSEN1 mutations being one of the most common causes of familial Alzheimer’s disease (FAD), PSEN1 defined Class V to a lesser extent than PDE5A. However, within this population, SYK (spleen tyrosine kinase) was found to be driving a difference within this group (Figure 4). The system is seeing the disease at multiple scales. NDUFS8 [NADH dehydrogenase (ubiquinone) Fe-S protein 8; NADH-coenzyme Q reductase] along with other genes involved in oxidative phosphorylation are decreased in AD [50]. Given the role of these genes in mitochondrial function and redox, Class V was defined as mitochondrial.

Display full size

Figure 4. Sub-map of the AD population of Class V. This sub-map (zoomed in) provides another facet of the Alzheimer’s population within this dataset. From one perspective, PDE5A and PSEN1 are driving the relationships between the subjects illustrated. However, SYK is driving the slight separation between loop 1 and 2

A map of samples in terms of how they compare to each other according to some non-linear metric via a set of genes is shown in Figure 4. Each point in this figure is a subject, and if they are near each other, it means that they are similar according to a set of genes. In this way, one can see non-linear relationships via a simple 2-dimensional representation, like in principle components analysis, except principle components only reveal linear relationships. Further, by using a looping feature with a mouse, one can query the machine in terms of what is driving the separation between groups of subjects. Looping triggers a statistical process to provide confidence in whatever variables are implicated. Thus, there are no axes, but instead, relative distances according to what is driving the heterogeneity of the subjects. This process is explained in detail elsewhere [29].

Class VI is defined by DCAF17, cDNA FLJ75819, and SLC33A1. DCAF17 (DDB1 and CUL4 associated factor 17) is a nuclear transmembrane protein associated with damaged DNA binding protein 1 ubiquitin ligase complex and is involved in iron accumulation [51]. Given the mitochondrial role of the primary identifier of Class VI, there may be some overlap in subjects in Class V and VI. Interestingly, 44 subjects fell under both Class V and Class VI (Figure 5). cDNA FLJ75819 is most similar to ZNF652, which is associated with metal protein. With these two genes in mind, Class VI appears to be defined by metal ion metabolism.

Display full size

Figure 5. Overlap of subjects in the metal ion transport and mitochondrial perspective classes. A complete list of the AD samples from the data we used, in addition to which classes they fall in, is available in the supplementary materials

Discussion

ML efforts in the field of Alzheimer’s genomic research have been primarily focused on discovering subjects at-risk for AD or with high MCI-to-AD conversion. This work has been increasingly focused on identifying genetic subtypes within the presumption of a heterogeneous AD population. The need to expand biomarker-based stratification within the AD population has been highlighted with as many as 30 altered transcriptional signatures found to distinguish AD samples from non-demented brain samples [52]. However, there are currently few predictions for AD-associated genes based on brain gene expression data alone. This study sought to develop a brain-specific gene interaction network to predict the potential AD association for every gene in the genome by integrating the relationship between each pair of AD-associated genes and the correlation coefficient of known AD-associated and -unassociated genes [53]. This genome-wide complement of AD candidate genes provides a precision medicine approach that can be used to explore AD mechanisms further and pave the path towards individualized novel treatments similar to what is already being done in cancer genomics.

Within the Class I identifiers, TUBB1 encodes part of one of the core protein families that heterodimerize and assemble to form microtubules [54]. The tubulin β-1 chain is the major β-tubulin isotype expressed in megakaryocytes and platelets. Mutations or absence of TUBB1 is associated with enlarged rounded platelets and result in thrombocytopenia [37]. TUBB1 has been reported to be downregulated in AD; thus, it is not surprising that taxanes and other microtubule-targeting drugs restore lost nerve signals in AD and other neurodegenerative diseases [55]. The second Class I identifier, ASB4, encodes a protein that degrades filamin B proteins and plays a role in vascular differentiation and insulin signaling [56]. Asb-4 is co-localized and interacts with IRS4 in hypothalamic neurons [57]. The Asb-4 and IRS4 interaction mediates the degradation of IRS4, which in turn decreases insulin signaling, implicating ASB4 in energy homeostasis [58]. Asb-4 has also been associated with the regulation of inflammation, angiogenesis, and apoptosis via interactions with factor inhibiting HIF-1α (FIH) and TNF-α [59, 60]. Phosphodiesterases (PDEs) are responsible for the hydrolysis of cyclic adenosine monophosphate (cAMP) and cyclic guanosine monophosphate (cGMP). PDE inhibition is involved in neurodegenerative processes due to the regulation of cAMP and cGMP [61]. PDE5 is a cGMP-specific PDE and is upregulated in AD subjects compared to age-matched healthy controls [62]. PDE5 inhibitors, such as Sildenafil, have been suggested as Alzheimer’s drugs, leading to vascular smooth muscle relaxation, vasodilation, improved cognition, and restoring memory function [63–65]. Collectively, the Class I identifiers support the proposition of AD as a vascular disorder [38–40].

We described Class II by cellular signaling and differentiation due to being identified by NRG2 and ZNF3. NRGs are a member of epidermal growth factor (EGF)-related proteins, which stimulate ErbB-receptor tyrosine phosphorylation that elicits different downstream signaling pathways such as MAPK, PI3K, PKC, and JAK-STAT pathways and are associated with synaptic plasticity [41, 42]. Neuregulin-2 (Nrg2) dysregulation has been associated with cancer, schizophrenia, and AD [41]. Neuregulin-1 (Nrg1) is the primary substrate for Beta-secretase 1 (BACE-1), which is the only β-secretase that generates Aβ peptides. Although Nrg1 and Nrg2 are highly homologous, it remains unclear whether Nrg2 is also a BACE-1 substrate [66]. However, ADAM10 and BACE-2 cleave Nrg2 to generate a C-terminal fragment that serves as a substrate for γ-secretase [67]. Little remains known about NRG2; however, other members of the NRG family, including the more widely reported NRG1 and less known NRG3, have both been speculated to be involved in AD and cognitive impairment [68, 69]. In line with the overlying cell differentiation theme of Class II, ZNF3 is a transcription factor involved in cell differentiation and proliferation. In a recent GWAS, ZNF3 has been associated with AD along with NDUFS3 and MTCH2 [70]. ZNF3 interacts with BAG3, which is involved in ubiquitin/proteasomal functions in protein degradation and is regulated by the upstream binding of BACH1, whose target genes have roles in the oxidative stress response and control of the cell cycle [71]. AD-associated tau has been identified as a BACH1 target, making it a potential AD target [72]. However, a clear link explaining this subpopulation remains to be identified and warrants further investigations.

We propose that Class III, which is defined by IGF1, ASB4, and GTSE1 to be classified by metabolism. Several studies have reported impaired insulin receptor/IGF1 receptor signaling in AD subjects with decreased receptor expression, suggesting that AD is brain-type diabetes [73]. However, the association between IGF1 and AD remains controversial [74, 75]. Low IGF1 serum levels are associated with aging, one of the significant risk factors for AD. This suggests that high IGF1 may protect against neurodegeneration [60]. Some studies report that IGF1 enhances the transport of Aβ-carrier proteins into the brain and promotes transport across the blood-brain barrier [76].

In contrast, other studies have shown that long-term suppression of IGF1R signaling alleviates AD progression, providing protection from neuroinflammation and memory impairments induced by Aβ oligomers [77]. A recent study identified that within APOE ε4 carriers, there is a threshold at which IGF1R stimulating activity becomes associated with dementia [78]. Thus, IGF1 expression and response to IGF1 signaling may present as a way to stratify AD subjects into different subtypes. One study suggests that IRS4 may be a negative regulator of IGF1 signaling by suppressing other IRS proteins [79]. Given this link, the IGF1 signaling pathway presents an interesting way to classify AD subpopulations. IRS4 is reported to be the most downregulated gene in the insulin signaling pathway, with IRS genes implicated in tau phosphorylation [80]. Asb-4 co-localization with IRS4 mediates IRS4 degradation, which in turn decreases insulin signaling [57]. Given this link, the IGF1 signaling pathway represents a unique classification of AD subpopulations, as increased ASB4 would promote decreased insulin signaling, as would IRS4 downregulation. Considering GTSE1, which encodes a microtubule-associated protein, as the third prevalent identifier for Class III, there is overlap between Class I and Class III. Both TUBB1 and GTSE1 are involved in microtubule stability and formation, and ASB4 defines both classes. Thus, these two classes may actually represent one larger subpopulation that can further be defined or stratified on the basis of insulin/insulin-like growth factor signaling.

Class IV was defined by cDNA FLJ39269, ITGA1, and CPM. As mentioned, GUCY1A3 is the most closely associated gene to cDNA FLJ39269. GUCY1A3 encodes for a subunit of the guanylyl cyclase, a key enzyme in the NO signaling pathway, which catalyzes the conversion of GTP to cGMP, which in turn regulates the activity of protein kinases, PDEs, and ion channels [81]. Furthermore, GUCY1A3 has been associated with vascular dementia [82]. GUCY1A3 mutations are associated with NO signaling disruption that leads to hypertension [83]. ITGA1 encodes the α1 subunit of integrin receptors, which heterodimerizes with the β1 subunit to form a cell-surface receptor for collagen and laminin [84]. More specifically, the α1β1 complex has been associated with mediating the Aβ neurotoxic effect, playing an essential role in initiating events that lead to neurite degeneration in the presence of Aβ [85]. ITGA1 is downregulated in neuroplastin 65 (NP65) knockout mice, which exhibit abnormal cognition and emotional disorders that resemble AD characteristics [86]. CPM is a carboxypeptidase for peptides and proteins involved in inflammation and neuropeptide processing and has been found to be downregulated in the lymphocytes of AD subjects [87]. CPM is known to enhance NO output, playing a role in NO signaling under inflammatory conditions [49]. The AD patient population is characterized by chronic inflammation in the brain and are increasingly susceptible to infections, suggesting a possible link between CPM and AD [88]. NO has been implicated in AD neurotoxicity as NO-dependent pathways have been reported to contribute to cognitive decline and neurodegeneration [89]. As an inflammatory disease, NO synthesis is increased in the AD brain, which is thought to contribute to oxidative stress-associated neurodegeneration. However, there are reports of an early neuroprotective role of NO in AD that may be harnessed as a therapeutic strategy [90]. NO has been reported to impair autophagy by several mechanisms, with nitric oxide synthase (NOS) inhibition enhancing clearance of autophagic substrates and reducing neurodegeneration [91]. However, autophagy impairment has been reported in individuals with neurodegenerative diseases, and the causal mechanistic links between NO, autophagy and AD remain to be elucidated [92, 93]. Furthermore, there have been links with other carboxypeptidases to AD. Specifically, a new human mutation in the carboxypeptidase E (CPE)/neurotrophic factor-α1 (NF-α1) gene from an AD patient was found to cause memory deficit and depressive-like behavior in transgenic mice [94]. Thus, this AD subpopulation appears to be linked to NO, which has been implicated in AD neurodegeneration.

PDE5A, PSEN1, and NDUFS8 identify Class V, which we described as a mitochondrial subpopulation, that may suggest a familial role. PDE5 is upregulated in AD subjects compared to age-matched healthy controls, underscoring the use of PDE5 inhibitors to restore memory function and cognition [62, 63]. Even further, PDE5 inhibition has been shown to decrease Aβ load in models of AD [95]. Although PDE5A was the third most prominent identifier for Class I, it was the primary identifier for Class V. PDE inhibition is involved in neurodegenerative processes by regulating cAMP and cGMP concentrations [61]. cGMP-specific PDE5 is reported to be upregulated in AD subjects compared to age-matched healthy controls [62]. What stood out the most for this Class was that although PSEN1 mutations are the most common cause of autosomal dominant FAD [96], PSEN1 was not the primary identifier. Two hypotheses describe the role of PSEN1 on AD pathogenesis–the amyloid hypothesis and the presenilin hypothesis. The amyloid hypothesis proposes that PSEN1 mutations initiate AD pathogenesis by increasing the production of Aβ42, which contributes to amyloid plaque deposition. In contrast, the presenilin hypothesis proposes that PSEN1 mutations cause loss of function of presenilin in the brain, which triggers neurodegeneration and dementia [97]. Looking even further into this subpopulation, we noticed that SYK drives an additional difference within this group, also highlighting the complexity of the disease. SYK regulates Aβ production and tau hyperphosphorylation [98, 99]. The Aβ and the NO/cGMP pathway can stimulate synaptic plasticity and memory at low doses and inhibit them at high doses. With aging, the body’s ability to regulate the balance between oxidant and antioxidant systems decreases, resulting in an increased production of reactive oxygen and nitrogen species that result in tissue damage. This oxidative stress also promotes the accumulation of Aβ [95]. Furthermore, NDUFS8 being one of the identifiers for Class V, highlights the mitochondrial role of AD. Complex I has essential bioenergetic and metabolic functions and is a known source of reactive oxygen species, linking it to many hereditary and degenerative diseases [100].

Class VI was defined by DCAF17, cDNA FLJ75819, and SLC33A1. DCAF17 encodes a nuclear transmembrane protein associated with damaged DNA binding protein 1 ubiquitin ligase complex and is involved in iron accumulation in Globus pallidus and in white matter [51]. Similar to Class V, this highlights the role of mitochondrial dysfunction in AD pathogenesis. Within Class VI specifically, this highlights the pathological role of iron overload in the mitochondria to cause mitochondrial dysfunction [101]. It appears that iron overload-induced mitochondrial dysfunction is the driving difference between Class V and Class VI. This idea is reinforced with ZNF652, which, although not the identifier for Class VI, is the most closely associated gene to cDNA FLJ75819. ZNF652 is associated with metal protein and has been reported to be upregulated in severe AD [102, 103]. The third identifier of this subpopulation, SLC33A1, and its associated protein AT-1, are associated with the import of acetyl-CoA by regulating Nε-lysine acetylation of ER-resident and -transiting proteins, which causes a progeria-like phenotype that mimics an accelerated form of aging [104]. Mutations and increased expression of AT-1/SLC33A1 have been associated with several diseases, including neurologic, intellectual, and dysmorphic conditions, and have also been reported in LOAD subjects [105]. Interestingly, SLC33A1 mutations have also been associated with low serum copper [106]. Homeostasis of metal ions, including iron, copper, zinc, and calcium, in the brain is crucial for maintaining normal physiological functions–and an imbalance is closely related to the onset and progression of AD. This is due to metal ion dysregulation contributing to oxidative stress and the induction of tau and Aβ pathologies [107]. Although there appears to be an underlying role of metals or metal metabolism, this represents a subpopulation that warrants additional investigation to understand how they collectively contribute to AD pathology.

Interestingly, PSEN1 was the only one of the genes primarily associated with AD and increased AD risk (APOE ε4, APP, PSEN1, and PSEN2) to be a primary identifier for a perspective class. Although we only utilized AD subjects in the study, this highlights the heterogeneous nature of AD pathology. This explains why APOE does not show up in the analysis as a driving variable, and it was our intention to extract a refined vantage into AD outside of APOE findings. The heterogeneity of the AD population used, in terms of beta-amyloid status, also contributed to APOE mRNA counts to be insignificant compared to other gene expression products. Furthermore, there have been reports on the molecular differences in AD between males and females [108]. Thus, identifying whether certain classes or combinations of classes are more prevalent in males or females will continue to shed light on disease etiology. Females are at a greater risk of developing AD dementia, while males are at a greater risk of developing vascular dementia [109]. Analysis of our dataset revealed that 58.9% of males fell under the vasculogenesis perspective class, compared to only 38.7% of females. Surprisingly, only 16.5% of females fell under the mitochondrial perspective class. Gender has been reported to not only influence AD evolution directly but also through other comorbidity factors [110]. Note that the perspective classes discovered by the machine intelligence we are using offers a view into how AD progresses for different people and how different people evolve towards this phenotype through potentially different combinations of factors. The six progression mechanisms discussed here appear to be an essential part of this story. By providing an increased granularity into the mechanisms at play, the advent of AI and ML algorithms provide a means of expediting the drug repurposing and development process, particularly with respect to heterogeneous neurodegenerative diseases. This is because ML approaches permit the mining of different kinds of data that shed light on disease etiology through precise subpopulations, which can, in turn, assist in the discovery and development of effective anti-AD drugs. Future work will explore statistical evaluations of several subpopulations.

It is widely known that AD is a heterogeneous disease, yet AD drug trials often have broad inclusion criteria, not accounting for disease heterogeneity in trial design [111–113]. Stratifying treatment trial designs to account for disease heterogeneity using algorithms and omics data will lead to personalized medicine in AD drug development. Hypothesis-generating AI technologies like the one described in [28] are able to help usher disease definitions that precisely relate to the molecular machinery at play. The improvement to clinical trial outcomes can be substantial as we will be better able to select patients and match them with drug candidates.

Perspective analytics allowed us to understand an AD patient population in various ways with the goal of being able to precisely define the various mechanisms at play behind this complex disease and how these perspectives can improve clinical trial efforts in this space. It is possible that certain drugs that have been designed for AD are actually effective at improving the health of specific subpopulations, and even more possible that several drug candidates can be repositioned for specific subtypes. Here, we have identified six perspective classes corresponding to disease progression mechanisms that contribute to AD heterogeneity. The six perspective classes highlight the critical roles of vasculogenesis, cellular signaling and differentiation, metabolic function, mitochondrial function, NO, and metal ion metabolism. Although these specific AD patient subpopulations have not explicitly been identified previously, the genetic identifiers for each perspective have been implicated in AD. The ability to utilize a small dataset to extract such precise insights opens up the possibility to boil away much of the noise that exists within the AD field, redefining the way we think about AD as a set of diseases that emerge through various molecular pathways.

Many remarkable advances using machine intelligence have been made over the last several years. Computer vision applications have been given particular attention as the advent of convolutional neural networks are beautifully suited for these tasks. Similarly, other types of deep neural networks are currently being used for drug discovery. There is great potential that comes with creating a new taxonomy of disease for complex disorders. These efforts will allow researchers and pharmaceutical companies to derive precise and novel ways to attack these disorders through the drug paradigm or genetic engineering. It should be noted that oligomannate has been approved for the treatment of Alzheimer’s in China, and that aducanumab is currently seeking approval. It will be interesting to see how our efforts to understand patient subpopulations will influence the utilization of existing and future therapies.

Although results from this study are not exhaustive, they demonstrate that even within a relatively small study sample, next-generation machine intelligence is capable of uncovering multiple genetically driven subtypes. We hope to continue this work with a larger Alzheimer’s transcriptomic dataset so that we can continue to unravel the etiology behind dementia. In future analyses, we are considering the combinatorial aspects of the patient population within this dataset and from others. Are there certain combinations of the six perspective classes that are statistically more likely to occur? Machine intelligence has opened up a door that is allowing us to pursue therapies for neurodegeneration with a much finer granularity of understanding.

Abbreviations

AD: Alzheimer’s disease

AI: artificial intelligence

Aβ: β-amyloid

BACE-1: Beta-secretase 1

cAMP: cyclic adenosine monophosphate

cGMP: cyclic guanosine monophosphate

CUL4: cullin-4

DDB1: DNA damage-binding protein 1

FAD: familial Alzheimer’s disease

GWAS: genome-wide association study

IRS4: insulin receptor substrate 4

JAK-STAT: Janus kinase-signal transducer and activator of transcription

LOAD: late onset Alzheimer’s disease

MCI: mild cognitive impairment

ML: machine learning

NADH: nicotinamide adenine dinucleotide

NO: nitric oxide

NRG: neuregulin

PDE: phosphodiesterase

PRS: polygenic risk score

REO: relative expression ordering

SNP: single nucleotide polymorphism

SOCS: suppressor of cytokine signaling

SYK: spleen tyrosine kinase

Supplementary materials

The supplementary materials for this article are available at:

https://www.explorationpub.com/uploads/Article/file/100126_sup_1.pdf

Declarations

Author contributions

JG created the mathematics from which the machine intelligence techniques utilized were derived, curated the data, led the vision, and carried out a good portion of the research along with MT, who contributed essential bioinformatics. BQ wrote the majority of the manuscript and carried out critical protein interaction work that allowed us to interpret the results provided by the machine intelligence. RA and AA were both primary readers of the manuscript, provided direction, criticism, and helped shape the overall flow of the research during the course of this work. All authors contributed to manuscript revision, read, and approved the submitted version.

Conflicts of interest

The author Joseph Geraci declares that he owns substantial shares in NetraMark Corp, which funded a major portion of this study.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent to publication

Not applicable.

Availability of data and materials

The data for this work was extracted from data available at https://www.ebi.ac.uk/arrayexpress/experiments/E-GEOD-84422/?query=GSE84422.

Funding

This project was supported by NetraMark Corp., an AI company focused on advanced machine intelligence methods, and by graduate student support from Queen’s University for Bessi Qorri. The funders had no role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Copyright

References

WHO. Global action plan on the public health response to dementia 2017–2025. Geneva: World Health Organization; 2017.

2020 Alzheimer’s disease facts and figures. Alzheimer’s Dement. 2020;16:391–460. [DOI]

Prince M, Ali GC, Guerchet M, Prina AM, Albanese E, Wu YT. Recent global trends in the prevalence and incidence of dementia, and survival with dementia. Alzheimers Res Ther. 2016;8:23. [DOI] [PubMed] [PMC]

Toro CA, Zhang L, Cao J, Cai D. Sex differences in Alzheimer’s disease: understanding the molecular impact. Brain Res. 2019;1719:194–207. [DOI] [PubMed] [PMC]

Quan M, Zhao T, Tang Y, Luo P, Wang W, Qin Q, et al. Effects of gene mutation and disease progression on representative neural circuits in familial Alzheimer’s disease. Alzheimers Res Ther. 2020;12:14. [DOI] [PubMed] [PMC]

Dorszewska J, Prendecki M, Oczkowska A, Dezor M, Kozubski W. Molecular basis of familial and sporadic Alzheimer’s disease. Curr Alzheimer Res. 2016;13:952–63. [DOI] [PubMed]

Anand R, Gill KD, Mahdi AA. Therapeutics of Alzheimer’s disease: past, present and future. Neuropharmacology. 2014;76:27–50. [DOI] [PubMed]

Eid A, Mhatre I, Richardson JR. Gene-environment interactions in Alzheimer’s disease: a potential path to precision medicine. Pharmacol Ther. 2019;199:173–87. [DOI] [PubMed] [PMC]

Cummings J, Lee G, Ritter A, Zhong K. Alzheimer’s disease drug development pipeline: 2018. Alzheimers Dement (N Y). 2018;4:195–214. [DOI] [PubMed] [PMC]

10.

Cummings JL, Morstorf T, Zhong K. Alzheimer’s disease drug-development pipeline: few candidates, frequent failures. Alzheimers Res Ther. 2014;6:37. [DOI] [PubMed] [PMC]

11.

Wang X, Sun G, Feng T, Zhang J, Huang X, Wang T, et al. Sodium oligomannate therapeutically remodels gut microbiota and suppresses gut bacterial amino acids-shaped neuroinflammation to inhibit Alzheimer’s disease progression. Cell Res. 2019;29:787–803. [DOI] [PubMed] [PMC]

12.

Hampel H, O’Bryant SE, Castrillo JI, Ritchie C, Rojkova K, Broich K, et al. Precision medicine-the golden gate for detection, treatment and prevention of Alzheimer’s disease. J Prev Alzheimers Dis. 2016;3:243–59. [DOI] [PubMed] [PMC]

13.

Freudenberg-Hua Y, Li W, Davies P. The role of genetics in advancing precision medicine for Alzheimer’s disease-a narrative review. Front Med (Lausanne). 2018;5:108. [DOI] [PubMed] [PMC]

14.

Van Cauwenberghe C, Van Broeckhoven C, Sleegers K. The genetic landscape of Alzheimer disease: clinical implications and perspectives. Genet Med. 2016;18:421–30. [DOI] [PubMed] [PMC]

15.

Lambert JC, Ibrahim-Verbaas CA, Harold D, Naj AC, Sims R, Bellenguez C, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat Genet. 2013;45:1452–8. [DOI] [PubMed] [PMC]

16.

Tábuas-Pereira M, Santana I, Guerreiro R, Brás J. Alzheimer’s disease genetics: review of novel loci associated with disease. Curr Genet Med Rep. 2020;8:1–16. [DOI]

17.

Castrillo JI, Lista S, Hampel H, Ritchie CW. Systems biology methods for Alzheimer’s disease research toward molecular signatures, subtypes, and stages and precision medicine: application in cohort studies and trials. In: Perneczky R, editor. Biomarkers for Alzheimer’s Disease Drug Development. New York: Humana Press; 2018. p. 31–66. [DOI]

18.

Zhang D, Wang Y, Zhou L, Yuan H, Shen D; Alzheimer’s Disease Neuroimaging Initiative. Multimodal classification of Alzheimer’s disease and mild cognitive impairment. Neuroimage. 2011;55:856–67. [DOI] [PubMed] [PMC]

19.

Wei R, Li C, Fogelson N, Li L. Prediction of conversion from mild cognitive impairment to alzheimer’s disease using MRI and structural network features. Front Aging Neurosci. 2016;8:76. [DOI] [PubMed] [PMC]

20.

Zhang D, Shen D; Alzheimer’s Disease Neuroimaging Initiative. Multimodal multi-task learning for joint prediction of multiple regression and classification variables in Alzheimer’s disease. Neuroimage. 2012;59:895–907. [DOI] [PubMed] [PMC]

21.

Zhang D, Shen D; Alzheimer’s Disease Neuroimaging Initiative. Predicting future clinical changes of MCI patients using longitudinal and multimodal biomarkers. PLoS One. 2012;7:e33182. [DOI] [PubMed] [PMC]

22.

Lo MT, Kauppi K, Fan CC, Sanyal N, Reas ET, Sundar VS, et al. Identification of genetic heterogeneity of Alzheimer’s disease across age. Neurobiol Aging. 2019;84:243.e1–9. [DOI] [PubMed] [PMC]

23.

Hong G, Zeng P, Li N, Cai H, Guo Y, Li X, et al. A qualitative analysis based on relative expression orderings identifies transcriptional subgroups for Alzheimer’s disease. Curr Alzheimer Res. 2019;16:1175–82. [DOI] [PubMed]

24.

Scheltens NM, Tijms BM, Koene T, Barkhof F, Teunissen CE, Wolfsgruber S, et al.; Alzheimer’s Disease Neuroimaging Initiative; German Dementia Competence Network; University of California San Francisco Memory and Aging Center; Amsterdam Dementia Cohort. Cognitive subtypes of probable Alzheimer’s disease robustly identified in four cohorts. Alzheimers Dement. 2017;13:1226–36. [DOI] [PubMed] [PMC]

25.

Scheltens NM, Galindo-Garre F, Pijnenburg YA, van der Vlies AE, Smits LL, Koene T, et al. The identification of cognitive subtypes in Alzheimer’s disease dementia using latent class analysis. J Neurol Neurosurg Psychiatry. 2016;87:235–43. [DOI] [PubMed]

26.

Crane PK, Trittschuh E, Mukherjee S, Saykin AJ, Sanders RE, Larson EB, et al.; Executive Prominent Alzheimer’s Disease: Genetics and Risk Factors (EPAD:GRF) Investigators. Incidence of cognitively defined late-onset Alzheimer’s dementia subgroups from a prospective cohort study. Alzheimers Dement. 2017;13:1307–16. [DOI] [PubMed] [PMC]

27.

Chasioti D, Yan J, Nho K, Saykin AJ. Progress in polygenic composite scores in Alzheimer’s and other complex diseases. Trends Genet. 2019;35:371–82. [DOI] [PubMed] [PMC]

28.

Gong CX, Liu F, Iqbal K. Multifactorial hypothesis and multi-targets for Alzheimer’s disease. J Alzheimers Dis 2018;64:S107–17. [DOI] [PubMed]

29.

Tsay M, Geraci J, Agrawal A. Next-gen AI for disease definition, patient stratification, and placebo effect. Version: 1. OSF Preprints [Preprint]. [posted 2020 Apr 6; revised 2020 Jul 22; cited 2020 Jun 18]: [9 p.]. Available from: https://osf.io/pc7ak

30.

Silva GA. The effect of signaling latencies and node refractory states on the dynamics of networks. Neural Comput. 2019;31:2492–522. [DOI] [PubMed]

31.

Rokach L. Pattern classification using ensemble methods. Singapore: World Scientific; 2019.

32.

Asaithambi S, editor. Why, how and when to apply feature selection [Internet]. Ontario: Towards Data Science; 2018. [cited 2020 Oct 3]. Available from: https://towardsdatascience.com/why-how-and-when-to-apply-feature-selection-e9c69adfabf2

33.

Chandrashekar G, Sahin F. A survey on feature selection methods. Comput Electr Eng. 2014;40:16–28. [DOI]

34.

Diaz-Papkovich A, Anderson-Trocmé L, Ben-Eghan C, Gravel S. UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts. PLoS Genet. 2019;15:e1008432. [DOI] [PubMed] [PMC]

35.

Scheff SW. Nonparametric statistics. Fundamental statistical principles for the neurobiologist. Cambridge: Academic Press; 2016. p. 157–82.

36.

Warde-Farley D, Donaldson SL, Comes O, Zuberi K, Badrawi R, Chao P, et al. The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res. 2010;38:W214–20. [DOI] [PubMed] [PMC]

37.

Burley K, Westbury SK, Mumford AD. TUBB1 variants and human platelet traits. Platelets. 2018;29:209–11. [DOI] [PubMed]

38.

de la Torre JC. Vascular basis of Alzheimer’s pathogenesis. Ann N Y Acad Sci. 2002;977:196–215. [DOI] [PubMed]

39.

de la Torre JC. Is Alzheimer’s disease a neurodegenerative or a vascular disorder? Data, dogma, and dialectics. Lancet Neurol. 2004;3:184–90. [DOI] [PubMed]

40.

Altman R, Rutledge JC. The vascular contribution to Alzheimer’s disease. Clin Sci. 2010;119:407–21. [DOI]

41.

Kim JA, Jayabalan AK, Kothandan VK, Mariappan R, Kee Y, Ohn T. Identification of Neuregulin-2 as a novel stress granule component. BMB Rep. 2016;49:449–54. [DOI] [PubMed] [PMC]

42.

Ledonne A, Mercuri NB. On the modulatory roles of Neuregulins/ErbB signaling on synaptic plasticity. Int J Mol Sci. 2019;21:275. [DOI]

43.

Islam T, Rahman R, Shahjaman, Zaman ST, Karim R, Quinn JMW, et al. Blood-based molecular biomarker signatures in Alzheimer’s disease: insights from systems biomedicine perspective. BioRxiv 481879 [Preprint]. 2018 [cited 2020 Oct 3]. Available from: https://www.biorxiv.org/content/10.1101/481879v4

44.

Freude S, Hettich MM, Schumann C, Stöhr O, Koch L, Köhler C, et al. Neuronal IGF-1 resistance reduces Aβ accumulation and protects against premature death in a model of Alzheimer’s disease. FASEB J. 2009;23:3315–24. [DOI] [PubMed]

45.

Wu M, Fang K, Wang W, Lin W, Guo L, Wang J. Identification of key genes and pathways for Alzheimer’S disease via combined analysis of genome-wide expression profiling in the hippocampus. Biophys Rep. 2019;5:98–109. [DOI]

46.

Tipton AR, Wren JD, Daum JR, Siefert JC, Gorbsky GJ. GTSE1 regulates spindle microtubule dynamics to control Aurora B kinase and Kif4A chromokinesin on chromosome arms. J Cell Biol. 2017;216:3117–32. [DOI] [PubMed] [PMC]

47.

Puthiyedth N, Riveros C, Berretta R, Moscato P. Identification of differentially expressed genes through integrated study of Alzheimer’s disease affected brain regions. PLoS One. 2016;11:e0152342. [DOI] [PubMed] [PMC]

48.

Kessler T, Wobst J, Wolf B, Eckhold J, Vilne B, Hollstein R, et al. Functional characterization of the GUCY1A3 coronary artery disease risk locus. Circulation. 2017;136:476–89. [DOI] [PubMed] [PMC]

49.

Zhang X, Tan F, Brovkovych V, Zhang Y, Lowry JL, Skidgel RA. Carboxypeptidase M augments kinin B1 receptor signaling by conformational crosstalk and enhances endothelial nitric oxide output. Biol Chem. 2013;394:335–45. [DOI] [PubMed] [PMC]

50.

Lunnon K, Keohane A, Pidsley R, Newhouse S, Riddoch-Contreras J, Thubron EB, et al. Mitochondrial genes are altered in blood early in Alzheimer’s disease. Neurobiol Aging. 2017;53:36–47. [DOI] [PubMed]

51.

Arber CE, Li A, Houlden H, Wray S. Review: Insights into molecular mechanisms of disease in neurodegeneration with brain iron accumulation: unifying theories. Neuropathol Appl Neurobiol. 2016;42:220–41. [DOI] [PubMed] [PMC]

52.

Dagan H, Flashner-Abramson E, Vasudevan S, Jubran MR, Cohen E, Kravchenko-Balasha N. Exploring Alzheimer’s disease molecular variability via calculation of personalized transcriptional signatures. Biomolecules. 2020;10:503. [DOI]

53.

Huang X, Liu H, Li X, Guan L, Li J, Tellier LCAM, et al. Revealing Alzheimer’s disease genes spectrum in the whole-genome by machine learning. BMC Neurol. 2018;18:5. [DOI] [PubMed] [PMC]

54.

Kim JH, Franck J, Kang T, Heinsen H, Ravid R, Ferrer I, et al. Proteome-wide characterization of signalling interactions in the hippocampal CA4/DG subfield of patients with Alzheimer’s disease. Sci Rep. 2015;5:11138. [DOI] [PubMed] [PMC]

55.

Morello G, Cavallaro S. Transcriptional analysis reveals distinct subtypes in amyotrophic lateral sclerosis: implications for personalized therapy. Future Med Chem. 2015;7:1335–59. [DOI] [PubMed]

56.

Upadhyay A, Joshi V, Amanullah A, Mishra R, Arora N, Prasad A, et al. E3 ubiquitin ligases neurobiological mechanisms: development to degeneration. Front Mol Neurosci. 2017;10:151. [DOI] [PubMed] [PMC]

57.

Li JY, Chai B, Zhang W, Wu X, Zhang C, Fritze D, et al. Ankyrin repeat and SOCS box containing protein 4 (Asb-4) colocalizes with insulin receptor substrate 4 (IRS4) in the hypothalamic neurons and mediates IRS4 degradation. BMC Neurosci. 2011;12:95. [DOI] [PubMed] [PMC]

58.

Li JY, Kuick R, Thompson RC, Misek DE, Lai YM, Liu YQ, et al. Arcuate nucleus transcriptome profiling identifies ankyrin repeat and suppressor of cytokine signalling box-containing protein 4 as a gene regulated by fasting in central nervous system feeding circuits. J Neuroendocrinol. 2005;17:394–404. [DOI] [PubMed]

59.

Anasa VV, Ravanan P, Talwar P. Multifaceted roles of ASB proteins and its pathological significance. Front Biol. 2018;13:376–88. [DOI]

60.

Westwood AJ, Beiser A, DeCarli C, Harris TB, Chen TC, He XM, et al. Insulin-like growth factor-1 and risk of Alzheimer dementia and brain atrophy. Neurology. 2014;82:1613–9. [DOI] [PubMed] [PMC]

61.

Wu Y, Li Z, Huang YY, Wu D, Luo HB. Novel phosphodiesterase inhibitors for cognitive improvement in Alzheimer’s disease: miniperspective. J Med Chem. 2018;61:5467–83. [DOI] [PubMed]

62.

Zhou LY, Zhu Y, Jiang YR, Zhao XJ, Guo D. Design, synthesis and biological evaluation of dual acetylcholinesterase and phosphodiesterase 5A inhibitors in treatment for Alzheimer’s disease. Bioorg Med Chem Lett. 2017;27:4180–4. [DOI] [PubMed]

63.

García-Osta A, Cuadrado-Tejedor M, García-Barroso C, Oyarzabal J, Franco R. Phosphodiesterases as therapeutic targets for Alzheimer’s disease. ACS Chem Neurosci. 2012;3:832–44. [DOI] [PubMed] [PMC]

64.

Sanders O. Sildenafil for the treatment of Alzheimer’s disease: a systematic review. J Alzheimers Dis Rep. 2020;4:91–106. [DOI] [PubMed] [PMC]

65.

Liu L, Xu H, Ding S, Wang D, Song G, Huang X. Phosphodiesterase 5 inhibitors as novel agents for the treatment of Alzheimer’s disease. Brain Res Bull. 2019;153:223–31. [DOI] [PubMed]

66.

Hu X, Fan Q, Hou H, Yan R. Neurological dysfunctions associated with altered BACE 1-dependent Neuregulin-1 signaling. J Neurochem. 2016;136:234–49. [DOI] [PubMed] [PMC]

67.

Czarnek M, Bereta J. Proteolytic processing of Neuregulin 2. Mol Neurobiol. 2019;57:1799–813. [DOI] [PubMed] [PMC]

68.

Cespedes JC, Liu M, Harbuzariu A, Nti A, Onyekaba J, Cespedes HW, et al. Neuregulin in health and disease. Int J Brain Disord Treat. 2018;4:024. [DOI] [PubMed] [PMC]

69.

Wang KS, Xu N, Wang L, Aragon L, Ciubuc R, Arana TB, et al. NRG3 gene is associated with the risk and age at onset of Alzheimer disease. J Neural Transm (Vienna). 2014;121:183–92. [DOI] [PubMed]

70.

Naj AC, Schellenberg GD; Alzheimer’s Disease Genetics Consortium (ADGC). Genomic variants, genes, and pathways of Alzheimer’s disease: an overview. Am J Med Genet B Neuropsychiatr Genet. 2017;174:5–26. [DOI] [PubMed] [PMC]

71.

Escott-Price V, Bellenguez C, Wang LS, Choi SH, Harold D, Jones L, et al.; Cardiovascular Health Study (CHS). Gene-wide analysis detects two new susceptibility genes for Alzheimer’s disease. 2014;9:e94661. [DOI]

72.

Warnatz HJ, Schmidt D, Manke T, Piccini I, Sultan M, Borodina T, et al. The BTB and CNC homology 1 (BACH1) target genes are involved in the oxidative stress response and in control of the cell cycle. J Biol Chem. 2011;286:23521–32. [DOI] [PubMed] [PMC]

73.

Moll L, Schubert M. The role of insulin and insulin-like growth factor-1/FoxO-mediated transcription for the pathogenesis of obesity-associated dementia. Curr Gerontol Geriatr Res. 2012;2012:384094. [DOI] [PubMed] [PMC]

74.

Ostrowski PP, Barszczyk A, Forstenpointner J, Zheng W, Feng ZP. Meta-analysis of serum insulin-like growth factor 1 in Alzheimer’s disease. PLoS One. 2016;11:e0155733. [DOI] [PubMed] [PMC]

75.

Gubbi S, Quipildor GF, Barzilai N, Huffman DM, Milman S. 40 years of IGF1: IGF1: the Jekyll and Hyde of the aging brain. J Mol Endocrinol. 2018;61:T171–85. [DOI] [PubMed] [PMC]

76.

Freude S, Schilbach K, Schubert M. The role of IGF-1 receptor and insulin receptor signaling for the pathogenesis of Alzheimer’s disease: from model organisms to human disease. Curr Alzheimer Res. 2009;6:213–23. [DOI] [PubMed]

77.

George C, Gontier G, Lacube P, François JC, Holzenberger M, Aïd S. The Alzheimer’s disease transcriptome mimics the neuroprotective signature of IGF-1 receptor-deficient neurons. Brain. 2017;140:2012–27. [DOI] [PubMed]

78.

Galle SA, van der Spek A, Drent ML, Brugts MP, Scherder EJA, Janssen JAMJL, et al. Revisiting the role of insulin-like growth factor-I receptor stimulating activity and the apolipoprotein E in Alzheimer’s disease. Front Aging Neurosci. 2019;11:20. [DOI] [PubMed] [PMC]

79.

Tsuruzoe K, Emkey R, Kriauciunas KM, Ueki K, Kahn CR. Insulin receptor substrate 3 (IRS-3) and IRS-4 impair IRS-1-and IRS-2-mediated signaling. Mol Cell Biol. 2001;21:26–38. [DOI] [PubMed] [PMC]

80.

Jackson HM, Soto I, Graham LC, Carter GW, Howell GR. Clustering of transcriptional profiles identifies changes to insulin signaling as an early event in a mouse model of Alzheimer’s disease. BMC Genomics. 2013;14:831. [DOI] [PubMed] [PMC]

81.

Cesarini V, Martini M, Vitiani LR, Gravina GL, Di Agostino S, Graziani G, et al. Type 5 phosphodiesterase regulates glioblastoma multiforme aggressiveness and clinical outcome. Oncotarget. 2017;8:13223–39. [DOI] [PubMed] [PMC]

82.

Manso-Calderón R. Genetics in vascular dementia. Future Neurol. 2019;14:FNL5. [DOI]

83.

Wallace S, Guo DC, Regalado E, Mellor-Crummey L, Bamshad M, Nickerson DA, et al. Disrupted nitric oxide signaling due to GUCY1A3 mutations increases risk for moyamoya disease, achalasia and hypertension. Clin Genet. 2016;90:351–60. [DOI] [PubMed] [PMC]

84.

Palmieri O, Mazza T, Merla A, Fusilli C, Cuttitta A, Martino G, et al. Gene expression of muscular and neuronal pathways is cooperatively dysregulated in patients with idiopathic achalasia. Sci Rep. 2016;6:31549. [DOI] [PubMed] [PMC]

85.

Anderson KL, Ferreira A. α1 integrin activation: a link between β-amyloid deposition and neuronal death in aging hippocampal neurons. J Neurosci Res. 2004;75:688–97. [DOI] [PubMed]

86.

Li H, Zeng J, Huang L, Wu D, Liu L, Liu Y, et al. Microarray analysis of gene expression changes in neuroplastin 65-Knockout mice: implications for abnormal cognition and emotional disorders. Neurosci Bull. 2018;34:779–88. [DOI] [PubMed] [PMC]

87.

Kalman J, Kitajka K, Pákáski M, Zvara A, Juhász A, Vincze G, et al. Gene expression profile analysis of lymphocytes from Alzheimer’s patients. Psychiatr Genet. 2005;15:1–6. [DOI] [PubMed]

88.

Deiteren K, Hendriks D, Scharpé S, Lambeir AM. Carboxypeptidase M: multiple alliances and unknown partners. Clin Chim Acta. 2009;399:24–39. [DOI] [PubMed]

89.

Tang X, Li Z, Zhang W, Yao Z. Nitric oxide might be an inducing factor in cognitive impairment in Alzheimer’s disease via downregulating the monocarboxylate transporter 1. Nitric Oxide. 2019;91:35–41. [DOI] [PubMed]

90.

Balez R, Ooi L. Getting to NO Alzheimer’s disease: neuroprotection versus neurotoxicity mediated by nitric oxide. Oxid Med Cell Longev. 2016;2016:3806157. [DOI] [PubMed] [PMC]

91.

Sarkar S, Korolchuk VI, Renna M, Imarisio S, Fleming A, Williams A, et al. Complex inhibitory effects of nitric oxide on autophagy. Mol Cell. 2011;43:19–32. [DOI] [PubMed] [PMC]

92.

Morris G, Berk M, Maes M, Puri BK. Could Alzheimer’s disease originate in the periphery and if so how so? Mol Neurobiol. 2019;56:406–34. [DOI] [PubMed] [PMC]

93.

Uddin MS, Stachowiak A, Mamun AA, Tzvetkov NT, Takeda S, Atanasov AG, et al. Autophagy and Alzheimer’s disease: from molecular mechanisms to therapeutic implications. Front Aging Neurosci. 2018;10:04. [DOI] [PubMed] [PMC]

94.

Cheng Y, Cawley NX, Yanik T, Murthy SR, Liu C, Kasikci F, et al. A human carboxypeptidase E/NF-α1 gene mutation in an Alzheimer’s disease patient leads to dementia and depression in mice. Transl Psychiatry. 2016;6:e973. [DOI] [PubMed] [PMC]

95.

Puzzo D, Loreto C, Giunta S, Musumeci G, Frasca G, Podda MV, et al. Effect of phosphodiesterase-5 inhibition on apoptosis and beta amyloid load in aged mice. Neurobiol Aging. 2014;35:520–31. [DOI] [PubMed]

96.

Bekris LM, Yu CE, Bird TD, Tsuang DW. Genetics of Alzheimer disease. J Geriatr Psychiatry Neurol. 2010;23:213–27. [DOI] [PubMed] [PMC]

97.

Kelleher RJ 3rd, Shen J. Presenilin-1 mutations and Alzheimer’s disease. Proc Natl Acad Sci U S A. 2017;114:629–31. [DOI] [PubMed] [PMC]

98.

Paris D, Ait-Ghezala G, Bachmeier C, Laco G, Beaulieu-Abdelahad D, Lin Y, et al. The spleen tyrosine kinase (Syk) regulates Alzheimer amyloid-β production and Tau hyperphosphorylation. J Biol Chem. 2014;289:33927–44. [DOI] [PubMed] [PMC]

99.

Schweig JE, Yao H, Beaulieu-Abdelahad D, Ait-Ghezala G, Mouzon B, Crawford F, et al. Alzheimer’s disease pathological lesions activate the spleen tyrosine kinase. Acta Neuropathol Commun. 2017;5:69. [DOI] [PubMed] [PMC]

100.

Lemire BD. Evolution, structure and membrane association of NDUFAF6, an assembly factor for NADH: ubiquinone oxidoreductase (Complex I). Mitochondrion. 2017;35:13–22. [DOI] [PubMed]

101.

Bagwe-Parab S, Kaur G. Molecular targets and therapeutic interventions for Iron induced neurodegeneration. Brain Res Bull. 2019;156:1–9. [DOI] [PubMed]

102.

Kong W, Mou X, Liu Q, Chen Z, Vanderburg CR, Rogers JT, et al. Independent component analysis of Alzheimer’s DNA microarray gene expression data. Mol Neurodegener. 2009;4:5. [DOI] [PubMed] [PMC]

103.

Kong W, Mou X, Hu X. Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data. BMC Bioinformatics. 2011;12 Suppl 5:S7. [DOI]

104.

Peng Y, Shapiro SL, Banduseela VC, Dieterich IA, Hewitt KJ, Bresnick EH, et al. Increased transport of acetyl-CoA into the endoplasmic reticulum causes a progeria-like phenotype. Aging cell. 2018;17:e12820. [DOI] [PubMed] [PMC]

105.

Peng Y, Shapiro S, Hewitt K, Kong G, Bresnick E, Zhang J, et al. Systemic overexpression of AT-1/SLC33A1 causes a progeria-like phenotype. Innov Aging. 2017;1:426–7. [DOI]

106.

Huppke P, Brendel C, Kalscheuer V, Korenke GC, Marquardt I, Freisinger P, et al. Mutations in SLC33A1 cause a lethal autosomal-recessive disorder with congenital cataracts, hearing loss, and low serum copper and ceruloplasmin. The Am J Hum Genet. 2012;90:61–8. [DOI] [PubMed] [PMC]

107.

Wang L, Yin YL, Liu XZ, Shen P, Zheng YG, Lan XR, et al. Current understanding of metal ions in the pathogenesis of Alzheimer’s disease. Transl Neurodegener. 2020;9:10. [DOI] [PubMed] [PMC]

108.

Sun LL, Yang SL, Sun H, Li WD, Duan SR. Molecular differences in Alzheimer’s disease between male and female patients determined by integrative network analysis. J Cell Mol Med. 2019;23:47–58. [DOI] [PubMed] [PMC]

109.

Podcasy JL, Epperson CN. Considering sex and gender in Alzheimer disease and other dementias. Dialogues Clin Neurosci. 2016;18:437–46. [DOI] [PubMed] [PMC]

110.

Sinforiani E, Citterio A, Zucchella C, Bono G, Corbetta S, Merlo P, et al. Impact of gender differences on the outcome of Alzheimer’s disease. Dement Geriatr Cogn Disord. 2010;30:147–54. [DOI] [PubMed]

111.

Devi G, Scheltens P. Heterogeneity of Alzheimer’s disease: consequence for drug trials? Alzheimers Res Ther. 201810:122. [DOI] [PubMed] [PMC]

112.

Au R, Piers RJ, Lancashire L. Back to the future: Alzheimer’s disease heterogeneity revisited. Alzheimers Dement (Amst). 2015;1:368–70. [DOI] [PubMed] [PMC]

113.

Ferreira D, Wahlund LO, Westman E. The heterogeneity within Alzheimer’s disease. Aging (Albany NY). 2018;10:3058–60. [DOI] [PubMed] [PMC]

Copyright: © The Author(s) 2020. This is an Open Access article licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, sharing, adaptation, distribution and reproduction in any medium or format, for any purpose, even commercially, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Abstract

Keywords

Introduction

Materials and methods

Dataset assembly and analysis

Perspective analytics

Validation

Results

Discussion

Abbreviations

Supplementary materials

Declarations

Author contributions

Conflicts of interest

Ethical approval

Consent to participate

Consent to publication

Availability of data and materials

Funding

Copyright

References

Going against the norm: validation of a novel alternative to brain SPECT normative datasets

The need for a harmonized speech dataset for Alzheimer’s disease biomarker development

Detection of mild traumatic brain injury in pediatric populations using BrainCheck, a tablet-based cognitive testing software: a preliminary study

Identification of digital voice biomarkers for cognitive health

Assessing the capacity for mental manipulation in patients with statically-determined mild cognitive impairment using digital technology

Proof of concept: digital clock drawing behaviors prior to transcatheter aortic valve replacement may predict length of hospital stay and cost of care

Neuropsychological test validation of speech markers of cognitive impairment in the Framingham Cognitive Aging Cohort

Digital sleep measures and white matter health in the Framingham Heart Study

Objective measurement of sleep by smartphone application: comparison with actigraphy and relation to self-reported sleep