<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "JATS-journalpublishing1.dtd">
<article xml:lang="en" article-type="research-article" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Exploration of Medicine</journal-id>
<journal-title-group>
<journal-title>Exploration of Medicine</journal-title>
</journal-title-group>
<issn pub-type="epub">2692-3106</issn>
<publisher>
<publisher-name>Open Exploration</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">100194</article-id>
<article-id pub-id-type="doi">10.37349/emed.2022.00094</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Original Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Construction and validation of gastric cancer diagnosis model based on machine learning</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-2029-8014</contrib-id>
<name>
<surname>Kong</surname>
<given-names>Fei</given-names>
</name>
<xref ref-type="aff" rid="AFF1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Yan</surname>
<given-names>Ziqin</given-names>
</name>
<xref ref-type="aff" rid="AFF2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Lan</surname>
<given-names>Ning</given-names>
</name>
<xref ref-type="aff" rid="AFF1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wang</surname>
<given-names>Pinxiu</given-names>
</name>
<xref ref-type="aff" rid="AFF1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fan</surname>
<given-names>Shanlin</given-names>
</name>
<xref ref-type="aff" rid="AFF1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Yuan</surname>
<given-names>Wenzhen</given-names>
</name>
<xref ref-type="aff" rid="AFF3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="C1"><sup>&#x0002A;</sup></xref>
</contrib>
<contrib contrib-type="academic-editor">
<name>
<surname>Dykxhoorn</surname>
<given-names>Derek M.</given-names>
</name>
</contrib>
<aff id="AFF1"><label>1</label>The First Clinical Medical College of Lanzhou University, Lanzhou 730030, Gansu, China</aff>
<aff id="AFF2"><label>2</label>The Silk Road Infoport Co., Ltd., Lanzhou 730030, Gansu, China</aff>
<aff id="AFF3"><label>3</label>Department of Oncology, The First Hospital of Lanzhou University, Lanzhou 730030, Gansu, China</aff>
<aff id="AFF4">University of Miami Miller School of Medicine, USA</aff>
</contrib-group>
<author-notes>
<corresp id="C1"><label>&#x0002A;</label><bold>Correspondence:</bold> Wenzhen Yuan, Department of Oncology, The First Hospital of Lanzhou University, Lanzhou 730030, Gansu, China. <email>yuanwzh@lzu.edu.cn</email></corresp>
</author-notes>
<pub-date pub-type="ppub">
<year>2022</year>
</pub-date>
<pub-date pub-type="epub">
<day>29</day>
<month>06</month>
<year>2022</year>
</pub-date>
<volume>3</volume>
<fpage>300</fpage>
<lpage>313</lpage>
<history>
<date date-type="received">
<day>14</day>
<month>04</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>18</day>
<month>05</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>&#x00A9; The Author(s) 2022.</copyright-statement>
<copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This is an Open Access article licensed under a Creative Commons Attribution 4.0 International License (<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link>), which permits unrestricted use, sharing, adaptation, distribution and reproduction in any medium or format, for any purpose, even commercially, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.</license-p></license>
</permissions>
<abstract>
<sec><title>Aim: </title><p>To screen differentially expressed genes related to gastric cancer based on The Cancer Genome Atlas (TCGA) database and construct a gastric cancer diagnosis model by machine learning.</p></sec>
<sec><title>Methods: </title><p>Transcriptional data, genomic data, and clinical information of gastric cancer tissues and non-gastric cancer tissues were downloaded from the TCGA database, and differentially expressed genes of gastric cancer messenger RNA (mRNA) and long non-coding RNA (lncRNA) were screened out. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyzed the differentially expressed genes, and the protein-protein interaction (PPI) of differentially expressed genes was constructed. Core differentially expressed genes were screened by Cytoscape software&#x02019;s molecular complex detection (MCODE) plug-in. The differential genes of lncRNA were analyzed by univariate Cox regression analysis and lasso regression for further dimension reduction to obtain the core genes. The core genes were screened by machine learning to construct the gastric cancer diagnosis model. The efficiency of the gastric cancer diagnosis model was verified externally by the Gene Expression Omnibus (GEO) database.</p></sec>
<sec><title>Results: </title><p>Finally, 10 genes including long intergenic non-protein coding RNA 1821 (<italic>LINC01821</italic>), <italic>AL138826.1</italic>, <italic>AC022164.1</italic>, adhesion G protein-coupled receptor D1-antisense RNA 1 (<italic>ADGRD1-AS1</italic>), cyclin B1 (<italic>CCNB1</italic>), kinesin family member 11 (<italic>KIF11</italic>), Aurora kinase B (<italic>AURKB</italic>), cyclin dependent kinase 1 (<italic>CDK1</italic>), nucleolar and spindle associated protein 1 (<italic>NUSAP1</italic>), and TTK protein kinase (<italic>TTK</italic>) were screened as gastric cancer diagnostic model genes. After efficiency analysis, it was found that the random forest algorithm model had the best comprehensive evaluation, with an accuracy of 92&#x00025; and an area under the curve (AUC) of 0.9722, which was more suitable for building a gastric cancer diagnosis model. The GSE54129 data set was used to verify the gastric cancer diagnosis model with an AUC of 0.904, indicating that the gastric cancer diagnosis model had high accuracy.</p></sec>
<sec><title>Conclusions: </title><p>Machine learning can simplify the bioinformatics analysis process and improve efficiency. The core gene discovered in this study is expected to become a gene chip for the diagnosis of gastric cancer.</p></sec>
</abstract>
<kwd-group>
<kwd>Machine learning</kwd>
<kwd>gastric cancer</kwd>
<kwd>diagnostic model</kwd>
<kwd>bioinformatics</kwd>
</kwd-group></article-meta>
</front>
<body>
<sec id="s1"><title>Introduction</title>
<p>Gastric cancer is the fifth most common cancer globally and ranks third in the world in terms of cancer mortality &#x0005B;<xref ref-type="bibr" rid="B1">1</xref>&#x0005D;, and is one of the most common malignant tumors in China &#x0005B;<xref ref-type="bibr" rid="B2">2</xref>&#x0005D;. Gastric cancer is characterized by high malignancy, susceptibility to distant metastasis, poor prognosis, and heavy disease burden &#x0005B;<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B4">4</xref>&#x0005D;. Currently, the diagnosis of gastric cancer still relies on upper gastrointestinal endoscopy &#x0005B;<xref ref-type="bibr" rid="B4">4</xref>&#x02013;<xref ref-type="bibr" rid="B6">6</xref>&#x0005D;. The invasive examination increases the difficulty of early screening &#x0005B;<xref ref-type="bibr" rid="B7">7</xref>&#x0005D;. For gastric cancer diagnosis, the specificity and sensitivity of traditional serum markers, such as carcinoembryonic antigen (CEA), are low &#x0005B;<xref ref-type="bibr" rid="B8">8</xref>&#x0005D;. Therefore, finding more accurate predictive markers for molecular diagnosis of gastric cancer is crucial in screening, early diagnosis and treatment.</p>
<p>With the development of high-throughput technology, RNA sequencing (RNA-seq)-based diagnostic markers for gastric cancer have been widely studied. Non-coding RNA (ncRNA) affects the expression of oncogenes or oncogenes and is expected to be a molecular marker for the early diagnosis of gastric cancer &#x0005B;<xref ref-type="bibr" rid="B9">9</xref>, <xref ref-type="bibr" rid="B10">10</xref>&#x0005D;. Long ncRNA (lncRNA) has been widely studied among ncRNAs in gastric cancer. LncRNA is an RNA transcript that has more than 200 nucleotides in length and is usually found in the nucleus &#x0005B;<xref ref-type="bibr" rid="B11">11</xref>&#x0005D;. LncRNA plays an important role in epigenetic regulation, cell cycle, genomic imprinting, chromatin modification, transcriptional interference, protein activation, etc. &#x0005B;<xref ref-type="bibr" rid="B12">12</xref>, <xref ref-type="bibr" rid="B13">13</xref>&#x0005D;. It was found that lncRNA is dysregulated in gastric, liver, breast, and cervical cancers and other tumors &#x0005B;<xref ref-type="bibr" rid="B14">14</xref>, <xref ref-type="bibr" rid="B15">15</xref>&#x0005D;, suggesting that lncRNA may be a potential biological marker for early diagnosis, efficacy chemoresistance and other assessments.</p>
<p>The volume of sequencing data is too large to be adequately analyzed by traditional means. Machine learning is algorithms that use statistical data analysis to build models for making predictions about the outcome of unknown data. Compared with existing statistical methods, machine learning has higher evaluation accuracy and personalized prediction ability when big data is used to analyze medical problems &#x0005B;<xref ref-type="bibr" rid="B16">16</xref>&#x0005D;. <italic>The New England Journal of Medicine</italic> believes that machine learning will bring a significant breakthrough in medicine &#x0005B;<xref ref-type="bibr" rid="B17">17</xref>&#x0005D;. For example, machine learning can predict the structures and functions of proteins based on the arrangement of genetic factors. Therefore, this study obtained core genes and built a diagnostic model for gastric cancer based on bioinformatics analysis of The Cancer Genome Atlas (TCGA) database, and then verified the model by machine learning and further validated the accuracy of the model using an external dataset (GEO dataset, Gene Expression Omnibus&#x02013;NCBI), to provide a new method for early diagnosis of gastric cancer.</p>
</sec>
<sec id="s2"><title>Materials and methods</title>
<sec><title>Data acquisition and processing</title>
<p>The transcriptomic data, genomic data, and clinical information on gastric cancer from the TCGA database were downloaded from the Genomic Data Commons (GDC) Data Portal (<ext-link ext-link-type="uri" xlink:href="https://portal.gdc.cancer.gov/">https://portal.gdc.cancer.gov/</ext-link>) on June 28, 2021, including 376 gastric cancer patients and 31 non-gastric cancer tissue samples. On July 8, 2021, this study downloaded transcriptomic and genomic data and clinical information of lung cancer patients from the TCGA database as a control group, including 543 lung cancer patients with cancer tissues and 51 normal tissue samples. The number of gastric cancer patients reached 376, and the number of non-gastric cancer patients reached 625. After that, the GEO public database (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo">https://www.ncbi.nlm.nih.gov/geo</ext-link>) was searched with &#x0201C;gastric cancer&#x0201D; as the keyword, and the GSE54129 dataset was selected as the external validation dataset. The GSE54129 dataset is based on the GPL570 platform, which contains the gene expression information of 111 gastric cancer patients and 21 normal controls. The RNA-seq data expression matrix was merged with R language (version 4.0.4) to obtain the complete RNA-seq expression profile, and the count data were normalized and ID transformed. The messenger RNA (mRNA) and lncRNA were extracted separately to generate gene expression matrices.</p>
</sec>
<sec><title>mRNA data processing</title>
<sec><title>Screening for differential genes</title>
<p>The &#x0201C;DESeq2&#x0201D; package was called in R language to screen the differential genes with the criteria of &#x0007C;log2 fold change (FC)&#x0007C; &#x02265; 3 and false discovery rate (FDR) &#x0003C; 0.05, and the &#x0201C;heatmap&#x0201D; package was used to plot the differential gene heat map and the &#x0201C;ggplot&#x0201D; package to plot the differential gene scatter map.</p>
</sec>
<sec><title>Functional enrichment analysis of differential genes and protein interaction network construction</title>
<p>Using R language, this research first performs gene ontology (GO) analysis &#x0005B;<xref ref-type="bibr" rid="B18">18</xref>&#x0005D; and gene function annotation of differentially expressed genes, including biological process (BP), cell composition (CC), and molecular function (MF). Subsequently, a Kyoto Encyclopedia of Genes and Genomes (KEGG) &#x0005B;<xref ref-type="bibr" rid="B19">19</xref>&#x0005D; analysis was performed to obtain signaling pathways that may have a role. This study used the search tool for the retrieval of interacting genes (STRING) (an online biological database that provides gene analysis and constructs networks of gene interactions at the protein level &#x0005B;<xref ref-type="bibr" rid="B20">20</xref>&#x0005D;). Then, the program downloaded and installed the molecular complex detection (MCODE) plug-in in Cytoscape 3.6.1 and imported the above the protein-protein interaction (PPI) data in tab-separated values (TSV) format with a degree cutoff &#x0003D; 2, Node score cutoff &#x0003D; 0.2, and &#x003BA;-core &#x0003D; 0.2. The PPI network&#x02019;s most densely associated regions were obtained using degree cutoff &#x0003D; 2, Node score cutoff &#x0003D; 0.2, &#x003BA;-core &#x0003D; 2, max. depth &#x0003D; 100 as the screening criteria. The most densely associated regions in the PPI network were obtained, which are the core genes related to gastric cancer screening in this study &#x0005B;<xref ref-type="bibr" rid="B21">21</xref>&#x0005D;.</p>
</sec>
</sec>
<sec><title>LncRNA data processing</title>
<sec><title>Screening for differential genes</title>
<p>The &#x0201C;DESeq2&#x0201D; package was called in R to screen the differential genes with the criteria of &#x0007C;log2 FC&#x0007C; &#x02265; 3 and FDR &#x0003C; 0.05. The differential gene heat map is plotted by the &#x0201C;heatmap&#x0201D; package. The differential gene volcano was shown through the &#x0201C;ggplot&#x0201D; package in R language.</p>
</sec>
<sec><title>Screening for key genes</title>
<p>Since lncRNAs are ncRNAs, the univariate Cox regression model was first used to investigate the relationship between lncRNA expression levels and overall patient survival for the screened differentially expressed lncRNAs, and the identification criterion was <italic>P</italic> &#x0003C; 0.05. To avoid overfitting of the model, lasso regression was used to process the data. Lasso regression is based on linear regression, and the addition of a penalty term in the model estimation can compress extremely small regression coefficients to 0, at the cost of some estimation bias to obtain higher model prediction accuracy and model generalization ability. This study performed lasso regression analysis by &#x0201C;glmnet&#x0201D; package in R language to further screen lncRNAs associated with survival prognosis by increasing the penalty strength and narrowing the regression coefficients.</p>
</sec>
</sec>
<sec><title>Construction of the model</title>
<p>This study used three machine learning algorithms (MLAs) &#x0005B;random forest (RF) &#x0005B;<xref ref-type="bibr" rid="B22">22</xref>, <xref ref-type="bibr" rid="B23">23</xref>&#x0005D;, naive Bayesian classification (NBC) &#x0005B;<xref ref-type="bibr" rid="B24">24</xref>&#x0005D;, and <italic>k</italic>-nearest neighbor (KNN) &#x0005B;<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>&#x0005D;&#x0005D; to construct and compare diagnostic models for gastric cancer (see the <xref ref-type="sec" rid="s5">supplementary file</xref> for details of the algorithm).</p>
</sec>
<sec><title>Model optimization and validation</title>
<p>This study used accuracy, sensitivity, specificity, and area under the curve (AUC) to assess the performance of the gastric cancer diagnostic model. Accuracy refers to the proportion of samples correctly predicted by the model to all samples, and sensitivity refers to the proportion of correct predictions where the true value is a positive case. The AUC is the area under the receiver operating characteristic (ROC) curve. The ROC curve shows the sensitivity and specificity of the model prediction, and the larger the value, the better the prediction. To further verify the model&#x02019;s efficacy, data from the GEO dataset GSE54129 were applied to further validate the accuracy of the model.</p>
</sec>
</sec>
<sec id="s3"><title>Results</title>
<sec><title>Acquisition of key mRNA genes for gastric cancer</title>
<sec><title>Screening for gastric cancer differential genes</title>
<p>Expression data were downloaded from the TCGA database for 407 patients, including 376 gastric cancer tissues and 31 control tissues. By differential comparison, this study screened a total of 947 differential mRNAs, of which 419 were upregulated and 526 downregulated, and plotted a heat map and volcano map (<xref ref-type="fig" rid="F1">Figure 1</xref>).</p>
<fig id="F1" position="float"><label>Figure 1.</label><caption><p>Heat map and volcano map of mRNA differential genes in gastric cancer. (A) Heat map of 947 mRNA differential genes in gastric cancer, and the top 50 most representative genes were selected to draw the heat map; (B) volcano map of 947 genes obtained at a cutoff value of 3, of which 419 were upregulated and 526 downregulated</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g001.tif"/></fig>
</sec>
<sec><title>Biological process analysis of differential genes in gastric cancer</title>
<p>GO enrichment analysis showed that gastric cancer upregulated differential genes (UDEGs) were mainly distributed in the extracellular region part, proteinaceous extracellular matrix (ECM), the ECM, and other tissues. The UDEGs were involved in biological processes such as cell adhesion, biological adhesion, response to wounding, and mainly had molecular functions such as ECM structural constituent and glycosaminoglycan binding. Gastric cancer downregulated differential genes (DDEGs) were mainly distributed in the apical part of cells, the extracellular region and other tissues, involved in biological processes such as digestion, lipid catabolic process and response to the metal ion. The DDEGs mainly had molecular functions such as steroid binding and coenzyme binding (<xref ref-type="fig" rid="F2">Figure 2</xref>).</p>
<fig id="F2" position="float"><label>Figure 2.</label><caption><p>Biological process analysis of differential genes in gastric cancer. (A&#x02013;C) The results of GO analysis of UDEGs in gastric cancer; (D&#x02013;F) the results of GO analysis of DDEGs in gastric cancer. CXCR: CXC chemokines; NAD<sup>&#x0002B;</sup>: Dihydrouracil Dehydrogenase; p. adjust: adjust <italic>P</italic>-values for multiple comparisons; P granule: germ cell ribonucleoprotein granules</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g002.tif"/></fig>
</sec>
<sec><title>Analysis of the signaling pathways involved in differential genes of gastric cancer</title>
<p>KEGG enrichment analysis showed that gastric cancer UDEGs were highly expressed in signaling pathways such as focal adhesion, ECM-receptor interactions, and leukocyte transendothelial migration. In contrast, gastric cancer DDEGs were enriched in expression in pathways such as metabolism of xenobiotics by cytochrome P450, drug metabolism-cytochrome P450, and retinol metabolism (<xref ref-type="fig" rid="F3">Figure 3</xref>).</p>
<fig id="F3" position="float"><label>Figure 3.</label><caption><p>KEGG pathway analysis of differential genes in gastric cancer. Red represents UDEGs, and blue represents DDEGs. PPAR: peroxisome proliferator-activated receptor; cAMP: cyclic adenosine monophosphate</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g003.tif"/></fig>
</sec>
<sec><title>PPI network construction and core gene identification</title>
<p>Interactions between differential genes were predicted using the STRING database, and the information of 947 differential genes was imported into Cytoscape software for visualization study. Nine hundred and eighteen nodes and 1,209 edges were involved in the PPI network (<xref ref-type="fig" rid="F4">Figure 4A</xref>). Ten gastric cancer-associated core differentially expressed genes were screened, and they were cyclin dependent kinase 1 (<italic>CDK1</italic>), non-SMC condensin I complex subunit G (<italic>NCAPG</italic>), cyclin B1 (<italic>CCNB1</italic>), kinesin family member 11 (<italic>KIF11</italic>), Aurora kinase B (<italic>AURKB</italic>), cell division cycle associated 8 (<italic>CDCA8</italic>), threonine kinase B (<italic>BUB1B</italic>), nucleolar and spindle associated protein 1 (<italic>NUSAP1</italic>), TTK protein kinase (<italic>TTK</italic>), and mitotic arrest deficient 2 like 1 (<italic>MAD2L1</italic>) according to the degree of node association from highest to lowest (<xref ref-type="fig" rid="F4">Figure 4B</xref>).</p>
<fig id="F4" position="float"><label>Figure 4.</label><caption><p>PPI network of differential genes and top 10 core genes in gastric cancer. (A) PPI map of differentially expressed genes in gastric cancer; (B) top 10 core genes in gastric cancer</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g004.tif"/></fig>
</sec>
</sec>
<sec><title>Acquisition of key lncRNA genes in gastric cancer</title>
<p>The expression data of patients were downloaded from the TCGA database, including 31 control tissues and 376 cancer tissues, and 66 differentially expressed lncRNAs were screened using the &#x0201C;DESeq2&#x0201D; package, of which 29 lncRNAs were upregulated, and 37 lncRNAs were downregulated. Among the 66 differentially expressed lncRNAs, 19 lncRNAs were further analyzed by univariate Cox regression with <italic>P</italic> &#x0003C; 0.05. Subsequently, lasso regression was used to further downscale the model, and the results showed that the model error was minimized when the number of variables was 5, which corresponded to &#x003BB; &#x0003D; 0. 075. The five screened lncRNAs were long intergenic non-protein coding RNA 1821 (<italic>LINC01821</italic>), <italic>AL138826.1</italic>, gastric cancer associated transcript 3 (<italic>GACAT3</italic>), <italic>AC022164.1</italic>, and adhesion G protein-coupled receptor D1-antisense RNA 1 (<italic>ADGRD1-AS1</italic>), which were used as gastric cancer lncRNA key genes (<xref ref-type="fig" rid="F5">Figure 5</xref>).</p>
<fig id="F5" position="float"><label>Figure 5.</label><caption><p>Lasso regression process. (A) Lasso diagram shows the dynamic process of screening variables; (B) the selection process of the cross-validation parameter Log(&#x003BB;)</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g005.tif"/></fig>
</sec>
<sec><title>Construction of data tables for gastric cancer diagnosis model</title>
<p>The 10 key genes of mRNA and 5 key genes of lncRNA were used to construct a gastric cancer diagnosis model, with column names as gene names, row names as the number of each sample, and content as the expression of each gene in each sample, and the genetic data of 543 cases of lung cancer tissues and 51 cases of paired paracancerous normal tissues were added as non-gastric cancer patients introduced into the model for the validation of the gastric cancer prediction model. As a result, the number of gastric cancer patients reached 376 cases, and the number of non-gastric cancer patients reached 628 cases.</p>
</sec>
<sec><title>Model performance analysis</title>
<p>The above 15 genes were further screened by the &#x0201C;Feature Importance&#x0201D; algorithm to identify 10 key genes, namely <italic>LINC01821</italic>, <italic>AL138826.1</italic>, <italic>AC022164.1</italic>, <italic>ADGRD1-AS1</italic>, <italic>CCNB1</italic>, <italic>KIF11</italic>, <italic>AURKB</italic>, <italic>CDK1</italic>, <italic>NUSAP1</italic>, <italic>TTK</italic>. The 10 genes were modeled using the best feature subset, by three MLAs: RF, NBC, and KNN. As for the performance of RF, NBC, and KNN, algorithm boosting was measured with 6 main metrics, namely AUC, ROC, correctness, sensitivity, specificity, and precision. Among the three models, RF has the highest AUC and ROC of 0.9722, higher than the NBC of 0.9088 and the KNN algorithm of 0.8656. RF has the highest accuracy of 0.920 among all models, slightly higher than the NBC of 0.824 and the KNN algorithm of 0.797 (<xref ref-type="fig" rid="F6">Figures 6</xref> and <xref ref-type="fig" rid="F7">7</xref>). Therefore, the RF algorithm was finally chosen to construct the model.</p>
<fig id="F6" position="float"><label>Figure 6.</label><caption><p>Feature selection and machine learning. (A) The initial screening of 15 genes by the &#x0201C;Feature Importance&#x0201D; algorithm and further screening of 10 key genes; (B) the NBC algorithm model; (C) the KNN algorithm model; (D) the RF algorithm model</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g006.tif"/></fig>
<fig id="F7" position="float"><label>Figure 7.</label><caption><p>Internal test set ROC curves. Red dashed line is the reference line, red solid is the NBC algorithm model, green is the KNN algorithm model, and blue is the RF algorithm model</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g007.tif"/></fig>
</sec>
<sec><title>External validation of the gastric cancer diagnostic model</title>
<p>The GSE54129 dataset was selected as an external validation dataset in the GEO database. The GSE54129 dataset was based on the GPL570 platform, which contained the gene expression information of 111 gastric cancer patients and 21 non-gastric cancer patients. As is shown in <xref ref-type="fig" rid="F8">Figure 8</xref>, the area of the ROC curve (AUC) for the validation sets was 0.9144. As a result, 0.9144 greater than 0.70 indicates that the model has good predictability.</p>
<fig id="F8" position="float"><label>Figure 8.</label><caption><p>External validation set ROC curve. The red dashed line is the reference line, and the blue is the ROC curve</p></caption><graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="100194-g008.tif"/></fig>
</sec>
</sec>
<sec id="s4"><title>Discussion</title>
<p>This study downloaded the transcriptomic and genomic data of 372 gastric cancer patients and 625 paired non-gastric cancer patient samples. All patients&#x02019; clinical information was included in the study through the TCGA database. This study first analyzed the data quality using R software, and the results showed that the data quality was good, indicating that the data of this study were reliable. Subsequently, the mRNA and lncRNA differentially expressed gastric cancer genes were screened out, and the core differentially expressed genes were initially screened out after the related pathway analysis. By using the &#x0201C;Feature Importance&#x0201D; algorithm, 10 key genes related to gastric cancer were finally screened: <italic>LINC01821</italic>, <italic>AL138826.1</italic>, <italic>AC022164.1</italic>, <italic>ADGRD1-AS1</italic>, <italic>CCNB1</italic>, <italic>KIF11</italic>, <italic>AURKB</italic>, <italic>CDK1</italic>, <italic>NUSAP1</italic>, and <italic>TTK</italic>. It was found that the accuracy of the model built by the RF algorithm was 92&#x00025; with an AUC of 0.897, which was more suitable for building a diagnostic model of gastric cancer. Then external validation was performed by data in the database, and the AUC of the model was found to be 0.9144 through validation, indicating that the gastric cancer diagnostic model has high accuracy and is expected to be an early diagnosis model for gastric cancer.</p>
<p>The research screened 10 key genes respectively from 947 differentially mRNAs and 66 differentially expressed lncRNAs. <italic>CCNB1</italic> is one of the major members of the cell cycle protein B family and has an important role in the G2/M transition phase of the eukaryotic cells &#x0005B;<xref ref-type="bibr" rid="B27">27</xref>&#x0005D;. Yasuda et al. &#x0005B;<xref ref-type="bibr" rid="B28">28</xref>&#x0005D; showed that overexpression of <italic>CCNB1</italic> mainly occurred in the early stages of gastric malignancies; Gao et al. &#x0005B;<xref ref-type="bibr" rid="B29">29</xref>&#x0005D; found that down-regulation of <italic>CCNB1</italic> expression could help cordycepin-induced cell arrest at the G2/M phase, and high expression of <italic>CCNB1</italic> could be one of the diagnostic genes for gastric cancer. <italic>KIF11</italic> is a member of the kinesin family that affects the formation of spindle bipolarity &#x0005B;<xref ref-type="bibr" rid="B29">29</xref>, <xref ref-type="bibr" rid="B30">30</xref>&#x0005D;, causes chromosomal instability, leads to abnormal cell division and proliferation, promotes tumor formation, and plays a role in gastric cancer progression &#x0005B;<xref ref-type="bibr" rid="B31">31</xref>, <xref ref-type="bibr" rid="B32">32</xref>&#x0005D;. <italic>AURKB</italic> is a member of the aurora kinase family, which plays a role in the assembly of the two-level spindle, maintains normal mitosis, and regulates stem cell self-renewal, reprogramming, and differentiation &#x0005B;<xref ref-type="bibr" rid="B33">33</xref>, <xref ref-type="bibr" rid="B34">34</xref>&#x0005D;. In recent years, <italic>AURKB</italic> has been highly expressed in breast cancer, bladder cancer, gastric cancer, and other tumors &#x0005B;<xref ref-type="bibr" rid="B35">35</xref>&#x02013;<xref ref-type="bibr" rid="B37">37</xref>&#x0005D;. Nie et al. &#x0005B;<xref ref-type="bibr" rid="B38">38</xref>&#x0005D; found that <italic>AURKB</italic> may promote gastric tumorigenesis through epigenetic activation of <italic>CCND1</italic> expression. <italic>NUSAP1</italic> is a microtubule-associated protein that plays an important role in cell division and chromosome segregation &#x0005B;<xref ref-type="bibr" rid="B39">39</xref>&#x02013;<xref ref-type="bibr" rid="B41">41</xref>&#x0005D;. <italic>NUSAP1</italic> can also be used as a molecular marker for prostate cancer, promoting cell proliferation and migration &#x0005B;<xref ref-type="bibr" rid="B42">42</xref>, <xref ref-type="bibr" rid="B43">43</xref>&#x0005D;. Recently, it has been shown that <italic>NUSAP1</italic> is highly expressed in gastric cancer cell lines and tissues and promotes malignant proliferation and invasion of gastric cancer cells &#x0005B;<xref ref-type="bibr" rid="B44">44</xref>&#x0005D;. Furthermore, the genes of <italic>LINC01821</italic>, <italic>AL138826.1</italic>, <italic>AC022164.1</italic>, <italic>ADGRD1-AS1</italic>, <italic>CDK1</italic>, and <italic>TTK</italic> in gastric cancer need to be further investigated.</p>
<p>Machine learning can summarize the patterns in a large amount of data information, explain the inner connection, and efficiently explore the value of data &#x0005B;<xref ref-type="bibr" rid="B45">45</xref>, <xref ref-type="bibr" rid="B46">46</xref>&#x0005D;. The biggest advantage of this study is using an MLA, which reduces manual repetitive work, makes the efficiency of detection greatly improved, and accomplishes complex computational work that is impossible to be done manually by computer. It has been used to predict tumors by learning from gene expression data, such as Leng et al. &#x0005B;<xref ref-type="bibr" rid="B47">47</xref>&#x0005D; collected a large amount of data, including 474 lung adenocarcinoma samples and 491 lung squamous carcinoma samples, and learned 1,099 differentially expressed mRNA data by the extreme gradient boosting (XGBoost) algorithm to predict lung cancer subtypes, lung squamous cell carcinoma and lung adenocarcinoma. The XGBoost algorithm showed high predictive power in this study, outperforming the logistic regression algorithm and supporting vector machine algorithm for lung early diagnosis and treatment of squamous and lung adenocarcinoma. Yang et al. &#x0005B;<xref ref-type="bibr" rid="B48">48</xref>&#x0005D; selected the most important DNA methylation features as a model using RF, and the authors construct a support vector machine classifier for hepatocellular carcinoma diagnosis. Tian et al. &#x0005B;<xref ref-type="bibr" rid="B49">49</xref>&#x0005D; proposed that the normal gastric cell and its cancer counterpart can be distinguished by multiple cellular mechanical phenotypes (CMPs) based on MLA. More accurate prognostic biomarkers can be obtained through MLA, which provides a new method to verify the prognosis of cancer.</p>
<p>The advantages of the model created in this study: 1. the AUC of the model is greater than 0.9 in both the internal validation group and the external validation group, indicating that the model has high accuracy; 2. the model includes not only coding RNA but also ncRNA (lncRNA), which further improves the accuracy of the model and avoids the bias caused by a single gene; 3. the potential for clinical translation application: for those who are financially well-off and unwilling to receive invasive examinations, after obtaining the whole genome sequencing through blood samples, the core gene model can be used to assist in the diagnosis of gastric cancer, which is expected to avoid invasive examinations such as gastroscopic biopsies for the negative population. However, the model still has shortcomings: 1. since the original purpose of our study was to establish a model for diagnosis of gastric cancer, this study did not select data from gastric cancer tissues and paracancerous tissues for validation but used gastric mucosal tissues from non-gastric cancer patients as controls, considering that the sampling of the paracancerous tissue specimens is not standardized in the clinical practice. The definition of paracancerous tissues (length from cancerous tissues, etc.) is not uniform. This avoided the bias caused by sampling and increased our difficulty in finding external validation data sets, resulting in a sample size that was not ideal. In addition, in the external validation, the final selection of the GSE54129 dataset by the microarray resulted in the lack of 2 lncRNAs (<italic>AL138826.1</italic> and <italic>AC022164.1</italic>) among the 10 core genes due to the insufficient number of lncRNA annotations, but this did not have a significant impact on this model, and the AUC of the model remained at 0.9144. 2. In the study, the number of available samples was limited. It would be better to collect more samples to strengthen the machine learning ability and improve the model&#x02019;s accuracy. 3. The machine learning process used sample tissues of lung cancer as a control group because there were not enough normal sample tissues, which may also bias the study results. 4. Since the TCGA database did not provide us with the population source, there are differences between databases in terms of ethnicity, region, nationality, and disease characteristics, which pose a challenge to the generalizability of this finding. 5. This study has not been designed to validate the fresh specimens from clinical patients, and there is no answer to whether the finding can be applied to the Chinese population. The next step will be to improve the design of the study so that the future research can better answer the above questions.</p>
</sec>
</body>
<back>
<glossary><title>Abbreviations</title>
<def-list>
<def-item><term><italic>ADGRD1-AS1</italic>:</term><def><p>adhesion G protein-coupled receptor D1-antisense RNA 1</p></def></def-item>
<def-item><term>AUC:</term><def><p>area under the curve</p></def></def-item>
<def-item><term><italic>AURKB</italic>:</term><def><p>Aurora kinase B</p></def></def-item>
<def-item><term><italic>CCNB1</italic>:</term><def><p>cyclin B1</p></def></def-item>
<def-item><term><italic>CDK1</italic>:</term><def><p>cyclin dependent kinase 1</p></def></def-item>
<def-item><term>DDEGs:</term><def><p>downregulated differential genes</p></def></def-item>
<def-item><term>ECM:</term><def><p>extracellular matrix</p></def></def-item>
<def-item><term>FC:</term><def><p>fold change</p></def></def-item>
<def-item><term>GEO:</term><def><p>Gene Expression Omnibus</p></def></def-item>
<def-item><term>GO:</term><def><p>gene ontology</p></def></def-item>
<def-item><term>KEGG:</term><def><p>Kyoto Encyclopedia of Genes and Genomes</p></def></def-item>
<def-item><term><italic>KIF11</italic>:</term><def><p>kinesin family member 11</p></def></def-item>
<def-item><term>KNN:</term><def><p><italic>k</italic>-nearest neighbor</p></def></def-item>
<def-item><term><italic>LINC01821</italic>:</term><def><p>long intergenic non-protein coding RNA 1821</p></def></def-item>
<def-item><term>lncRNA:</term><def><p>long non-coding RNA</p></def></def-item>
<def-item><term>MLAs:</term><def><p>machine learning algorithms</p></def></def-item>
<def-item><term>mRNA:</term><def><p>messenger RNA</p></def></def-item>
<def-item><term>NBC:</term><def><p>naive Bayesian classification</p></def></def-item>
<def-item><term>ncRNA:</term><def><p>non-coding RNA</p></def></def-item>
<def-item><term><italic>NUSAP1</italic>:</term><def><p>nucleolar and spindle associated protein 1</p></def></def-item>
<def-item><term>PPI:</term><def><p>protein-protein interaction</p></def></def-item>
<def-item><term>RF:</term><def><p>random forest</p></def></def-item>
<def-item><term>RNA-seq:</term><def><p>RNA sequencing</p></def></def-item>
<def-item><term>ROC:</term><def><p>receiver operating characteristic</p></def></def-item>
<def-item><term>TCGA:</term><def><p>The Cancer Genome Atlas</p></def></def-item>
<def-item><term><italic>TTK</italic>:</term><def><p>TTK protein kinase</p></def></def-item>
<def-item><term>UDEGs:</term><def><p>upregulated differential genes</p></def></def-item>
</def-list>
</glossary>
<sec id="s5"><title>Supplementary materials</title>
<p>The supplementary material for this article is available at: <ext-link ext-link-type="uri" xlink:href="https://www.explorationpub.com/uploads/Article/file/100194_sup_1.pdf">https://www.explorationpub.com/uploads/Article/file/100194_sup_1.pdf</ext-link>.</p>
</sec>
<sec id="s6"><title>Declarations</title>
<sec><title>Acknowledgments</title>
<p>We acknowledge TCGA and GEO databases for providing their platforms and contributors for uploading their meaningful datasets.</p>
</sec>
<sec><title>Author contributions</title>
<p>FK and WY conceived and designed this study; ZY constructed the machine learning and IT support. FK, NL, PW, and SF designed the statistical analysis and analyzed the data. FK, NL, and WY interpreted the results. FK wrote the first draft of the manuscript. FK, PW, WY, and NL contributed to the writing of the manuscript. All authors have read and confirmed the data in the manuscript. They can take responsibility for the data&#x02019;s integrity and the data analysis&#x02019;s accuracy and read and approved the submitted version.</p>
</sec>
<sec><title>Conflicts of interest</title>
<p>The authors declare that they have no conflict of interest.</p>
</sec>
<sec><title>Ethical approval</title>
<p>Not applicable.</p>
</sec>
<sec><title>Consent to participate</title>
<p>Not applicable.</p>
</sec>
<sec><title>Consent to publication</title>
<p>Not applicable.</p>
</sec>
<sec><title>Availability of data and materials</title>
<p>The transcriptomic data and clinical information of gastric cancer and lung cancer for this study can be found in the TCGA database and downloaded from the Genomic Data Commons (GDC) Data Portal (<ext-link ext-link-type="uri" xlink:href="https://portal.gdc.cancer.gov/">https://portal.gdc.cancer.gov/</ext-link>); the GSE54129 dataset for this study can be found in the GEO public database (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo">https://www.ncbi.nlm.nih.gov/geo</ext-link>).</p>
</sec>
<sec><title>Funding</title>
<p>Not applicable.</p>
</sec>
<sec><title>Copyright</title>
<p>&#x000A9; The Author(s) 2022.</p>
</sec>
</sec>
<ref-list><title>References</title>
<ref id="B1"><label>1.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Bray</surname><given-names>F</given-names></name><name><surname>Ferlay</surname><given-names>J</given-names></name><name><surname>Soerjomataram</surname><given-names>I</given-names></name><name><surname>Siegel</surname><given-names>RL</given-names></name><name><surname>Torre</surname><given-names>LA</given-names></name><name><surname>Jemal</surname><given-names>A.</given-names></name></person-group> <article-title>Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries</article-title>. <source>CA Cancer J Clin</source>. <year>2018</year>;<volume>68</volume>:<fpage>394</fpage>&#x02013;<lpage>424</lpage>. <pub-id pub-id-type="doi">10.3322/caac.21492</pub-id> <pub-id pub-id-type="pmid">30207593</pub-id></mixed-citation></ref>
<ref id="B2"><label>2.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname><given-names>W</given-names></name><name><surname>Zheng</surname><given-names>R</given-names></name><name><surname>Zhang</surname><given-names>S</given-names></name><name><surname>Zhao</surname><given-names>P</given-names></name><name><surname>Zeng</surname><given-names>H</given-names></name><name><surname>Zou</surname><given-names>X.</given-names></name></person-group> <article-title>Report of cancer incidence and mortality in China, 2010</article-title>. <source>Ann Transl Med</source>. <year>2014</year>;<volume>2</volume>:<fpage>61</fpage>. <pub-id pub-id-type="doi">10.3978/j.issn.2305-5839.2014.04.05</pub-id> <pub-id pub-id-type="pmid">25333036</pub-id> <pub-id pub-id-type="pmcid">PMC4202458</pub-id></mixed-citation></ref>
<ref id="B3"><label>3.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Machlowska</surname><given-names>J</given-names></name><name><surname>Baj</surname><given-names>J</given-names></name><name><surname>Sitarz</surname><given-names>M</given-names></name><name><surname>Maciejewski</surname><given-names>R</given-names></name><name><surname>Sitarz</surname><given-names>R.</given-names></name></person-group> <article-title>Gastric cancer: epidemiology, risk factors, classification, genomic characteristics and treatment strategies</article-title>. <source>Int J Mol Sci</source>. <year>2020</year>;<volume>21</volume>:<fpage>4012</fpage>. <pub-id pub-id-type="doi">10.3390/ijms21114012</pub-id> <pub-id pub-id-type="pmid">32512697</pub-id> <pub-id pub-id-type="pmcid">PMC7312039</pub-id></mixed-citation></ref>
<ref id="B4"><label>4.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Karimi</surname><given-names>P</given-names></name><name><surname>Islami</surname><given-names>F</given-names></name><name><surname>Anandasabapathy</surname><given-names>S</given-names></name><name><surname>Freedman</surname><given-names>ND</given-names></name><name><surname>Kamangar</surname><given-names>F.</given-names></name></person-group> <article-title>Gastric cancer: descriptive epidemiology, risk factors, screening, and prevention</article-title>. <source>Cancer Epidemiol Biomarkers Prev</source>. <year>2014</year>;<volume>23</volume>:<fpage>700</fpage>&#x02013;<lpage>13</lpage>. <pub-id pub-id-type="doi">10.1158/1055-9965.EPI-13-1057</pub-id> <pub-id pub-id-type="pmid">24618998</pub-id> <pub-id pub-id-type="pmcid">PMC4019373</pub-id></mixed-citation></ref>
<ref id="B5"><label>5.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kinami</surname><given-names>S</given-names></name><name><surname>Funaki</surname><given-names>H</given-names></name><name><surname>Fujita</surname><given-names>H</given-names></name><name><surname>Nakano</surname><given-names>Y</given-names></name><name><surname>Ueda</surname><given-names>N</given-names></name><name><surname>Kosaka</surname><given-names>T.</given-names></name></person-group> <article-title>Local resection of the stomach for gastric cancer</article-title>. <source>Surg Today</source>. <year>2017</year>;<volume>47</volume>:<fpage>651</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/s00595-016-1371-z</pub-id> <pub-id pub-id-type="pmid">27342746</pub-id> <pub-id pub-id-type="pmcid">PMC5406487</pub-id></mixed-citation></ref>
<ref id="B6"><label>6.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sun</surname><given-names>C</given-names></name><name><surname>Yuan</surname><given-names>Q</given-names></name><name><surname>Wu</surname><given-names>D</given-names></name><name><surname>Meng</surname><given-names>X</given-names></name><name><surname>Wang</surname><given-names>B.</given-names></name></person-group> <article-title>Identification of core genes and outcome in gastric cancer using bioinformatics analysis</article-title>. <source>Oncotarget</source>. <year>2017</year>;<volume>8</volume>:<fpage>70271</fpage>&#x02013;<lpage>80</lpage>. <pub-id pub-id-type="doi">10.18632/oncotarget.20082</pub-id> <pub-id pub-id-type="pmid">29050278</pub-id> <pub-id pub-id-type="pmcid">PMC5642553</pub-id></mixed-citation></ref>
<ref id="B7"><label>7.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Orditura</surname><given-names>M</given-names></name><name><surname>Galizia</surname><given-names>G</given-names></name><name><surname>Sforza</surname><given-names>V</given-names></name><name><surname>Gambardella</surname><given-names>V</given-names></name><name><surname>Fabozzi</surname><given-names>A</given-names></name><name><surname>Laterza</surname><given-names>MM</given-names></name><etal/></person-group> <article-title>Treatment of gastric cancer</article-title>. <source>World J Gastroenterol</source>. <year>2014</year>;<volume>20</volume>:<fpage>1635</fpage>&#x02013;<lpage>49</lpage>. <pub-id pub-id-type="doi">10.3748/wjg.v20.i7.1635</pub-id> <pub-id pub-id-type="pmid">24587643</pub-id> <pub-id pub-id-type="pmcid">PMC3930964</pub-id></mixed-citation></ref>
<ref id="B8"><label>8.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shimada</surname><given-names>H</given-names></name><name><surname>Noie</surname><given-names>T</given-names></name><name><surname>Ohashi</surname><given-names>M</given-names></name><name><surname>Oba</surname><given-names>K</given-names></name><name><surname>Takahashi</surname><given-names>Y.</given-names></name></person-group> <article-title>Clinical significance of serum tumor markers for gastric cancer: a systematic review of literature by the Task Force of the Japanese Gastric Cancer Association</article-title>. <source>Gastric Cancer</source>. <year>2014</year>;<volume>17</volume>:<fpage>26</fpage>&#x02013;<lpage>33</lpage>. <pub-id pub-id-type="doi">10.1007/s10120-013-0259-5</pub-id> <pub-id pub-id-type="pmid">23572188</pub-id></mixed-citation></ref>
<ref id="B9"><label>9.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Fitzgerald</surname><given-names>KA</given-names></name><name><surname>Caffrey</surname><given-names>DR.</given-names></name></person-group> <article-title>Long noncoding RNAs in innate and adaptive immunity</article-title>. <source>Curr Opin Immunol</source>. <year>2014</year>;<volume>26</volume>:<fpage>140</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1016/j.coi.2013.12.001</pub-id> <pub-id pub-id-type="pmid">24556411</pub-id> <pub-id pub-id-type="pmcid">PMC3932021</pub-id></mixed-citation></ref>
<ref id="B10"><label>10.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Slaby</surname><given-names>O</given-names></name><name><surname>Laga</surname><given-names>R</given-names></name><name><surname>Sedlacek</surname><given-names>O.</given-names></name></person-group> <article-title>Therapeutic targeting of non-coding RNAs in cancer</article-title>. <source>Biochem J</source>. <year>2017</year>;<volume>474</volume>:<fpage>4219</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1042/BCJ20170079</pub-id> <pub-id pub-id-type="pmid">29242381</pub-id></mixed-citation></ref>
<ref id="B11"><label>11.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Xie</surname><given-names>J</given-names></name><name><surname>Tan</surname><given-names>ZH</given-names></name><name><surname>Tang</surname><given-names>X</given-names></name><name><surname>Mo</surname><given-names>MS</given-names></name><name><surname>Liu</surname><given-names>YP</given-names></name><name><surname>Gan</surname><given-names>RL</given-names></name><etal/></person-group> <article-title>MiR-374b-5p suppresses RECK expression and promotes gastric cancer cell invasion and metastasis</article-title>. <source>World J Gastroenterol</source>. <year>2014</year>;<volume>20</volume>:<fpage>17439</fpage>&#x02013;<lpage>47</lpage>. <pub-id pub-id-type="doi">10.3748/wjg.v20.i46.17439</pub-id> <pub-id pub-id-type="pmid">25516656</pub-id> <pub-id pub-id-type="pmcid">PMC4265603</pub-id></mixed-citation></ref>
<ref id="B12"><label>12.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sigova</surname><given-names>AA</given-names></name><name><surname>Mullen</surname><given-names>AC</given-names></name><name><surname>Molinie</surname><given-names>B</given-names></name><name><surname>Gupta</surname><given-names>S</given-names></name><name><surname>Orlando</surname><given-names>DA</given-names></name><name><surname>Guenther</surname><given-names>MG</given-names></name><etal/></person-group> <article-title>Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells</article-title>. <source>Proc Natl Acad Sci U S A</source>. <year>2013</year>;<volume>110</volume>:<fpage>2876</fpage>&#x02013;<lpage>81</lpage>. <pub-id pub-id-type="doi">10.1073/pnas.1221904110</pub-id> <pub-id pub-id-type="pmid">23382218</pub-id> <pub-id pub-id-type="pmcid">PMC3581948</pub-id></mixed-citation></ref>
<ref id="B13"><label>13.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>St Laurent</surname><given-names>G</given-names></name><name><surname>Wahlestedt</surname><given-names>C</given-names></name><name><surname>Kapranov</surname><given-names>P.</given-names></name></person-group> <article-title>The landscape of long noncoding RNA classification</article-title>. <source>Trends Genet</source>. <year>2015</year>;<volume>31</volume>:<fpage>239</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1016/j.tig.2015.03.007</pub-id> <pub-id pub-id-type="pmid">25869999</pub-id> <pub-id pub-id-type="pmcid">PMC4417002</pub-id></mixed-citation></ref>
<ref id="B14"><label>14.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname><given-names>WX</given-names></name><name><surname>Koirala</surname><given-names>P</given-names></name><name><surname>Mo</surname><given-names>YY.</given-names></name></person-group> <article-title>LncRNA-mediated regulation of cell signaling in cancer</article-title>. <source>Oncogene</source>. <year>2017</year>;<volume>36</volume>:<fpage>5661</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1038/onc.2017.184</pub-id> <pub-id pub-id-type="pmid">28604750</pub-id> <pub-id pub-id-type="pmcid">PMC6450570</pub-id></mixed-citation></ref>
<ref id="B15"><label>15.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>He</surname><given-names>RZ</given-names></name><name><surname>Luo</surname><given-names>DX</given-names></name><name><surname>Mo</surname><given-names>YY.</given-names></name></person-group> <article-title>Emerging roles of lncRNAs in the post-transcriptional regulation in cancer</article-title>. <source>Genes Dis</source>. <year>2019</year>;<volume>6</volume>:<fpage>6</fpage>&#x02013;<lpage>15</lpage>. <pub-id pub-id-type="doi">10.1016/j.gendis.2019.01.003</pub-id> <pub-id pub-id-type="pmid">30906827</pub-id> <pub-id pub-id-type="pmcid">PMC6411652</pub-id></mixed-citation></ref>
<ref id="B16"><label>16.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Jordan</surname><given-names>MI</given-names></name><name><surname>Mitchell</surname><given-names>TM.</given-names></name></person-group> <article-title>Machine learning: trends, perspectives, and prospects</article-title>. <source>Science</source>. <year>2015</year>;<volume>349</volume>:<fpage>255</fpage>&#x02013;<lpage>60</lpage>. <pub-id pub-id-type="doi">10.1126/science.aaa8415</pub-id> <pub-id pub-id-type="pmid">26185243</pub-id></mixed-citation></ref>
<ref id="B17"><label>17.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Obermeyer</surname><given-names>Z</given-names></name><name><surname>Emanuel</surname><given-names>EJ.</given-names></name></person-group> <article-title>Predicting the future&#x02014;big data, machine learning, and clinical medicine</article-title>. <source>N Engl J Med</source>. <year>2016</year>;<volume>375</volume>:<fpage>1216</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1056/NEJMp1606181</pub-id> <pub-id pub-id-type="pmid">27682033</pub-id> <pub-id pub-id-type="pmcid">PMC5070532</pub-id></mixed-citation></ref>
<ref id="B18"><label>18.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Kulasingam</surname><given-names>V</given-names></name><name><surname>Diamandis</surname><given-names>EP.</given-names></name></person-group> <article-title>Strategies for discovering novel cancer biomarkers through utilization of emerging technologies</article-title>. <source>Nat Clin Pract Oncol</source>. <year>2008</year>;<volume>5</volume>:<fpage>588</fpage>&#x02013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1038/ncponc1187</pub-id> <pub-id pub-id-type="pmid">18695711</pub-id></mixed-citation></ref>
<ref id="B19"><label>19.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Huang da</surname><given-names>W</given-names></name><name><surname>Sherman</surname><given-names>BT</given-names></name><name><surname>Lempicki</surname><given-names>RA.</given-names></name></person-group> <article-title>Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources</article-title>. <source>Nat Protoc</source>. <year>2009</year>;<volume>4</volume>:<fpage>44</fpage>&#x02013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2008.211</pub-id> <pub-id pub-id-type="pmid">19131956</pub-id></mixed-citation></ref>
<ref id="B20"><label>20.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Szklarczyk</surname><given-names>D</given-names></name><name><surname>Franceschini</surname><given-names>A</given-names></name><name><surname>Wyder</surname><given-names>S</given-names></name><name><surname>Forslund</surname><given-names>K</given-names></name><name><surname>Heller</surname><given-names>D</given-names></name><name><surname>Huerta-Cepas</surname><given-names>J</given-names></name><etal/></person-group> <article-title>STRING v10: protein-protein interaction networks, integrated over the tree of life</article-title>. <source>Nucleic Acids Res</source>. <year>2015</year>;<volume>43</volume>:<fpage>D447</fpage>&#x02013;<lpage>52</lpage>. <pub-id pub-id-type="doi">10.1093/nar/gku1003</pub-id> <pub-id pub-id-type="pmid">25352553</pub-id> <pub-id pub-id-type="pmcid">PMC4383874</pub-id></mixed-citation></ref>
<ref id="B21"><label>21.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Shannon</surname><given-names>P</given-names></name><name><surname>Markiel</surname><given-names>A</given-names></name><name><surname>Ozier</surname><given-names>O</given-names></name><name><surname>Baliga</surname><given-names>NS</given-names></name><name><surname>Wang</surname><given-names>JT</given-names></name><name><surname>Ramage</surname><given-names>D</given-names></name><etal/></person-group> <article-title>Cytoscape: a software environment for integrated models of biomolecular interaction networks</article-title>. <source>Genome Res</source>. <year>2003</year>;<volume>13</volume>:<fpage>2498</fpage>&#x02013;<lpage>504</lpage>. <pub-id pub-id-type="doi">10.1101/gr.1239303</pub-id> <pub-id pub-id-type="pmid">14597658</pub-id> <pub-id pub-id-type="pmcid">PMC403769</pub-id></mixed-citation></ref>
<ref id="B22"><label>22.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Liaw</surname><given-names>A</given-names></name><name><surname>Wiener</surname><given-names>M.</given-names></name></person-group> <article-title>Classification and regression by randomForest</article-title>. <source>R News</source>. <year>2002</year>;<volume>2</volume>:<fpage>18</fpage>&#x02013;<lpage>22</lpage>.</mixed-citation></ref>
<ref id="B23"><label>23.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Biau</surname><given-names>G</given-names></name><name><surname>Scornet</surname><given-names>E.</given-names></name></person-group> <article-title>A random forest guided tour</article-title>. <source>TEST</source>. <year>2016</year>;<volume>25</volume>:<fpage>197</fpage>&#x02013;<lpage>227</lpage>. <pub-id pub-id-type="doi">10.1007/s11749-016-0481-7</pub-id></mixed-citation></ref>
<ref id="B24"><label>24.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Leung</surname><given-names>KM.</given-names></name></person-group> <article-title>Naive bayesian classifier</article-title>. <source>Polytechnic University Department of Computer Science/Finance and Risk Engineering</source>. <year>2007 Nov</year> [cited 2022 Apr 14]. Available from: <ext-link ext-link-type="uri" xlink:href="https://cse.engineering.nyu.edu/~mleung/FRE7851/f07/naiveBayesianClassifier.pdf">https://cse.engineering.nyu.edu/~mleung/FRE7851/f07/naiveBayesianClassifier.pdf</ext-link></mixed-citation></ref>
<ref id="B25"><label>25.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Peterson</surname><given-names>LE.</given-names></name></person-group> <article-title><italic>K</italic>-nearest neighbor</article-title>. <source>Scholarpedia</source>. <year>2009</year>;<volume>4</volume>:<fpage>1883</fpage>. <pub-id pub-id-type="doi">10.4249/scholarpedia.1883</pub-id></mixed-citation></ref>
<ref id="B26"><label>26.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Laaksonen</surname><given-names>J</given-names></name><name><surname>Oja</surname><given-names>E.</given-names></name></person-group> <article-title>Classification with learning <italic>k</italic>-nearest neighbors</article-title>. <source>Proceedings of International Conference on Neural Networks (ICNN&#x02019;96)</source>. <year>1996</year>;<volume>3</volume>:<fpage>1480</fpage>&#x02013;<lpage>3</lpage>. <pub-id pub-id-type="doi">10.1109/ICNN.1996.549118</pub-id></mixed-citation></ref>
<ref id="B27"><label>27.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Wiman</surname><given-names>KG</given-names></name><name><surname>Zhivotovsky</surname><given-names>B.</given-names></name></person-group> <article-title>Understanding cell cycle and cell death regulation provides novel weapons against human diseases</article-title>. <source>J Intern Med</source>. <year>2017</year>;<volume>281</volume>:<fpage>483</fpage>&#x02013;<lpage>95</lpage>. <pub-id pub-id-type="doi">10.1111/joim.12609</pub-id> <pub-id pub-id-type="pmid">28374555</pub-id></mixed-citation></ref>
<ref id="B28"><label>28.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yasuda</surname><given-names>M</given-names></name><name><surname>Takesue</surname><given-names>F</given-names></name><name><surname>Inutsuka</surname><given-names>S</given-names></name><name><surname>Honda</surname><given-names>M</given-names></name><name><surname>Nozoe</surname><given-names>T</given-names></name><name><surname>Korenaga</surname><given-names>D.</given-names></name></person-group> <article-title>Overexpression of cyclin B1 in gastric cancer and its clinicopathological significance: an immunohistological study</article-title>. <source>J Cancer Res Clin Oncol</source>. <year>2002</year>;<volume>128</volume>:<fpage>412</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1007/s00432-002-0359-9</pub-id> <pub-id pub-id-type="pmid">12200597</pub-id></mixed-citation></ref>
<ref id="B29"><label>29.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gao</surname><given-names>SY</given-names></name><name><surname>Li</surname><given-names>J</given-names></name><name><surname>Qu</surname><given-names>XY</given-names></name><name><surname>Zhu</surname><given-names>N</given-names></name><name><surname>Ji</surname><given-names>YB.</given-names></name></person-group> <article-title>Downregulation of Cdk1 and cyclinB1 expression contributes to oridonin-induced cell cycle arrest at G2/M phase and growth inhibition in SGC-7901 gastric cancer cells</article-title>. <source>Asian Pac J Cancer Prev</source>. <year>2014</year>;<volume>15</volume>:<fpage>6437</fpage>&#x02013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.7314/APJCP.2014.15.15.6437</pub-id> <pub-id pub-id-type="pmid">25124639</pub-id></mixed-citation></ref>
<ref id="B30"><label>30.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Hata</surname><given-names>S</given-names></name><name><surname>Pastor Peidro</surname><given-names>A</given-names></name><name><surname>Panic</surname><given-names>M</given-names></name><name><surname>Liu</surname><given-names>P</given-names></name><name><surname>Atorino</surname><given-names>E</given-names></name><name><surname>Funaya</surname><given-names>C</given-names></name><etal/></person-group> <article-title>The balance between KIFC3 and EG5 tetrameric kinesins controls the onset of mitotic spindle assembly</article-title>. <source>Nat Cell Biol</source>. <year>2019</year>;<volume>21</volume>:<fpage>1138</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1038/s41556-019-0382-6</pub-id> <pub-id pub-id-type="pmid">31481795</pub-id></mixed-citation></ref>
<ref id="B31"><label>31.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Oue</surname><given-names>N</given-names></name><name><surname>Sentani</surname><given-names>K</given-names></name><name><surname>Sakamoto</surname><given-names>N</given-names></name><name><surname>Uraoka</surname><given-names>N</given-names></name><name><surname>Yasui</surname><given-names>W.</given-names></name></person-group> <article-title>Molecular carcinogenesis of gastric cancer: Lauren classification, mucin phenotype expression, and cancer stem cells</article-title>. <source>Int J Clin Oncol</source>. <year>2019</year>;<volume>24</volume>:<fpage>771</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1007/s10147-019-01443-9</pub-id> <pub-id pub-id-type="pmid">30980196</pub-id></mixed-citation></ref>
<ref id="B32"><label>32.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Imai</surname><given-names>T</given-names></name><name><surname>Oue</surname><given-names>N</given-names></name><name><surname>Sentani</surname><given-names>K</given-names></name><name><surname>Sakamoto</surname><given-names>N</given-names></name><name><surname>Uraoka</surname><given-names>N</given-names></name><name><surname>Egi</surname><given-names>H</given-names></name><etal/></person-group> <article-title>KIF11 is required for spheroid formation by oesophageal and colorectal cancer cells</article-title>. <source>Anticancer Res</source>. <year>2017</year>;<volume>37</volume>:<fpage>47</fpage>&#x02013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.21873/anticanres.11287</pub-id> <pub-id pub-id-type="pmid">28011472</pub-id></mixed-citation></ref>
<ref id="B33"><label>33.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Dar</surname><given-names>AA</given-names></name><name><surname>Belkhiri</surname><given-names>A</given-names></name><name><surname>Ecsedy</surname><given-names>J</given-names></name><name><surname>Zaika</surname><given-names>A</given-names></name><name><surname>El-Rifai</surname><given-names>W.</given-names></name></person-group> <article-title>Aurora kinase A inhibition leads to p73-dependent apoptosis in p53-deficient cancer cells</article-title>. <source>Cancer Res</source>. <year>2008</year>;<volume>68</volume>:<fpage>8998</fpage>&#x02013;<lpage>9004</lpage>. <pub-id pub-id-type="doi">10.1158/0008-5472.CAN-08-2658</pub-id> <pub-id pub-id-type="pmid">18974145</pub-id> <pub-id pub-id-type="pmcid">PMC2587495</pub-id></mixed-citation></ref>
<ref id="B34"><label>34.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sehdev</surname><given-names>V</given-names></name><name><surname>Katsha</surname><given-names>A</given-names></name><name><surname>Ecsedy</surname><given-names>J</given-names></name><name><surname>Zaika</surname><given-names>A</given-names></name><name><surname>Belkhiri</surname><given-names>A</given-names></name><name><surname>El-Rifai</surname><given-names>W.</given-names></name></person-group> <article-title>The combination of alisertib, an investigational Aurora kinase A inhibitor, and docetaxel promotes cell death and reduces tumor growth in preclinical cell models of upper gastrointestinal adenocarcinomas</article-title>. <source>Cancer</source>. <year>2013</year>;<volume>119</volume>:<fpage>904</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1002/cncr.27801</pub-id> <pub-id pub-id-type="pmid">22972611</pub-id> <pub-id pub-id-type="pmcid">PMC3524359</pub-id></mixed-citation></ref>
<ref id="B35"><label>35.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Katsha</surname><given-names>A</given-names></name><name><surname>Arras</surname><given-names>J</given-names></name><name><surname>Soutto</surname><given-names>M</given-names></name><name><surname>Belkhiri</surname><given-names>A</given-names></name><name><surname>El-Rifai</surname><given-names>W.</given-names></name></person-group> <article-title>AURKA regulates JAK2-STAT3 activity in human gastric and esophageal cancers</article-title>. <source>Mol Oncol</source>. <year>2014</year>;<volume>8</volume>:<fpage>1419</fpage>&#x02013;<lpage>28</lpage>. <pub-id pub-id-type="doi">10.1016/j.molonc.2014.05.012</pub-id> <pub-id pub-id-type="pmid">24953013</pub-id> <pub-id pub-id-type="pmcid">PMC4254172</pub-id></mixed-citation></ref>
<ref id="B36"><label>36.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Katayama</surname><given-names>H</given-names></name><name><surname>Wang</surname><given-names>J</given-names></name><name><surname>Treekitkarnmongkol</surname><given-names>W</given-names></name><name><surname>Kawai</surname><given-names>H</given-names></name><name><surname>Sasai</surname><given-names>K</given-names></name><name><surname>Zhang</surname><given-names>H</given-names></name><etal/></person-group> <article-title>Aurora kinase-A inactivates DNA damage-induced apoptosis and spindle assembly checkpoint response functions of p73</article-title>. <source>Cancer Cell</source>. <year>2012</year>;<volume>21</volume>:<fpage>196</fpage>&#x02013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.1016/j.ccr.2011.12.025</pub-id> <pub-id pub-id-type="pmid">22340593</pub-id> <pub-id pub-id-type="pmcid">PMC3760020</pub-id></mixed-citation></ref>
<ref id="B37"><label>37.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Sehdev</surname><given-names>V</given-names></name><name><surname>Peng</surname><given-names>D</given-names></name><name><surname>Soutto</surname><given-names>M</given-names></name><name><surname>Washington</surname><given-names>MK</given-names></name><name><surname>Revetta</surname><given-names>F</given-names></name><name><surname>Ecsedy</surname><given-names>J</given-names></name><etal/></person-group> <article-title>The aurora kinase A inhibitor MLN8237 enhances cisplatin-induced cell death in esophageal adenocarcinoma cells</article-title>. <source>Mol Cancer Ther</source>. <year>2012</year>;<volume>11</volume>:<fpage>763</fpage>&#x02013;<lpage>74</lpage>. <pub-id pub-id-type="doi">10.1158/1535-7163.MCT-11-0623</pub-id> <pub-id pub-id-type="pmid">22302096</pub-id> <pub-id pub-id-type="pmcid">PMC3297687</pub-id></mixed-citation></ref>
<ref id="B38"><label>38.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Nie</surname><given-names>M</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name><name><surname>Yu</surname><given-names>Z</given-names></name><name><surname>Li</surname><given-names>X</given-names></name><name><surname>Deng</surname><given-names>Y</given-names></name><name><surname>Wang</surname><given-names>Y</given-names></name><etal/></person-group> <article-title>AURKB promotes gastric cancer progression via activation of <italic>CCND1</italic> expression</article-title>. <source>Aging (Albany NY)</source>. <year>2020</year>;<volume>12</volume>:<fpage>1304</fpage>&#x02013;<lpage>21</lpage>. <pub-id pub-id-type="doi">10.18632/aging.102684</pub-id> <pub-id pub-id-type="pmid">31982864</pub-id> <pub-id pub-id-type="pmcid">PMC7053608</pub-id></mixed-citation></ref>
<ref id="B39"><label>39.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Raemaekers</surname><given-names>T</given-names></name><name><surname>Ribbeck</surname><given-names>K</given-names></name><name><surname>Beaudouin</surname><given-names>J</given-names></name><name><surname>Annaert</surname><given-names>W</given-names></name><name><surname>Van Camp</surname><given-names>M</given-names></name><name><surname>Stockmans</surname><given-names>I</given-names></name><etal/></person-group> <article-title>NuSAP, a novel microtubule-associated protein involved in mitotic spindle organization</article-title>. <source>J Cell Biol</source>. <year>2003</year>;<volume>162</volume>:<fpage>1017</fpage>&#x02013;<lpage>29</lpage>. <pub-id pub-id-type="doi">10.1083/jcb.200302129</pub-id> <pub-id pub-id-type="pmid">12963707</pub-id> <pub-id pub-id-type="pmcid">PMC2172854</pub-id></mixed-citation></ref>
<ref id="B40"><label>40.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ribbeck</surname><given-names>K</given-names></name><name><surname>Raemaekers</surname><given-names>T</given-names></name><name><surname>Carmeliet</surname><given-names>G</given-names></name><name><surname>Mattaj</surname><given-names>IW.</given-names></name></person-group> <article-title>A role for NuSAP in linking microtubules to mitotic chromosomes</article-title>. <source>Curr Biol</source>. <year>2007</year>;<volume>17</volume>:<fpage>230</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1016/j.cub.2006.11.050</pub-id> <pub-id pub-id-type="pmid">17276916</pub-id></mixed-citation></ref>
<ref id="B41"><label>41.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Vanden Bosch</surname><given-names>A</given-names></name><name><surname>Raemaekers</surname><given-names>T</given-names></name><name><surname>Denayer</surname><given-names>S</given-names></name><name><surname>Torrekens</surname><given-names>S</given-names></name><name><surname>Smets</surname><given-names>N</given-names></name><name><surname>Moermans</surname><given-names>K</given-names></name><etal/></person-group> <article-title>NuSAP is essential for chromatin-induced spindle formation during early embryogenesis</article-title>. <source>J Cell Sci</source>. <year>2010</year>;<volume>123</volume>:<fpage>3244</fpage>&#x02013;<lpage>55</lpage>. <pub-id pub-id-type="doi">10.1242/jcs.063875</pub-id> <pub-id pub-id-type="pmid">20807801</pub-id></mixed-citation></ref>
<ref id="B42"><label>42.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gordon</surname><given-names>CA</given-names></name><name><surname>Gulzar</surname><given-names>ZG</given-names></name><name><surname>Brooks</surname><given-names>JD.</given-names></name></person-group> <article-title><italic>NUSAP1</italic> expression is upregulated by loss of RB1 in prostate cancer cells</article-title>. <source>Prostate</source>. <year>2015</year>;<volume>75</volume>:<fpage>517</fpage>&#x02013;<lpage>26</lpage>. <pub-id pub-id-type="doi">10.1002/pros.22938</pub-id> <pub-id pub-id-type="pmid">25585568</pub-id></mixed-citation></ref>
<ref id="B43"><label>43.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Gulzar</surname><given-names>ZG</given-names></name><name><surname>McKenney</surname><given-names>JK</given-names></name><name><surname>Brooks</surname><given-names>JD.</given-names></name></person-group> <article-title>Increased expression of <italic>NuSAP</italic> in recurrent prostate cancer is mediated by <italic>E2F1</italic></article-title>. <source>Oncogene</source>. <year>2013</year>;<volume>32</volume>:<fpage>70</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1038/onc.2012.27</pub-id> <pub-id pub-id-type="pmid">22349817</pub-id> <pub-id pub-id-type="pmcid">PMC3360134</pub-id></mixed-citation></ref>
<ref id="B44"><label>44.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Ge</surname><given-names>Y</given-names></name><name><surname>Li</surname><given-names>Q</given-names></name><name><surname>Lin</surname><given-names>L</given-names></name><name><surname>Jiang</surname><given-names>M</given-names></name><name><surname>Shi</surname><given-names>L</given-names></name><name><surname>Wang</surname><given-names>B</given-names></name><etal/></person-group> <article-title>Downregulation of NUSAP1 suppresses cell proliferation, migration, and invasion via inhibiting mTORC1 signalling pathway in gastric cancer</article-title>. <source>Cell Biochem Funct</source>. <year>2020</year>;<volume>38</volume>:<fpage>28</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1002/cbf.3444</pub-id> <pub-id pub-id-type="pmid">31710389</pub-id></mixed-citation></ref>
<ref id="B45"><label>45.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Deo</surname><given-names>RC</given-names></name></person-group>. <article-title>Machine learning in medicine</article-title>. <source>Circulation</source>. <year>2015</year>;<volume>132</volume>:<fpage>1920</fpage>&#x02013;<lpage>30</lpage>. <pub-id pub-id-type="doi">10.1161/CIRCULATIONAHA.115.001593</pub-id> <pub-id pub-id-type="pmid">26572668</pub-id> <pub-id pub-id-type="pmcid">PMC5831252</pub-id></mixed-citation></ref>
<ref id="B46"><label>46.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Keane</surname><given-names>PA</given-names></name><name><surname>Topol</surname><given-names>EJ.</given-names></name></person-group> <article-title>With an eye to AI and autonomous diagnosis</article-title>. <source>NPJ Digit Med</source>. <year>2018</year>;<volume>1</volume>:<fpage>40</fpage>. <pub-id pub-id-type="doi">10.1038/s41746-018-0048-y</pub-id> <pub-id pub-id-type="pmid">31304321</pub-id> <pub-id pub-id-type="pmcid">PMC6550235</pub-id></mixed-citation></ref>
<ref id="B47"><label>47.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Leng</surname><given-names>F</given-names></name><name><surname>Li</surname><given-names>W.</given-names></name></person-group> <article-title>Classification prediction of lung squamous cell carcinoma and lung adenocarcinoma based on XGBoost</article-title>. <source>J Cap Med Univ</source>. <year>2019</year>;<volume>40</volume>:<fpage>889</fpage>&#x02013;<lpage>93</lpage>.</mixed-citation></ref>
<ref id="B48"><label>48.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname><given-names>Z</given-names></name><name><surname>Jin</surname><given-names>M</given-names></name><name><surname>Zhang</surname><given-names>Z</given-names></name><name><surname>Lu</surname><given-names>J</given-names></name><name><surname>Hao</surname><given-names>K.</given-names></name></person-group> <article-title>Classification based on feature extraction for hepatocellular carcinoma diagnosis using high-throughput DNA methylation sequencing data</article-title>. <source>Procedia Comput Sci</source>. <year>2017</year>;<volume>107</volume>:<fpage>412</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1016/j.procs.2017.03.130</pub-id></mixed-citation></ref>
<ref id="B49"><label>49.</label><mixed-citation publication-type="journal"><person-group person-group-type="author"><name><surname>Tian</surname><given-names>Y</given-names></name><name><surname>Lin</surname><given-names>W</given-names></name><name><surname>Qu</surname><given-names>K</given-names></name><name><surname>Wang</surname><given-names>Z</given-names></name><name><surname>Zhu</surname><given-names>X.</given-names></name></person-group> <article-title>Insights into cell classification based on combination of multiple cellular mechanical phenotypes by using machine learning algorithm</article-title>. <source>J Mech Behav Biomed Mater</source>. <year>2022</year>;<volume>128</volume>:<fpage>105097</fpage>. <pub-id pub-id-type="doi">10.1016/j.jmbbm.2022.105097</pub-id> <pub-id pub-id-type="pmid">35151180</pub-id></mixed-citation></ref>
</ref-list>
</back>
</article>