﻿<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Explor Immunol</journal-id>
<journal-id journal-id-type="publisher-id">EI</journal-id>
<journal-title-group>
<journal-title>Exploration of Immunology</journal-title>
</journal-title-group>
<issn pub-type="epub">2768-6655</issn>
<publisher>
<publisher-name>Open Exploration Publishing</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.37349/ei.2025.1003215</article-id>
<article-id pub-id-type="manuscript">1003215</article-id>
<article-categories>
<subj-group>
<subject>Original Article</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>Forecast of cytotoxic T lymphocyte epitope using sequence weighting and artificial neural network based on EasyPred modeler</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-3759-6528</contrib-id>
<name>
<surname>Singh</surname>
<given-names>Satarudra Prakash</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/">Conceptualization</role>
<role content-type="https://credit.niso.org/contributor-roles/methodology/">Methodology</role>
<role content-type="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing—review &amp; editing</role>
<xref ref-type="aff" rid="I1">
<sup>1</sup>
</xref>
<xref ref-type="corresp" rid="cor1">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Singh</surname>
<given-names>Garima</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/data-curation/">Data curation</role>
<role content-type="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<xref ref-type="aff" rid="I2">
<sup>2</sup>
</xref>
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-3191-406X</contrib-id>
<name>
<surname>Mishra</surname>
<given-names>Bhartendu Nath</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/supervision/">Supervision</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing—review &amp; editing</role>
<xref ref-type="aff" rid="I3">
<sup>3</sup>
</xref>
</contrib>
<contrib contrib-type="editor">
<name>
<surname>Gong</surname>
<given-names>Wenping</given-names>
</name>
<role>Academic Editor</role>
<aff>The Eighth Medical Center of PLA General Hospital, China</aff>
</contrib>
</contrib-group>
<aff id="I1">
<sup>1</sup>Department of Biotechnology, Mahatma Gandhi Central University, Motihari 845401, Bihar, India</aff>
<aff id="I2">
<sup>2</sup>Amity Institute of Biotechnology, Lucknow Campus, Amity University, Lucknow 226028, Uttar Pradesh, India</aff>
<aff id="I3">
<sup>3</sup>Institute of Engineering and Technology, Dr. A.P.J. Abdul Kalam Technical University, Lucknow 226021, Uttar Pradesh, India</aff>
<author-notes>
<corresp id="cor1">
<bold>
<sup>*</sup>Correspondence:</bold> Satarudra Prakash Singh, Department of Biotechnology, Mahatma Gandhi Central University, Motihari 845401, Bihar, India. <email>sprakashsingh@mgcub.ac.in</email></corresp>
</author-notes>
<pub-date pub-type="collection">
<year>2025</year>
</pub-date>
<pub-date pub-type="epub">
<day>09</day>
<month>09</month>
<year>2025</year>
</pub-date>
<volume>5</volume>
<elocation-id>1003215</elocation-id>
<history>
<date date-type="received">
<day>27</day>
<month>03</month>
<year>2025</year>
</date>
<date date-type="accepted">
<day>13</day>
<month>07</month>
<year>2025</year>
</date>
</history>
<permissions>
<copyright-statement>© The Author(s) 2025.</copyright-statement>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This is an Open Access article licensed under a Creative Commons Attribution 4.0 International License (<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link>), which permits unrestricted use, sharing, adaptation, distribution and reproduction in any medium or format, for any purpose, even commercially, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.</license-p>
</license>
</permissions>
<abstract>
<sec>
<title>Aim:</title>
<p id="absp-1">Cytotoxic T lymphocytes (CTL) examine the major histocompatibility complex (MHC) class I ligands on nucleated cells to detect antigens derived from pathogens and cancer cells. Accurate prediction of T-cell epitopes is therefore crucial for the development of a wide range of biopharmaceuticals, including vaccines.</p>
</sec>
<sec>
<title>Methods:</title>
<p id="absp-2">The present study involved the development of position-specific scoring matrices (PSSM) and artificial neural networks (ANN) based models for 22 MHC class I molecules, including the integrated forecast of CTL epitopes using the EasyPred modeler. Similarity-reduced peptides dataset was used to train and evaluate models with performance assessed using the area under the receiver operating characteristic curve (Aroc) as the primary metric.</p>
</sec>
<sec>
<title>Results:</title>
<p id="absp-3">Comparative analysis revealed that the ANN-based predictor achieved superior performance for the HLA-A*0202 molecule by achieving the maximum Aroc value of 0.97 as compared to the PSSM predictor, having a value of 0.93. Furthermore, most natural MHC binders were identified within the top 5% with an average relative rank (%) of 2.23 and 3.13 for predictors PSSM and ANN, respectively, on the NetCTLpan dataset. Likewise, evaluation on the SARS-CoV-2 dataset of HLA-A*0201 revealed that the PSSM predictor (2.46%) performed better than the other contemporary CTL epitope forecast methods like naturally eluted ligands (EL) of NetMHCpan 4.0 (2.66%), NetCTLpan 1.1 (2.69%), and binding affinity (BA) of NetMHCpan 4.0 (3.33%), respectively.</p>
</sec>
<sec>
<title>Conclusions:</title>
<p id="absp-4">The application of these predictive models offers a significant reduction of approximately 97% in the resources typically required for epitope identification, including costs related to materials, labor, and time. As such, these models represent a valuable advancement in the rational design of more efficient, cost-effective, and innovative biotherapeutics.</p>
</sec>
</abstract>
<kwd-group>
<kwd>epitope</kwd>
<kwd>MHC</kwd>
<kwd>machine learning</kwd>
<kwd>vaccine</kwd>
<kwd>forecast</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p id="p-1">In the current trend, epitope-based bio-therapeutic products such as prophylactic vaccines and antibodies are critical in health care [<xref ref-type="bibr" rid="B1">1</xref>]. Specifically, vaccines based on the T-cell epitope are the most important method of triggering cellular immune response and clearing intracellular infections [<xref ref-type="bibr" rid="B2">2</xref>]. The responsibility of carrying out this task is specifically assigned to cytotoxic T lymphocytes (CTL) [<xref ref-type="bibr" rid="B3">3</xref>]. A crucial role of CTL is their ability to induce apoptosis of infected/altered cells, facilitated by helper T lymphocytes (HTL) that produce cytokines to modulate the activity of immune response cells [<xref ref-type="bibr" rid="B4">4</xref>]. The process of antigen processing and presentation, which involves the recognition of CTL epitopes, encompasses three essential stages: the proteasomal degradation of an antigen, the translocation of the antigenic fragment via the transporters associated with antigen processing (TAP) transport system, and the binding of the fragment to major histocompatibility complex (MHC) class I molecules [<xref ref-type="bibr" rid="B5">5</xref>]. The proteasome is responsible for producing the C-terminus of peptides [<xref ref-type="bibr" rid="B6">6</xref>]. A portion of these peptides is capable of being transported into the endoplasmic reticulum (ER) via the TAP, where subsequent N-terminal trimming takes place [<xref ref-type="bibr" rid="B7">7</xref>, <xref ref-type="bibr" rid="B8">8</xref>], resulting in peptides of suitable length (approximately 8 to 10 residues) which can eventually bind to MHC class I molecules [<xref ref-type="bibr" rid="B9">9</xref>]. The resulting MHC class I-peptide complexes are subsequently translocated to the cell surface, which can be recognized by epitope-specific CTL receptors [<xref ref-type="bibr" rid="B10">10</xref>]. The immunodominance of a peptide is largely influenced by its capability to interact with an MHC class I molecule [<xref ref-type="bibr" rid="B11">11</xref>]. Therefore, the development of an assay to evaluate peptides binding to MHC class I molecules is essential for accurately identifying CTL epitopes [<xref ref-type="bibr" rid="B12">12</xref>]. However, CTL epitope mapping for the prominent proteome pathogen, for instance, <italic>Plasmodium falciparum</italic> (~5,300 proteins), is expensive and arduous [<xref ref-type="bibr" rid="B13">13</xref>]. Consequently, researchers are presently employing contemporary computational models to forecast effective epitopes that can trigger specific immune responses [<xref ref-type="bibr" rid="B14">14</xref>, <xref ref-type="bibr" rid="B15">15</xref>].</p>
<p id="p-2">Formerly, numerous in silico tools and methods have been developed for predicting binding peptides to MHC class I and/or CTL epitopes based upon data retrieved from several immunological databases such as IEDB [<xref ref-type="bibr" rid="B16">16</xref>–<xref ref-type="bibr" rid="B18">18</xref>] and other repositories [<xref ref-type="bibr" rid="B19">19</xref>, <xref ref-type="bibr" rid="B20">20</xref>]. Several research groups have created systematic and quantitative benchmarks to assess the forecasting performance of binding peptides to MHC class I molecules [<xref ref-type="bibr" rid="B21">21</xref>, <xref ref-type="bibr" rid="B22">22</xref>]. Despite many available computational methods, forecasting CTL epitopes remains challenging to date [<xref ref-type="bibr" rid="B23">23</xref>]. Given that the majority of existing methods rely on training data, we can anticipate remarkable models solely by acquiring exceptional and current general data [<xref ref-type="bibr" rid="B24">24</xref>]. Therefore, a crucial step in creating a more precise and dependable forecasting model is the collection of experimentally reliable training and validation datasets.</p>
<p id="p-3">The present study involved an extensive collection of MHC class I binding and non-binding peptides available at the Repository for Epitope Datasets (RED) and subsequent generation of position-specific scoring matrices (PSSM) [<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>] and artificial neural networks (ANN) [<xref ref-type="bibr" rid="B27">27</xref>] models for predicting MHC class I binding peptides. Subsequently, a conclusive model for integrated prediction of CTL epitopes was developed utilizing experimentally validated weight matrices for TAP binding affinity (BA) and/or forecasts of constitutive and immuno-proteasomal cleavage as a filter [<xref ref-type="bibr" rid="B28">28</xref>]. The CTL epitopes forecasted by the present model are likely to have a C-terminus cleavage generated by the proteasome (constitutive and immunoproteasome), a moderate BA for TAP, and a high BA for a specific MHC class I molecule, which can be used in designing a universal vaccine that ultimately regulates the efficient HLA cross-presentation [<xref ref-type="bibr" rid="B29">29</xref>, <xref ref-type="bibr" rid="B30">30</xref>].</p>
</sec>
<sec id="s2">
<title>Materials and methods</title>
<sec id="t2-1">
<title>Data retrieval of MHC class I binding and nonbinding peptides</title>
<p id="p-4">A dataset of MHC class I binding and nonbinding peptides, with reduced similarity, was compiled for 22 molecules from the supplementary data provided by the MHCIPREDS web server, which is accessible at RED (<uri xlink:href="https://web.archive.org/web/20130828192234/http://ailab.cs.iastate.edu/red/">https://web.archive.org/web/20130828192234/http://ailab.cs.iastate.edu/red/</uri>). The log-transformed MHC BA (LTMBA) values between 0 (no affinity) and 1 (very high affinity) were obtained using the relation, 1 – log (aff)/log (50,000), where aff is the experimentally measured BA in terms of half-maximal inhibitory concentration (IC<sub>50</sub>) with nM (nmol/L) unit. An LTMBA threshold value of 0.426, equivalent to an IC<sub>50</sub> value of 500 nM, was used to classify binders and non-binders, which means peptides with LTMBA values ≥ 0.426 (IC<sub>50</sub> ≤ 500 nM) were classified as binders (<xref ref-type="table" rid="t1">Table 1</xref>).</p>
<table-wrap id="t1">
<label>Table 1</label>
<caption>
<p id="t1-p-1">
<bold>MHC class I binding and nonbinding peptide dataset used in the study for training and evaluation of algorithms.</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th rowspan="2">
<bold>S. No.</bold>
</th>
<th rowspan="2">
<bold>MHC molecule</bold> </th>
<th colspan="2">
<bold>Training set</bold>
</th>
<th colspan="3">
<bold>Evaluation set</bold>
</th>
</tr>
<tr>
<th>
<bold>Total no. of data</bold>
</th>
<th>
<bold>No. of binding data<sup>#</sup></bold>
</th>
<th>
<bold>Total no. of data</bold>
</th>
<th>
<bold>No. of binding data<sup>#</sup></bold>
</th>
<th>
<bold>No. of nonbinding data</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>HLA-A*0201</td>
<td>2,280</td>
<td>1,155</td>
<td>560</td>
<td>280</td>
<td>280</td>
</tr>
<tr>
<td>2</td>
<td>HLA-A*0203</td>
<td>1,080</td>
<td>555</td>
<td>283</td>
<td>141</td>
<td>142</td>
</tr>
<tr>
<td>3</td>
<td>HLA-A*0206</td>
<td>976</td>
<td>478</td>
<td>244</td>
<td>122</td>
<td>122</td>
</tr>
<tr>
<td>4</td>
<td>HLA-A*2902</td>
<td>266</td>
<td>134</td>
<td>66</td>
<td>33</td>
<td>33</td>
</tr>
<tr>
<td>5</td>
<td>HLA-A*6801</td>
<td>919</td>
<td>458</td>
<td>229</td>
<td>114</td>
<td>115</td>
</tr>
<tr>
<td>6</td>
<td>HLA-A*6802</td>
<td>678</td>
<td>334</td>
<td>168</td>
<td>84</td>
<td>84</td>
</tr>
<tr>
<td>7</td>
<td>HLA-A*3301</td>
<td>324</td>
<td>162</td>
<td>81</td>
<td>40</td>
<td>41</td>
</tr>
<tr>
<td>8</td>
<td>HLA-A*0101</td>
<td>258</td>
<td>128</td>
<td>65</td>
<td>32</td>
<td>33</td>
</tr>
<tr>
<td>9</td>
<td>HLA-A*0202</td>
<td>1,082</td>
<td>550</td>
<td>281</td>
<td>140</td>
<td>141</td>
</tr>
<tr>
<td>10</td>
<td>HLA-A*3002</td>
<td>232</td>
<td>115</td>
<td>58</td>
<td>29</td>
<td>29</td>
</tr>
<tr>
<td>11</td>
<td>HLA-A*3101</td>
<td>811</td>
<td>405</td>
<td>203</td>
<td>101</td>
<td>102</td>
</tr>
<tr>
<td>12</td>
<td>HLA-A*0301</td>
<td>1,006</td>
<td>503</td>
<td>250</td>
<td>125</td>
<td>125</td>
</tr>
<tr>
<td>13</td>
<td>HLA-A*1101</td>
<td>1,293</td>
<td>643</td>
<td>324</td>
<td>162</td>
<td>162</td>
</tr>
<tr>
<td>14</td>
<td>HLA-A*2402</td>
<td>317</td>
<td>157</td>
<td>80</td>
<td>40</td>
<td>40</td>
</tr>
<tr>
<td>15</td>
<td>HLA-B*0702</td>
<td>368</td>
<td>182</td>
<td>92</td>
<td>46</td>
<td>46</td>
</tr>
<tr>
<td>16</td>
<td>HLA-B*1501</td>
<td>296</td>
<td>148</td>
<td>72</td>
<td>36</td>
<td>36</td>
</tr>
<tr>
<td>17</td>
<td>HLA-B*3501</td>
<td>351</td>
<td>176</td>
<td>87</td>
<td>43</td>
<td>44</td>
</tr>
<tr>
<td>18</td>
<td>HLA-B*4002</td>
<td>111</td>
<td>56</td>
<td>28</td>
<td>14</td>
<td>14</td>
</tr>
<tr>
<td>19</td>
<td>HLA-B*4501</td>
<td>108</td>
<td>55</td>
<td>26</td>
<td>13</td>
<td>13</td>
</tr>
<tr>
<td>20</td>
<td>HLA-B*5101</td>
<td>161</td>
<td>78</td>
<td>40</td>
<td>20</td>
<td>20</td>
</tr>
<tr>
<td>21</td>
<td>HLA-B*5301</td>
<td>174</td>
<td>87</td>
<td>42</td>
<td>21</td>
<td>21</td>
</tr>
<tr>
<td>22</td>
<td>HLA-B*5401</td>
<td>147</td>
<td>73</td>
<td>36</td>
<td>18</td>
<td>18</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t1-fn-1">MHC: major histocompatibility complex; #: no. of peptides with an IC<sub>50</sub> ≤ 500 nM.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p id="p-5">Similar peptides were removed to generate a similarity-reduced cross-validation dataset for each MHC allele, and the data were randomly partitioned. The present study used a 5-fold cross-validation approach, i.e., the dataset was split into five subsets. Initially, the dataset was scanned for equal binders and non-binders based on LTMBA value ≥ 0.426 for each 22 MHC class I allele. Then, four out of five (4/5) of each dataset were used for the training, and 1/5 data points (peptides) were used for the blind evaluation (because none of the peptides in the evaluation set were included in the training set at any stage) as described by Nielsen et al. [<xref ref-type="bibr" rid="B31">31</xref>] (Supplementary file 1 in <uri xlink:href="https://data.mendeley.com/datasets/dxz3dk3tcm/1">https://data.mendeley.com/datasets/dxz3dk3tcm/1</uri>).</p>
</sec>
<sec id="t2-2">
<title>Construction of MHC class I binding motifs (PSSM)</title>
<p id="p-6">Initially, based on a training dataset of fixed length (9-mer) MHC class I binding peptide sequences (positive) and using a statistical model of sequence weighting methods available at EasyPred modeler (<uri xlink:href="https://services.healthtech.dtu.dk/services/EasyPred-1.0/">https://services.healthtech.dtu.dk/services/EasyPred-1.0/</uri>), 22 PSSM (9 × 20) were constructed, which represents the frequencies of residues observed for a position in a multiple sequence alignment (MSA) assuming no correlations exist between the different peptide positions [<xref ref-type="bibr" rid="B32">32</xref>]. The score <italic>S</italic> of a peptide to a motif is usually calculated as the sum of the log-odds ratio.</p>
<p id="p-7">
<disp-formula id="eq1">
<label>(1)</label>
<mml:math id="m1859c">
<mml:mi>S</mml:mi>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">log</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>⁡</mml:mo>
<mml:mrow>
<mml:mfenced separators="|">
<mml:mrow>
<mml:mrow>
<mml:munder>
<mml:mo stretchy="false">∏</mml:mo>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>a</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
<mml:mo>=</mml:mo>
<mml:mrow>
<mml:munder>
<mml:mo stretchy="false">∑</mml:mo>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">log</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>k</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>⁡</mml:mo>
<mml:mrow>
<mml:mfenced separators="|">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>a</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p id="p-8">Where <italic>p<sub>pa</sub></italic> is the probability of finding amino acid <italic>a</italic> (<italic>a</italic> can be any of the 20 amino acids) at position <italic>p</italic> (<italic>p</italic> can be 1 to 9) in the motif, and <italic>q<sub>a</sub></italic> is the background frequency of amino acid <italic>a</italic>, log<italic><sub>k</sub> </italic>is the logarithm with base <italic>k</italic>. The scores are often normalized to half-bit by multiplying all scores by 2/log<italic><sub>k</sub> </italic>(<xref ref-type="disp-formula" rid="eq2">equation 2</xref>). In half-bit units, the log-odds score <italic>S</italic> is calculated as:</p>
<p id="p-9">
<disp-formula id="eq2">
<label>(2)</label>
<mml:math id="m9efae">
<mml:mi>S</mml:mi>
<mml:mo>=</mml:mo>
<mml:mn>2</mml:mn>
<mml:mrow>
<mml:munder>
<mml:mo stretchy="false">∑</mml:mo>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi mathvariant="normal">log</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mo>⁡</mml:mo>
<mml:mrow>
<mml:mfenced separators="|">
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>P</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>p</mml:mi>
<mml:mi>a</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>q</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:mfrac>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:mrow>
</mml:math>
</disp-formula>
</p>
<p id="p-10">The three sequence weighting methods, namely Henikoff &amp; Henikoff 1/nr [<xref ref-type="bibr" rid="B26">26</xref>], Hobohm [<xref ref-type="bibr" rid="B25">25</xref>] clustering at 62% identity, and no clustering, were used to compensate for the over-representation among MSA, along with four different weights on pseudo counts (50, 100, 150, and 200) [<xref ref-type="bibr" rid="B33">33</xref>–<xref ref-type="bibr" rid="B35">35</xref>]. In the Henikoff &amp; Henikoff 1/nr method [<xref ref-type="bibr" rid="B26">26</xref>], an amino acid <italic>a</italic> on position <italic>p</italic> in sequence k contributes a weight w<sub>kp</sub> = 1/nr, where n is the number of different amino acids at a given position (column) in the alignment and r is the number of occurrences of amino acid <italic>a</italic> in that column. The weight of a sequence is then assigned as the sum of the weights over all positions in the alignment. However, in the Hobohm 62% clustering method [<xref ref-type="bibr" rid="B25">25</xref>], each peptide k in a cluster is assigned a weight w<sub>k</sub> = 1/n<sub>c</sub>, where n<sub>c</sub> is the number of sequences in the cluster containing peptide k. When the amino acid frequencies are calculated, each amino acid in sequence k is weighted by w<sub>k.</sub> The Henikoff &amp; Henikoff method is as fast as the computation time increases linearly with the number of sequences. In contrast, in the Hobohm clustering algorithm, computation time increases as the square of the number of sequences. Additionally, the binding potential (score) of any peptide sequence (query) to a specific MHC allele was determined by aligning the corresponding PSSM with the peptide sequence and summing the scores that correspond to the residue type and position in the PSSM. To narrow down the probable binders from the list of scored and ranked peptides, a binding cut-off score was established that encompasses 85% of the peptide sequences in the training dataset (positive data) [<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B36">36</xref>].</p>
</sec>
<sec id="t2-3">
<title>Training of ANN for the forecast of MHC class I binding peptides</title>
<p id="p-11">An ANN is a computer simulation of a system of interconnected processing units that can be trained to extract and remember a pattern present within a data set. It can subsequently recognize that pattern when presented with new data. In binding a peptide to the MHC molecule, the amino acids might compete for the space available in the binding groove. Therefore, the mutual information in the binding motif will allow the identification of higher-order sequence correlations. Neural networks with hidden layers are used to describe sequence patterns with such higher-order correlations (<uri xlink:href="https://teaching.healthtech.dtu.dk/mpmbioinformatics/22801_04.pdf">https://teaching.healthtech.dtu.dk/mpmbioinformatics/22801_04.pdf</uri>). Thus, the ANN simulations were performed for all 22 MHC class I molecules using the neural network options available in the EasyPred modeler [<xref ref-type="bibr" rid="B27">27</xref>]. Based on the other studies of MHC-binding peptides forecast, two ANN architectures, namely 180-1-1 and 180-2-1, were considered in the current study, which represent respective numbers of neurons in the input, hidden, and output layers, respectively [<xref ref-type="bibr" rid="B37">37</xref>, <xref ref-type="bibr" rid="B38">38</xref>]. In these architectures, the input 9-mer peptide sequences were presented with 180 values (where individual amino acid was encoded as a binary string of length 20 with an exclusive position set to 0.9 and other positions set to 0.05). At the same time, the output from each neuron was transformed using the standard sigmoid function [<xref ref-type="bibr" rid="B39">39</xref>]. The training algorithm employed to generate the final network was the steepest descent method that learns from a training set of input-output pairs by modifying the network weight parameters such that the network generates a numerical value for each input close to the desired target output. Throughout the training process, the LTMBA value of a peptide served as a target value and was deliberated as a strong binder and a potent epitope for MHC class I alleles [<xref ref-type="bibr" rid="B40">40</xref>]. Nevertheless, the learning process utilized was error backpropagation [<xref ref-type="bibr" rid="B41">41</xref>]. In addition, the following parameters were used for ANN simulations: i) The top 80% of the training set was used to train the neural network, and the bottom 20% was used to stop the training to avoid fitting; ii) The learning rate was 0.005; iii) The maximum number of iterations was set to 300.</p>
</sec>
<sec id="t2-4">
<title>Evaluation parameters</title>
<p id="p-12">The nonparametric performance measure, the area under the receiver operating characteristic curve (Aroc) and correlation coefficient (CC) values, was used to evaluate the predictive performance of the EasyPred modeler-based PSSM and ANN predictors. The receiver operator characteristic (ROC) curve is a plot of the true positive rate [TP/(TP + FN)] on the Y-axis versus the false positive rate [FP/(TN + FP)] on the X-axis for the entire range of the decision thresholds where, true positive (TP) is an experimentally proven binding peptide forecasted as a binder, false positive (FP) is an experimentally proven nonbinding peptide forecasted as a binder, true negative (TN) is an experimentally proven nonbinding peptide forecasted as a non-binder and false negative (FN) is an experimentally proven binding peptide forecasted as nonbinder [<xref ref-type="bibr" rid="B42">42</xref>, <xref ref-type="bibr" rid="B43">43</xref>]. An Aroc value can be interpreted as the probability of distinguishing a true positive from a false positive. For the calculation of Aroc, peptides were classified into binders and non-binders at a cutoff value of 500 nM. This affinity threshold is associated with the most well-known T-cell epitopes. The values Aroc ≥ 0.90 indicate excellent, 0.90 &gt; Aroc ≥ 0.80 good, 0.80 &gt; Aroc ≥ 0.70 marginal, and Aroc &lt; 0.70 poor predictions [<xref ref-type="bibr" rid="B42">42</xref>]. The CC is another widely used measure of the association between pairs of values (predicted versus experimental). It is calculated as:</p>
<p id="p-13">
<disp-formula id="eq3">
<label>(3)</label>
<mml:math id="m4d719">
<mml:mi mathvariant="normal">C</mml:mi>
<mml:mi mathvariant="normal">C</mml:mi>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:munder>
<mml:mo stretchy="false">∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:munder>
<mml:mrow>
<mml:mfenced separators="|">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
<mml:mo>-</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mfenced>
<mml:mfenced separators="|">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mo>-</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:msqrt>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mo stretchy="false">∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced separators="|">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>a</mml:mi>
</mml:mrow>
<mml:mo>-</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:msqrt>
<mml:msqrt>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mo stretchy="false">∑</mml:mo>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mrow>
<mml:mfenced separators="|">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mrow>
<mml:mi>i</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>-</mml:mo>
<mml:mover accent="true">
<mml:mrow>
<mml:mi>p</mml:mi>
</mml:mrow>
<mml:mo>-</mml:mo>
</mml:mover>
</mml:mrow>
</mml:mfenced>
</mml:mrow>
</mml:mrow>
</mml:mrow>
<mml:mrow>
<mml:mn>2</mml:mn>
</mml:mrow>
</mml:msup>
</mml:msqrt>
</mml:mrow>
</mml:mfrac>
</mml:math>
</disp-formula>
</p>
<p id="p-14">where the value <italic>p<sub>i</sub></italic> is found using a prediction method of choice, and the <italic>a<sub>i</sub></italic> is the known corresponding target value. However, the overlined letters denote average values. The value of 1 corresponds to a perfect correlation and –1 to a perfect anti-correlation, and 0 value corresponds to a random prediction [<xref ref-type="bibr" rid="B42">42</xref>].</p>
</sec>
<sec id="t2-5">
<title>Generalization test and Epstein-Barr virus case study</title>
<p id="p-15">The generalizability test was used to assess the ability of the PSSM and ANN models, trained for the allele HLA-A*0201, to generalize to the other 21 alleles. Training was performed on the same dataset, followed by testing on evaluation datasets of all 22 alleles. In addition to the above, the performance of the PSSM and ANN predictors was validated through observations revealed in the very recent study conducted by Wohlwend et al. [<xref ref-type="bibr" rid="B44">44</xref>] called the Epstein-Barr virus (EBV) dataset. This research indicates that the dataset comprises 11 distinct immunogenic epitopes that are restricted by 5 HLA class I alleles derived from EBV, as identified through IFNγ ELISpot assays. The study employs PSSM and ANN models to effectively identify both confirmed and new HLA class I epitopes from EBV, which have been experimentally validated through in vitro and ex vivo studies.</p>
</sec>
<sec id="t2-6">
<title>Prediction and relative rank measurement of PSSM and ANN predictors on NetCTLpan and SARS-CoV-2 dataset</title>
<p id="p-16">The additional performance evaluation of the present EasyPred modeler-based PSSM and ANN predictors, along with recently published NetMHCpan 4.0 methods [eluted ligands (EL) and BA] (<uri xlink:href="https://services.healthtech.dtu.dk/services/NetMHCpan-4.0/">https://services.healthtech.dtu.dk/services/NetMHCpan-4.0/</uri>), was estimated on NetCTLpan and SARS-CoV-2 dataset in terms of comparative ranking of the reported ligands among all nonamers included in the source protein as described by Larsen et al. [<xref ref-type="bibr" rid="B45">45</xref>, <xref ref-type="bibr" rid="B46">46</xref>]. This NetCTLpan dataset was gathered from the supplementary material of the NetCTLpan1.0 method (<uri xlink:href="https://services.healthtech.dtu.dk/suppl/immunology/NetCTLpan.php">https://services.healthtech.dtu.dk/suppl/immunology/NetCTLpan.php</uri>) reported as SYFPEITHI 9-mer training and evaluation dataset [<xref ref-type="bibr" rid="B47">47</xref>, <xref ref-type="bibr" rid="B48">48</xref>]. The dataset involved 413 sequences of naturally processed T cell epitopes restricted by 7 HLA class I molecules that were common between the present study and the NetCTLpan1.0 tool (Supplementary file 2 in <uri xlink:href="https://data.mendeley.com/datasets/dxz3dk3tcm/1">https://data.mendeley.com/datasets/dxz3dk3tcm/1</uri>). At the time of conducting the present study, initially, no T cell epitope data were available for SARS-CoV-2, but there was a significant amount of information available on T cell epitopes for betacoronaviruses that cause similar diseases in humans, like SARS-CoV. Thus, we have compiled the SARS-CoV-2 dataset from the study of Grifoni et al. [<xref ref-type="bibr" rid="B49">49</xref>] that contains 11 T cell epitopes (9-mer) derived from surface glycoprotein (NCBI ID: QHD43416) and nucleocapsid phosphoprotein (NCBI ID: QHD43423) that showed 100% identity with SARS-CoV epitopes. Additionally, evaluation of PSSM and ANN predictors on the recent SARS-CoV-2 dataset of Gfeller et al. [<xref ref-type="bibr" rid="B50">50</xref>] was performed for prediction and % rank analysis of CD8+ T cell epitope in their source protein using allele-specific cutoff score.</p>
</sec>
<sec id="t2-7">
<title>Integrated forecast of CTL epitope processing</title>
<p id="p-17">The CTL epitope processing forecast integrated the three significant forecast steps in the filtering approach, including MHC class I BA, TAP transport efficiency, and C-terminal proteasomal cleavage (<xref ref-type="fig" rid="fig1">Figure 1</xref>). The details are described below:</p>
<fig id="fig1" position="float">
<label>Figure 1</label>
<caption>
<p id="fig1-p-1">
<bold>Flow chart indicating the integrated forecast of cytotoxic T lymphocytes (CTL) epitopes based on the EasyPred modeler.</bold> MHC: major histocompatibility complex; TAP: transporters associated with antigen processing.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="ei-05-1003215-g001.tif" />
</fig>
<p id="p-18">
<list list-type="simple">
<list-item>
<label>1.</label>
<p>Amino acid sequences of a given target protein were parsed into 9-mer overlapping peptides. Peptides were then scored using EasyPred-based PSSM and ANN predictors, and values above the defined threshold (<xref ref-type="table" rid="t2">Table 2</xref>) for each 22 MHC class I allele were forecasted as binders. In the case of PSSM predictors, we calculated the threshold scores in terms of predicting peptides that bind with 85% of all epitopes in the training set because an established threshold associated with immunogenicity (i.e., IC<sub>50</sub> ≤ 500 nM) covers 80–90% of all immunogenic epitopes [<xref ref-type="bibr" rid="B51">51</xref>]. However, in the case of ANN predictors, the threshold value of an IC<sub>50</sub> less than 500 nM was considered for binder forecast.</p>
</list-item>
<list-item>
<label>2.</label>
<p>Forecasted MHC class I binders were again scored by the quantitative weight matrix (9 × 20) for TAP BA in terms of –log IC<sub>50</sub> (pIC<sub>50</sub>) described by Peters et al. [<xref ref-type="bibr" rid="B52">52</xref>] (<xref ref-type="sec" rid="s-suppl">Table S1</xref>) and/or Doytchinova et al. [<xref ref-type="bibr" rid="B53">53</xref>] (<xref ref-type="sec" rid="s-suppl">Table S2</xref>).</p>
</list-item>
<list-item>
<label>3.</label>
<p>Peptides with scores less than the selected threshold value (default, pIC<sub>50</sub> &lt; 4) were considered as TAP binders. The TAP transports peptides into the ER that are potentially N-terminally extended from the ligand that ends up in MHC. This means that the peptide that binds to MHC does not necessarily need to be a suitable substrate for TAP [<xref ref-type="bibr" rid="B52">52</xref>].</p>
</list-item>
<list-item>
<label>4.</label>
<p>Subsequently, the proteasomal (constitutive or immuno) cleavage score of 12-mer overlapping peptide fragments generated from the target protein sequence was calculated along with recording of C-terminal position at the cleavage site (i.e., position of the sixth amino acid in the target protein) by using a quantitative cleavage weight matrix (12 × 20) published by Toes et al. [<xref ref-type="bibr" rid="B54">54</xref>] (<xref ref-type="sec" rid="s-suppl">Table S3</xref> and <xref ref-type="sec" rid="s-suppl">S4</xref>).</p>
</list-item>
<list-item>
<label>5.</label>
<p>The query 12-mer peptide with a score above a chosen threshold value (corresponding to 1–10%) was forecasted to be cleaved between the sixth and seventh amino acids [<xref ref-type="bibr" rid="B55">55</xref>].</p>
</list-item>
<list-item>
<label>6.</label>
<p>Finally, MHC class I and TAP binding peptides identified through the above steps involving proteasomal (constitutive or immuno) cleavage site position at their C-terminal (steps 1–5) were considered CTL epitopes.</p>
</list-item>
</list>
</p>
<table-wrap id="t2">
<label>Table 2</label>
<caption>
<p id="t2-p-1">
<bold>MHC class I allele-specific binding affinity cutoff of PSSM predictor, along with worldwide frequency in the human population sourced from IEDB and Paul et al. [<xref ref-type="bibr" rid="B64">64</xref>].</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>S. No.</bold>
</th>
<th>
<bold>MHC class I allele</bold>
</th>
<th>
<bold>Worldwide population frequency of allele (%)</bold>
</th>
<th>
<bold>PSSM predictor binding cutoff score</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>HLA-A*0201</td>
<td>25.2</td>
<td>–0.007</td>
</tr>
<tr>
<td>2</td>
<td>HLA-A*0203</td>
<td>3.3</td>
<td>1.379</td>
</tr>
<tr>
<td>3</td>
<td>HLA-A*0206</td>
<td>4.9</td>
<td>1.625</td>
</tr>
<tr>
<td>4</td>
<td>HLA-A*2902</td>
<td>2.9</td>
<td>3.839</td>
</tr>
<tr>
<td>5</td>
<td>HLA-A*6801</td>
<td>4.6</td>
<td>3.362</td>
</tr>
<tr>
<td>6</td>
<td>HLA-A*6802</td>
<td>3.3</td>
<td>2.305</td>
</tr>
<tr>
<td>7</td>
<td>HLA-A*3301</td>
<td>3.2</td>
<td>4.660</td>
</tr>
<tr>
<td>8</td>
<td>HLA-A*0101</td>
<td>16.2</td>
<td>3.933</td>
</tr>
<tr>
<td>9</td>
<td>HLA-A*0202</td>
<td>0.28</td>
<td>3.984</td>
</tr>
<tr>
<td>10</td>
<td>HLA-A*3002</td>
<td>5.0</td>
<td>4.77</td>
</tr>
<tr>
<td>11</td>
<td>HLA-A*3101</td>
<td>4.7</td>
<td>2.651</td>
</tr>
<tr>
<td>12</td>
<td>HLA-A*0301</td>
<td>15.4</td>
<td>3.800</td>
</tr>
<tr>
<td>13</td>
<td>HLA-A*1101</td>
<td>12.9</td>
<td>2.832</td>
</tr>
<tr>
<td>14</td>
<td>HLA-A*2402</td>
<td>16.8</td>
<td>2.465</td>
</tr>
<tr>
<td>15</td>
<td>HLA-B*0702</td>
<td>13.3</td>
<td>2.835</td>
</tr>
<tr>
<td>16</td>
<td>HLA-B*1501</td>
<td>5.2</td>
<td>1.829</td>
</tr>
<tr>
<td>17</td>
<td>HLA-B*3501</td>
<td>6.5</td>
<td>2.363</td>
</tr>
<tr>
<td>18</td>
<td>HLA-B*4002</td>
<td>3.5</td>
<td>5.415</td>
</tr>
<tr>
<td>19</td>
<td>HLA-B*4501</td>
<td>0.63</td>
<td>5.736</td>
</tr>
<tr>
<td>20</td>
<td>HLA-B*5101</td>
<td>5.5</td>
<td>8.074</td>
</tr>
<tr>
<td>21</td>
<td>HLA-B*5301</td>
<td>5.4</td>
<td>7.923</td>
</tr>
<tr>
<td>22</td>
<td>HLA-B*5401</td>
<td>0.56</td>
<td>7.050</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t2-fn-1">MHC: major histocompatibility complex; PSSM: position-specific scoring matrices.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec id="s3">
<title>Results</title>
<sec id="t3-1">
<title>Construction and evaluation of PSSM and ANN predictors</title>
<p id="p-19">Sequence weighting methods have been employed in creating a PSSM from frequencies of residues surveyed for a position in an MSA of MHC binders [<xref ref-type="bibr" rid="B32">32</xref>]. The present study utilizes three sequence weighting methods (Henikoff &amp; Henikoff 1/nr, clustering at 62% identity, and no clustering) available at the EasyPred modeler (<uri xlink:href="https://services.healthtech.dtu.dk/services/EasyPred-1.0/">https://services.healthtech.dtu.dk/services/EasyPred-1.0/</uri>) for the construction of MHC allele-specific PSSM, including four corrections for low counts (weight on pseudo counts) 50, 100, 150, and 200 [<xref ref-type="bibr" rid="B34">34</xref>, <xref ref-type="bibr" rid="B35">35</xref>]. As peptides in the evaluation dataset are not included in the training dataset, it is equivalent to a blind test [<xref ref-type="bibr" rid="B27">27</xref>]. For each MHC class I allele, only those PSSM were selected as predictors that gave maximum predictive performance evaluated in terms of Aroc and CC values (<xref ref-type="sec" rid="s-suppl">Table S5</xref>). The Aroc value operates independently of the anticipated scale, as it assesses the ranking of predictors and remains unaffected by the dataset’s composition, including varying ratios of binders and non-binders. The Aroc value provides a crucial metric for evaluating forecast quality, with a score of 0.5 indicating random forecasts and 1.0 representing perfect forecasts. Conversely, the CC value of one corresponds to perfect correlation, a value of zero indicates a random estimate, and a value of minus one signifies perfect anti-correlation. The Aroc value principally captures the probability that, given two peptides, one a binder and the other a non-binder, the forecasted score will be higher for the binder than the non-binder [<xref ref-type="bibr" rid="B43">43</xref>]. Furthermore, the MSA of each HLA class I allele binding motif was visualized by using the graphical depiction (sequence logo) method [<xref ref-type="bibr" rid="B56">56</xref>], where the height of a column of letters represents the information content (I) at that position. The height of each letter within a column is proportional to the frequency of the corresponding amino acid at that position and colored according to their physicochemical properties, such as acidic (DE)-red, basic (HKR)-blue, hydrophobic (ACFILMPVW)-black, and neutral (GNQSTY)-green (<xref ref-type="sec" rid="s-suppl">Figure S1</xref>). The similarity in peptide binding preferences is observed when comparing the MHC class I binding motif logo developed in the presented study with the logo stored in the MHC motif viewer. In the MHC motif viewer database, pairwise assessments of MHC binding motifs facilitate immediate analysis of epitope selection data in patient cohorts with HLA diversity [<xref ref-type="bibr" rid="B57">57</xref>]. For example, when comparing the binding motif of human (HLA-A*2402) to the chimpanzee (Patr-A*0701) allele, an unmistakable resemblance between the binding motifs of the two alleles was spotted, as noted by Sidney et al. [<xref ref-type="bibr" rid="B58">58</xref>]. A similar conserved motif for MHC class I peptide binding has also been shown between humans and rhesus macaques, as demonstrated by Dzuris et al. [<xref ref-type="bibr" rid="B59">59</xref>]. Consequently, the ability to differentiate various MHC binding specificities (motifs) has applications that range from the design of experiments for peptide binding assays to personalized medicine for a significant population, which includes the selection of peptides that are immunogenic in both humans and model organisms [<xref ref-type="bibr" rid="B60">60</xref>, <xref ref-type="bibr" rid="B61">61</xref>].</p>
<p id="p-20">Moreover, in the case of ANN simulations, two architectures (180-2-1 and 180-1-1) were initially used, and the input peptide sequences (binders with IC<sub>50</sub> ≤ 500 nM and rest non-binders) were offered in a conventional encoding, as described by Nielsen et al. [<xref ref-type="bibr" rid="B27">27</xref>]. Further, the network weights were renewed using gradient descent backpropagation algorithms. For a given peptide sequence of 9-mer, the ANN weights were renewed to lower the sum of squared errors between the forecasted and experimentally measured BA (target value). The training of the neural networks was performed using a five-fold cross-validation (Supplementary file 1 in <uri xlink:href="https://data.mendeley.com/datasets/dxz3dk3tcm/1">https://data.mendeley.com/datasets/dxz3dk3tcm/1</uri>). For each of the five training and test subsets, a series of network training sessions is conducted, each utilizing two distinct hidden neurons (1 and 2) along with a single bin for balanced training. For every series, a single network exhibiting the highest test performance (measured by the highest CC and the lowest square error) was ultimately chosen as an ANN predictor (refer to <xref ref-type="sec" rid="s-suppl">Table S6</xref>). Subsequently, the predictive performance of the corresponding ANN network for each MHC allele was assessed based on Aroc (see <xref ref-type="sec" rid="s-suppl">Table S6</xref>). From these findings, it can also be deduced that increasing the number of hidden neurons within the ANN architecture does not have a significant impact on performance. Consequently, the Aroc value for the ANN predictor featuring a single hidden neuron was utilized for comparison with the PSSM predictor. It is crucial to note that, initially, a BA (IC<sub>50</sub>) threshold for the MHC class I allele was established, where IC<sub>50</sub> values exceeding 500 nM were identified in competitive assays (involving an MHC and isolated peptides) and were compared with immunogenicity across various marker peptides and nonimmunogenic peptides [<xref ref-type="bibr" rid="B62">62</xref>]. This affinity threshold has been identified as being associated with the majority of known T-cell epitopes [<xref ref-type="bibr" rid="B63">63</xref>]. Nevertheless, this threshold value has also been demonstrated to be specific to MHC class I alleles [<xref ref-type="bibr" rid="B64">64</xref>]. An alternative way to assess the efficacy of peptide binding to MHC class I involves measuring the stability of peptide-MHC complexes over time, but affinity and stability do not rank the peptides in the same order within their source proteins and remain debatable [<xref ref-type="bibr" rid="B65">65</xref>, <xref ref-type="bibr" rid="B66">66</xref>]. To compare the performance of ANN and PSSM predictors, the identical evaluation dataset from MHCIPREDS-IEDB was utilized. Consequently, upon comparison, the predictive performance of the ANN models was determined to be superior to that of the PSSM models for most of the MHC class I molecules (<xref ref-type="fig" rid="fig2">Figure 2</xref>). The ANN performance in terms of Aroc is maximum (0.97) for the alleles HLA-A*0202 and HLA-A*0203, as well as the minimum (0.56) for allele HLA-A*2902, whereas the maximum PSSM predictor performance in terms of Aroc is (0.93) for the allele HLA-A*0203 and the minimum (0.49) for allele HLA-B*4501 (<xref ref-type="fig" rid="fig2">Figure 2</xref>).</p>
<fig id="fig2" position="float">
<label>Figure 2</label>
<caption>
<p id="fig2-p-1">
<bold>Histogram of the predictive performance measured in terms of the Aroc value of PSSM and ANN predictors for 22 MHC class I alleles trained and evaluated on the MHCIPREDS-IEDB dataset.</bold> Aroc: area under the receiver operating characteristic curve; PSSM: position-specific scoring matrices; ANN: artificial neural networks; MHC: major histocompatibility complex.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="ei-05-1003215-g002.tif" />
</fig>
<p id="p-21">From the above results, it is clearly stated that PSSM predictors, to a high degree, describe the binding motif of the corresponding HLA class I alleles, though they demonstrate lower Aroc than the ANN predictors. This is to be expected since the ANN can take higher-order sequence associations into account for a fixed length of MHC binding peptides [<xref ref-type="bibr" rid="B27">27</xref>]. Moreover, not only the size of the training and validation dataset but also the choice of specific algorithms can influence the efficiency of the forecast. To integrate PSSM-based MHC class I predictor in CTL epitope processing identification, we calculated the binding thresholds in terms of predicting peptides that bind with an IC<sub>50</sub> value less than 500 nM, an established threshold associated with immunogenicity for 85% of all epitopes in the training set [<xref ref-type="bibr" rid="B51">51</xref>]. These binding threshold scores for each of the 22 MHC class I molecules were determined for PSSM predictors to demarcate the range of putative binders among the top-scoring peptides (<xref ref-type="table" rid="t2">Table 2</xref>). Paul et al. [<xref ref-type="bibr" rid="B64">64</xref>] also revealed similar observations that different MHC molecules bind ligands at different (forecasted) binding thresholds (scores). If a single criterion for MHC class I alleles has to be deliberated, then it is preferable to select absolute BA, as in the case of the ANN predictor, where IC<sub>50</sub> ≤ 500 nM could be regarded as a reasonably good “universal” binding threshold. In a similar study, Bonsack et al. [<xref ref-type="bibr" rid="B67">67</xref>] confirmed the decisive threshold of IC<sub>50</sub> ≤ 500 nM for MHC class I binding peptides through in vitro validation. However, predictive efficacy is increased using allele-specific affinity thresholds.</p>
<p id="p-22">Thus, based on these observations, the PSSM-based MHC class I binding forecasts were applied without rescaling, thereby preserving prospective fundamental biological differences between MHC class molecules. However, ANN-based MHC class I binding peptide forecasts were performed with IC<sub>50</sub> ≤ 500 nM (LTMBA value ≥ 0.426).</p>
</sec>
<sec id="t3-2">
<title>Evaluation of the EBV epitope dataset</title>
<p id="p-23">The PSSM and ANN predictors successfully identified the EBV epitopes as revealed in the study conducted by Wohlwend et al. [<xref ref-type="bibr" rid="B44">44</xref>], above the chosen threshold (<xref ref-type="table" rid="t2">Table 2</xref>), except the epitope IACPIVMRY restricted by the HLA-B*1501 allele with a comparable BA score of 0.411 compared to the ANN predictor threshold (LTMBA value) of 0.426 (<xref ref-type="table" rid="t3">Table 3</xref>). This clearly indicates the real-life application of models in the identification of both established and novel HLA class I epitopes from EBV.</p>
<table-wrap id="t3">
<label>Table 3</label>
<caption>
<p id="t3-p-1">
<bold>List of EBV epitopes used in the present study derived from Wohlwend et al. [<xref ref-type="bibr" rid="B44">44</xref>].</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>S. No.</bold>
</th>
<th>
<bold>CD8+ T cell epitope</bold>
</th>
<th>
<bold>HLA binding allele</bold>
</th>
<th>
<bold>PSSM</bold>
<break />
<bold>predictor score</bold>
</th>
<th>
<bold>ANN</bold>
<break />
<bold>predictor score (binding affinity)</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>AFDQATRVY</td>
<td>HLA-A*0101 </td>
<td>8.505</td>
<td>0.438</td>
</tr>
<tr>
<td>2</td>
<td>HLSQAAFGL</td>
<td rowspan="3">HLA-A*0201</td>
<td>6.252</td>
<td>0.633</td>
</tr>
<tr>
<td>3</td>
<td>SIIPRTPDV </td>
<td>6.581</td>
<td>0.46</td>
</tr>
<tr>
<td>4</td>
<td>YVLDHLIVV</td>
<td>9.036</td>
<td>0.769</td>
</tr>
<tr>
<td>5</td>
<td>RYSIFFDYM</td>
<td rowspan="2">HLA-A*2402</td>
<td>8.583</td>
<td>0.45</td>
</tr>
<tr>
<td>6</td>
<td>TYPVLEEMF</td>
<td>7.784</td>
<td>0.455</td>
</tr>
<tr>
<td>7</td>
<td>IPQCRLTPL</td>
<td rowspan="2">HLA-B*0702</td>
<td>12.551</td>
<td>0.454</td>
</tr>
<tr>
<td>8</td>
<td>RPPIFIRRL</td>
<td>10.057</td>
<td>0.495</td>
</tr>
<tr>
<td>9</td>
<td>IACPIVMRY</td>
<td rowspan="3">HLA-B*1501</td>
<td>6.259</td>
<td>0.411</td>
</tr>
<tr>
<td>10</td>
<td>SQISNTEMY</td>
<td>10.109</td>
<td>0.486</td>
</tr>
<tr>
<td>11</td>
<td>VQTAAAVVF</td>
<td>6.888</td>
<td>0.428</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t3-fn-1">EBV: Epstein-Barr virus; PSSM: position-specific scoring matrices; ANN: artificial neural networks.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="t3-3">
<title>Generalizability evaluation</title>
<p id="p-24">The generalization ability of the MHC class I binding prediction model is essential for epitope prediction, as there are many HLA alleles with inadequate data for training an allele-specific model [<xref ref-type="bibr" rid="B68">68</xref>]. Therefore, we have performed a detailed analysis of the performance of PSSM/ANN predictors trained on one allele and their ability to accurately predict other alleles in their evaluation datasets. Pan-specific algorithms can predict peptide binding to HLA alleles for which limited or even no experimental data are available [<xref ref-type="bibr" rid="B69">69</xref>]. In <xref ref-type="fig" rid="fig3">Figure 3</xref>, the Aroc values for the models trained on the HLA-A*0201 dataset are given for 22 different alleles. The ANN model excellently performs over the PSSM model for alleles of the HLA-A*02 type, but for the other alleles, the performance of both models is poor, except for HLA-B*4002, 4501, 5101. The prediction capabilities are good to marginal for some alleles, suggesting that cross-allele prediction is feasible in some cases. This may be due to MHC supertype classification systems that make clustered sets of HLA molecules with largely overlapping peptide repertoires. These classification systems normally depend on descriptions such as published motifs and/or analysed shared repertoires of binding peptides, etc. [<xref ref-type="bibr" rid="B70">70</xref>–<xref ref-type="bibr" rid="B73">73</xref>]. Generally, HLA-A and -B alleles are not clustered in the same supertype, but our PSSM/ANN predictor trained on HLA-A*0201 allele was able to make reasonable predictions for HLA-B*4002, 4501, and 5101 alleles.</p>
<fig id="fig3" position="float">
<label>Figure 3</label>
<caption>
<p id="fig3-p-1">
<bold>The cross-allele performance of the PSSM and ANN prediction models, trained on the HLA-A*0201 dataset and tested on the evaluation dataset of all 22 alleles.</bold> Aroc: area under the receiver operating characteristic curve; PSSM: position-specific scoring matrices; ANN: artificial neural networks.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="ei-05-1003215-g003.tif" />
</fig>
</sec>
<sec id="t3-4">
<title>Rank measure analysis of PSSM and ANN predictors with NetMHCpan 4.0 and NetCTLpan 1.1</title>
<sec id="t3-4-1">
<title>NetCTLpan dataset</title>
<p id="p-25">Although the existence of homologous peptides between training and testing datasets has been avoided to provide real-world estimates of forecast performance metrics, the relative ranking of diverse predictors is principally unaffected by the existence of homologous peptides [<xref ref-type="bibr" rid="B63">63</xref>]. Thus, the additional performance evaluation of the present PSSM/ANN predictors was estimated in terms of the relative ranking of the reported ligands between all nonamers included in the source protein as described by Larsen et al. [<xref ref-type="bibr" rid="B45">45</xref>, <xref ref-type="bibr" rid="B46">46</xref>]. This evaluation indicates how large a portion of the peptides for a provided protein needs to be verified to identify the new epitopes [<xref ref-type="bibr" rid="B74">74</xref>]. For each of the stated ligands in the NetCTLpan dataset (Supplementary file 2 in <uri xlink:href="https://data.mendeley.com/datasets/dxz3dk3tcm/1">https://data.mendeley.com/datasets/dxz3dk3tcm/1</uri>), the source protein was identified, and the affinity of all nonamers contained in the source protein was forecasted (assuming that all nonamers, except for the reported ligand, are non-binders). Here, we define the term reliability of a forecast method as the probability of identifying an epitope in each protein within a certain top percentage of the peptides [<xref ref-type="bibr" rid="B45">45</xref>]. The rank measure performance of the NetMHCpan 4.0 method based on EL (1.11%) and BA (3.19%) showed similar results (comparatively higher average values) as compared to PSSM (2.23%) and ANN (3.13%) predictors. The results from these evaluations are encouraging because the most natural MHC binders compiled in the NetCTLpan dataset were identified within the top 5% (<xref ref-type="sec" rid="s-suppl">Table S7</xref>; <xref ref-type="table" rid="t4">Table 4</xref>). Therefore, in terms of wet laboratory work, nearly ~97% less expenses are consumed on materials, labor, and time for the peptides that require experimental verification to detect new epitopes in an antigen. However, the NetCTLpan method reports a rank measure of 3.7% for the peptides that need to be experimentally verified to detect new epitopes with 90% likelihood. Thus, for a hypothetical protein of 300 peptides, this means that, on average, 7 and 9 peptides need to be tested to identify the epitope using PSSM and ANN predictors, respectively. The corresponding numbers reported for NetCTL, NetMHCpan, and NetCTLpan were 17, 13, and 11 peptides, respectively [<xref ref-type="bibr" rid="B46">46</xref>, <xref ref-type="bibr" rid="B47">47</xref>]. Using the NetCTLpan tool, the experimental effort to discover 90% of new epitopes can be minimized by 15% and 40%, respectively, compared to the NetMHCpan and NetCTL tools [<xref ref-type="bibr" rid="B47">47</xref>]. However, the corresponding peptide numbers are 3 and 10 to identify new CTL epitopes using NetMHCpan 4.0 (EL) and NetMHCpan 4.0 (BA), respectively. Thus, the overall performance of EasyPred modeler-based predictors (PSSM and ANN) was found to be similar to NetMHCpan 4.0 (BA) but lower than NetMHCpan 4.0 (EL).</p>
<table-wrap id="t4">
<label>Table 4</label>
<caption>
<p id="t4-p-1">
<bold>Rank measure analysis of MHC class I predictors (PSSM and ANN) based on the NetCTLpan dataset.</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th rowspan="2">
<bold>S. No.</bold>
</th>
<th rowspan="2">
<bold>MHC class I allele</bold>
</th>
<th rowspan="2">
<bold>No. of antigen</bold>
</th>
<th colspan="4">
<bold>Average relative rank of epitopes in their source antigen (%)</bold>
</th>
</tr>
<tr>
<th>
<bold>PSSM</bold>
</th>
<th>
<bold>ANN</bold>
</th>
<th>
<bold>NetMHCpan 4.0 (EL)</bold>
</th>
<th>
<bold>NetMHCpan 4.0 (BA)</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>HLA-A*0101</td>
<td>29</td>
<td>0.36</td>
<td>1.84</td>
<td>0.3246</td>
<td>0.3159</td>
</tr>
<tr>
<td>2</td>
<td>HLA-A*0201</td>
<td>254</td>
<td>3.30</td>
<td>4.17</td>
<td>3.1785</td>
<td>17.405</td>
</tr>
<tr>
<td>3</td>
<td>HLA-A*1101</td>
<td>14</td>
<td>1.72</td>
<td>0.80</td>
<td>0.399</td>
<td>0.4765</td>
</tr>
<tr>
<td>4</td>
<td>HLA-A*6801</td>
<td>12</td>
<td>3.05</td>
<td>3.11</td>
<td>0.9291</td>
<td>0.907</td>
</tr>
<tr>
<td>5</td>
<td>HLA-A*0301</td>
<td>65</td>
<td>2.77</td>
<td>3.68</td>
<td>1.5704</td>
<td>1.4966</td>
</tr>
<tr>
<td>6</td>
<td>HLA-B*0702</td>
<td>25</td>
<td>1.59</td>
<td>4.71</td>
<td>0.6781</td>
<td>0.9225</td>
</tr>
<tr>
<td>7</td>
<td>HLA-B*4501</td>
<td>14</td>
<td>2.83</td>
<td>3.66</td>
<td>0.7079</td>
<td>0.8363</td>
</tr>
<tr>
<td colspan="3">Average</td>
<td>2.23</td>
<td>3.13</td>
<td>1.11</td>
<td>3.19</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t4-fn-1">MHC: major histocompatibility complex; PSSM: position-specific scoring matrices; ANN: artificial neural networks; EL: eluted ligands; BA: binding affinity.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="t3-4-2">
<title>SARS-CoV-2 dataset</title>
<p id="p-26">In a comparative evaluation (average rank measure analysis) of EasyPred modeler-based predictors (PSSM and ANN) on the SARS-CoV-2 dataset of Grifoni et al. [<xref ref-type="bibr" rid="B49">49</xref>], the PSSM predictor (2.46%) performed better than other T-cell epitope forecast methods, NetMHCpan 4.0 (EL) (2.66%), NetCTLpan 1.1 (2.69%), NetMHCpan 4.0 (BA) (3.33%), as well as its own ANN predictor (4.67%), respectively (<xref ref-type="table" rid="t5">Table 5</xref>). Similar results were also observed for evaluation on the SARS-CoV-2 dataset of Gfeller et al. [<xref ref-type="bibr" rid="B50">50</xref>]. In which both the predictors (PSSM and ANN) identified all the CD8+ T cell epitopes above their cutoff score, except for a few: LYLYALVYF (A*2402: 0.418), LWLLWPVTL (A*2402: 0.402), FTSDYYQLY (A*2402: 0.326), and YFPLQSYGF (A*2402: 0.38) in the case of ANN. Moreover, % rank measure analysis in their source protein revealed that the PSSM predictor identified the epitopes within 3% (average 2.4%) (<xref ref-type="table" rid="t6">Table 6</xref>). However, Nosrati et al. [<xref ref-type="bibr" rid="B75">75</xref>] recently established that ANN was the most accurate algorithm for distinguishing epitopes and non-epitopes of the Crimean-Congo hemorrhagic fever virus with an accuracy of 90%. Such evaluation knowledge is of urgent significance that would assist COVID-19 vaccine developers in facilitating the evaluation of vaccine candidate immunogenicity against human populations [<xref ref-type="bibr" rid="B76">76</xref>].</p>
<table-wrap id="t5">
<label>Table 5</label>
<caption>
<p id="t5-p-1">
<bold>Comparative using rank measure evaluation of PSSM and ANN predictors with up-to-date tools NetMHCpan 4.0 (EL/BA) and NetCTLpan 1.1 on SARS-CoV-2 dataset from the study of Grifoni et al. [<xref ref-type="bibr" rid="B49">49</xref>].</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th rowspan="2">
<bold>S. No.</bold>
</th>
<th rowspan="2">
<bold>T cell epitope</bold>
</th>
<th rowspan="2">
<bold>Source protein</bold>
</th>
<th colspan="5">
<bold>Average relative rank of epitopes in their source antigen (%)</bold>
</th>
</tr>
<tr>
<th>
<bold>PSSM</bold>
<break />
<bold>predictor</bold>
</th>
<th>
<bold>ANN</bold>
<break />
<bold>predictor</bold>
</th>
<th>
<bold>NetMHCpan 4.0 (EL)</bold>
</th>
<th>
<bold>NetMHCpan 4.0 (BA)</bold>
</th>
<th>
<bold>NetCTLpan 1.1</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>ALNTLVKQL </td>
<td>Spike</td>
<td>2.77</td>
<td>6.80</td>
<td>1.42</td>
<td>3.87</td>
<td>2.06</td>
</tr>
<tr>
<td>2</td>
<td>VLNDILSRL </td>
<td>Spike</td>
<td>0.79</td>
<td>0.95</td>
<td>0.16</td>
<td>0.47</td>
<td>0.32</td>
</tr>
<tr>
<td>3</td>
<td>LITGRLQSL</td>
<td>Spike</td>
<td>2.29</td>
<td>5.06</td>
<td>8.06</td>
<td>9.64</td>
<td>5.06</td>
</tr>
<tr>
<td>4</td>
<td>RLNEVAKNL </td>
<td>Spike</td>
<td>4.98</td>
<td>9.33</td>
<td>1.11</td>
<td>7.06</td>
<td>1.42</td>
</tr>
<tr>
<td>5</td>
<td>NLNESLIDL </td>
<td>Spike</td>
<td>1.58</td>
<td>4.35</td>
<td>0.79</td>
<td>1.58</td>
<td>1.11</td>
</tr>
<tr>
<td>6</td>
<td>FIAGLIAIV </td>
<td>Spike</td>
<td>0.40</td>
<td>0.28</td>
<td>0.40</td>
<td>0.16</td>
<td>0.79</td>
</tr>
<tr>
<td>7</td>
<td>ALNTPKDHI </td>
<td>Nucleocapsid</td>
<td>3.16</td>
<td>8.03</td>
<td>2.43</td>
<td>4.62</td>
<td>6.08</td>
</tr>
<tr>
<td>8</td>
<td>LQLPQGTTL </td>
<td>Nucleocapsid</td>
<td>2.92</td>
<td>5.12</td>
<td>1.70</td>
<td>2.91</td>
<td>1.12</td>
</tr>
<tr>
<td>9</td>
<td>LALLLLDRL </td>
<td>Nucleocapsid</td>
<td>7.06</td>
<td>10.71</td>
<td>12.41</td>
<td>5.56</td>
<td>10.94</td>
</tr>
<tr>
<td>10</td>
<td>LLLDRLNQL </td>
<td>Nucleocapsid</td>
<td>0.24</td>
<td>0.24</td>
<td>0.24</td>
<td>0.24</td>
<td>0.24</td>
</tr>
<tr>
<td>11</td>
<td>GMSRIGMEV </td>
<td>Nucleocapsid</td>
<td>0.71</td>
<td>0.49</td>
<td>0.49</td>
<td>0.49</td>
<td>0.49</td>
</tr>
<tr>
<td colspan="3">Average</td>
<td>2.46</td>
<td>4.67</td>
<td>2.66</td>
<td>3.33</td>
<td>2.69</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t5-fn-1">PSSM: position-specific scoring matrices; ANN: artificial neural networks; EL: eluted ligands; BA: binding affinity.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t6">
<label>Table 6</label>
<caption>
<p id="t6-p-1">
<bold>Evaluation of PSSM and ANN predictors on the SARS-CoV-2 dataset of Gfeller et al. [<xref ref-type="bibr" rid="B50">50</xref>] using threshold (cutoff score) and % rank measure analysis in the source protein.</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th rowspan="2">
<bold>S. No.</bold>
</th>
<th rowspan="2">
<bold>CD8+ T cell epitope</bold>
</th>
<th rowspan="2">
<bold>Source protein (SWISS-PROT accession no.)</bold>
</th>
<th rowspan="2">
<bold>HLA binding allele</bold>
</th>
<th colspan="2">
<bold>PSSM predictor</bold>
</th>
<th colspan="2">
<bold>ANN predictor</bold>
</th>
</tr>
<tr>
<th>
<bold>Score</bold>
</th>
<th>
<bold>% rank</bold>
</th>
<th>
<bold>Score</bold>
</th>
<th>
<bold>% rank</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>LYLYALVYF</td>
<td>AP3A (P0DTC3)</td>
<td rowspan="2">A*2402</td>
<td>9.218</td>
<td>0.75</td>
<td>0.418</td>
<td>13.48</td>
</tr>
<tr>
<td>2</td>
<td>LWLLWPVTL</td>
<td>VME1 (P0DTC5)</td>
<td>8.366</td>
<td>1.4</td>
<td>0.402</td>
<td>22.43</td>
</tr>
<tr>
<td>3</td>
<td>LPPAYTNSF</td>
<td>SPIKE (P0DTC2)</td>
<td>B*0702, B*3501, B*5301</td>
<td>9.718, 10.505, 9.574</td>
<td>0.32, 0.24, 0.87</td>
<td>0.457, 0.447, 0.495</td>
<td>2.21, 19.37, 6.56</td>
</tr>
<tr>
<td>4</td>
<td>FTSDYYQLY</td>
<td>AP3A (P0DTC3)</td>
<td>A*0101, A*2402, A*2902</td>
<td>14.473, 2.999, 11.658</td>
<td>0.37, 16.48, 0.37</td>
<td>0.491, 0.326, 0.540</td>
<td>34.08, 46.81, 22.47</td>
</tr>
<tr>
<td>5</td>
<td>YFPLQSYGF</td>
<td>SPIKE (P0DTC2)</td>
<td>A*2402</td>
<td>7.166</td>
<td>1.42</td>
<td>0.38</td>
<td>34.78</td>
</tr>
<tr>
<td>6</td>
<td>SASKIITLK</td>
<td>AP3A (P0DTC3)</td>
<td>A*0301, A*1101</td>
<td>6.574, 5.973</td>
<td>2.25, 1.9</td>
<td>0.672, 0.790</td>
<td>0.37, 0.37</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t6-fn-1">PSSM: position-specific scoring matrices; ANN: artificial neural networks.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="t3-4-3">
<title>CTL epitope processing forecast</title>
<p id="p-27">The PSSM and ANN predictors of the EasyPred modeler described in the present study have been shown to perform best when high-sensitivity forecasts for CTL epitope identification are focused (<xref ref-type="fig" rid="fig4">Figure 4</xref>). Most of the MHC molecules achieve a sensitivity of 80% at the threshold determined by the score of the top 85% in the training set, while some alleles (HLA-A*0101, A*1101, and A*6801) approached the sensitivity of 100%. If focusing on optimal sensitivity, it was shown that the forecast algorithm should exclude both proteasomal cleavage and TAP forecasts, reducing the method to the MHC binding forecast alone (<xref ref-type="fig" rid="fig4">Figure 4</xref>). Whether this observation reflects actual biological aspects of the specificity overlap between the three pathway players or it simply occurs because the forecast of MHC class I affinity has gained accuracy during the recent years, the predictors for TAP transport efficiency and proteasomal cleavage have not much changed or been renewed [<xref ref-type="bibr" rid="B77">77</xref>].</p>
<fig id="fig4" position="float">
<label>Figure 4</label>
<caption>
<p id="fig4-p-1">
<bold>Sensitivity analysis of PSSM and ANN predictors, TAP-2003 [<xref ref-type="bibr" rid="B52">52</xref>] and TAP-2004 [<xref ref-type="bibr" rid="B53">53</xref>] matrices, as well as constitutive- and immuno-proteasomal cleavage matrices in identifying CTL epitopes.</bold> MHC: major histocompatibility complex; PSSM: position-specific scoring matrices; ANN: artificial neural networks; TAP: transporters associated with antigen processing; CTL: cytotoxic T lymphocytes.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="ei-05-1003215-g004.tif" />
</fig>
</sec>
</sec>
</sec>
<sec id="s4">
<title>Discussion</title>
<p id="p-28">Antigen processing happens before the MHC binding, revealing the pool of peptides that can become T-cell epitopes. MHC class I-restricted CTL epitopes, in general, are obtained from protein pieces produced by the protease activity of the constitutive and/or immunoproteasome. The activity of several amino peptidases with the resulting loss of information shapes the N-terminus of every MHC class I-restricted peptide. In contrast, the C-terminus results from the proteasomal cleavage [<xref ref-type="bibr" rid="B78">78</xref>]. Moreover, it is believed that the immuno-proteasome is more responsible for producing CTL epitopes [<xref ref-type="bibr" rid="B54">54</xref>, <xref ref-type="bibr" rid="B79">79</xref>, <xref ref-type="bibr" rid="B80">80</xref>]. As a first approximation, proposed by Assarsson et al. [<xref ref-type="bibr" rid="B81">81</xref>], about 15% of all peptides that can be made from a protein are TAP transported into the ER, and about 2.5% of peptides that are made will bind to an MHC molecule. Further, about 50% of MHC binding peptides presented on the cell surface will be recognized by a CD8+ T cell receptor (TCR) and considered CTL epitopes. Using TAP binding as a filter, Doytchinova et al. [<xref ref-type="bibr" rid="B53">53</xref>] have also shown that the forecast of the peptide unable to bind with TAP decreases the number of peptides binding to MHC by 10–30% (depending on MHC allele). Although there are several bioinformatics programs available for the forecast of proteasomal cleavage sites in antigens [<xref ref-type="bibr" rid="B82">82</xref>, <xref ref-type="bibr" rid="B83">83</xref>], substantial success has been realized in blending these into an integrated CTL epitope forecast system. RankPep [<xref ref-type="bibr" rid="B36">36</xref>], ProPred1 [<xref ref-type="bibr" rid="B29">29</xref>], MAP [<xref ref-type="bibr" rid="B84">84</xref>], and PEPVAC [<xref ref-type="bibr" rid="B85">85</xref>] programs provide a platform for a concurrent forecast of proteasomal cleavage and MHC binding. Other virtual models of the endogenous antigen processing pathway have also been developed, incorporating proteasomal cleavage, TAP transport, and MHC class I binding forecasts [<xref ref-type="bibr" rid="B86">86</xref>, <xref ref-type="bibr" rid="B87">87</xref>] such as MAPPP [<xref ref-type="bibr" rid="B84">84</xref>], WAPP [<xref ref-type="bibr" rid="B88">88</xref>], EpiJen [<xref ref-type="bibr" rid="B28">28</xref>], MHC-pathway [<xref ref-type="bibr" rid="B89">89</xref>], and NetCTL [<xref ref-type="bibr" rid="B45">45</xref>]. Many of these methods have also proved their efficiency in screening new epitopes and designing poly-epitope vaccines for cancer [<xref ref-type="bibr" rid="B90">90</xref>, <xref ref-type="bibr" rid="B91">91</xref>], tuberculosis [<xref ref-type="bibr" rid="B92">92</xref>], Ebola [<xref ref-type="bibr" rid="B93">93</xref>], dengue [<xref ref-type="bibr" rid="B94">94</xref>], novel coronavirus [<xref ref-type="bibr" rid="B95">95</xref>], etc. In a large-scale benchmark evaluation of a publicly available MHC class I pathway presentation forecast server, Larsen et al. [<xref ref-type="bibr" rid="B46">46</xref>] showed that the NetCTL method substantially outperformed all these methods, closely followed by the MHC pathway. Further, the NetCTL method has also proven successful in the identification of CTL epitopes from Influenza [<xref ref-type="bibr" rid="B96">96</xref>], HIV [<xref ref-type="bibr" rid="B97">97</xref>], and Orthopoxvirus [<xref ref-type="bibr" rid="B98">98</xref>]. NetCTLpan [<xref ref-type="bibr" rid="B47">47</xref>] is also available for integrating many MHC class I allele binding, TAP transport efficiency, and proteasomal cleavage forecasts to an overall forecast of CTL epitopes. However, all these methods are limited because they allow for the forecast of peptide binding to pre-calculated parameters of MHC molecules and have one or more limitations. However, the current algorithms, PSSM and ANN predictors of the EasyPred modeler, are independent of using pre-calculated MHC binding parameters, and the user can make their own PSSM and ANN parameters for the forecast. Therefore, an integrated model was developed in the present study to predict CTL epitope primarily based on forecasting MHC class I binding and constitutive, as well as immuno-proteasomal cleavage and TAP binding as filters. This type of filtering algorithm has been shown to improve the forecast of CTL epitopes by decreasing the false positives and improving the process of drug and vaccine design strategies [<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B99">99</xref>]. It is worth noting that proteasomal cleavage, TAP transport, and MHC binding have largely undergone co-evolution so that MHC molecules have evolved to bind peptides in the ER. As a result, their combination does not provide vastly improved forecasts [<xref ref-type="bibr" rid="B89">89</xref>]. Thus, we recommend using the MHC class I binding forecasts as a primary tool to select candidate peptides for wet lab validations. We also recommend CTL processing forecasts to reduce further candidates to test in vitro/in vivo [<xref ref-type="bibr" rid="B51">51</xref>].</p>
<p id="p-29">An increasing body of literature substantiates the prediction of direct T cell immunogenicity, which refers to the relative capacity of a specific set of peptides that are bound within an MHC complex and recognized by the TCR. In the MHC-peptide-TCR complex, the residues P1, P2, and P9 of the peptide are most likely to interact directly with MHC binding, whereas the other residues, P3 to P8, are more likely to engage with the TCR. These investigations have shown that certain amino acids, including tryptophan, phenylalanine, and isoleucine, are prevalent in immunogenic peptides, while other residues, such as serine, methionine, and lysine, are less common. This phenomenon may be attributed to the longer side chains of tryptophan and phenylalanine, which have a higher likelihood of interacting with the TCR [<xref ref-type="bibr" rid="B100">100</xref>]. Consequently, it is possible to generate more immunogenic peptides through amino acid substitutions in naturally occurring MHC class I-restricted epitopes, thereby enhancing the stability and affinity of the MHC-peptide-TCR complex. This approach enhances the immunogenicity of MHC class I transitional affinity binders [<xref ref-type="bibr" rid="B101">101</xref>]. Furthermore, Rasmussen et al. [<xref ref-type="bibr" rid="B102">102</xref>] indicated that the stability of the peptide-MHC class I complex is crucial for eliciting T-cell responses. Therefore, parameters related to BA and stability are vital for characterizing and predicting peptide immunogenicity [<xref ref-type="bibr" rid="B103">103</xref>]. Given these more dependable structural models, peptide-based vaccines are likely the most favored and extensively researched due to their ease of synthesis and biocompatibility [<xref ref-type="bibr" rid="B104">104</xref>].</p>
<p id="p-30">However, the efficacy of peptide vaccines tends to be compromised by various factors such as rapid clearance, extracellular and enzymatic degradation, poor solubility (due to the presence of hydrophobic peptides), reduced immunogenicity, and reduced uptake of the peptide by antigen-presenting cells (APC) [<xref ref-type="bibr" rid="B105">105</xref>]. With current advancements in nanobiotechnology-based delivery systems, such as self-assembled peptide nanoparticles (SANP) using spacers for separation of epitopes for proteasomal cleavage, which can efficiently deliver peptide vaccines to APC, could help to surmount the limitations and elicit T cell responses in humans [<xref ref-type="bibr" rid="B106">106</xref>–<xref ref-type="bibr" rid="B109">109</xref>]. Successful production and efficiency of such vaccine delivery systems require deliberation of other variables, e.g., material toxicity, particle shape and size, surface charge, and stiffness or rigidity, which are crucial in various biological processes, such as bioavailability, biodistribution, and cytotoxicity, including activation of adaptive immune responses [<xref ref-type="bibr" rid="B110">110</xref>, <xref ref-type="bibr" rid="B111">111</xref>] and specific other biomedical applications [<xref ref-type="bibr" rid="B112">112</xref>].</p>
<p id="p-31">In conclusion, this study emphasizes the creation of an innovative forecasting model aimed at identifying peptide interactions with MHC class I molecules through the application of sequence weighting and ANN methodologies. As a result of this research, we have successfully established new forecasting parameters, specifically PSSM and ANN weights, for predicting MHC binding peptides. The models developed can be utilized alongside the EasyPred modeler to forecast MHC class I binding peptides, which includes the processing of CTL epitopes. Overall, the findings discussed highlight the importance of predicting MHC peptide binding for epitope identification, while also acknowledging the challenges that persist, indicating significant opportunities for enhancement and integration with other structure-based approaches. These forecasting techniques will relieve vaccinologists and immunologists from the burdens of uninformed experimentation, enabling them to devise improved, quicker, and more innovative methods for discovering new reagents, diagnostics, and vaccines.</p>
</sec>
</body>
<back>
<glossary>
<title>Abbreviations</title>
<def-list>
<def-item>
<term>ANN</term>
<def>
<p>artificial neural networks</p>
</def>
</def-item>
<def-item>
<term>APC</term>
<def>
<p>antigen-presenting cells</p>
</def>
</def-item>
<def-item>
<term>Aroc</term>
<def>
<p>area under the receiver operating characteristic curve</p>
</def>
</def-item>
<def-item>
<term>BA</term>
<def>
<p>binding affinity</p>
</def>
</def-item>
<def-item>
<term>CC</term>
<def>
<p>correlation coefficient</p>
</def>
</def-item>
<def-item>
<term>CTL</term>
<def>
<p>cytotoxic T lymphocytes</p>
</def>
</def-item>
<def-item>
<term>EBV</term>
<def>
<p>Epstein-Barr virus</p>
</def>
</def-item>
<def-item>
<term>EL</term>
<def>
<p>eluted ligands</p>
</def>
</def-item>
<def-item>
<term>ER</term>
<def>
<p>endoplasmic reticulum</p>
</def>
</def-item>
<def-item>
<term>IC<sub>50</sub></term>
<def>
<p>half-maximal inhibitory concentration</p>
</def>
</def-item>
<def-item>
<term>LTMBA</term>
<def>
<p>log-transformed major histocompatibility complex binding affinity</p>
</def>
</def-item>
<def-item>
<term>MHC</term>
<def>
<p>major histocompatibility complex</p>
</def>
</def-item>
<def-item>
<term>MSA</term>
<def>
<p>multiple sequence alignment</p>
</def>
</def-item>
<def-item>
<term>PSSM</term>
<def>
<p>position-specific scoring matrices</p>
</def>
</def-item>
<def-item>
<term>RED</term>
<def>
<p>Repository for Epitope Datasets</p>
</def>
</def-item>
<def-item>
<term>TAP</term>
<def>
<p>transporters associated with antigen processing</p>
</def>
</def-item>
<def-item>
<term>TCR</term>
<def>
<p>T cell receptor</p>
</def>
</def-item>
</def-list>
</glossary>
<sec id="s-suppl" sec-type="supplementary-material">
<title>Supplementary materials</title>
<p>The supplementary materials for this article are available at: <uri xlink:href="https://www.explorationpub.com/uploads/Article/file/1003215_sup_1.pdf">https://www.explorationpub.com/uploads/Article/file/1003215_sup_1.pdf</uri>.</p>
<supplementary-material id="SD1" content-type="local-data">
<media xlink:href="1003215_sup_1.pdf" mimetype="application" mime-subtype="pdf"></media>
</supplementary-material>
</sec>
<sec id="s6">
<title>Declarations</title>
<sec id="t-6-1">
<title>Acknowledgments</title>
<p>The authors are thankful to Prof. Brijesh Pandey, Mahatma Gandhi Central University, Motihari, for the valuable suggestions in revising the manuscript.</p>
</sec>
<sec id="t-6-2">
<title>Author contributions</title>
<p>SPS: Conceptualization, Methodology, Investigation, Writing—review &amp; editing. GS: Data curation, Investigation. BNM: Supervision, Writing—review &amp; editing. All authors read and approved the submitted version.</p>
</sec>
<sec id="t-6-3" sec-type="COI-statement">
<title>Conflicts of interest</title>
<p>The authors declare that they have no conflicts of interest.</p>
</sec>
<sec id="t-6-4">
<title>Ethical approval</title>
<p>Not applicable.</p>
</sec>
<sec id="t-6-5">
<title>Consent to participate</title>
<p>Not applicable.</p>
</sec>
<sec id="t-6-6">
<title>Consent to publication</title>
<p>Not applicable.</p>
</sec>
<sec id="t-6-7" sec-type="data-availability">
<title>Availability of data and materials</title>
<p>Supplemental information can be found online at <uri xlink:href="https://data.mendeley.com/datasets/dxz3dk3tcm/1">https://data.mendeley.com/datasets/dxz3dk3tcm/1</uri> at Mendeley data (DOI: 10.17632/dxz3dk3tcm.1). Data are licensed under an Attribution-NonCommercial 3.0 Unported licence. All figures in the study were created using the MS-Excel program.</p>
</sec>
<sec id="t-6-8">
<title>Funding</title>
<p>Not applicable.</p>
</sec>
<sec id="t-6-9">
<title>Copyright</title>
<p>© The Author(s) 2025.</p>
</sec>
</sec>
<sec id="s7">
<title>Publisher’s note</title>
<p>Open Exploration maintains a neutral stance on jurisdictional claims in published institutional affiliations and maps. All opinions expressed in this article are the personal views of the author(s) and do not represent the stance of the editorial team or the publisher.</p>
</sec>
<ref-list>
<ref id="B1">
<label>1</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kazi</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Chuah</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Majeed</surname>
<given-names>ABA</given-names>
</name>
<name>
<surname>Leow</surname>
<given-names>CH</given-names>
</name>
<name>
<surname>Lim</surname>
<given-names>BH</given-names>
</name>
<name>
<surname>Leow</surname>
<given-names>CY</given-names>
</name>
</person-group>
<article-title>Current progress of immunoinformatics approach harnessed for cellular- and antibody-dependent vaccine design</article-title>
<source>Pathog Glob Health</source>
<year iso-8601-date="2018">2018</year>
<volume>112</volume>
<fpage>123</fpage>
<lpage>31</lpage>
<pub-id pub-id-type="doi">10.1080/20477724.2018.1446773</pub-id>
<pub-id pub-id-type="pmid">29528265</pub-id>
<pub-id pub-id-type="pmcid">PMC6056828</pub-id>
</element-citation>
</ref>
<ref id="B2">
<label>2</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gilbert</surname>
<given-names>SC</given-names>
</name>
</person-group>
<article-title>T-cell-inducing vaccines - what’s the future</article-title>
<source>Immunology</source>
<year iso-8601-date="2012">2012</year>
<volume>135</volume>
<fpage>19</fpage>
<lpage>26</lpage>
<pub-id pub-id-type="doi">10.1111/j.1365-2567.2011.03517.x</pub-id>
<pub-id pub-id-type="pmid">22044118</pub-id>
<pub-id pub-id-type="pmcid">PMC3246649</pub-id>
</element-citation>
</ref>
<ref id="B3">
<label>3</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rudolph</surname>
<given-names>MG</given-names>
</name>
<name>
<surname>Stanfield</surname>
<given-names>RL</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>IA</given-names>
</name>
</person-group>
<article-title>How TCRs bind MHCs, peptides, and coreceptors</article-title>
<source>Annu Rev Immunol</source>
<year iso-8601-date="2006">2006</year>
<volume>24</volume>
<fpage>419</fpage>
<lpage>66</lpage>
<pub-id pub-id-type="doi">10.1146/annurev.immunol.23.021704.115658</pub-id>
<pub-id pub-id-type="pmid">16551255</pub-id>
</element-citation>
</ref>
<ref id="B4">
<label>4</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kägi</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Ledermann</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bürki</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Seiler</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Odermatt</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Olsen</surname>
<given-names>KJ</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Cytotoxicity mediated by T cells and natural killer cells is greatly impaired in perforin-deficient mice</article-title>
<source>Nature</source>
<year iso-8601-date="1994">1994</year>
<volume>369</volume>
<fpage>31</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.1038/369031a0</pub-id>
<pub-id pub-id-type="pmid">8164737</pub-id>
</element-citation>
</ref>
<ref id="B5">
<label>5</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lankat-Buttgereit</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Tampé</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>The transporter associated with antigen processing: function and implications in human diseases</article-title>
<source>Physiol Rev</source>
<year iso-8601-date="2002">2002</year>
<volume>82</volume>
<fpage>187</fpage>
<lpage>204</lpage>
<pub-id pub-id-type="doi">10.1152/physrev.00025.2001</pub-id>
<pub-id pub-id-type="pmid">11773612</pub-id>
</element-citation>
</ref>
<ref id="B6">
<label>6</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Craiu</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Akopian</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Goldberg</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Rock</surname>
<given-names>KL</given-names>
</name>
</person-group>
<article-title>Two distinct proteolytic processes in the generation of a major histocompatibility complex class I-presented peptide</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year iso-8601-date="1997">1997</year>
<volume>94</volume>
<fpage>10850</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.94.20.10850</pub-id>
<pub-id pub-id-type="pmid">9380723</pub-id>
<pub-id pub-id-type="pmcid">PMC23505</pub-id>
</element-citation>
</ref>
<ref id="B7">
<label>7</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>van Endert</surname>
<given-names>PM</given-names>
</name>
<name>
<surname>Riganelli</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Greco</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Fleischhauer</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>The peptide-binding motif for the human transporter associated with antigen processing</article-title>
<source>J Exp Med</source>
<year iso-8601-date="1995">1995</year>
<volume>182</volume>
<fpage>1883</fpage>
<lpage>95</lpage>
<pub-id pub-id-type="doi">10.1084/jem.182.6.1883</pub-id>
<pub-id pub-id-type="pmid">7500034</pub-id>
<pub-id pub-id-type="pmcid">PMC2192244</pub-id>
</element-citation>
</ref>
<ref id="B8">
<label>8</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schatz</surname>
<given-names>MM</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Akkad</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Ullrich</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Martinez</surname>
<given-names>AN</given-names>
</name>
<name>
<surname>Carroll</surname>
<given-names>O</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Characterizing the N-terminal processing motif of MHC class I ligands</article-title>
<source>J Immunol</source>
<year iso-8601-date="2008">2008</year>
<volume>180</volume>
<fpage>3210</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.180.5.3210</pub-id>
<pub-id pub-id-type="pmid">18292545</pub-id>
</element-citation>
</ref>
<ref id="B9">
<label>9</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matsumura</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Fremont</surname>
<given-names>DH</given-names>
</name>
<name>
<surname>Peterson</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Wilson</surname>
<given-names>IA</given-names>
</name>
</person-group>
<article-title>Emerging principles for the recognition of peptide antigens by MHC class I molecules</article-title>
<source>Science</source>
<year iso-8601-date="1992">1992</year>
<volume>257</volume>
<fpage>927</fpage>
<lpage>34</lpage>
<pub-id pub-id-type="doi">10.1126/science.1323878</pub-id>
<pub-id pub-id-type="pmid">1323878</pub-id>
</element-citation>
</ref>
<ref id="B10">
<label>10</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pamer</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Cresswell</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>Mechanisms of MHC class I--restricted antigen processing</article-title>
<source>Annu Rev Immunol</source>
<year iso-8601-date="1998">1998</year>
<volume>16</volume>
<fpage>323</fpage>
<lpage>58</lpage>
<pub-id pub-id-type="doi">10.1146/annurev.immunol.16.1.323</pub-id>
<pub-id pub-id-type="pmid">9597133</pub-id>
</element-citation>
</ref>
<ref id="B11">
<label>11</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yewdell</surname>
<given-names>JW</given-names>
</name>
<name>
<surname>Bennink</surname>
<given-names>JR</given-names>
</name>
</person-group>
<article-title>Immunodominance in major histocompatibility complex class I-restricted T lymphocyte responses</article-title>
<source>Annu Rev Immunol</source>
<year iso-8601-date="1999">1999</year>
<volume>17</volume>
<fpage>51</fpage>
<lpage>88</lpage>
<pub-id pub-id-type="doi">10.1146/annurev.immunol.17.1.51</pub-id>
<pub-id pub-id-type="pmid">10358753</pub-id>
</element-citation>
</ref>
<ref id="B12">
<label>12</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhong</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Reche</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Lai</surname>
<given-names>CC</given-names>
</name>
<name>
<surname>Reinhold</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Reinherz</surname>
<given-names>EL</given-names>
</name>
</person-group>
<article-title>Genome-wide characterization of a viral cytotoxic T lymphocyte epitope repertoire</article-title>
<source>J Biol Chem</source>
<year iso-8601-date="2003">2003</year>
<volume>278</volume>
<fpage>45135</fpage>
<lpage>44</lpage>
<pub-id pub-id-type="doi">10.1074/jbc.M307417200</pub-id>
<pub-id pub-id-type="pmid">12960169</pub-id>
</element-citation>
</ref>
<ref id="B13">
<label>13</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Construction and immunogenicity prediction of Plasmodium falciparum CTL epitope minigene vaccine</article-title>
<source>Sci China C Life Sci</source>
<year iso-8601-date="2001">2001</year>
<volume>44</volume>
<fpage>207</fpage>
<lpage>15</lpage>
<pub-id pub-id-type="doi">10.1007/BF02879327</pub-id>
<pub-id pub-id-type="pmid">18726439</pub-id>
</element-citation>
</ref>
<ref id="B14">
<label>14</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>GL</given-names>
</name>
<name>
<surname>Khan</surname>
<given-names>AM</given-names>
</name>
<name>
<surname>Srinivasan</surname>
<given-names>KN</given-names>
</name>
<name>
<surname>August</surname>
<given-names>JT</given-names>
</name>
<name>
<surname>Brusic</surname>
<given-names>V</given-names>
</name>
</person-group>
<article-title>MULTIPRED: a computational system for prediction of promiscuous HLA binding peptides</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2005">2005</year>
<volume>33</volume>
<fpage>W172</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gki452</pub-id>
<pub-id pub-id-type="pmid">15980449</pub-id>
<pub-id pub-id-type="pmcid">PMC1160213</pub-id>
</element-citation>
</ref>
<ref id="B15">
<label>15</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kar</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Ruiz-Perez</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Arooj</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Mancera</surname>
<given-names>RL</given-names>
</name>
</person-group>
<article-title>Current methods for the prediction of T‐cell epitopes</article-title>
<source>Pept Sci</source>
<year iso-8601-date="2018">2018</year>
<volume>110</volume>
<elocation-id>e24046</elocation-id>
<pub-id pub-id-type="doi">10.1002/pep2.24046</pub-id>
</element-citation>
</ref>
<ref id="B16">
<label>16</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gowthaman</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Agrewala</surname>
<given-names>JN</given-names>
</name>
</person-group>
<article-title>In silico methods for predicting T-cell epitopes: Dr Jekyll or Mr Hyde?</article-title>
<source>Expert Rev Proteomics</source>
<year iso-8601-date="2009">2009</year>
<volume>6</volume>
<fpage>527</fpage>
<lpage>37</lpage>
<pub-id pub-id-type="doi">10.1586/epr.09.71</pub-id>
<pub-id pub-id-type="pmid">19811074</pub-id>
</element-citation>
</ref>
<ref id="B17">
<label>17</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>X</given-names>
</name>
</person-group>
<article-title>An introduction to epitope prediction methods and software</article-title>
<source>Rev Med Virol</source>
<year iso-8601-date="2009">2009</year>
<volume>19</volume>
<fpage>77</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1002/rmv.602</pub-id>
<pub-id pub-id-type="pmid">19101924</pub-id>
</element-citation>
</ref>
<ref id="B18">
<label>18</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Bourne</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Bui</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Buus</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Doh</surname>
<given-names>G</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>The immune epitope database and analysis resource: from vision to blueprint</article-title>
<source>PLoS Biol</source>
<year iso-8601-date="2005">2005</year>
<volume>3</volume>
<elocation-id>e91</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pbio.0030091</pub-id>
<pub-id pub-id-type="pmid">15760272</pub-id>
<pub-id pub-id-type="pmcid">PMC1065705</pub-id>
</element-citation>
</ref>
<ref id="B19">
<label>19</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Udaka</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Mamitsuka</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Toward more accurate pan-specific MHC-peptide binding prediction: a review of current methods and tools</article-title>
<source>Brief Bioinform</source>
<year iso-8601-date="2012">2012</year>
<volume>13</volume>
<fpage>350</fpage>
<lpage>64</lpage>
<pub-id pub-id-type="doi">10.1093/bib/bbr060</pub-id>
<pub-id pub-id-type="pmid">21949215</pub-id>
</element-citation>
</ref>
<ref id="B20">
<label>20</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhasin</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Raghava</surname>
<given-names>GPS</given-names>
</name>
</person-group>
<article-title>Prediction of CTL epitopes using QM, SVM and ANN techniques</article-title>
<source>Vaccine</source>
<year iso-8601-date="2004">2004</year>
<volume>22</volume>
<fpage>3195</fpage>
<lpage>204</lpage>
<pub-id pub-id-type="doi">10.1016/j.vaccine.2004.02.005</pub-id>
<pub-id pub-id-type="pmid">15297074</pub-id>
</element-citation>
</ref>
<ref id="B21">
<label>21</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Trolle</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Metushi</surname>
<given-names>IG</given-names>
</name>
<name>
<surname>Greenbaum</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Automated benchmarking of peptide-MHC class I binding predictions</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2015">2015</year>
<volume>31</volume>
<fpage>2174</fpage>
<lpage>81</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btv123</pub-id>
<pub-id pub-id-type="pmid">25717196</pub-id>
<pub-id pub-id-type="pmcid">PMC4481849</pub-id>
</element-citation>
</ref>
<ref id="B22">
<label>22</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>HH</given-names>
</name>
<name>
<surname>Ray</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Tongchusak</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Reinherz</surname>
<given-names>EL</given-names>
</name>
<name>
<surname>Brusic</surname>
<given-names>V</given-names>
</name>
</person-group>
<article-title>Evaluation of MHC class I peptide binding prediction servers: applications for vaccine research</article-title>
<source>BMC Immunol</source>
<year iso-8601-date="2008">2008</year>
<volume>9</volume>
<elocation-id>8</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2172-9-8</pub-id>
<pub-id pub-id-type="pmid">18366636</pub-id>
<pub-id pub-id-type="pmcid">PMC2323361</pub-id>
</element-citation>
</ref>
<ref id="B23">
<label>23</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Paul</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>TepiTool: A Pipeline for Computational Prediction of T Cell Epitope Candidates</article-title>
<source>Curr Protoc Immunol</source>
<year iso-8601-date="2016">2016</year>
<volume>114</volume>
<fpage>18.19.1</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="doi">10.1002/cpim.12</pub-id>
<pub-id pub-id-type="pmid">27479659</pub-id>
<pub-id pub-id-type="pmcid">PMC4981331</pub-id>
</element-citation>
</ref>
<ref id="B24">
<label>24</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dimitrov</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Garnev</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Flower</surname>
<given-names>DR</given-names>
</name>
<name>
<surname>Doytchinova</surname>
<given-names>I</given-names>
</name>
</person-group>
<article-title>MHC Class II Binding Prediction-A Little Help from a Friend</article-title>
<source>J Biomed Biotechnol</source>
<year iso-8601-date="2010">2010</year>
<volume>2010</volume>
<elocation-id>705821</elocation-id>
<pub-id pub-id-type="doi">10.1155/2010/705821</pub-id>
<pub-id pub-id-type="pmid">20508817</pub-id>
<pub-id pub-id-type="pmcid">PMC2875769</pub-id>
</element-citation>
</ref>
<ref id="B25">
<label>25</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hobohm</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Scharf</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Schneider</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Sander</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Selection of representative protein data sets</article-title>
<source>Protein Sci</source>
<year iso-8601-date="1992">1992</year>
<volume>1</volume>
<fpage>409</fpage>
<lpage>17</lpage>
<pub-id pub-id-type="doi">10.1002/pro.5560010313</pub-id>
<pub-id pub-id-type="pmid">1304348</pub-id>
<pub-id pub-id-type="pmcid">PMC2142204</pub-id>
</element-citation>
</ref>
<ref id="B26">
<label>26</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Henikoff</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Henikoff</surname>
<given-names>JG</given-names>
</name>
</person-group>
<article-title>Position-based sequence weights</article-title>
<source>J Mol Biol</source>
<year iso-8601-date="1994">1994</year>
<volume>243</volume>
<fpage>574</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1016/0022-2836(94)90032-9</pub-id>
<pub-id pub-id-type="pmid">7966282</pub-id>
</element-citation>
</ref>
<ref id="B27">
<label>27</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Worning</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Lauemøller</surname>
<given-names>SL</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Buus</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Reliable prediction of T-cell epitopes using neural networks with novel sequence representations</article-title>
<source>Protein Sci</source>
<year iso-8601-date="2003">2003</year>
<volume>12</volume>
<fpage>1007</fpage>
<lpage>17</lpage>
<pub-id pub-id-type="doi">10.1110/ps.0239403</pub-id>
<pub-id pub-id-type="pmid">12717023</pub-id>
<pub-id pub-id-type="pmcid">PMC2323871</pub-id>
</element-citation>
</ref>
<ref id="B28">
<label>28</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Doytchinova</surname>
<given-names>IA</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Flower</surname>
<given-names>DR</given-names>
</name>
</person-group>
<article-title>EpiJen: a server for multistep T cell epitope prediction</article-title>
<source>BMC Bioinformatics</source>
<year iso-8601-date="2006">2006</year>
<volume>7</volume>
<elocation-id>131</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-7-131</pub-id>
<pub-id pub-id-type="pmid">16533401</pub-id>
<pub-id pub-id-type="pmcid">PMC1421443</pub-id>
</element-citation>
</ref>
<ref id="B29">
<label>29</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Raghava</surname>
<given-names>GPS</given-names>
</name>
</person-group>
<article-title>ProPred1: prediction of promiscuous MHC Class-I binding sites</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2003">2003</year>
<volume>19</volume>
<fpage>1009</fpage>
<lpage>14</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btg108</pub-id>
<pub-id pub-id-type="pmid">12761064</pub-id>
</element-citation>
</ref>
<ref id="B30">
<label>30</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jaishwal</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Jha</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>SP</given-names>
</name>
</person-group>
<article-title>Revisiting the dimensions of universal vaccine with special focus on COVID-19: Efficacy versus methods of designing</article-title>
<source>Int J Biol Macromol</source>
<year iso-8601-date="2024">2024</year>
<volume>277</volume>
<elocation-id>134012</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.ijbiomac.2024.134012</pub-id>
<pub-id pub-id-type="pmid">39048013</pub-id>
</element-citation>
</ref>
<ref id="B31">
<label>31</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Blicher</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Harndahl</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Justesen</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>NetMHCpan, a method for quantitative predictions of peptide binding to any HLA-A and -B locus protein of known sequence</article-title>
<source>PLoS One</source>
<year iso-8601-date="2007">2007</year>
<volume>2</volume>
<elocation-id>e796</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0000796</pub-id>
<pub-id pub-id-type="pmid">17726526</pub-id>
<pub-id pub-id-type="pmcid">PMC1949492</pub-id>
</element-citation>
</ref>
<ref id="B32">
<label>32</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gribskov</surname>
<given-names>M</given-names>
</name>
<name>
<surname>McLachlan</surname>
<given-names>AD</given-names>
</name>
<name>
<surname>Eisenberg</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Profile analysis: detection of distantly related proteins</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year iso-8601-date="1987">1987</year>
<volume>84</volume>
<fpage>4355</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1073/pnas.84.13.4355</pub-id>
<pub-id pub-id-type="pmid">3474607</pub-id>
<pub-id pub-id-type="pmcid">PMC305087</pub-id>
</element-citation>
</ref>
<ref id="B33">
<label>33</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lüthy</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Xenarios</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Bucher</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>Improving the sensitivity of the sequence profile method</article-title>
<source>Protein Sci</source>
<year iso-8601-date="1994">1994</year>
<volume>3</volume>
<fpage>139</fpage>
<lpage>46</lpage>
<pub-id pub-id-type="doi">10.1002/pro.5560030118</pub-id>
<pub-id pub-id-type="pmid">7511453</pub-id>
<pub-id pub-id-type="pmcid">PMC2142471</pub-id>
</element-citation>
</ref>
<ref id="B34">
<label>34</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Altschul</surname>
<given-names>SF</given-names>
</name>
<name>
<surname>Madden</surname>
<given-names>TL</given-names>
</name>
<name>
<surname>Schäffer</surname>
<given-names>AA</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>W</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="1997">1997</year>
<volume>25</volume>
<fpage>3389</fpage>
<lpage>402</lpage>
<pub-id pub-id-type="doi">10.1093/nar/25.17.3389</pub-id>
<pub-id pub-id-type="pmid">9254694</pub-id>
<pub-id pub-id-type="pmcid">PMC146917</pub-id>
</element-citation>
</ref>
<ref id="B35">
<label>35</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Worning</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Hvid</surname>
<given-names>CS</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Buus</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2004">2004</year>
<volume>20</volume>
<fpage>1388</fpage>
<lpage>97</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/bth100</pub-id>
<pub-id pub-id-type="pmid">14962912</pub-id>
</element-citation>
</ref>
<ref id="B36">
<label>36</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reche</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Reinherz</surname>
<given-names>EL</given-names>
</name>
</person-group>
<article-title>Definition of MHC supertypes through clustering of MHC peptide-binding repertoires</article-title>
<source>Methods Mol Biol</source>
<year iso-8601-date="2007">2007</year>
<volume>409</volume>
<fpage>163</fpage>
<lpage>73</lpage>
<pub-id pub-id-type="doi">10.1007/978-1-60327-118-9_11</pub-id>
<pub-id pub-id-type="pmid">18449999</pub-id>
</element-citation>
</ref>
<ref id="B37">
<label>37</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brusic</surname>
<given-names>V</given-names>
</name>
<name>
<surname>van Endert</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Zeleznikow</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Daniel</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Hammer</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Petrovsky</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>A neural network model approach to the study of human TAP transporter</article-title>
<source>In Silico Biol</source>
<year iso-8601-date="1999">1999</year>
<volume>1</volume>
<fpage>109</fpage>
<lpage>21</lpage>
<pub-id pub-id-type="pmid">11471244</pub-id>
</element-citation>
</ref>
<ref id="B38">
<label>38</label>
<element-citation publication-type="confproc">
<person-group person-group-type="author">
<name>
<surname>Brusic</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Zeleznikow</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sturniolo</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Bono</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Hammer</surname>
<given-names>J</given-names>
</name>
</person-group>
<comment>Data cleaning for computer models: a case study from immunology. Proceedings of the 6th International Conference on Neural Information Processing (ICONIP’99/ANZIIS'99/ANNES’99/ACNN’99); 1999 Nov 22–24; Perth, Australia. Piscataway (NJ): IEEE; 1999. pp. 603–9.</comment>
<pub-id pub-id-type="doi">10.1109/ICONIP.1999.845663</pub-id>
</element-citation>
</ref>
<ref id="B39">
<label>39</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Kesmir</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Brunak</surname>
<given-names>S</given-names>
</name>
</person-group>
<source>Immunological Bioinformatics</source>
<publisher-loc>Cambridge</publisher-loc>
<publisher-name>The MIT Press</publisher-name>
<year iso-8601-date="2005">2005</year>
</element-citation>
</ref>
<ref id="B40">
<label>40</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhao</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Sher</surname>
<given-names>X</given-names>
</name>
</person-group>
<article-title>Systematically benchmarking peptide-MHC binding predictors: From synthetic to naturally processed epitopes</article-title>
<source>PLoS Comput Biol</source>
<year iso-8601-date="2018">2018</year>
<volume>14</volume>
<elocation-id>e1006457</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pcbi.1006457</pub-id>
<pub-id pub-id-type="pmid">30408041</pub-id>
<pub-id pub-id-type="pmcid">PMC6224037</pub-id>
</element-citation>
</ref>
<ref id="B41">
<label>41</label>
<element-citation publication-type="book">
<person-group person-group-type="author">
<name>
<surname>Rumelhart</surname>
<given-names>DE</given-names>
</name>
<name>
<surname>Hinton</surname>
<given-names>GE</given-names>
</name>
<name>
<surname>Williams</surname>
<given-names>RJ</given-names>
</name>
</person-group>
<source>Parallel distributed processing: explorations in the microstructure of cognition</source>
<publisher-loc>Cambridge</publisher-loc>
<publisher-name>MIT Press</publisher-name>
<year iso-8601-date="1986">1986</year>
<comment>pp. 318–62.</comment>
</element-citation>
</ref>
<ref id="B42">
<label>42</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Swets</surname>
<given-names>JA</given-names>
</name>
</person-group>
<article-title>Measuring the accuracy of diagnostic systems</article-title>
<source>Science</source>
<year iso-8601-date="1988">1988</year>
<volume>240</volume>
<fpage>1285</fpage>
<lpage>93</lpage>
<pub-id pub-id-type="doi">10.1126/science.3287615</pub-id>
<pub-id pub-id-type="pmid">3287615</pub-id>
</element-citation>
</ref>
<ref id="B43">
<label>43</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lasko</surname>
<given-names>TA</given-names>
</name>
<name>
<surname>Bhagwat</surname>
<given-names>JG</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>KH</given-names>
</name>
<name>
<surname>Ohno-Machado</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>The use of receiver operating characteristic curves in biomedical informatics</article-title>
<source>J Biomed Inform</source>
<year iso-8601-date="2005">2005</year>
<volume>38</volume>
<fpage>404</fpage>
<lpage>15</lpage>
<pub-id pub-id-type="doi">10.1016/j.jbi.2005.02.008</pub-id>
<pub-id pub-id-type="pmid">16198999</pub-id>
</element-citation>
</ref>
<ref id="B44">
<label>44</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wohlwend</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Nathan</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Shalon</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Crain</surname>
<given-names>CR</given-names>
</name>
<name>
<surname>Tano-Menka</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Goldberg</surname>
<given-names>B</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Deep learning enhances the prediction of HLA class I-presented CD8<sup>+</sup> T cell epitopes in foreign pathogens</article-title>
<source>Nat Mach Intell</source>
<year iso-8601-date="2025">2025</year>
<volume>7</volume>
<fpage>232</fpage>
<lpage>43</lpage>
<pub-id pub-id-type="doi">10.1038/s42256-024-00971-y</pub-id>
<pub-id pub-id-type="pmid">40008296</pub-id>
<pub-id pub-id-type="pmcid">PMC11847706</pub-id>
</element-citation>
</ref>
<ref id="B45">
<label>45</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Larsen</surname>
<given-names>MV</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Buus</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Brunak</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>An integrative approach to CTL epitope prediction: a combined algorithm integrating MHC class I binding, TAP transport efficiency, and proteasomal cleavage predictions</article-title>
<source>Eur J Immunol</source>
<year iso-8601-date="2005">2005</year>
<volume>35</volume>
<fpage>2295</fpage>
<lpage>303</lpage>
<pub-id pub-id-type="doi">10.1002/eji.200425811</pub-id>
<pub-id pub-id-type="pmid">15997466</pub-id>
</element-citation>
</ref>
<ref id="B46">
<label>46</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Larsen</surname>
<given-names>MV</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Buus</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Large-scale validation of methods for cytotoxic T-lymphocyte epitope prediction</article-title>
<source>BMC Bioinformatics</source>
<year iso-8601-date="2007">2007</year>
<volume>8</volume>
<elocation-id>424</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-8-424</pub-id>
<pub-id pub-id-type="pmid">17973982</pub-id>
<pub-id pub-id-type="pmcid">PMC2194739</pub-id>
</element-citation>
</ref>
<ref id="B47">
<label>47</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Stranzl</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Larsen</surname>
<given-names>MV</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>NetCTLpan: pan-specific MHC class I pathway epitope predictions</article-title>
<source>Immunogenetics</source>
<year iso-8601-date="2010">2010</year>
<volume>62</volume>
<fpage>357</fpage>
<lpage>68</lpage>
<pub-id pub-id-type="doi">10.1007/s00251-010-0441-4</pub-id>
<pub-id pub-id-type="pmid">20379710</pub-id>
<pub-id pub-id-type="pmcid">PMC2875469</pub-id>
</element-citation>
</ref>
<ref id="B48">
<label>48</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rammensee</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Bachmann</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Emmerich</surname>
<given-names>NP</given-names>
</name>
<name>
<surname>Bachor</surname>
<given-names>OA</given-names>
</name>
<name>
<surname>Stevanović</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>SYFPEITHI: database for MHC ligands and peptide motifs</article-title>
<source>Immunogenetics</source>
<year iso-8601-date="1999">1999</year>
<volume>50</volume>
<fpage>213</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1007/s002510050595</pub-id>
<pub-id pub-id-type="pmid">10602881</pub-id>
</element-citation>
</ref>
<ref id="B49">
<label>49</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Grifoni</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Scheuermann</surname>
<given-names>RH</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>A Sequence Homology and Bioinformatic Approach Can Predict Candidate Targets for Immune Responses to SARS-CoV-2</article-title>
<source>Cell Host Microbe</source>
<year iso-8601-date="2020">2020</year>
<volume>27</volume>
<fpage>671</fpage>
<lpage>80.e2</lpage>
<pub-id pub-id-type="doi">10.1016/j.chom.2020.03.002</pub-id>
<pub-id pub-id-type="pmid">32183941</pub-id>
<pub-id pub-id-type="pmcid">PMC7142693</pub-id>
</element-citation>
</ref>
<ref id="B50">
<label>50</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gfeller</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Croce</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Guillaume</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Bobisse</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Genolet</surname>
<given-names>R</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Improved predictions of antigen presentation and TCR recognition with MixMHCpred2.2 and PRIME2.0 reveal potent SARS-CoV-2 CD8<sup>+</sup> T-cell epitopes</article-title>
<source>Cell Syst</source>
<year iso-8601-date="2023">2023</year>
<volume>14</volume>
<fpage>72</fpage>
<lpage>83.e5</lpage>
<pub-id pub-id-type="doi">10.1016/j.cels.2022.12.002</pub-id>
<pub-id pub-id-type="pmid">36603583</pub-id>
<pub-id pub-id-type="pmcid">PMC9811684</pub-id>
</element-citation>
</ref>
<ref id="B51">
<label>51</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fleri</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Paul</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Dhanda</surname>
<given-names>SK</given-names>
</name>
<name>
<surname>Mahajan</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>The Immune Epitope Database and Analysis Resource in Epitope Discovery and Synthetic Vaccine Design</article-title>
<source>Front Immunol</source>
<year iso-8601-date="2017">2017</year>
<volume>8</volume>
<elocation-id>278</elocation-id>
<pub-id pub-id-type="doi">10.3389/fimmu.2017.00278</pub-id>
<pub-id pub-id-type="pmid">28352270</pub-id>
<pub-id pub-id-type="pmcid">PMC5348633</pub-id>
</element-citation>
</ref>
<ref id="B52">
<label>52</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bulik</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Tampe</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Endert</surname>
<given-names>PMV</given-names>
</name>
<name>
<surname>Holzhütter</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Identifying MHC class I epitopes by predicting the TAP transport efficiency of epitope precursors</article-title>
<source>J Immunol</source>
<year iso-8601-date="2003">2003</year>
<volume>171</volume>
<fpage>1741</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.171.4.1741</pub-id>
<pub-id pub-id-type="pmid">12902473</pub-id>
</element-citation>
</ref>
<ref id="B53">
<label>53</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Doytchinova</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Hemsley</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Flower</surname>
<given-names>DR</given-names>
</name>
</person-group>
<article-title>Transporter associated with antigen processing preselection of peptides binding to the MHC: a bioinformatic evaluation</article-title>
<source>J Immunol</source>
<year iso-8601-date="2004">2004</year>
<volume>173</volume>
<fpage>6813</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.173.11.6813</pub-id>
<pub-id pub-id-type="pmid">15557175</pub-id>
</element-citation>
</ref>
<ref id="B54">
<label>54</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Toes</surname>
<given-names>RE</given-names>
</name>
<name>
<surname>Nussbaum</surname>
<given-names>AK</given-names>
</name>
<name>
<surname>Degermann</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Schirle</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Emmerich</surname>
<given-names>NP</given-names>
</name>
<name>
<surname>Kraft</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Discrete cleavage motifs of constitutive and immunoproteasomes revealed by quantitative analysis of cleavage products</article-title>
<source>J Exp Med</source>
<year iso-8601-date="2001">2001</year>
<volume>194</volume>
<fpage>1</fpage>
<lpage>12</lpage>
<pub-id pub-id-type="doi">10.1084/jem.194.1.1</pub-id>
<pub-id pub-id-type="pmid">11435468</pub-id>
<pub-id pub-id-type="pmcid">PMC2193442</pub-id>
</element-citation>
</ref>
<ref id="B55">
<label>55</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kuttler</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Nussbaum</surname>
<given-names>AK</given-names>
</name>
<name>
<surname>Dick</surname>
<given-names>TP</given-names>
</name>
<name>
<surname>Rammensee</surname>
<given-names>HG</given-names>
</name>
<name>
<surname>Schild</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Hadeler</surname>
<given-names>KP</given-names>
</name>
</person-group>
<article-title>An algorithm for the prediction of proteasomal cleavages</article-title>
<source>J Mol Biol</source>
<year iso-8601-date="2000">2000</year>
<volume>298</volume>
<fpage>417</fpage>
<lpage>29</lpage>
<pub-id pub-id-type="doi">10.1006/jmbi.2000.3683</pub-id>
<pub-id pub-id-type="pmid">10772860</pub-id>
</element-citation>
</ref>
<ref id="B56">
<label>56</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schneider</surname>
<given-names>TD</given-names>
</name>
<name>
<surname>Stephens</surname>
<given-names>RM</given-names>
</name>
</person-group>
<article-title>Sequence logos: a new way to display consensus sequences</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="1990">1990</year>
<volume>18</volume>
<fpage>6097</fpage>
<lpage>100</lpage>
<pub-id pub-id-type="doi">10.1093/nar/18.20.6097</pub-id>
<pub-id pub-id-type="pmid">2172928</pub-id>
<pub-id pub-id-type="pmcid">PMC332411</pub-id>
</element-citation>
</ref>
<ref id="B57">
<label>57</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rapin</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Hoof</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>MHC motif viewer</article-title>
<source>Immunogenetics</source>
<year iso-8601-date="2008">2008</year>
<volume>60</volume>
<fpage>759</fpage>
<lpage>65</lpage>
<pub-id pub-id-type="doi">10.1007/s00251-008-0330-2</pub-id>
<pub-id pub-id-type="pmid">18766337</pub-id>
<pub-id pub-id-type="pmcid">PMC2613509</pub-id>
</element-citation>
</ref>
<ref id="B58">
<label>58</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Frahm</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Brander</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>HLA class I supertypes: a revised and updated classification</article-title>
<source>BMC Immunol</source>
<year iso-8601-date="2008">2008</year>
<volume>9</volume>
<elocation-id>1</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2172-9-1</pub-id>
<pub-id pub-id-type="pmid">18211710</pub-id>
<pub-id pub-id-type="pmcid">PMC2245908</pub-id>
</element-citation>
</ref>
<ref id="B59">
<label>59</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dzuris</surname>
<given-names>JL</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Appella</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Chesnut</surname>
<given-names>RW</given-names>
</name>
<name>
<surname>Watkins</surname>
<given-names>DI</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Conserved MHC class I peptide binding motif between humans and rhesus macaques</article-title>
<source>J Immunol</source>
<year iso-8601-date="2000">2000</year>
<volume>164</volume>
<fpage>283</fpage>
<lpage>91</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.164.1.283</pub-id>
<pub-id pub-id-type="pmid">10605022</pub-id>
</element-citation>
</ref>
<ref id="B60">
<label>60</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Frahm</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Yusim</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Suscovich</surname>
<given-names>TJ</given-names>
</name>
<name>
<surname>Adams</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Hraber</surname>
<given-names>P</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Extensive HLA class I allele promiscuity among viral CTL epitopes</article-title>
<source>Eur J Immunol</source>
<year iso-8601-date="2007">2007</year>
<volume>37</volume>
<fpage>2419</fpage>
<lpage>33</lpage>
<pub-id pub-id-type="doi">10.1002/eji.200737365</pub-id>
<pub-id pub-id-type="pmid">17705138</pub-id>
<pub-id pub-id-type="pmcid">PMC2628559</pub-id>
</element-citation>
</ref>
<ref id="B61">
<label>61</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Buus</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Major histocompatibility complex class I binding predictions as a tool in epitope discovery</article-title>
<source>Immunology</source>
<year iso-8601-date="2010">2010</year>
<volume>130</volume>
<fpage>309</fpage>
<lpage>18</lpage>
<pub-id pub-id-type="doi">10.1111/j.1365-2567.2010.03300.x</pub-id>
<pub-id pub-id-type="pmid">20518827</pub-id>
<pub-id pub-id-type="pmcid">PMC2913210</pub-id>
</element-citation>
</ref>
<ref id="B62">
<label>62</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Vitiello</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Reherman</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Fowler</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Nayersina</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Kast</surname>
<given-names>WM</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>The relationship between class I binding affinity and immunogenicity of potential cytotoxic T cell epitopes</article-title>
<source>J Immunol</source>
<year iso-8601-date="1994">1994</year>
<volume>153</volume>
<fpage>5586</fpage>
<lpage>92</lpage>
<pub-id pub-id-type="pmid">7527444</pub-id>
</element-citation>
</ref>
<ref id="B63">
<label>63</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Peptide binding predictions for HLA DR, DP and DQ molecules</article-title>
<source>BMC Bioinformatics</source>
<year iso-8601-date="2010">2010</year>
<volume>11</volume>
<elocation-id>568</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-11-568</pub-id>
<pub-id pub-id-type="pmid">21092157</pub-id>
<pub-id pub-id-type="pmcid">PMC2998531</pub-id>
</element-citation>
</ref>
<ref id="B64">
<label>64</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Paul</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Weiskopf</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Angelo</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>HLA class I alleles are associated with peptide-binding repertoires of different size, affinity, and immunogenicity</article-title>
<source>J Immunol</source>
<year iso-8601-date="2013">2013</year>
<volume>191</volume>
<fpage>5831</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.1302101</pub-id>
<pub-id pub-id-type="pmid">24190657</pub-id>
<pub-id pub-id-type="pmcid">PMC3872965</pub-id>
</element-citation>
</ref>
<ref id="B65">
<label>65</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>van der Burg</surname>
<given-names>SH</given-names>
</name>
<name>
<surname>Visseren</surname>
<given-names>MJ</given-names>
</name>
<name>
<surname>Brandt</surname>
<given-names>RM</given-names>
</name>
<name>
<surname>Kast</surname>
<given-names>WM</given-names>
</name>
<name>
<surname>Melief</surname>
<given-names>CJ</given-names>
</name>
</person-group>
<article-title>Immunogenicity of peptides bound to MHC class I molecules depends on the MHC-peptide complex stability</article-title>
<source>J Immunol</source>
<year iso-8601-date="1996">1996</year>
<volume>156</volume>
<fpage>3308</fpage>
<lpage>14</lpage>
<pub-id pub-id-type="pmid">8617954</pub-id>
</element-citation>
</ref>
<ref id="B66">
<label>66</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Harndahl</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Rasmussen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Roder</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Pedersen</surname>
<given-names>ID</given-names>
</name>
<name>
<surname>Sørensen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Peptide-MHC class I stability is a better predictor than peptide affinity of CTL immunogenicity</article-title>
<source>Eur J Immunol</source>
<year iso-8601-date="2012">2012</year>
<volume>42</volume>
<fpage>1405</fpage>
<lpage>16</lpage>
<pub-id pub-id-type="doi">10.1002/eji.201141774</pub-id>
<pub-id pub-id-type="pmid">22678897</pub-id>
</element-citation>
</ref>
<ref id="B67">
<label>67</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bonsack</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Hoppe</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Winter</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Tichy</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Zeller</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Küpper</surname>
<given-names>MD</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Performance Evaluation of MHC Class-I Binding Prediction Tools Based on an Experimentally Validated MHC-Peptide Binding Data Set</article-title>
<source>Cancer Immunol Res</source>
<year iso-8601-date="2019">2019</year>
<volume>7</volume>
<fpage>719</fpage>
<lpage>36</lpage>
<pub-id pub-id-type="doi">10.1158/2326-6066.CIR-18-0584</pub-id>
<pub-id pub-id-type="pmid">30902818</pub-id>
</element-citation>
</ref>
<ref id="B68">
<label>68</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Roomp</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Antes</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Lengauer</surname>
<given-names>T</given-names>
</name>
</person-group>
<article-title>Predicting MHC class I epitopes in large datasets</article-title>
<source>BMC Bioinformatics</source>
<year iso-8601-date="2010">2010</year>
<volume>11</volume>
<elocation-id>90</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-11-90</pub-id>
<pub-id pub-id-type="pmid">20163709</pub-id>
<pub-id pub-id-type="pmcid">PMC2836306</pub-id>
</element-citation>
</ref>
<ref id="B69">
<label>69</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Pan-specific MHC class I predictors: a benchmark of HLA class I pan-specific prediction methods</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2009">2009</year>
<volume>25</volume>
<fpage>83</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btn579</pub-id>
<pub-id pub-id-type="pmid">18996943</pub-id>
<pub-id pub-id-type="pmcid">PMC2638932</pub-id>
</element-citation>
</ref>
<ref id="B70">
<label>70</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>SP</given-names>
</name>
<name>
<surname>Mishra</surname>
<given-names>BN</given-names>
</name>
</person-group>
<article-title>Prediction of MHC binding peptide using Gibbs motif sampler, weight matrix and artificial neural network</article-title>
<source>Bioinformation</source>
<year iso-8601-date="2008">2008</year>
<volume>3</volume>
<fpage>150</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.6026/97320630003150</pub-id>
<pub-id pub-id-type="pmid">19238237</pub-id>
<pub-id pub-id-type="pmcid">PMC2639663</pub-id>
</element-citation>
</ref>
<ref id="B71">
<label>71</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Claesson</surname>
<given-names>MH</given-names>
</name>
</person-group>
<article-title>Classification of human leukocyte antigen (HLA) supertypes</article-title>
<source>Methods Mol Biol</source>
<year iso-8601-date="2014">2014</year>
<volume>1184</volume>
<fpage>309</fpage>
<lpage>17</lpage>
<pub-id pub-id-type="doi">10.1007/978-1-4939-1115-8_17</pub-id>
<pub-id pub-id-type="pmid">25048132</pub-id>
<pub-id pub-id-type="pmcid">PMC7121184</pub-id>
</element-citation>
</ref>
<ref id="B72">
<label>72</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shen</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Parks</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>JC</given-names>
</name>
</person-group>
<article-title>HLA Class I Supertype Classification Based on Structural Similarity</article-title>
<source>J Immunol</source>
<year iso-8601-date="2023">2023</year>
<volume>210</volume>
<fpage>103</fpage>
<lpage>14</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.2200685</pub-id>
<pub-id pub-id-type="pmid">36453976</pub-id>
</element-citation>
</ref>
<ref id="B73">
<label>73</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Singh</surname>
<given-names>SP</given-names>
</name>
<name>
<surname>Mishra</surname>
<given-names>BN</given-names>
</name>
</person-group>
<article-title>Major histocompatibility complex linked databases and prediction tools for designing vaccines</article-title>
<source>Hum Immunol</source>
<year iso-8601-date="2016">2016</year>
<volume>77</volume>
<fpage>295</fpage>
<lpage>306</lpage>
<pub-id pub-id-type="doi">10.1016/j.humimm.2015.11.012</pub-id>
<pub-id pub-id-type="pmid">26585361</pub-id>
</element-citation>
</ref>
<ref id="B74">
<label>74</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tscharke</surname>
<given-names>DC</given-names>
</name>
<name>
<surname>Croft</surname>
<given-names>NP</given-names>
</name>
<name>
<surname>Doherty</surname>
<given-names>PC</given-names>
</name>
<name>
<surname>Gruta</surname>
<given-names>NLL</given-names>
</name>
</person-group>
<article-title>Sizing up the key determinants of the CD8(+) T cell response</article-title>
<source>Nat Rev Immunol</source>
<year iso-8601-date="2015">2015</year>
<volume>15</volume>
<fpage>705</fpage>
<lpage>16</lpage>
<pub-id pub-id-type="doi">10.1038/nri3905</pub-id>
<pub-id pub-id-type="pmid">26449178</pub-id>
</element-citation>
</ref>
<ref id="B75">
<label>75</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nosrati</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Mohabatkar</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Behbahani</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>Introducing of an integrated artificial neural network and Chou’s pseudo amino acid composition approach for computational epitope-mapping of Crimean-Congo haemorrhagic fever virus antigens</article-title>
<source>Int Immunopharmacol</source>
<year iso-8601-date="2020">2020</year>
<volume>78</volume>
<elocation-id>106020</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.intimp.2019.106020</pub-id>
<pub-id pub-id-type="pmid">31776090</pub-id>
</element-citation>
</ref>
<ref id="B76">
<label>76</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Grifoni</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Weiskopf</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Ramirez</surname>
<given-names>SI</given-names>
</name>
<name>
<surname>Mateus</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Dan</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Moderbacher</surname>
<given-names>CR</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Targets of T Cell Responses to SARS-CoV-2 Coronavirus in Humans with COVID-19 Disease and Unexposed Individuals</article-title>
<source>Cell</source>
<year iso-8601-date="2020">2020</year>
<volume>181</volume>
<fpage>1489</fpage>
<lpage>501.e15</lpage>
<pub-id pub-id-type="doi">10.1016/j.cell.2020.05.015</pub-id>
<pub-id pub-id-type="pmid">32473127</pub-id>
<pub-id pub-id-type="pmcid">PMC7237901</pub-id>
</element-citation>
</ref>
<ref id="B77">
<label>77</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lundegaard</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Lund</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Keşmir</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>The role of the proteasome in generating cytotoxic T-cell epitopes: insights obtained from improved predictions of proteasomal cleavage</article-title>
<source>Immunogenetics</source>
<year iso-8601-date="2005">2005</year>
<volume>57</volume>
<fpage>33</fpage>
<lpage>41</lpage>
<pub-id pub-id-type="doi">10.1007/s00251-005-0781-7</pub-id>
<pub-id pub-id-type="pmid">15744535</pub-id>
</element-citation>
</ref>
<ref id="B78">
<label>78</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brouwenstijn</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Serwold</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Shastri</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>MHC class I molecules can direct proteolytic cleavage of antigenic precursors in the endoplasmic reticulum</article-title>
<source>Immunity</source>
<year iso-8601-date="2001">2001</year>
<volume>15</volume>
<fpage>95</fpage>
<lpage>104</lpage>
<pub-id pub-id-type="doi">10.1016/s1074-7613(01)00174-1</pub-id>
<pub-id pub-id-type="pmid">11485741</pub-id>
</element-citation>
</ref>
<ref id="B79">
<label>79</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>van Hall</surname>
<given-names>T</given-names>
</name>
<name>
<surname>van Bergen</surname>
<given-names>J</given-names>
</name>
<name>
<surname>van Veelen</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Kraakman</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Heukamp</surname>
<given-names>LC</given-names>
</name>
<name>
<surname>Koning</surname>
<given-names>F</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Identification of a novel tumor-specific CTL epitope presented by RMA, EL-4, and MBL-2 lymphomas reveals their common origin</article-title>
<source>J Immunol</source>
<year iso-8601-date="2000">2000</year>
<volume>165</volume>
<fpage>869</fpage>
<lpage>77</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.165.2.869</pub-id>
<pub-id pub-id-type="pmid">10878361</pub-id>
</element-citation>
</ref>
<ref id="B80">
<label>80</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Norbury</surname>
<given-names>CC</given-names>
</name>
<name>
<surname>Cho</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Yewdell</surname>
<given-names>JW</given-names>
</name>
<name>
<surname>Bennink</surname>
<given-names>JR</given-names>
</name>
</person-group>
<article-title>Immunoproteasomes shape immunodominance hierarchies of antiviral CD8(+) T cells at the levels of T cell repertoire and presentation of viral antigens</article-title>
<source>J Exp Med</source>
<year iso-8601-date="2001">2001</year>
<volume>193</volume>
<fpage>1319</fpage>
<lpage>26</lpage>
<pub-id pub-id-type="doi">10.1084/jem.193.11.1319</pub-id>
<pub-id pub-id-type="pmid">11390439</pub-id>
<pub-id pub-id-type="pmcid">PMC2193381</pub-id>
</element-citation>
</ref>
<ref id="B81">
<label>81</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Assarsson</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Sidney</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Oseroff</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Pasquetto</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Bui</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Frahm</surname>
<given-names>N</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>A quantitative analysis of the variables affecting the repertoire of T cell specificities recognized after vaccinia virus infection</article-title>
<source>J Immunol</source>
<year iso-8601-date="2007">2007</year>
<volume>178</volume>
<fpage>7890</fpage>
<lpage>901</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.178.12.7890</pub-id>
<pub-id pub-id-type="pmid">17548627</pub-id>
</element-citation>
</ref>
<ref id="B82">
<label>82</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Holzhütter</surname>
<given-names>HG</given-names>
</name>
<name>
<surname>Frömmel</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Kloetzel</surname>
<given-names>PM</given-names>
</name>
</person-group>
<article-title>A theoretical approach towards the identification of cleavage-determining amino acid motifs of the 20 S proteasome</article-title>
<source>J Mol Biol</source>
<year iso-8601-date="1999">1999</year>
<volume>286</volume>
<fpage>1251</fpage>
<lpage>65</lpage>
<pub-id pub-id-type="doi">10.1006/jmbi.1998.2530</pub-id>
<pub-id pub-id-type="pmid">10047495</pub-id>
</element-citation>
</ref>
<ref id="B83">
<label>83</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nussbaum</surname>
<given-names>AK</given-names>
</name>
<name>
<surname>Kuttler</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Hadeler</surname>
<given-names>KP</given-names>
</name>
<name>
<surname>Rammensee</surname>
<given-names>HG</given-names>
</name>
<name>
<surname>Schild</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>PAProC: a prediction algorithm for proteasomal cleavages available on the WWW</article-title>
<source>Immunogenetics</source>
<year iso-8601-date="2001">2001</year>
<volume>53</volume>
<fpage>87</fpage>
<lpage>94</lpage>
<pub-id pub-id-type="doi">10.1007/s002510100300</pub-id>
<pub-id pub-id-type="pmid">11345595</pub-id>
</element-citation>
</ref>
<ref id="B84">
<label>84</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hakenberg</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Nussbaum</surname>
<given-names>AK</given-names>
</name>
<name>
<surname>Schild</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Rammensee</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Kuttler</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Holzhütter</surname>
<given-names>H</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>MAPPP: MHC class I antigenic peptide processing prediction</article-title>
<source>Appl Bioinformatics</source>
<year iso-8601-date="2003">2003</year>
<volume>2</volume>
<fpage>155</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="pmid">15130801</pub-id>
</element-citation>
</ref>
<ref id="B85">
<label>85</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Reche</surname>
<given-names>PA</given-names>
</name>
<name>
<surname>Reinherz</surname>
<given-names>EL</given-names>
</name>
</person-group>
<article-title>PEPVAC: a web server for multi-epitope vaccine development based on the prediction of supertypic MHC ligands</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2005">2005</year>
<volume>33</volume>
<fpage>W138</fpage>
<lpage>42</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gki357</pub-id>
<pub-id pub-id-type="pmid">15980443</pub-id>
<pub-id pub-id-type="pmcid">PMC1160118</pub-id>
</element-citation>
</ref>
<ref id="B86">
<label>86</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brusic</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Bajic</surname>
<given-names>VB</given-names>
</name>
<name>
<surname>Petrovsky</surname>
<given-names>N</given-names>
</name>
</person-group>
<article-title>Computational methods for prediction of T-cell epitopes--a framework for modelling, testing, and applications</article-title>
<source>Methods</source>
<year iso-8601-date="2004">2004</year>
<volume>34</volume>
<fpage>436</fpage>
<lpage>43</lpage>
<pub-id pub-id-type="doi">10.1016/j.ymeth.2004.06.006</pub-id>
<pub-id pub-id-type="pmid">15542369</pub-id>
</element-citation>
</ref>
<ref id="B87">
<label>87</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Heemels</surname>
<given-names>MT</given-names>
</name>
<name>
<surname>Ploegh</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Generation, translocation, and presentation of MHC class I-restricted peptides</article-title>
<source>Annu Rev Biochem</source>
<year iso-8601-date="1995">1995</year>
<volume>64</volume>
<fpage>463</fpage>
<lpage>91</lpage>
<pub-id pub-id-type="doi">10.1146/annurev.bi.64.070195.002335</pub-id>
<pub-id pub-id-type="pmid">7574490</pub-id>
</element-citation>
</ref>
<ref id="B88">
<label>88</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dönnes</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Kohlbacher</surname>
<given-names>O</given-names>
</name>
</person-group>
<article-title>Integrated modeling of the major events in the MHC class I antigen processing pathway</article-title>
<source>Protein Sci</source>
<year iso-8601-date="2005">2005</year>
<volume>14</volume>
<fpage>2132</fpage>
<lpage>40</lpage>
<pub-id pub-id-type="doi">10.1110/ps.051352405</pub-id>
<pub-id pub-id-type="pmid">15987883</pub-id>
<pub-id pub-id-type="pmcid">PMC2279325</pub-id>
</element-citation>
</ref>
<ref id="B89">
<label>89</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tenzer</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Peters</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bulik</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Schoor</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Lemmel</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Schatz</surname>
<given-names>MM</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Modeling the MHC class I pathway by combining predictions of proteasomal cleavage, TAP transport and MHC class I binding</article-title>
<source>Cell Mol Life Sci</source>
<year iso-8601-date="2005">2005</year>
<volume>62</volume>
<fpage>1025</fpage>
<lpage>37</lpage>
<pub-id pub-id-type="doi">10.1007/s00018-005-4528-2</pub-id>
<pub-id pub-id-type="pmid">15868101</pub-id>
<pub-id pub-id-type="pmcid">PMC11924537</pub-id>
</element-citation>
</ref>
<ref id="B90">
<label>90</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adotévi</surname>
<given-names>O</given-names>
</name>
<name>
<surname>Mollier</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Neuveut</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Cardinaud</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Boulanger</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Mignen</surname>
<given-names>B</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Immunogenic HLA-B*0702-restricted epitopes derived from human telomerase reverse transcriptase that elicit antitumor cytotoxic T-cell responses</article-title>
<source>Clin Cancer Res</source>
<year iso-8601-date="2006">2006</year>
<volume>12</volume>
<fpage>3158</fpage>
<lpage>67</lpage>
<pub-id pub-id-type="doi">10.1158/1078-0432.CCR-05-2647</pub-id>
<pub-id pub-id-type="pmid">16707616</pub-id>
</element-citation>
</ref>
<ref id="B91">
<label>91</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Hundemer</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Condomines</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lupu</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Hose</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Moos</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Identification of a new HLA-A2-restricted T-cell epitope within HM1.24 as immunotherapy target for multiple myeloma</article-title>
<source>Exp Hematol</source>
<year iso-8601-date="2006">2006</year>
<volume>34</volume>
<fpage>486</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1016/j.exphem.2006.01.008</pub-id>
<pub-id pub-id-type="pmid">16569595</pub-id>
<pub-id pub-id-type="pmcid">PMC1913933</pub-id>
</element-citation>
</ref>
<ref id="B92">
<label>92</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mustafa</surname>
<given-names>AS</given-names>
</name>
<name>
<surname>Shaban</surname>
<given-names>FA</given-names>
</name>
</person-group>
<article-title>ProPred analysis and experimental evaluation of promiscuous T-cell epitopes of three major secreted antigens of Mycobacterium tuberculosis</article-title>
<source>Tuberculosis (Edinb)</source>
<year iso-8601-date="2006">2006</year>
<volume>86</volume>
<fpage>115</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="doi">10.1016/j.tube.2005.05.001</pub-id>
<pub-id pub-id-type="pmid">16039905</pub-id>
</element-citation>
</ref>
<ref id="B93">
<label>93</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sundar</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Boesen</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Coico</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>Computational prediction and identification of HLA-A2.1-specific Ebola virus CTL epitopes</article-title>
<source>Virology</source>
<year iso-8601-date="2007">2007</year>
<volume>360</volume>
<fpage>257</fpage>
<lpage>63</lpage>
<pub-id pub-id-type="doi">10.1016/j.virol.2006.09.042</pub-id>
<pub-id pub-id-type="pmid">17123567</pub-id>
</element-citation>
</ref>
<ref id="B94">
<label>94</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wen</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Fang</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Computational prediction and identification of dengue virus-specific CD4(+) T-cell epitopes</article-title>
<source>Virus Res</source>
<year iso-8601-date="2008">2008</year>
<volume>132</volume>
<fpage>42</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1016/j.virusres.2007.10.010</pub-id>
<pub-id pub-id-type="pmid">18061300</pub-id>
<pub-id pub-id-type="pmcid">PMC7114202</pub-id>
</element-citation>
</ref>
<ref id="B95">
<label>95</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Baruah</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Bose</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Immunoinformatics-aided identification of T cell and B cell epitopes in the surface glycoprotein of 2019-nCoV</article-title>
<source>J Med Virol</source>
<year iso-8601-date="2020">2020</year>
<volume>92</volume>
<fpage>495</fpage>
<lpage>500</lpage>
<pub-id pub-id-type="doi">10.1002/jmv.25698</pub-id>
<pub-id pub-id-type="pmid">32022276</pub-id>
<pub-id pub-id-type="pmcid">PMC7166505</pub-id>
</element-citation>
</ref>
<ref id="B96">
<label>96</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Harndahl</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Røder</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Stryhn</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Larsen</surname>
<given-names>MV</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>CTL epitopes for influenza A including the H5N1 bird flu; genome-, pathogen-, and HLA-wide screening</article-title>
<source>Vaccine</source>
<year iso-8601-date="2007">2007</year>
<volume>25</volume>
<fpage>2823</fpage>
<lpage>31</lpage>
<pub-id pub-id-type="doi">10.1016/j.vaccine.2006.12.038</pub-id>
<pub-id pub-id-type="pmid">17254671</pub-id>
</element-citation>
</ref>
<ref id="B97">
<label>97</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perez</surname>
<given-names>EE</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>JC</given-names>
</name>
<name>
<surname>Jouvenot</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>KA</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>O</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Establishment of HIV-1 resistance in CD4+ T cells by genome editing using zinc-finger nucleases</article-title>
<source>Nat Biotechnol</source>
<year iso-8601-date="2008">2008</year>
<volume>26</volume>
<fpage>808</fpage>
<lpage>16</lpage>
<pub-id pub-id-type="doi">10.1038/nbt1410</pub-id>
<pub-id pub-id-type="pmid">18587387</pub-id>
<pub-id pub-id-type="pmcid">PMC3422503</pub-id>
</element-citation>
</ref>
<ref id="B98">
<label>98</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tang</surname>
<given-names>ST</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lamberth</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Harndahl</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Dziegiel</surname>
<given-names>MH</given-names>
</name>
<name>
<surname>Claesson</surname>
<given-names>MH</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>MHC-I-restricted epitopes conserved among variola and other related orthopoxviruses are recognized by T cells 30 years after vaccination</article-title>
<source>Arch Virol</source>
<year iso-8601-date="2008">2008</year>
<volume>153</volume>
<fpage>1833</fpage>
<lpage>44</lpage>
<pub-id pub-id-type="doi">10.1007/s00705-008-0194-7</pub-id>
<pub-id pub-id-type="pmid">18797815</pub-id>
<pub-id pub-id-type="pmcid">PMC2784598</pub-id>
</element-citation>
</ref>
<ref id="B99">
<label>99</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lehnert</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Tampé</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>Structure and Dynamics of Antigenic Peptides in Complex with TAP</article-title>
<source>Front Immunol</source>
<year iso-8601-date="2017">2017</year>
<volume>8</volume>
<elocation-id>10</elocation-id>
<pub-id pub-id-type="doi">10.3389/fimmu.2017.00010</pub-id>
<pub-id pub-id-type="pmid">28194151</pub-id>
<pub-id pub-id-type="pmcid">PMC5277011</pub-id>
</element-citation>
</ref>
<ref id="B100">
<label>100</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Calis</surname>
<given-names>JJA</given-names>
</name>
<name>
<surname>Maybeno</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Greenbaum</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Weiskopf</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Silva</surname>
<given-names>ADD</given-names>
</name>
<name>
<surname>Sette</surname>
<given-names>A</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Properties of MHC class I presented peptides that enhance immunogenicity</article-title>
<source>PLoS Comput Biol</source>
<year iso-8601-date="2013">2013</year>
<volume>9</volume>
<elocation-id>e1003266</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pcbi.1003266</pub-id>
<pub-id pub-id-type="pmid">24204222</pub-id>
<pub-id pub-id-type="pmcid">PMC3808449</pub-id>
</element-citation>
</ref>
<ref id="B101">
<label>101</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lasso</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Cárdenas</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Guzmán</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Rosas</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>MC</given-names>
</name>
<name>
<surname>López</surname>
<given-names>MC</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Effect of secondary anchor amino acid substitutions on the immunogenic properties of an HLA-A*0201-restricted T cell epitope derived from the Trypanosoma cruzi KMP-11 protein</article-title>
<source>Peptides</source>
<year iso-8601-date="2016">2016</year>
<volume>78</volume>
<fpage>68</fpage>
<lpage>76</lpage>
<pub-id pub-id-type="doi">10.1016/j.peptides.2016.02.002</pub-id>
<pub-id pub-id-type="pmid">26854383</pub-id>
</element-citation>
</ref>
<ref id="B102">
<label>102</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rasmussen</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Fenoy</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Harndahl</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Kristensen</surname>
<given-names>AB</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>IK</given-names>
</name>
<name>
<surname>Nielsen</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Pan-Specific Prediction of Peptide-MHC Class I Complex Stability, a Correlate of T Cell Immunogenicity</article-title>
<source>J Immunol</source>
<year iso-8601-date="2016">2016</year>
<volume>197</volume>
<fpage>1517</fpage>
<lpage>24</lpage>
<pub-id pub-id-type="doi">10.4049/jimmunol.1600582</pub-id>
<pub-id pub-id-type="pmid">27402703</pub-id>
<pub-id pub-id-type="pmcid">PMC4976001</pub-id>
</element-citation>
</ref>
<ref id="B103">
<label>103</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Antunes</surname>
<given-names>DA</given-names>
</name>
<name>
<surname>Devaurs</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Moll</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Lizée</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Kavraki</surname>
<given-names>LE</given-names>
</name>
</person-group>
<article-title>General Prediction of Peptide-MHC Binding Modes Using Incremental Docking: A Proof of Concept</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2018">2018</year>
<volume>8</volume>
<elocation-id>4327</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41598-018-22173-4</pub-id>
<pub-id pub-id-type="pmid">29531253</pub-id>
<pub-id pub-id-type="pmcid">PMC5847594</pub-id>
</element-citation>
</ref>
<ref id="B104">
<label>104</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kuai</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Ochyl</surname>
<given-names>LJ</given-names>
</name>
<name>
<surname>Bahjat</surname>
<given-names>KS</given-names>
</name>
<name>
<surname>Schwendeman</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Moon</surname>
<given-names>JJ</given-names>
</name>
</person-group>
<article-title>Designer vaccine nanodiscs for personalized cancer immunotherapy</article-title>
<source>Nat Mater</source>
<year iso-8601-date="2017">2017</year>
<volume>16</volume>
<fpage>489</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1038/nmat4822</pub-id>
<pub-id pub-id-type="pmid">28024156</pub-id>
<pub-id pub-id-type="pmcid">PMC5374005</pub-id>
</element-citation>
</ref>
<ref id="B105">
<label>105</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kapadia</surname>
<given-names>CH</given-names>
</name>
<name>
<surname>Perry</surname>
<given-names>JL</given-names>
</name>
<name>
<surname>Tian</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Luft</surname>
<given-names>JC</given-names>
</name>
<name>
<surname>DeSimone</surname>
<given-names>JM</given-names>
</name>
</person-group>
<article-title>Nanoparticulate immunotherapy for cancer</article-title>
<source>J Control Release</source>
<year iso-8601-date="2015">2015</year>
<volume>219</volume>
<fpage>167</fpage>
<lpage>80</lpage>
<pub-id pub-id-type="doi">10.1016/j.jconrel.2015.09.062</pub-id>
<pub-id pub-id-type="pmid">26432555</pub-id>
</element-citation>
</ref>
<ref id="B106">
<label>106</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bissati</surname>
<given-names>KE</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Paulillo</surname>
<given-names>SM</given-names>
</name>
<name>
<surname>Raman</surname>
<given-names>SK</given-names>
</name>
<name>
<surname>Karch</surname>
<given-names>CP</given-names>
</name>
<name>
<surname>Roberts</surname>
<given-names>CW</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Protein nanovaccine confers robust immunity against <italic>Toxoplasma</italic></article-title>
<source>NPJ Vaccines</source>
<year iso-8601-date="2017">2017</year>
<volume>2</volume>
<elocation-id>24</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41541-017-0024-6</pub-id>
<pub-id pub-id-type="pmid">29263879</pub-id>
<pub-id pub-id-type="pmcid">PMC5627305</pub-id>
</element-citation>
</ref>
<ref id="B107">
<label>107</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Vo</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Taylor</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Smith</surname>
<given-names>BR</given-names>
</name>
</person-group>
<article-title>Non-spherical micro-and nanoparticles in nanomedicine</article-title>
<source>Mater Horiz</source>
<year iso-8601-date="2019">2019</year>
<volume>6</volume>
<fpage>1094</fpage>
<lpage>121</lpage>
<pub-id pub-id-type="doi">10.1039/C8MH01527A</pub-id>
</element-citation>
</ref>
<ref id="B108">
<label>108</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Miao</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Sui</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>Nanoparticle cancer vaccines: Design considerations and recent advances</article-title>
<source>Asian J Pharm Sci</source>
<year iso-8601-date="2020">2020</year>
<volume>15</volume>
<fpage>576</fpage>
<lpage>90</lpage>
<pub-id pub-id-type="doi">10.1016/j.ajps.2019.10.006</pub-id>
<pub-id pub-id-type="pmid">33193861</pub-id>
<pub-id pub-id-type="pmcid">PMC7610208</pub-id>
</element-citation>
</ref>
<ref id="B109">
<label>109</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jha</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Jaishwal</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Yadav</surname>
<given-names>TP</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>SP</given-names>
</name>
</person-group>
<article-title>Self-assembling of coiled-coil peptides into virus-like particles: Basic principles, properties, design, and applications with special focus on vaccine design and delivery</article-title>
<source>Biophys Chem</source>
<year iso-8601-date="2025">2025</year>
<volume>318</volume>
<elocation-id>107375</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.bpc.2024.107375</pub-id>
<pub-id pub-id-type="pmid">39674128</pub-id>
</element-citation>
</ref>
<ref id="B110">
<label>110</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Saleh</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Shojaosadati</surname>
<given-names>SA</given-names>
</name>
</person-group>
<article-title>Multifunctional nanoparticles for cancer immunotherapy</article-title>
<source>Hum Vaccin Immunother</source>
<year iso-8601-date="2016">2016</year>
<volume>12</volume>
<fpage>1863</fpage>
<lpage>75</lpage>
<pub-id pub-id-type="doi">10.1080/21645515.2016.1147635</pub-id>
<pub-id pub-id-type="pmid">26901287</pub-id>
<pub-id pub-id-type="pmcid">PMC4964832</pub-id>
</element-citation>
</ref>
<ref id="B111">
<label>111</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schudel</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Francis</surname>
<given-names>DM</given-names>
</name>
<name>
<surname>Thomas</surname>
<given-names>SN</given-names>
</name>
</person-group>
<article-title>Material design for lymph node drug delivery</article-title>
<source>Nat Rev Mater</source>
<year iso-8601-date="2019">2019</year>
<volume>4</volume>
<fpage>415</fpage>
<lpage>28</lpage>
<pub-id pub-id-type="doi">10.1038/s41578-019-0110-7</pub-id>
<pub-id pub-id-type="pmid">32523780</pub-id>
<pub-id pub-id-type="pmcid">PMC7286627</pub-id>
</element-citation>
</ref>
<ref id="B112">
<label>112</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mu</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Abdulmalik</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wijekoon</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Kumbar</surname>
<given-names>SG</given-names>
</name>
</person-group>
<article-title>Stimuli-responsive peptide assemblies: Design, self-assembly, modulation, and biomedical applications</article-title>
<source>Bioact Mater</source>
<year iso-8601-date="2024">2024</year>
<volume>35</volume>
<fpage>181</fpage>
<lpage>207</lpage>
<pub-id pub-id-type="doi">10.1016/j.bioactmat.2024.01.023</pub-id>
<pub-id pub-id-type="pmid">38327824</pub-id>
<pub-id pub-id-type="pmcid">PMC10847779</pub-id>
</element-citation>
</ref>
</ref-list>
</back>
</article>