﻿<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1 20151215//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="review-article">
<front>
<journal-meta>
<journal-id journal-id-type="nlm-ta">Explor Drug Sci</journal-id>
<journal-id journal-id-type="publisher-id">EDS</journal-id>
<journal-title-group>
<journal-title>Exploration of Drug Science</journal-title>
</journal-title-group>
<issn pub-type="epub">2836-7677</issn>
<publisher>
<publisher-name>Open Exploration Publishing</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.37349/eds.2026.1008161</article-id>
<article-id pub-id-type="manuscript">1008161</article-id>
<article-categories>
<subj-group>
<subject>Review</subject>
</subj-group>
</article-categories>
<title-group>
<article-title>AI-driven discovery of antimicrobial peptides and derivatives: database and tools</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0009-0004-9191-2247</contrib-id>
<name>
<surname>Zhou</surname>
<given-names>Yunuo</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/conceptualization/">Conceptualization</role>
<role content-type="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing—original draft</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing—review &amp; editing</role>
<xref ref-type="aff" rid="I1" />
</contrib>
<contrib contrib-type="author">
<name>
<surname>Wang</surname>
<given-names>Zihan</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/investigation/">Investigation</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-original-draft/">Writing—original draft</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing—review &amp; editing</role>
<xref ref-type="aff" rid="I1" />
</contrib>
<contrib contrib-type="author">
<contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-1810-5842</contrib-id>
<name>
<surname>Zheng</surname>
<given-names>Heng</given-names>
</name>
<role content-type="https://credit.niso.org/contributor-roles/validation/">Validation</role>
<role content-type="https://credit.niso.org/contributor-roles/writing-review-editing/">Writing—review &amp; editing</role>
<role content-type="https://credit.niso.org/contributor-roles/supervision/">Supervision</role>
<role content-type="https://credit.niso.org/contributor-roles/funding-acquisition/">Funding acquisition</role>
<xref ref-type="aff" rid="I1" />
<xref ref-type="corresp" rid="cor1">
<sup>*</sup>
</xref>
</contrib>
<contrib contrib-type="editor">
<name>
<surname>Kumar</surname>
<given-names>Kamal</given-names>
</name>
<role>Academic Editor</role>
<aff>Smartbax GmbH, Germany</aff>
</contrib>
</contrib-group>
<aff id="I1">School of Life Science and Technology, China Pharmaceutical University, Nanjing 211198, Jiangsu, People’s Republic of China</aff>
<author-notes>
<corresp id="cor1">
<sup>*</sup>
<bold>Correspondence:</bold> Heng Zheng, School of Life Science and Technology, China Pharmaceutical University, Nanjing 211198, Jiangsu, People’s Republic of China. <email>zhengh18@hotmail.com</email></corresp>
</author-notes>
<pub-date pub-type="collection">
<year>2026</year>
</pub-date>
<pub-date pub-type="epub">
<day>01</day>
<month>06</month>
<year>2026</year>
</pub-date>
<volume>4</volume>
<elocation-id>1008161</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>02</month>
<year>2026</year>
</date>
<date date-type="accepted">
<day>01</day>
<month>04</month>
<year>2026</year>
</date>
</history>
<permissions>
<copyright-statement>© The Author(s) 2026.</copyright-statement>
<license xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>This is an Open Access article licensed under a Creative Commons Attribution 4.0 International License (<ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">https://creativecommons.org/licenses/by/4.0/</ext-link>), which permits unrestricted use, sharing, adaptation, distribution and reproduction in any medium or format, for any purpose, even commercially, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.</license-p>
</license>
</permissions>
<abstract>
<p id="absp-1">The global rise of antimicrobial resistance (AMR) has emerged as one of the most pressing threats to public health. This crisis calls for the urgent development of alternative therapeutic agents against antibiotic-resistant pathogens. Antimicrobial peptides (AMPs) are widely present in nature, with a broad range of effects and a low risk of causing drug resistance. Therefore, they are an ideal choice for the development of the next generation of antimicrobial drugs. To overcome the inefficiencies of traditional AMP discovery, artificial intelligence (AI) and machine learning (ML) technologies have been increasingly used to predict and design AMPs. Multiple AMP databases were used to train ML models for predicting the activity of AMPs or generating AMP sequences. This review briefly provides a comprehensive overview of AMP databases and computational tools, highlighting their capabilities and challenges. Future work should integrate larger datasets and experimental validation to accelerate clinical translation.</p>
</abstract>
<kwd-group>
<kwd>antimicrobial peptides (AMPs)</kwd>
<kwd>artificial intelligence</kwd>
<kwd>machine learning</kwd>
<kwd>peptide prediction</kwd>
<kwd>antibiotic resistance</kwd>
<kwd>genome mining</kwd>
</kwd-group>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p id="p-1">Antimicrobial resistance (AMR) has escalated to a global public health emergency. The World Health Organization reports that antibiotic resistance directly caused 1.27 million deaths in 2021 and indirectly contributed to another 4.95 million deaths. Predictions suggest that without decisive intervention measures, the number of deaths due to drug-resistant infections could reach 10 million annually by 2050, surpassing cancer as the leading cause of death worldwide [<xref ref-type="bibr" rid="B1">1</xref>].</p>
<p id="p-2">The development of traditional antibiotics has focused primarily on several major classes, such as β-lactams, macrolides, tetracyclines, aminoglycosides, and glycopeptides [<xref ref-type="bibr" rid="B2">2</xref>]. These constraints in structural diversity confine antibiotic mechanisms of action to a narrow spectrum, enabling bacteria to readily develop multidrug resistance through point mutations, enzymatic inactivation, or efflux pump activation [<xref ref-type="bibr" rid="B3">3</xref>].</p>
<p id="p-3">Antimicrobial peptides (AMPs) are a class of small-molecule proteins naturally produced by organisms. They are an important part of the innate immune system and play a crucial role in the host defense mechanisms of all living forms, including microorganisms, plants, and animals [<xref ref-type="bibr" rid="B4">4</xref>]. The natural AMPs, due to their diverse mechanisms of action, high sequence diversity, and relatively low tendency to induce drug resistance, have provided a new solution path for the structural homogeneity problem of traditional antibiotics [<xref ref-type="bibr" rid="B5">5</xref>]. As of now, more than 60 kinds of AMPs have entered the clinical trial stage worldwide, among which 10 such substances, such as daptomycin and polymyxin B, have been approved for market release [<xref ref-type="bibr" rid="B6">6</xref>]. However, it is extremely difficult to conduct exhaustive screening for AMPs. Theoretically, a vast array of peptide chains with different sequences and spatial structures can be formed using no more than 25 natural amino acids [<xref ref-type="bibr" rid="B7">7</xref>]. As of 2024, the Database of Antimicrobial Activity and Structure of Peptides (DBAASP) v3 database only contains approximately 3.2 × 10<sup>4</sup> experimentally verified AMP sequences [<xref ref-type="bibr" rid="B8">8</xref>]. This significant disparity in quantity makes comprehensive experimental verification nearly impossible.</p>
<p id="p-4">The development of artificial intelligence (AI) provides a potential way to solve this problem. Driven by breakthroughs in image recognition, natural-language processing, and computer vision, contemporary AI systems are uniquely positioned to resolve long-standing bottlenecks in pharmaceutical research, converting terabyte-scale datasets into actionable insight and compressing discovery timelines while elevating success rates across the entire drug-development pipeline [<xref ref-type="bibr" rid="B9">9</xref>]. AI has also delivered unprecedented efficiency in AMP discovery. For example, generative models jointly design peptide sequences and self-assembling water-rich hydrogel matrices under multi-objective optimization, yielding composites that rapidly eradicate drug-resistant pathogens while simultaneously orchestrating tissue regeneration [<xref ref-type="bibr" rid="B10">10</xref>]. By analyzing large amounts of natural AMP data, these models automatically learn design patterns and successfully guide the synthesis of novel AMPs that have been verified to be effective in experiments and animal models [<xref ref-type="bibr" rid="B11">11</xref>]. These breakthroughs demonstrate that AI can reliably design and optimize antibacterial molecules, thus providing a promising solution for combating drug-resistant bacteria.</p>
<p id="p-5">This review systematically summarizes the application of AI and machine learning (ML) technologies in the discovery and design of AMPs, covering the latest advancements in related databases and computational tools, as well as their current challenges. The aim is to address the increasingly severe AMR crisis through efficient and intelligent computational methods, providing technical paths and theoretical support for the development of new generations of antimicrobial drugs.</p>
</sec>
<sec id="s2">
<title>Antimicrobial mechanisms of AMPs</title>
<p id="p-6">The mechanisms underlying bacterial resistance to traditional antibiotics encompass multiple strategies, such as the production of inactivating enzymes, modification of drug targets, reduced membrane permeability, enhanced efflux pump activity, biofilm formation, and entry into a dormant state (<xref ref-type="fig" rid="fig1">Figure 1</xref>) [<xref ref-type="bibr" rid="B12">12</xref>]. The distinct mechanisms of action between AMPs and traditional antibiotics position AMPs as a promising solution for the therapy of multidrug-resistant (MDR) pathogens [<xref ref-type="bibr" rid="B13">13</xref>], especially in the treatment of MDR pathogens such as methicillin-resistant <italic>Staphylococcus aureus</italic> (MRSA), vancomycin-resistant <italic>Enterococcus</italic> (VRE), and MDR tuberculosis (MDR-TB). The potential mechanisms of action of AMPs include disrupting bacterial cell membranes, inhibiting critical cellular processes, and modulating the host’s immune response. These mechanisms can be categorized into two main types: direct killing (encompassing both membrane-targeted and non-membrane-targeted actions) and immune modulation [<xref ref-type="bibr" rid="B14">14</xref>–<xref ref-type="bibr" rid="B16">16</xref>]. Cationic amphiphilic peptides (such as magainin, AMP from Xenopus laevis [PGLa], and LL-37) adopt a random coiling conformation in aqueous solution. When they bind to negatively charged bacterial membranes, they transform into α-helices parallel to the membrane surface. This “interface parallel” conformation causes the peptide’s polarity to face the aqueous phase and its hydrophobic side to insert into the lipid layer, resulting in local loosening of lipid chains and a thinning of the membrane, eventually forming transient pores or complete rupture [<xref ref-type="bibr" rid="B17">17</xref>].</p>
<fig id="fig1" position="float">
<label>Figure 1</label>
<caption>
<p id="fig1-p-1">
<bold>Mechanisms of AMPs.</bold> AMPs: antimicrobial peptides.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="eds-04-1008161-g001.tif" />
</fig>
<sec id="t2-1">
<title>Direct mechanism of action</title>
<p id="p-7">AMPs exert their effects through multi-target mechanisms. They can disrupt bacterial cell wall synthesis (such as targeting lipid II) or fungal cell wall components, directly lysing microbial cell membranes through barrel-shaped pores, carpet models, or toroidal-pore model [<xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B19">19</xref>]. The barrel-stave model shows that AMPs alter the conformation of bacterial membranes through electrostatic interactions and binding to the cell membrane, and insert into the phospholipid bilayer of the cell membrane in the form of polymers from a direction perpendicular to the cell membrane. Their hydrophobic sides face the acyl chains of the membrane outward, while the hydrophilic sides face inward to form pores or channels, thereby creating ion channels in the cell membrane. The carpet model was used to describe the action of AMPs such as dermaseptin [<xref ref-type="bibr" rid="B20">20</xref>]. The mechanism of action of this model: First, the AMP monomers combine with the phospholipid groups. Then, the AMP monomers are arranged successively on the bacterial membrane surface. Due to the combination of phospholipid groups with the peptide and the action of water, the orientation of the hydrophilic surface of the membrane changes, and a hydrophobic core is formed on the membrane. Finally, the AMP breaks the bacterial membrane [<xref ref-type="bibr" rid="B21">21</xref>]. The toroidal-pore model was first proposed by Yoneyama et al. [<xref ref-type="bibr" rid="B22">22</xref>]. The positively charged AMP was electrostatically captured by the negatively charged bacterial membrane. Its hydrophobic end inserted into the lipid bilayer, causing a breach in the hydrophobic center of the cell membrane, resulting in the positive curvature and stretching of the bacterial membrane, and further stretching of the breach further disrupted the integrity of the bacterial membrane.</p>
<p id="p-8">For drug-resistant bacteria and biofilms, AMPs utilize positive charge adsorption and hydrophobic insertion to disrupt membrane structure [<xref ref-type="bibr" rid="B23">23</xref>], increase membrane permeability to restore antibiotic sensitivity [<xref ref-type="bibr" rid="B24">24</xref>], and target intracellular metabolic processes to achieve “internal and external synergy” killing [<xref ref-type="bibr" rid="B25">25</xref>, <xref ref-type="bibr" rid="B26">26</xref>]. In addition, AMPs can also enter the interior of cells, inhibiting nucleic acid replication, protein synthesis, or interfering with key enzyme functions [<xref ref-type="bibr" rid="B27">27</xref>].</p>
<p id="p-9">For example, LL-37, a cationic amphiphilic α-helical peptide, exerts its antimicrobial effect through multiple synergistic mechanisms, mainly including: membrane disruption and pore formation; its inhibitory effect on the palmitoyl transferase that maintains membrane integrity results in enhanced AMPs [<xref ref-type="bibr" rid="B28">28</xref>]; it has lipopolysaccharide (LPS) neutralization, biofilm inhibition by outer membrane penetration, and intracellular inhibition by non-membrane targeting [<xref ref-type="bibr" rid="B29">29</xref>]. Plectasin is a fungal peptide that achieves antimicrobial effects by inhibiting cell wall biosynthesis. Through in vitro experiments, it can be confirmed that plectasin exerts antimicrobial activity by directly binding to lipid II. Nuclear magnetic resonance spectroscopy and computational models further determined the key residues involved in this interaction [<xref ref-type="bibr" rid="B30">30</xref>]. Apidaecins I and II are derived from bees and are also a type of gram-negative bacterial AMP that does not damage cell membranes. Studies have shown that apidaecins competitively bind to the A site of the ribosome, specifically inhibiting translation termination related to the UAG stop codon and suppressing the release of newly synthesized proteins [<xref ref-type="bibr" rid="B31">31</xref>, <xref ref-type="bibr" rid="B32">32</xref>]; P5 exerts antimicrobial effects on carbapenem-resistant <italic>Pseudomonas aeruginosa</italic> (CRPA) by disrupting the biofilm and promoting their death [<xref ref-type="bibr" rid="B24">24</xref>].</p>
</sec>
<sec id="t2-2">
<title>Immunomodulatory and anti-virulence mechanisms of AMPs</title>
<p id="p-10">AMPs have significant immunomodulatory functions: They possess the functions of neutralizing endotoxins and regulating the local immune microenvironment [<xref ref-type="bibr" rid="B24">24</xref>]. They can also attract immune cells such as neutrophils and macrophages to the infection site, upregulate the expression of pro-inflammatory factors such as TNF-α and IL-6, and act as vaccine adjuvants to enhance adaptive immune responses [<xref ref-type="bibr" rid="B14">14</xref>–<xref ref-type="bibr" rid="B16">16</xref>]. Moreover, AMPs can effectively inhibit biofilm formation by interfering with the quorum-sensing system, inhibiting extracellular polymer synthesis, changing the morphology and adhesion ability of free bacteria, and significantly reducing the drug resistance of biofilms [<xref ref-type="bibr" rid="B33">33</xref>]. The multi-modal and low-resistance characteristics of these mechanisms make AMPs ideal candidate drugs for combating drug-resistant infections. For instance, cathelicidin BF has immunomodulatory activity against <italic>Pseudomonas aeruginosa</italic> and can significantly alleviate the symptoms of pneumonia caused by it [<xref ref-type="bibr" rid="B34">34</xref>]; human α-lactalbumin made lethal to tumor cells (HAMLET) causes MRSA to lose resistance to methicillin and VRSA to lose resistance to vancomycin by depolarizing the bacterial membrane and dissipating proton dynamics [<xref ref-type="bibr" rid="B35">35</xref>].</p>
</sec>
<sec id="t2-3">
<title>Structural and computational approaches to elucidate AMP mechanisms</title>
<p id="p-11">The core technologies used in the research on the AMP mechanism: Experimental methods such as ARGO assay [<xref ref-type="bibr" rid="B36">36</xref>] and NMR [<xref ref-type="bibr" rid="B37">37</xref>] provide high-resolution complex structure and affinity data, and reverse-verify the computational predictions, forming a closed research system from atomic simulation to in vivo validation; molecular dynamics (MD) simulations achieve the revelation of the interaction between AMPs and cellular components, especially the interaction with cell membranes, by simulating the movements of atoms and molecules. This enables the understanding of the transition states during pore formation and the mechanism of AMPs’ action [<xref ref-type="bibr" rid="B38">38</xref>]. MD simulations through software such as GROMACS track the dynamic trajectories of AMP insertion into lipids, binding to DNA, or filamenting temperature-sensitive mutant Z (FtsZ) at the atomic scale, enabling real-time observation of pore formation or enzyme inhibition details. For instance, in the 2025 study by Tian et al. [<xref ref-type="bibr" rid="B39">39</xref>], MD simulation was used as the core validation step in the screening of AMPs. Firstly, nine candidate peptides selected by AI reached an equilibrium state in the solution (MD1). Subsequently, they were placed outside the bacterial membrane model composed of DMPC/DMPG (3:1) and observed for their adsorption behavior through a 2-microsecond simulation (MD2). Additionally, a two-microsecond simulation of a four-fold system (MD3) revealed that only AMP-12 and AMP-48 could spontaneously insert into the membrane and induce the formation of stable water channels (with over 500 water molecules present in the membrane for more than 50% of the time), while the other seven peptides only adsorbed on the membrane surface. The reliability of the screening strategy was verified using positive control (PGLa) and negative control (nAMP1) [<xref ref-type="bibr" rid="B39">39</xref>]. DTBind, proposed by Li et al. [<xref ref-type="bibr" rid="B40">40</xref>] in 2025, is a successful example of the systematic integration of AI with computational chemistry methods. AI automatically learn the nonlinear relationships between features to capture the deep patterns in sequence and structure, while computational chemistry provides physical priors such as geometric invariance description, surface morphology analysis, and interaction type identification to achieve unified and accurate prediction of drug-target binding occurrence, site, and affinity [<xref ref-type="bibr" rid="B40">40</xref>]. Quantitative structure-activity relationship (QSAR) and deep learning [DL; random forest (RF), AntiBERTy, etc.] convert peptide sequences into thousands of dimensional descriptors such as hydrophobicity, charge, and secondary structure, establish activity-to-toxicity prediction models, and achieve millions of virtual screenings; bioinformatics tools and platforms such as APD3, AutoDock, etc., complete sequence alignment, homology modeling, and molecular docking, quickly identifying potential targets [<xref ref-type="bibr" rid="B16">16</xref>]; through homology modeling, the three-dimensional structure of AMP is constructed using tools such as SWISS-MODEL or MODELLER [<xref ref-type="bibr" rid="B41">41</xref>].</p>
</sec>
</sec>
<sec id="s3">
<title>AMP databases</title>
<p id="p-12">The global spread of bacterial resistance to conventional antibiotics has spurred the research and development (R&amp;D) of novel anti-infective agents, in turn accelerating the discovery of AMPs. Concurrently, the integration of ML and DL into pharmaceutical R&amp;D workflows has further expedited the expansion of curated information within AMP databases. The AMP database is divided into two main categories: the general database and the specific database. The general database includes all types of AMPs, without considering peptide families with specific properties. The specific database covers information related to a certain type of AMP (such as only anti-parasitic peptides or bacteriocins) or includes a certain supergroup of AMPs (such as only anti-parasitic peptides or only bacteriocin peptides) [<xref ref-type="bibr" rid="B42">42</xref>].</p>
<p id="p-13">Among these databases, tools for predicting the three-dimensional structure of peptides are often included (such as AlphaFold or ESMFold). They often play a role in accurately predicting the three-dimensional structures of various biomolecular complexes, such as proteins, nucleic acids, small-molecule ligands, ions, and modified residues, within a unified DL framework [<xref ref-type="bibr" rid="B43">43</xref>]. Protein structure prediction models based on DL use deep neural networks (DNN, especially architectures such as Transformer) to automatically learn complex evolutionary patterns and physicochemical constraints from the amino acid sequence of proteins, so as to achieve high-precision prediction of their three-dimensional spatial structure [<xref ref-type="bibr" rid="B44">44</xref>]. However, these tools for dealing with AMPs are limited. Due to the flexible C-terminal tail that can freely swing to insert and penetrate the cell membrane, the diffusion model of short peptide structures with antimicrobial activity lacks the necessary physical confinement mechanism when working with engineered alternately folded systems with intrinsic conformational ambiguity, resulting in frequent unphysical atomic overlap in the prediction results [<xref ref-type="bibr" rid="B45">45</xref>, <xref ref-type="bibr" rid="B46">46</xref>]. At the same time, it also has limitations in producing “illusion” structures in disordered regions, failing to capture the dynamic behavior of molecules, and requiring a large number of samples to obtain high-precision predictions for some complex targets [<xref ref-type="bibr" rid="B43">43</xref>]. Thus, dynamic simulation needs to be taken into consideration.</p>
<p id="p-14">In this article, we list several comprehensive AMP databases and specialized AMP databases (<xref ref-type="table" rid="t1">Tables 1</xref> and <xref ref-type="table" rid="t2">2</xref>).</p>
<table-wrap id="t1">
<label>Table 1</label>
<caption>
<p id="t1-p-1">
<bold>Comprehensive antimicrobial peptide databases [<xref ref-type="bibr" rid="B47">47</xref>, <xref ref-type="bibr" rid="B48">48</xref>].</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>Name</bold>
</th>
<th>
<bold>Sequence</bold>
</th>
<th>
<bold>Update the year of the current version</bold>
</th>
<th>
<bold>Remarks</bold>
</th>
<th>
<bold>Reference</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>AMPDB v1</td>
<td>59,122</td>
<td>2023</td>
<td>Basic information, physical and chemical properties, activity information</td>
<td>[<xref ref-type="bibr" rid="B49">49</xref>]</td>
</tr>
<tr>
<td>dbAMP 3.0</td>
<td>35,518</td>
<td>2025</td>
<td>Sequence information, functional activity data, physicochemical properties, and structural annotations</td>
<td>[<xref ref-type="bibr" rid="B50">50</xref>, <xref ref-type="bibr" rid="B51">51</xref>]</td>
</tr>
<tr>
<td>DRAMP 4.0</td>
<td>30,260</td>
<td>2025</td>
<td>Antimicrobial activity data, structure information, hemolytic and cytotoxic information</td>
<td>[<xref ref-type="bibr" rid="B52">52</xref>–<xref ref-type="bibr" rid="B55">55</xref>]</td>
</tr>
<tr>
<td>DBAASP v3</td>
<td>&gt; 15,700</td>
<td>2021</td>
<td>Sequences, chemical modifications, 3D structures, bioactivities, and toxicities of peptides</td>
<td>[<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B56">56</xref>]</td>
</tr>
<tr>
<td>LAMP2</td>
<td>23,253</td>
<td>2020</td>
<td>Primary structure, collection, composition, source, and function</td>
<td>[<xref ref-type="bibr" rid="B57">57</xref>]</td>
</tr>
<tr>
<td>CAMP<sub>R4</sub></td>
<td>24,243</td>
<td>2023</td>
<td>Sequences, source organisms, target organisms, minimum inhibitory, hemolytic concentrations, N- and C-terminal modifications, and presence of unusual amino acids</td>
<td>[<xref ref-type="bibr" rid="B58">58</xref>, <xref ref-type="bibr" rid="B59">59</xref>]</td>
</tr>
<tr>
<td>APD</td>
<td>5,188</td>
<td>2026</td>
<td>Structure, activity, stability, synergistic effect, toxicity, animal models, and recombinant expression systems</td>
<td>[<xref ref-type="bibr" rid="B60">60</xref>–<xref ref-type="bibr" rid="B62">62</xref>]</td>
</tr>
<tr>
<td>YADAMP</td>
<td>2,133</td>
<td>2012</td>
<td>Explicit presence of antimicrobial activity against the most common bacterial strains</td>
<td>[<xref ref-type="bibr" rid="B63">63</xref>]</td>
</tr>
<tr>
<td>InverPep</td>
<td>702</td>
<td>2017</td>
<td>Including InverPep code, phylum and species source, peptide name, sequence, peptide length, secondary structure, molar mass, charge, isoelectric point, hydrophobicity, Boman index, aliphatic index, and percentage of hydrophobic amino acids</td>
<td>[<xref ref-type="bibr" rid="B64">64</xref>]</td>
</tr>
</tbody>
</table>
</table-wrap>
<table-wrap id="t2">
<label>Table 2</label>
<caption>
<p id="t2-p-1">
<bold>Specialized antimicrobial peptide (AMP) databases.</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>Name</bold>
</th>
<th>
<bold>Sequence</bold>
</th>
<th>
<bold>Description</bold>
</th>
<th>
<bold>Update year of the current version</bold>
</th>
<th>
<bold>Remarks</bold>
</th>
<th>
<bold>Reference</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>PlantPepDB</td>
<td>3,848</td>
<td>AMPs derived from plants</td>
<td>2020</td>
<td>Activity, hemolytic activity, cytotoxicity</td>
<td>[<xref ref-type="bibr" rid="B65">65</xref>]</td>
</tr>
<tr>
<td>ACovPepDB</td>
<td>518</td>
<td>Anti-coronavirus peptides</td>
<td>2022</td>
<td>Organism, activity, mechanism</td>
<td>[<xref ref-type="bibr" rid="B66">66</xref>]</td>
</tr>
<tr>
<td>AVPdb</td>
<td>2,683</td>
<td>Antiviral peptide</td>
<td>2014</td>
<td>Organism, activity, mechanism</td>
<td>[<xref ref-type="bibr" rid="B67">67</xref>]</td>
</tr>
<tr>
<td>HIPdb</td>
<td>981</td>
<td>HIV inhibiting peptides</td>
<td>2013</td>
<td>Organism, activity, mechanism</td>
<td>[<xref ref-type="bibr" rid="B68">68</xref>]</td>
</tr>
<tr>
<td>DRAVP</td>
<td>5,688</td>
<td>Antiviral peptide</td>
<td>2023</td>
<td>Antiviral activity, structure information, physicochemical information, and literature information</td>
<td>[<xref ref-type="bibr" rid="B69">69</xref>]</td>
</tr>
<tr>
<td>DCTPep</td>
<td>6,214</td>
<td>Cancer therapy peptides</td>
<td>2024</td>
<td>Tumor active peptide, cancer targeted peptides, targeted peptide conjugates, and collection of targeted peptides</td>
<td>[<xref ref-type="bibr" rid="B70">70</xref>]</td>
</tr>
</tbody>
</table>
</table-wrap>
<sec id="t3-1">
<title>Common databases</title>
<sec id="t3-1-1">
<title>DRAMP</title>
<p id="p-15">Our laboratory has constructed DRAMP, a comprehensive open-access database of AMPs dedicated to accelerating the clinical translation of AMPs. The version 4.0 contains 30,260 entries (8,001 new additions compared to the previous version), with core innovations and data integration including: the first introduction of a stability dataset, 110 newly added verified serum/protease stability and half-life data; the expansion of clinical entries to 96, integrating repetitive sequences and associating clinical trial numbers; 2,891 new hemolytic activity and 2,674 new cytotoxicity experimental data, systematically marking toxicity bottlenecks; the addition of an “Expanded AMPs” subset (179 entries) including experiments verified by Chinese journals not indexed in international databases; the number of experimentally verified structures (PDB) increased to 419, and the number of predicted structures reached 1,570. The database provides categorized downloads, covering 70.47% of unique sequences (18,580 entries not found in APD/DBAASP/CAMP). The database can be accessed at: <uri xlink:href="http://dramp.cpu-bioinfor.org/">http://dramp.cpu-bioinfor.org/</uri> [<xref ref-type="bibr" rid="B52">52</xref>–<xref ref-type="bibr" rid="B55">55</xref>].</p>
</sec>
<sec id="t3-1-2">
<title>dbAMP</title>
<p id="p-16">dbAMP is dedicated to addressing the increasingly severe AMR challenges in the post-pandemic era. This database has significantly expanded its data scale and functional depth through continuous updates. The latest version (dbAMP 3.0, 2025) integrates data from over 5,200 scientific papers, including 33,065 AMPs and 2,453 antimicrobial proteins, covering 3,534 source organisms, representing a nearly threefold increase from the initial version in 2019 (12,389 entries). The core breakthrough lies in the use of ESMFold to predict the three-dimensional structures of over 30,000 AMPs and to provide contact maps and secondary structure information, which greatly facilitates structural-function research. To accelerate clinical translation, the platform has added an AMP rational design pipeline, using multi-objective optimization algorithms (balancing antimicrobial activity, low toxicity, and stability) to assist in the development of new antibacterial peptides. At the same time, dbAMP integrates over 20 prediction tools and benchmark data set portals, becoming the core hub for AMP discovery, mechanism research, and clinical design. The database can be accessed at: <uri xlink:href="https://awi.cuhk.edu.cn/dbAMP/">https://awi.cuhk.edu.cn/dbAMP/</uri> [<xref ref-type="bibr" rid="B50">50</xref>, <xref ref-type="bibr" rid="B51">51</xref>].</p>
</sec>
<sec id="t3-1-3">
<title>DBAASP</title>
<p id="p-17">DBAASP is currently the most comprehensive resource library for antimicrobial and cytotoxic peptides, containing over 15,700 entries, including &gt; 14,500 monomers and nearly 400 homo- and hetero-multimers. Its core advantage lies in integrating multi-dimensional data, covering more than 2,700 ribosomal synthesis peptides, over 170 non-ribosomal peptides, and more than 12,000 synthetic peptides, including various modifications such as linear, cyclic, and disulfide bonds; providing experimental activity data for 4,200+ targets [such as minimum inhibitory concentration (MIC)/IC<sub>50</sub>], as well as hemolysis/cytotoxicity indicators; generating 3,200+ peptide conformational trajectories through an automated MD process and correlating with 400+ PDB experimental structures, supporting online visualization and analysis; the newly added PAASS tool can predict the activity of peptides against specific pathogens, and expands the calculation of physicochemical properties. As a key platform for antibacterial peptide design, DBAASP v3 promotes the rational development of new anti-infective peptides through high-precision experimental data, dynamic structure simulation, and ML tools. The access address is: <uri xlink:href="http://dbaasp.org">http://dbaasp.org</uri> [<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B56">56</xref>].</p>
</sec>
<sec id="t3-1-4">
<title>AVPdb</title>
<p id="p-18">AVPdb is a specialized database dedicated to experimental verification of antiviral peptides (AVPs), containing 2,683 peptide sequences targeting over 60 medically significant viruses (such as influenza, HCV, HSV, etc.), including 624 chemically modified peptide segments. This database provides detailed experimental information for each peptide, including sequence, source, target virus, inhibitory activity, mechanism of action, cell line, detection method, and complete PubMed literature reference. Additionally, AVPdb integrates various analysis tools, such as BLAST alignment, physicochemical property calculation, and structure prediction, enabling users to browse, search, and download data. This resource aims to promote AVP research and drug development, and is publicly accessible at: <uri xlink:href="http://crdd.osdd.net/servers/avpdb">http://crdd.osdd.net/servers/avpdb</uri> [<xref ref-type="bibr" rid="B67">67</xref>].</p>
</sec>
<sec id="t3-1-5">
<title>PlantPepDB</title>
<p id="p-19">PlantPepDB is a comprehensive database focusing on plant-derived active peptides, which includes 3,848 peptide sequences from 443 plant species, covering 9 major functional categories, such as antimicrobial, anti-cancer, antiviral, anti-hypertension, immune regulation, and many other biological activities. This database integrates data from 11 existing databases and 835 research papers. Each peptide entry provides detailed sequence, source plant, functional description, experimental verification level (protein level, transcriptional level, prediction or homology inference), physicochemical properties, tertiary structure prediction, and dictionary of protein secondary structure (DSSP) secondary structure report. The database offers multiple search methods and integrates analysis tools such as BLAST, Smith-Waterman alignment, and peptide mapping, enabling users to perform function prediction, structure analysis, and target screening. The database also places particular emphasis on data deduplication and information integration, merging duplicate peptide entries from different sources into a single record with more comprehensive information. The database is publicly accessible at: <uri xlink:href="http://www.nipgr.ac.in/PlantPepDB/">http://www.nipgr.ac.in/PlantPepDB/</uri> [<xref ref-type="bibr" rid="B65">65</xref>].</p>
</sec>
<sec id="t3-1-6">
<title>DCTPep</title>
<p id="p-20">DCTPep is an open-source comprehensive database specifically designed for cancer therapeutic peptides. It contains a total of 6,214 entries, including 6,106 peptide sequences and 108 peptide drugs. This database not only encompasses traditional anti-cancer peptides (ACPs) but also integrates information on cell-penetrating peptides with targeting functions, cancer-targeting peptides, and peptide-drug conjugates. It provides detailed annotations such as activity, targets, physicochemical properties, and three-dimensional structures predicted by AlphaFold. The database covers data on over 60 targets and more than 400 cancer cell lines. Its purpose is to offer valuable resources for the design, screening, and mechanism-of-action research of ACPs. The database is publicly accessible at: <uri xlink:href="http://dctpep.cpu-bioinfor.org/">http://dctpep.cpu-bioinfor.org/</uri> [<xref ref-type="bibr" rid="B70">70</xref>].</p>
</sec>
</sec>
<sec id="t3-2">
<title>AMP database evaluation</title>
<p id="p-21">In evaluating AMP databases, it is typically necessary to establish comparative benchmarks across multiple dimensions. First, data scale and source diversity refer to the total number of peptides included in the database and whether it encompasses various types, such as natural and synthetic peptides. Second, data uniqueness and functional richness are crucial, as low-redundancy, independent datasets are essential for preventing model overfitting. Third, annotation depth and comprehensiveness extend beyond basic sequence information to include details such as source species, three-dimensional structures, post-translational modifications, MIC data, toxicity profiles, and mechanisms of action, which directly determine the research value of the database. Additionally, data update frequency and maintenance status are significant indicators, with databases that undergo continuous updates and data corrections, such as APD, being more reliable. Finally, the integration of functional tools and their utility—including retrieval, design capabilities, and support for emerging technologies such as ML predictions—forms a critical dimension for assessing the applicability of a database. Key aspects include whether it offers tools for BLAST alignment, sequence search, physicochemical property calculation, and even ML-based predictive capabilities [<xref ref-type="bibr" rid="B71">71</xref>].</p>
<p id="p-22">Take dbAMP and DBAASP mentioned above as example. At the basic data level, dbAMP leads significantly over DBAASP with its scale advantage of 35,518 peptide sequences (including 33,065 AMPs) and 3,534 source organisms, compared to DBAASP’s over 15,700 records. In terms of annotation depth, the former categorizes 58 types of functional activities and fully annotates MIC efficacy data, while the latter provides a unique negative example resource for ML by including experimentally verified “inactive” peptides. In the aspect of structural information processing, dbAMP adopts the cutting-edge protein language model (pLM) ESMFold to complete high-throughput 3D structure prediction for over 30,000 peptides, with a prediction speed approximately 50 times faster than AlphaFold2, while DBAASP uses a classic computational path to provide high-precision 400 ns MD simulation trajectories for over 3,200 peptides. In terms of technical adaptability: dbAMP comprehensively utilizes generative AI, not only introducing GPT-4 to assist in literature mining but also integrating AMPActiPred based on deep forests, HemoFinder for hemolytic toxicity prediction, and a multi-objective peptide optimization platform based on genetic algorithms; DBAASP, on the other hand, focuses on specific strain activity prediction models (PAASS) and DiSAAP statistical analysis tools, and ensures the biophysical authenticity of simulations through the CHARMM36m force field. In summary, dbAMP is more suitable for large-scale structure-function mining and AI-driven peptide design optimization, while DBAASP has advantages in targeted strain prediction and sequence pattern statistics [<xref ref-type="bibr" rid="B8">8</xref>, <xref ref-type="bibr" rid="B50">50</xref>].</p>
</sec>
<sec id="t3-3">
<title>Genome mining</title>
<p id="p-23">Genome mining refers to the analysis of sequenced microbial genomes to identify biosynthetic gene clusters (BGCs) that may encode novel AMPs or secondary metabolites. Most of the data in the database are derived from genome mining, and the main research subjects are actinomycetes and fungi. Mining strategies can be divided into two categories: one is to identify BGCs based on conserved biosynthetic enzymes such as non-ribosomal peptide synthetase (NRPS) and polyketide synthase (PKS); the second is to combine algorithms to predict precursor peptides and their post-translational modifications. To achieve these strategies, researchers have developed a variety of specialized tools, such as BAGEL, the earliest tool for mining bacteriocins; antiSMASH, the most comprehensive tool that supports the identification of 81 BGCs; ThioFinder, which specializes in identifying thiopeptides; RODEO, which combines ML to identify lasso peptides; Ripp-prism, which can predict 21 classes of ribosomally synthesized and post-translationally modified peptides (RiPP) structures and BGCs, and RiPPMiner, which can predict RiPP classes and cleavage sites based on ML. The successful application of these tools has led to the discovery of a large number of new AMPs, such as stambomycins from <italic>Streptomyces ambofaciens</italic> and medipeptin A from <italic>Pseudomonas mediterranea</italic>. The strategies of genome mining have evolved from the early reliance on BLAST homology search to the identification based on conserved modifying enzymes, and now to the intelligent development process that combines ML and DL to achieve automated and high-throughput predictions, enabling the discovery of high-throughput and culture-independent AMPs [<xref ref-type="bibr" rid="B72">72</xref>–<xref ref-type="bibr" rid="B75">75</xref>].</p>
</sec>
</sec>
<sec id="s4">
<title>Computational approaches for AMP prediction</title>
<p id="p-24">The establishment of AMP databases has significantly facilitated the development of computational prediction methods for AMPs. Certain AMP databases are equipped with predictive functionalities. For instance, the CAMP<sub>R3</sub> database provides a suite of prediction tools that allow users to submit any peptide sequence for evaluation. Using ML models such as support vector machine (SVM) and RF, the system predicts whether the peptide exhibits antimicrobial activity. It can also identify localized antimicrobial regions (e.g., functional domains) within longer peptide sequences, or predict potential AMPs based on family-specific characteristics using publicly available data [<xref ref-type="bibr" rid="B58">58</xref>]. Similarly, DBAASP supports predictive analysis based on its database content. Its PAASS tool enables the prediction of antimicrobial activity of user-submitted peptide sequences against specific microbial strains (including <italic>Escherichia coli</italic> ATCC 25922, <italic>Staphylococcus aureus</italic> ATCC 25923, and seven pathogenic organisms in total), allowing for strain-level targeting. Another tool, PGA, predicts whether a peptide sequence possesses broad-spectrum antimicrobial activity against a comprehensive set of microbial targets [<xref ref-type="bibr" rid="B8">8</xref>]</p>
<p id="p-25">The computer-based prediction of AMPs using established known peptides was first introduced by the APD database in 2004 [<xref ref-type="bibr" rid="B61">61</xref>]. Later, in 2007, methods such as artificial neural network (ANN), quantitative matrices (QM), and SVM were developed based on the APD dataset [<xref ref-type="bibr" rid="B76">76</xref>]. Recently, DL methods and large language models (LLMs) have also been applied to the discovery of AMPs.</p>
<sec id="t4-1">
<title>Prediction model based on ML algorithms</title>
<p id="p-26">Since 2007, a wide range of different ML algorithms have been applied in AMP classification and prediction models. The ML methods successfully applied to the modeling of AMPs include decision tree (DT) [<xref ref-type="bibr" rid="B77">77</xref>–<xref ref-type="bibr" rid="B79">79</xref>], K-nearest neighbor (K-NN) [<xref ref-type="bibr" rid="B79">79</xref>, <xref ref-type="bibr" rid="B80">80</xref>], naive bayes (NB) [<xref ref-type="bibr" rid="B81">81</xref>, <xref ref-type="bibr" rid="B82">82</xref>], eXtreme Gradient Boosting (XGBoost) [<xref ref-type="bibr" rid="B82">82</xref>], SVM [<xref ref-type="bibr" rid="B79">79</xref>, <xref ref-type="bibr" rid="B82">82</xref>], RF [<xref ref-type="bibr" rid="B79">79</xref>, <xref ref-type="bibr" rid="B82">82</xref>, <xref ref-type="bibr" rid="B83">83</xref>], neural network (including its DL variants) [<xref ref-type="bibr" rid="B84">84</xref>], and many other methods. Many AMP prediction models, such as CalcAMP [<xref ref-type="bibr" rid="B85">85</xref>], AMPpred-EL [<xref ref-type="bibr" rid="B86">86</xref>], PTPAMP [<xref ref-type="bibr" rid="B87">87</xref>], etc., have utilized these ML models. Next, we will introduce several of them.</p>
<p id="p-27">The utility of DTs in AMP discovery primarily stems from their high interpretability and efficient processing of structured features. Through the three-step learning rules of feature selection, tree generation, and pruning, they are used in the detection of AMPs for classifying active peptides and non-active peptides. Their advantages include strong interpretability, which can intuitively display the key discriminative features and corresponding hypotheses. For example, the ADTree model, by identifying key discriminative features such as specific dipeptide composition, cationic amino acid content, and alpha-helical structure, can accurately pinpoint potential bacteriocins. The clear rules it generates directly inform the rational design of novel AMPs with desired properties [<xref ref-type="bibr" rid="B77">77</xref>].</p>
<p id="p-28">K-NN is a distance-based classification method that relies on the similarity between the feature vectors of peptide sequences in AMP prediction. Since K-NN does not require explicit training, researchers can quickly assess which category of peptides a new sequence is closer to in the database. For example, iAMP-2L uses fuzzy K-NN (FKNN) for multi-label functional prediction, by calculating the similarity between the input sequence and the known AMPs in the training set to determine whether it has antimicrobial activity and other functions [<xref ref-type="bibr" rid="B80">80</xref>]. The advantage of K-NN is that it does not require a training process; however, when the feature dimension increases, the “dimension curse” causes the distance measurement to lose its discriminatory power, and the computational cost also grows linearly with the data size.</p>
</sec>
<sec id="t4-2">
<title>Prediction model based on DL</title>
<p id="p-29">DL in AMP prediction mainly falls into four categories: basic models, language models, graph neural networks (GNN), and mixed and multimodal models [<xref ref-type="bibr" rid="B88">88</xref>].</p>
<sec id="t4-2-1">
<title>Basic models</title>
<p id="p-30">The basic models mainly include DNN, convolutional neural networks (CNN), recurrent neural networks (RNN), their variants (LSTM, BiLSTM, GRU, etc.), and other hybrid models. CNN is adept at extracting local conserved motifs using multi-scale convolution and can effectively capture local sequence patterns, making it suitable for sequence encoding; DNN, which is a multi-layer fully connected network, is suitable for the fusion and classification of various manual features (such as amino acid composition, dipeptide composition, physical and chemical properties), and achieves this by globally optimizing the network parameters through minimizing the selected loss function. This is similar to CNN. Many works have implemented AMP prediction. For instance, Ma et al. [<xref ref-type="bibr" rid="B89">89</xref>] employed DL tools, including DNN, to mine and predict novel AMPs from human gut metagenomic data on a large scale. Subsequent experimental validation confirmed the antimicrobial activity of over a hundred of these predicted peptides, demonstrating the powerful application of DNN models in the field of AMP discovery [<xref ref-type="bibr" rid="B89">89</xref>]. RNN captures long-range dependencies between amino acids through a recurrent mechanism, but has a slower training speed and requires a large amount of data. It is often cascaded with CNN to form the CNN-BiLSTM architecture, and is supplemented by transfer learning, multi-task learning, or stacking integration to address the issues of insufficient data and model robustness [<xref ref-type="bibr" rid="B90">90</xref>]. DL variants (such as CNN, LSTM, DNN) simulate the processing of complex patterns by human brain neurons. In 2022, the research integrated LSTM, Attention, BERT, and other natural language processing (NLP) models. The prediction accuracy of AMP exceeded 91%, and the AUPRC reached 0.9244. Moreover, it does not require sequence homology or amino acid composition information and can discover new peptides with a homology lower than 40%, significantly reducing false positive (FP) [<xref ref-type="bibr" rid="B89">89</xref>]. TriNet further integrates sequence fingerprints, position-specific scoring matrix (PSSM) evolutionary information, and physicochemical properties with a three-layer network. It introduces the training-validation interaction (TVI) mechanism and supports hyperparameter tuning to enhance the generalization ability of ACP/AMP predictions [<xref ref-type="bibr" rid="B84">84</xref>].</p>
<p id="p-31">Currently, such prediction models generally encounter the following common bottlenecks: Firstly, the models are highly dependent on known databases for training, which limits their generalization ability and makes it difficult for them to identify sequences with low homology and novel categories. Moreover, the training data is mostly concentrated on short peptides, resulting in poor prediction performance for long sequences. Secondly, most models only learn based on amino acid sequences and lack the integration of key physical and chemical information, such as protein spatial conformation, dynamic behavior, and interactions. Finally, the selected candidate molecules often have toxicity issues, and the experimental verification cost is high, which limits their efficient transformation from computational prediction to practical application [<xref ref-type="bibr" rid="B84">84</xref>, <xref ref-type="bibr" rid="B89">89</xref>].</p>
</sec>
<sec id="t4-2-2">
<title>Language models</title>
<p id="p-32">The role of the language model is to use large-scale unsupervised pre-training to transform amino acid sequences into semantic-rich context vectors, and then perform lightweight fine-tuning to achieve performance even surpassing that of traditional feature engineering. This includes BERT and its variants, pLMs, and peptide-specific lightweight models, etc. BERT and its variants obtain context-sensitive sequence embeddings through large-scale masked language modeling and encode and extract features for AMPs [<xref ref-type="bibr" rid="B49">49</xref>, <xref ref-type="bibr" rid="B91">91</xref>], significantly reducing the need for labeled data and improving prediction performance, and can further fine-tune the input for other models to adapt to AMP classification, functional annotation, and even design tasks. pLMs are inspired by BERT and capture complex dependencies in protein sequences through self-supervised learning and multi-task pre-training, such as Diff-AMP [<xref ref-type="bibr" rid="B92">92</xref>] and AMP-BERT [<xref ref-type="bibr" rid="B93">93</xref>], etc. In 2023, Lobo et al. [<xref ref-type="bibr" rid="B94">94</xref>] used a concise and transferable “pre-training and fine-tuning” pipeline for rapid prediction of the antifungal activity of peptide segments, and ultimately found that the combination of SeqVec + k-Best feature selector + support vector classification (SVC) had the best effect. Peptide language models draw on the idea of pre-trained language models in NLP, are specifically designed for protein or peptide sequences, but need to be combined with structural biology and efficient algorithms to solve the key bottlenecks in the design of AMPs [<xref ref-type="bibr" rid="B88">88</xref>].</p>
<p id="p-33">In recent years, the Transformer neural network architecture has demonstrated significant application value in the field of bioinformatics. Originally proposed by Vaswani et al. [<xref ref-type="bibr" rid="B95">95</xref>], its core self-attention mechanism effectively captures long-range dependencies in sequences, providing new technical approaches for protein and peptide sequence analysis.</p>
<p id="p-34">In protein sequence representation learning, the evolutionary scale modeling (ESM)-2 model developed by Lin et al. [<xref ref-type="bibr" rid="B96">96</xref>] employs self-supervised pre-training strategies to learn semantically rich embeddings from large-scale protein sequence data. Similarly, the ProtT5 model proposed by Elnaggar et al. [<xref ref-type="bibr" rid="B97">97</xref>] utilizes advanced pre-training methods and has achieved excellent performance in various protein-related tasks.</p>
<p id="p-35">In the field of drug discovery, the Transformer architecture also plays an important role. The PepMLM model developed by Chen et al. [<xref ref-type="bibr" rid="B98">98</xref>] employs masked language modeling and fine-tuning techniques to enable effective prediction and design of peptide sequence functions. This model can adapt to specific biomedical application scenarios with minimal labeled data, demonstrating good practicality and generalization value. However, such models often encounter the problem of a limited experimental verification scope during the experiment. The selection process may still have a relatively high FP rate. Leveraging features from larger-scale sequence datasets can improve AMP classification performance, but when it comes to model size, bigger isn't always better. Moreover, incorporating structural information can greatly enhance screening efficiency [<xref ref-type="bibr" rid="B98">98</xref>, <xref ref-type="bibr" rid="B99">99</xref>].</p>
</sec>
<sec id="t4-2-3">
<title>GNN</title>
<p id="p-36">The core idea of GNN in the prediction of AMPs is: treating the atoms/residues/fragments of the peptide as nodes, and establishing edges using chemical bonds, field atomic information, or spatial adjacency relationships to construct a graph structure, converting the sequence into a structural graph. GNN aggregates neighbor information through message passing to simultaneously capture sequence order and three-dimensional/topological features, featuring direct peptide graph representation, hierarchical graph representation, and alternative representations leveraging DL. It can incorporate physical-chemical properties (charge, hydrophobicity) as node or edge features, but the structural prediction error will gradually increase with each level or due to reasons such as over-smoothing [<xref ref-type="bibr" rid="B88">88</xref>, <xref ref-type="bibr" rid="B100">100</xref>, <xref ref-type="bibr" rid="B101">101</xref>]. For example, graph neural network models such as PNAbind can be used for high-throughput annotation of nucleic acid binding functions of unknown proteins, which can complete local prediction at the level of a single residue and global prediction at the level of the whole protein complex. As a structural and functional characterization tool, PNAbind is suitable for functional analysis of various proteins [<xref ref-type="bibr" rid="B102">102</xref>].</p>
<p id="p-37">Although graph neural networks can effectively model the structural features and spatial relationships of peptide sequences in AMP prediction, their application still faces certain limitations: Firstly, the model performance is highly dependent on the prediction accuracy of the input structure, which requires a large number of high-quality experimental resolved structures. However, the AMP peptide sequences often provide enough information to construct complex graph structures, resulting in unstable performance and low accuracy. Secondly, it is difficult for graph models to integrate non-graphical information (such as evolutionary spectrum, chemical modification, etc.), which limits their ability to model multi-dimensional features. In addition, although its inference speed is fast, the training process has certain requirements on computational resources, which may limit its wide application in resource-limited environments. Therefore, the advantages of graph models in AMP prediction have not been fully exploited, and it is still necessary to combine multimodal information (such as language models) for optimization and fusion [<xref ref-type="bibr" rid="B88">88</xref>, <xref ref-type="bibr" rid="B102">102</xref>].</p>
</sec>
<sec id="t4-2-4">
<title>The mixed and multimodal models</title>
<p id="p-38">The mixed and multimodal model achieves information complementation within a unified framework by jointly employing sequence embeddings, predictive structural graphs, evolutionary spectra, and physical-chemical properties, and aligning multi-source features through attention or contrastive learning. For example, APEX screened out 37,176 broad-spectrum active peptides from extinct species (11,035 of which only appeared in extinct species), verified through 69 synthetic experiments, 59% of which were active, and the in vivo efficacy was comparable to polymyxin B, proving that multimodal DL combined with molecular de-extinction is an efficient path for discovering new antibiotics [<xref ref-type="bibr" rid="B103">103</xref>]; UniproLcad is an AMP predictor that integrates a multi-pLM. It concatenates the sequence representations of three pre-trained models, ESM-2, ProtBert, and UniRep, and then uses BiLSTM, 1D-CNN, and Attention to refine the features, and finally uses multi-layer perceptron (MLP) for classification. It significantly outperforms existing methods [<xref ref-type="bibr" rid="B104">104</xref>].</p>
<p id="p-39">The wide application of such models still faces a series of common problems. First of all, such models usually rely on large-scale labeled datasets for training. When the size of the dataset is insufficient and the source is single, it is difficult to truly reflect the robustness of the model in diverse scenarios. However, the heterogeneity, unbalanced length distribution, and potential redundancy of biological data may lead to the model learning the specific deviation of the dataset instead of the universal biological law. Secondly, multimodal fusion often introduces multiple deep subnetworks and complex interaction modules to form a highly nonlinear “black box” system. Although the prediction accuracy is improved, it is difficult to explain the decision basis from the biological perspective. Even with the introduction of attention mechanisms, most studies still stay at the visualization level and lack deep integration with experimental verification, which leads to doubts about the biological significance of attention weight. Thirdly, it is common practice to send the outputs of each modal encoder to the prediction layer after simple splicing or weighted summation. This late fusion method fails to fully model the complex intersection between modes, which limits the ability of the model to describe the internal mechanism of the biological system. In addition, most hybrid multimodal models do not take into account the dynamics and physicochemical details of biological systems, such as protein conformational changes, water molecule-mediated interactions, ligand flexibility, and other key factors, resulting in a gap between the prediction results and the real physiological environment [<xref ref-type="bibr" rid="B44">44</xref>, <xref ref-type="bibr" rid="B103">103</xref>, <xref ref-type="bibr" rid="B104">104</xref>].</p>
<p id="p-40">Overall, these four models have their own focuses: Basic models are lightweight and efficient, language models are semantically rich, graph models have precise structures, and multimodal fusion balances performance and generalization, collectively forming the current technical panorama of AMP prediction. DL has four main technical routes in AMP prediction: CNN/RNN basic models, language models, graph neural networks, and multimodal models.</p>
</sec>
</sec>
<sec id="t4-3">
<title>Prediction model based on LLMs</title>
<p id="p-41">LLMs, particularly those based on the Transformer architecture, have revolutionized NLP and are now making significant impacts in bioinformatics and cheminformatics. Their core advantage lies in the ability to learn complex, context-dependent representations of sequential data through self-supervised learning on vast unlabeled corpora. In the context of AMPs, amino acid sequences can be treated as a biological language, allowing LLMs to capture long-range dependencies, evolutionary constraints, and structural semantics without the need for manual feature engineering [<xref ref-type="bibr" rid="B95">95</xref>, <xref ref-type="bibr" rid="B105">105</xref>].</p>
<sec id="t4-3-1">
<title>The Transformer architecture and pLMs</title>
<p id="p-42">The foundation of modern LLMs is the Transformer architecture, introduced by Vaswani et al. [<xref ref-type="bibr" rid="B95">95</xref>], which utilizes a self-attention mechanism to weigh the importance of different parts of the input sequence, effectively capturing global dependencies. This architecture has been adapted for proteins, leading to the development of specialized “pLMs”. These models are pre-trained on massive datasets of protein sequences (e.g., from UniRef or PDB) to learn the underlying “grammar” of protein sequences.</p>
<p id="p-43">Several prominent pLMs have been developed and applied to AMP discovery. For instance, the ESM family of models, such as ESM-1b and ESM-2, developed by Lin et al. [<xref ref-type="bibr" rid="B96">96</xref>], uses self-supervised learning to generate rich embeddings that capture biological and structural properties from sequence alone [<xref ref-type="bibr" rid="B106">106</xref>]. Similarly, the ProtTrans family, including ProtT5 proposed by Elnaggar et al. [<xref ref-type="bibr" rid="B97">97</xref>], leverages Transformer architectures and has achieved state-of-the-art performance in various protein prediction tasks by providing powerful feature representations.</p>
</sec>
<sec id="t4-3-2">
<title>Application of LLMs in AMP prediction</title>
<p id="p-44">pLMs serve as powerful feature extractors for downstream tasks like AMP prediction. Their pre-trained knowledge can be leveraged through fine-tuning or by using their embeddings as input to simpler classifiers. This approach has proven highly effective, especially in data-sparse scenarios common in peptide research [<xref ref-type="bibr" rid="B105">105</xref>].</p>
<p id="p-45">For example, HMD-AMP, developed by Yu et al. [<xref ref-type="bibr" rid="B106">106</xref>], utilizes the ESM-1b pLM combined with a deep forest classifier to accurately identify AMP functions across 11 target types, demonstrating robustness even with small sample sizes. Another model, AMP-BERT, proposed by Lee et al. [<xref ref-type="bibr" rid="B93">93</xref>], fine-tunes a BERT-based model specifically for AMP function prediction, showcasing the adaptability of the Transformer architecture. PeptideBERT, introduced by Guntuboina et al. [<xref ref-type="bibr" rid="B107">107</xref>], is a lightweight Transformer-based language model specifically fine-tuned for peptide property prediction, including antimicrobial activity. It demonstrates that even smaller, specialized models can achieve high performance by leveraging the power of the Transformer architecture [<xref ref-type="bibr" rid="B107">107</xref>]. Furthermore, Lobo et al. [<xref ref-type="bibr" rid="B94">94</xref>] successfully used a “pre-training and fine-tuning” pipeline with the SeqVec pLMs to rapidly and accurately predict antifungal peptides, outperforming traditional methods.</p>
<p id="p-46">A more recent advancement is the integration of LLMs with other architectures to create hybrid models. PepGraphormer, developed by Lin et al. [<xref ref-type="bibr" rid="B108">108</xref>], fuses the semantic power of the ESM-2 language model with a graph attention network (GAT). This hybrid framework constructs a heterogeneous graph containing peptide and amino acid nodes, allowing it to capture both global semantic understanding and local structural relationships for high-precision AMP identification without requiring explicit 3D structures [<xref ref-type="bibr" rid="B108">108</xref>].</p>
</sec>
<sec id="t4-3-3">
<title>LLMs for peptide generation</title>
<p id="p-47">Beyond prediction, LLMs and deep generative models are increasingly used for the de novo design of novel AMPs. These models learn the distribution of functional peptide sequences and can generate new ones with desired properties. For instance, deepAMP, introduced by Li et al. [<xref ref-type="bibr" rid="B109">109</xref>], is a deep generation framework based on peptide language models. It uses a pre-training and fine-tuning strategy to optimize peptides from low-activity to high-activity and broad-spectrum candidates, with subsequent experimental validation confirming its efficiency [<xref ref-type="bibr" rid="B109">109</xref>]. Similarly, BroadAMP-GPT, proposed by Li et al. [<xref ref-type="bibr" rid="B110">110</xref>], combines CNN and LSTM architectures for AI-driven design and multi-model screening, achieving high experimental hit rates. In the same type, BatGPT-Chem proposed by Yang et al. [<xref ref-type="bibr" rid="B111">111</xref>], based on the learning of massive chemical data, establishes the mapping relationship between molecular properties and molecular structures represented by SMILES, and the SMILES code is used as a “special language” to carry out unified sequence modeling with natural language. The peptide molecular codes that meet the requirements of these properties are directly output, so as to complete the conversion from “property description” to “peptide structure generation” [<xref ref-type="bibr" rid="B111">111</xref>].</p>
<p id="p-48">Other generative models, while not strictly LLMs, also play a crucial role. HydrAMP, a conditional variational autoencoder (cVAE) developed by Szymczak et al. [<xref ref-type="bibr" rid="B112">112</xref>], creates a continuous latent space for peptide sequences, enabling both the optimization of existing peptides and the generation of entirely new, highly potent AMPs. Multi-CGAN, introduced by Yu et al. [<xref ref-type="bibr" rid="B113">113</xref>], uses a conditional generative adversarial network to design peptides with multiple desired attributes, such as high activity, low toxicity, and specific secondary structures.</p>
</sec>
<sec id="t4-3-4">
<title>Outlook and current limitation</title>
<p id="p-49">The application of LLMs represents a paradigm shift in AMP discovery, moving towards more intelligent and data-efficient design [<xref ref-type="bibr" rid="B5">5</xref>]. By treating sequences as a language, these models can uncover hidden patterns and design principles that are difficult to capture with traditional methods. However, challenges remain. As highlighted by several studies, the quality and scale of training data are paramount [<xref ref-type="bibr" rid="B47">47</xref>, <xref ref-type="bibr" rid="B114">114</xref>]. Models trained on biased or limited datasets may have poor generalization. Furthermore, while powerful, LLMs are often “black boxes”, and enhancing their interpretability to reveal the key structural motifs driving activity is a critical area of future research [<xref ref-type="bibr" rid="B115">115</xref>]. The integration of LLMs with structural modeling and experimental validation will be key to realizing their full potential in developing next-generation antimicrobial therapeutics [<xref ref-type="bibr" rid="B6">6</xref>].</p>
</sec>
</sec>
</sec>
<sec id="s5">
<title>Model evaluation</title>
<p id="p-50">In the model evaluation criteria, a series of performance indicators are calculated based on four basic parameters [true positive (TP), true negative (TN), FP, false negative (FN)], including sensitivity, specificity, accuracy, precision, F-measure, and Matthews correlation coefficient (MCC). Additionally, two evaluation methods that are independent of threshold values, namely ROC curve and AUC value, are also employed to comprehensively assess the classification performance of the AMP prediction model (<xref ref-type="table" rid="t3">Tables 3</xref> and <xref ref-type="table" rid="t4">4</xref>) [<xref ref-type="bibr" rid="B42">42</xref>, <xref ref-type="bibr" rid="B116">116</xref>].</p>
<table-wrap id="t3">
<label>Table 3</label>
<caption>
<p id="t3-p-1">
<bold>Definition of prediction model evaluation basic parameters [<xref ref-type="bibr" rid="B42">42</xref>].</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>Acronym</bold>
</th>
<th>
<bold>Definition description</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>TP</td>
<td>These are the experimentally validated AMPs that have been correctly predicted as AMPs by the prediction method.</td>
</tr>
<tr>
<td>TN</td>
<td>These are the non-AMP sites that have been correctly predicted as non-AMPs.</td>
</tr>
<tr>
<td>FP</td>
<td>These are the non-AMPs that have been incorrectly predicted as AMPs.</td>
</tr>
<tr>
<td>FN</td>
<td>These are the experimentally validated AMPs that have been incorrectly predicted as non-AMPs.</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t3-fn-1">AMPs: antimicrobial peptides; FN: false negative; FP: false positive; TN: true negative; TP: true positive.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t4">
<label>Table 4</label>
<caption>
<p id="t4-p-1">
<bold>Definition of prediction model evaluation indicators [<xref ref-type="bibr" rid="B42">42</xref>].</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>Name</bold>
</th>
<th>
<bold>Definition description</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>Sensitivity</td>
<td>The proportion of samples that are truly positive and are correctly predicted as positive by the model.<break /><disp-formula id="eq_in_t_1"><mml:math id="m72054"><mml:mi>S</mml:mi><mml:mi>e</mml:mi><mml:mi>n</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>i</mml:mi><mml:mi>v</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mi> </mml:mi><mml:mo>=</mml:mo><mml:mi> </mml:mi><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mn>100</mml:mn></mml:math></disp-formula></td>
</tr>
<tr>
<td>Specificity</td>
<td>Among all the samples that are truly negative cases, the proportion of those that were correctly predicted as negative by the model.<break /><disp-formula id="eq_in_t_2"><mml:math id="m72935"><mml:mi>S</mml:mi><mml:mi>p</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mi>i</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>t</mml:mi><mml:mi>y</mml:mi><mml:mi> </mml:mi><mml:mo>=</mml:mo><mml:mi> </mml:mi><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfrac><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mn>100</mml:mn></mml:math></disp-formula></td>
</tr>
<tr>
<td>Accuracy</td>
<td>The proportion of samples that were correctly predicted (whether positive or negative) in all the samples.<break /><disp-formula id="eq_in_t_3"><mml:math id="m573ab"><mml:mi>A</mml:mi><mml:mi>c</mml:mi><mml:mi>c</mml:mi><mml:mi>u</mml:mi><mml:mi>r</mml:mi><mml:mi>a</mml:mi><mml:mi>c</mml:mi><mml:mi>y</mml:mi><mml:mi> </mml:mi><mml:mo>=</mml:mo><mml:mi> </mml:mi><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mn>100</mml:mn></mml:math></disp-formula></td>
</tr>
<tr>
<td>Precision</td>
<td>The proportion of samples that are truly positive among all those predicted as positive by the model.<break /><disp-formula id="eq_in_t_4"><mml:math id="m48a8e"><mml:mi>P</mml:mi><mml:mi>r</mml:mi><mml:mi>e</mml:mi><mml:mi>c</mml:mi><mml:mi>i</mml:mi><mml:mi>s</mml:mi><mml:mi>i</mml:mi><mml:mi>o</mml:mi><mml:mi>n</mml:mi><mml:mi> </mml:mi><mml:mo>=</mml:mo><mml:mi> </mml:mi><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfrac><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mn>100</mml:mn></mml:math></disp-formula></td>
</tr>
<tr>
<td>F-measure</td>
<td>The harmonic mean of precision and recall.<break /><disp-formula id="eq_in_t_5"><mml:math id="mcbf9d"><mml:mtext>F-measure</mml:mtext><mml:mi> </mml:mi><mml:mo>=</mml:mo><mml:mi> </mml:mi><mml:mfrac><mml:mrow><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>P</mml:mi></mml:mrow><mml:mrow><mml:mn>2</mml:mn><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfrac><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mn>100</mml:mn></mml:math></disp-formula></td>
</tr>
<tr>
<td>MCC</td>
<td>A balanced metric used to measure the performance of binary classification, with its value ranging from –1 to +1. +1 indicates perfect prediction, 0 indicates random prediction, and –1 indicates completely incorrect prediction.<break /><disp-formula id="eq_in_t_6"><mml:math id="m7d55d"><mml:mi>M</mml:mi><mml:mi>C</mml:mi><mml:mi>C</mml:mi><mml:mi> </mml:mi><mml:mo>=</mml:mo><mml:mi> </mml:mi><mml:mfrac><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mi> </mml:mi><mml:mo>-</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow><mml:mrow><mml:msqrt><mml:mfenced separators="|"><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>T</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfenced><mml:mi> </mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mi>T</mml:mi><mml:mi>P</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfenced><mml:mi> </mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>P</mml:mi></mml:mrow></mml:mfenced><mml:mi> </mml:mi><mml:mfenced separators="|"><mml:mrow><mml:mi>T</mml:mi><mml:mi>N</mml:mi><mml:mi> </mml:mi><mml:mo>+</mml:mo><mml:mi> </mml:mi><mml:mi>F</mml:mi><mml:mi>N</mml:mi></mml:mrow></mml:mfenced></mml:msqrt></mml:mrow></mml:mfrac><mml:mi> </mml:mi><mml:mo>×</mml:mo><mml:mi> </mml:mi><mml:mn>100</mml:mn></mml:math></disp-formula></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t4-fn-1">FN: false negative; FP: false positive; MCC: Matthews correlation coefficient; TN: true negative; TP: true positive.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p id="p-51">The typical AMP prediction process consists of the following steps: a. data collection, obtaining positive and negative samples from the database; b. data cleaning and redundancy removal, using tools such as cluster database at high identity with tolerance (CD-HIT) to eliminate similar sequences and reduce bias; c. feature extraction and selection, converting peptide sequences into numerical feature vectors; d. model training and optimization, evaluating model performance using cross-validation or independent test sets; e. performing performance evaluation using indicators such as sensitivity, specificity, accuracy, F1-score, MCC, and AUC [<xref ref-type="bibr" rid="B42">42</xref>].</p>
<p id="p-52">CD-HIT is a tool used for clustering biological sequences. The new version has significantly improved its speed through parallelization and multiple optimizations, and supports multi-core acceleration, making it suitable for processing large-scale sequencing data [<xref ref-type="bibr" rid="B117">117</xref>]. Its tools are integrated into the CDSnake process for processing 16S sequencing data. It clusters paired-end read lengths separately, clustering read 1 (R1) and read 2 (R2) respectively to avoid information loss caused by merging errors or poor quality, thereby retaining more effective data. It is suitable for large-scale microbiota research and performs exceptionally well, especially in high-redundancy data [<xref ref-type="bibr" rid="B118">118</xref>].</p>
<p id="p-53">Feature extraction is a crucial step in ML, referring to the process of extracting numerical information from raw data that can effectively represent the characteristics of samples and facilitate model learning. For instance, in the Ensemble-AMPPred study, in order to predict AMPs, 517 features were extracted from the sequences, including amino acid composition, pseudo-amino acid composition (PseAAC), physicochemical properties, secondary structure propensity, antimicrobial index (IC<sub>50</sub>), etc., to convert variable-length peptide sequences into fixed-length numerical vectors. To further enhance the model performance, the researchers fused four key features through logistic regression and created a new “mixed feature”. This mixed feature ranked highly in terms of feature importance and effectively improved the sensitivity and overall performance of the prediction model [<xref ref-type="bibr" rid="B119">119</xref>]. ftrCOOL, developed by Amerifar et al. [<xref ref-type="bibr" rid="B120">120</xref>], is an R-language tool designed for biological sequence feature extraction. Its functions encompass seven categories of feature extraction methods: k-mer frequency calculation, grouped amino acid composition, physicochemical index substitution, mathematical transformations such as autocorrelation, K-NN similarity based on sequence alignment, position-specific k-mer frequency, and open reading frame feature analysis. This tool covers all the functions of iLearnPlus and adds 30 new features, such as the codon adaptation index and Fickett score [<xref ref-type="bibr" rid="B120">120</xref>].</p>
<p id="p-54">To enhance the efficiency of feature extraction and reduce storage space, representation learning has become the fundamental and crucial first step in the computational research of AMPs. Its core objective is to convert the symbolic data of amino acid sequences into numerical features that can retain their biological and functional information, so that subsequent ML or DL models can process them [<xref ref-type="bibr" rid="B121">121</xref>]. Among them, pLMs, such as ESM-2 and ProtBert, encode amino acid sequences into context-dependent dense vectors through self-supervised pre-training, directly replacing traditional one-hot or Word2Vec, significantly reducing the dimensionality problem and improving prediction accuracy [<xref ref-type="bibr" rid="B122">122</xref>].</p>
<p id="p-55">Graph contrastive learning (GCL) is a graph representation learning method based on self-supervised learning. The core idea is to first generate dual views of the original features using two different Laplacian smoothing techniques. Then, positive and negative sample pairs are constructed at the structural level. Specifically, the embeddings of the same node in the two views form a positive pair, while those of different nodes form a negative pair. Through the joint optimization of the structural alignment loss and the contrastive loss, the distance between positive pairs is reduced, and the distance between negative pairs is increased [<xref ref-type="bibr" rid="B123">123</xref>]. In 2024, Yang et al. [<xref ref-type="bibr" rid="B124">124</xref>] proposed a GCL framework called Local Structure-aware GCL (LS-GCL), which consists of three key steps: Firstly, multiple related views are generated from the original graph through data augmentation techniques such as node feature masking, edge dropout, or subgraph sampling; secondly, the node or graph embedding representations are extracted using graph neural network encoders (such as GCN, GAT); finally, the contrastive loss function (such as InfoNCE loss) is used to maximize the consistency of the representation of the same node in different views or contexts, and minimize the similarity between different nodes [<xref ref-type="bibr" rid="B124">124</xref>].</p>
<p id="p-56">Dynamic Network Contrastive representation Learning (DNCL) is a contrastive learning framework specifically designed for dynamic networks. Its core idea is to learn node representations through a dual contrast mechanism involving inter-snapshot and intra-snapshot within snapshots. This framework generates multi-view network sequences using enhancement techniques, extracts spatio-temporal features through a GCN + GRU encoder, and utilizes an improved contrastive loss (introducing time decay) to maximize the consistency of positive samples and minimize the similarity of negative samples, thereby enabling the learning of robust and discriminative dynamic node representations without the need for manual annotation [<xref ref-type="bibr" rid="B125">125</xref>].</p>
</sec>
<sec id="s6">
<title>AMP prediction and generation tool</title>
<p id="p-57">As of now, many online tools and software for predicting AMPs and their functions have been put into use, which has facilitated the discovery and design of more effective antimicrobial agents (<xref ref-type="table" rid="t5">Tables 5</xref> and <xref ref-type="table" rid="t6">6</xref>). With the development of computer science and computational methods, these tools have also been continuously optimized (<xref ref-type="fig" rid="fig2">Figure 2</xref>).</p>
<table-wrap id="t5">
<label>Table 5</label>
<caption>
<p id="t5-p-1">
<bold>AMP prediction tool.</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>Predictor</bold>
</th>
<th>
<bold>Core methodology</bold>
</th>
<th>
<bold>Key features</bold>
</th>
<th>
<bold>Reference</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>PyAMPA</td>
<td>random forest</td>
<td>Multifunctional platform for prediction, validation, and optimization</td>
<td>[<xref ref-type="bibr" rid="B126">126</xref>]</td>
</tr>
<tr>
<td>AMPlify</td>
<td>BiLSTM + attention</td>
<td>Deep learning model with balanced/imbalanced training modes</td>
<td>[<xref ref-type="bibr" rid="B127">127</xref>]</td>
</tr>
<tr>
<td>TriNet</td>
<td>Triple fusion network</td>
<td>Simultaneously predicts AMPs and anti-cancer peptides (ACPs)</td>
<td>[<xref ref-type="bibr" rid="B84">84</xref>, <xref ref-type="bibr" rid="B128">128</xref>]</td>
</tr>
<tr>
<td>Deep-AmPEP30</td>
<td>CNN</td>
<td>Using pseudo-amino acids combined with CNN for AMP prediction</td>
<td>[<xref ref-type="bibr" rid="B129">129</xref>]</td>
</tr>
<tr>
<td>AI4AMP</td>
<td>LSTM</td>
<td>An AMP prediction tool based on physical-chemical coding</td>
<td>[<xref ref-type="bibr" rid="B130">130</xref>]</td>
</tr>
<tr>
<td>PTPAMP</td>
<td>SVM</td>
<td>SVM performed the best and was specifically used for identifying and classifying plant-derived AMPs</td>
<td>[<xref ref-type="bibr" rid="B87">87</xref>]</td>
</tr>
<tr>
<td>PeptideBERT</td>
<td>Protein language model</td>
<td>A language model for fine-tuning peptides, for AMP prediction</td>
<td>[<xref ref-type="bibr" rid="B107">107</xref>]</td>
</tr>
<tr>
<td>AutoPeptideML</td>
<td>Automated modeling platform</td>
<td>Supports multiple machine learning algorithms and automatically builds peptide prediction models</td>
<td>[<xref ref-type="bibr" rid="B131">131</xref>]</td>
</tr>
<tr>
<td>AmPEP</td>
<td>RF</td>
<td>Predicting AMP based on amino acid distribution patterns</td>
<td>[<xref ref-type="bibr" rid="B132">132</xref>]</td>
</tr>
<tr>
<td>HMD-AMP</td>
<td>ESM-1b; DF</td>
<td>Accurately identify the functions of AMP and its 11 types of targets, with both robustness in small sample sizes and interpretability</td>
<td>[<xref ref-type="bibr" rid="B106">106</xref>]</td>
</tr>
<tr>
<td>iAMP-CA2L</td>
<td>Multi-scale convolution and attention mechanism</td>
<td>Extract key features of the sequence, accurately identify and conduct interpretable analysis</td>
<td>[<xref ref-type="bibr" rid="B26">26</xref>]</td>
</tr>
<tr>
<td>iAMPCN</td>
<td>CNN</td>
<td>A large-scale and high-quality training and testing dataset was constructed, and it demonstrated higher accuracy in most functional activity predictions</td>
<td>[<xref ref-type="bibr" rid="B133">133</xref>]</td>
</tr>
<tr>
<td>PAMPred</td>
<td>k-means</td>
<td>Hierarchical heterogeneous integration architecture, improved particle swarm optimization (PSO) weight optimization, superior experimental performance</td>
<td>[<xref ref-type="bibr" rid="B134">134</xref>]</td>
</tr>
<tr>
<td>PepGraphormer</td>
<td>ESM2; GAT</td>
<td>An AMP prediction framework that does not require a three-dimensional structure and integrates language models with graph neural networks</td>
<td>[<xref ref-type="bibr" rid="B108">108</xref>]</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t5-fn-1">AMP: antimicrobial peptide; BiLSTM: bidirectional long short-term memory; CNN: convolutional neural networks; DF: decision forest; ESM: evolutionary scale modeling; GAT: graph attention network; PSO: particle swarm optimization; RF: random forest; SVM: support vector machine.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<table-wrap id="t6">
<label>Table 6</label>
<caption>
<p id="t6-p-1">
<bold>Examples of peptide generation tools.</bold>
</p>
</caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th>
<bold>Predictor</bold>
</th>
<th>
<bold>Core methodology</bold>
</th>
<th>
<bold>Key features</bold>
</th>
<th>
<bold>Reference</bold>
</th>
</tr>
</thead>
<tbody>
<tr>
<td>HydrAMP</td>
<td>cVAE</td>
<td>By constructing a low-dimensional continuous latent space representation of peptide sequences, analogue generation and unconstrained generation can be achieved</td>
<td>[<xref ref-type="bibr" rid="B112">112</xref>]</td>
</tr>
<tr>
<td>deepAMP</td>
<td>Protein language model</td>
<td>Data augmentation, multi-round iterative optimization, and experimental verification are highly efficient</td>
<td>[<xref ref-type="bibr" rid="B109">109</xref>]</td>
</tr>
<tr>
<td>Multi-CGAN</td>
<td>CGAN</td>
<td>Adopt a three-stage generation strategy, and introduce the multi-label embedding mechanism and the automatic integration module</td>
<td>[<xref ref-type="bibr" rid="B113">113</xref>]</td>
</tr>
<tr>
<td>LSTM_Pep</td>
<td>Long short-term memory</td>
<td>After pre-training with large-scale general peptide data, then fine-tuning with specific active peptides</td>
<td>[<xref ref-type="bibr" rid="B135">135</xref>]</td>
</tr>
<tr>
<td>BroadAMP-GPT</td>
<td>CNN + LSTM</td>
<td>AI-driven design, multi-model fusion screening, high experimental hit rate</td>
<td>[<xref ref-type="bibr" rid="B110">110</xref>]</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p id="t6-fn-1">AI: artificial intelligence; CNN: convolutional neural network; cVAE: conditional variational autoencoder.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<fig id="fig2" position="float">
<label>Figure 2</label>
<caption>
<p id="fig2-p-1">
<bold>General workflow for developing a prediction model for AMPs.</bold> This figure outlines the basic steps for building an AMP prediction model. Starting from the collection of AMP sequences from the database, a dataset containing active and inactive peptides was created and then divided into a training set (80%) and a test set (20%). Feature extraction can be achieved by BLAST, motif search, and computational approaches. After building and training the model with algorithms, the resulting model can be deployed as a standalone tool or a web server. AMPs: antimicrobial peptides; CNN: convolutional neural networks; DT: decision tree; ET: extra trees; K-NN: K-nearest neighbor; RF: random forest; SVM: support vector machine; XGBoost: eXtreme Gradient Boosting.</p>
</caption>
<graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="eds-04-1008161-g002.tif" />
</fig>
<sec id="t6-1">
<title>AMP prediction tools</title>
<p id="p-58">PyAMPA is a comprehensive AMP discovery and optimization platform developed in Python. It uses RF classifiers for predicting antimicrobial activity, cell-penetrating capacity, hemolytic activity, and toxicity. The AMPScreen module uses a sliding window method based on a propensity scale to rapidly scan entire proteomes. It consists of five core modules: AMPScreen is used for high-throughput screening of potential antimicrobial sequences; AMPValidate uses ML models to verify antimicrobial activity; AMPSolve predicts the hemolysis, toxicity, half-life, and antimicrobial spectrum of peptides; AMPMutate assesses the impact of single-point mutations on function; and AMPOptimize uses genetic algorithms to optimize the comprehensive performance of peptides. PyAMPA integrates multiple databases and prediction models, featuring efficiency, accuracy, and scalability, and provides a complete solution from initial screening to optimization for the development of antibacterial peptide drugs [<xref ref-type="bibr" rid="B126">126</xref>].</p>
<p id="p-59">iAMPCN is a DL-based computing framework designed for efficient identification of AMPs and their 22 functional activities (such as antibacterial, antifungal, antiviral, anti-cancer, anti-parasitic, etc.). This model integrates four sequence encoding methods (one-hot, BLOSUM62, AAIndex, and PAAC) and employs CNN for feature extraction and prediction. The research team constructed a large-scale and high-quality training and testing dataset, covering over 49,000 experimentally verified AMP sequences, and systematically evaluated the impact of different sequence similarity thresholds on the model’s performance. Benchmark experiments demonstrated that iAMPCN significantly outperformed existing mainstream tools on independent test sets, showing higher accuracy, AUC, and MCC values in most functional activity predictions. Additionally, this study conducted model interpretability analysis, revealing that positively charged amino acids such as lysine (K) and arginine (R) play a crucial role in AMP activity. The code of iAMPCN is open-source: <uri xlink:href="https://github.com/joy50706/iAMPCN">https://github.com/joy50706/iAMPCN</uri>, providing powerful support for the discovery and functional annotation of AMPs and promoting their application in drug development [<xref ref-type="bibr" rid="B133">133</xref>].</p>
<p id="p-60">PepGraphormer is a DL framework that integrates the pLM ESM2 with the GAT, which is designed to directly predict AMPs from amino acid sequences. First, it utilizes the pre-trained ESM2 model to extract the deep semantic features of peptide sequences, which serve as the initial prediction and the initial embedding of peptide nodes in the graph. Meanwhile, it constructs a heterogeneous graph containing peptide nodes and amino acid nodes. In this graph, the term frequency-inverse document frequency (TF-IDF) and mutual information are used to quantify the peptide-amino acid composition relationship and the global co-occurrence pattern among amino acids respectively, thus avoiding the reliance on time-consuming and error-prone three-dimensional structure prediction. Subsequently, the GAT learns the complex association features between peptides and amino acids on this graph. Finally, the model linearly fuses the global semantic understanding of ESM2 with the graph structural relationships learned by the GAT to jointly make the final classification decision, achieving high-precision and interpretable AMP identification [<xref ref-type="bibr" rid="B108">108</xref>].</p>
</sec>
<sec id="t6-2">
<title>AMP generation tools</title>
<p id="p-61">HydrAMP is an AMP generation model based on cVAE. By constructing a low-dimensional continuous latent space representation of peptide sequences, it achieves prototype peptide-based optimization for analogue generation and unconstrained generation from scratch. The model introduces Jacobian decoupling regularization, decoupling the peptide’s latent representation from its antimicrobial activity conditions, thereby enhancing the generation control. HydrAMP supports parameterized adjustment of creativity (temperature τ), flexibly balancing diversity and conservation. Its core advantage lies in being able to generate highly active AMPs starting from both active and inactive peptides, significantly enhancing the diversity and activity of peptide classes; combined with consensus screening by an auxiliary classifier and pre-assessment through MD simulation, it improves the efficiency of experimental verification. Experimental results show that the peptides generated by HydrAMP exhibit excellent antimicrobial activity, low toxicity, and broad-spectrum properties, making it a powerful tool in combating AMR [<xref ref-type="bibr" rid="B112">112</xref>].</p>
<p id="p-62">deepAMP is a deep generation framework based on peptide language models. It achieves targeted optimization from low-activity peptides to high-activity and broad-spectrum AMPs through a combination of pre-training and fine-tuning. Its core methods include pre-training a general generation model (deepAMP-general) on a large-scale peptide sequence dataset, and then using sequence degradation strategies to construct “low-activity→high-activity” peptide pairs, and fine-tuning the AMPs optimization model (deepAMP-AOM) and the penetration peptide optimization model (deepAMP-POM), respectively. This model has features such as data augmentation, multi-round iterative optimization, and efficient experimental verification. The designed peptides exhibit significantly superior activity compared to the template peptide penetration peptide, with some reaching antibiotic levels. They are also less likely to induce drug resistance and have good biocompatibility and in vivo anti-infection effects, providing an efficient and intelligent new strategy for the discovery of AMPs [<xref ref-type="bibr" rid="B109">109</xref>].</p>
<p id="p-63">Multi-CGAN is a multi-attribute AMP generation model based on conditional generative adversarial networks. It adopts a three-stage generation strategy: single-label generation, multi-label integration, and multi-label generation, enabling the learning and generation of peptide sequences with multiple attributes such as antimicrobial properties, low toxicity, and specific secondary structures from single-attribute data. Its core method introduces a multi-label embedding mechanism and an automatic integration module, allowing the generator to generate peptide chains that meet the requirements of multiple attributes under specified conditions. The model has good attribute controllability and sequence diversity and can be used for data augmentation in downstream tasks, effectively improving the performance of the prediction model and supporting targeted attribute generation by regulating input noise, providing an efficient and interpretable generation tool for aAMP design [<xref ref-type="bibr" rid="B113">113</xref>].</p>
<p id="p-64">After being pre-trained with large-scale general peptide data and then fine-tuned with specific active peptides, the research-generated model LSTM_Pep is based on the Long Short-Term Memory network. Through pre-training with large-scale general peptide data and fine-tuning with specific active peptides, it generates peptide sequences with potential therapeutic activity. Combined with the DeepPep DL model for protein-peptide interaction prediction, it enables rapid screening. The core process includes peptide generation, modeling, docking, MD simulation, and iterative optimization, significantly improving the efficiency of peptide drug discovery. The model supports the generation of multiple types of active peptides, has high diversity, low redundancy, and controllability, and is suitable for the screening of various targets such as antiviral, antimicrobial, and anti-cancer [<xref ref-type="bibr" rid="B110">110</xref>, <xref ref-type="bibr" rid="B135">135</xref>].</p>
</sec>
</sec>
<sec id="s7">
<title>Challenges in clinical translation and optimization strategies for AMPs</title>
<p id="p-65">Although AMPs have broad prospects, their clinical translation still faces some problems, such as high toxicity, low activity, easy degradation, and short retention time. For example, Na-SNK2-2a (C<sup>18</sup>W) is a cationic AMP obtained by bioinformatics mining and rational design of plant snakin peptides. Through a single point mutation in the key site (replacing the 18th cysteine with tryptophan), the toxicity and allergenicity of the peptide were reduced, while its hydrophobicity and membrane targeting selectivity were enhanced, and the ability to penetrate cells was retained [<xref ref-type="bibr" rid="B136">136</xref>].</p>
<p id="p-66">PMAP-23 of the cathelicidin family exerts a broad spectrum of antimicrobial effects through membrane destruction mechanism, but its binding ability to LPS is limited, and it may face the challenge of balancing stability and toxicity in vivo. To this end, the researchers replaced the C-terminal or N-terminal of PMAP-23 with the LPS binding sequence GWKRKRFG (G8) with a β-boomerang structure, and used the PQKP itself as the linker region to construct a series of heterozygous peptides, which not only significantly enhanced the binding ability of PMAP-23 to LPS, but also significantly enhanced the binding ability of PMAP-23 to LPS. It further enhanced the antimicrobial activity and cell selectivity, and showed good salt ion and serum stability [<xref ref-type="bibr" rid="B137">137</xref>, <xref ref-type="bibr" rid="B138">138</xref>].</p>
<p id="p-67">Human host defense peptide LL-37 has broad-spectrum antibacterial and immunomodulatory functions, but its linear structure leads to prominent proteolytic degradability and poor stability, which limits its direct clinical application [<xref ref-type="bibr" rid="B139">139</xref>]. In order to break through this bottleneck, the minimal antibacterial core fragment KR-12 was used as a template, and the two KR-12 units were covalently linked to form a circular dimer through the main chain cyclizing and dimerization strategy. The best derivative, such as cd4(Q5K, D9K), had significantly increased antibacterial activity and could stably exist in serum for several hours. Simultaneous enhancement of activity and stability was achieved [<xref ref-type="bibr" rid="B140">140</xref>].</p>
<p id="p-68">Despite significant progress in the application of AI to the discovery and design of AMPs, current research still faces many challenges that hinder experimental validation and clinical translation. The first issue concerns data quality and consistency. Although tens of thousands of AMP sequences are available in public databases, there is a severe scarcity of data on specific pathogens, toxicity profiles, structural stability, and ADMET properties, which are severely lacking, leading to biased model training and insufficient generalization capability [<xref ref-type="bibr" rid="B114">114</xref>].</p>
<p id="p-69">Another critical barrier is the lack of model interpretability. Most mainstream DL approaches function as “black-box” models. While they often achieve high classification accuracy, they struggle to reveal the key residues or structural motifs in the peptide sequence that determine antimicrobial activity. This opacity restricts the practical utility of AI in guiding rational peptide design [<xref ref-type="bibr" rid="B115">115</xref>]. Additionally, the issue of insufficient utilization of structural information needs to be addressed. Although GNN and pLMs have been preliminarily applied to structure modeling, their performance in representing short peptides, cyclic peptides, and peptides with post-translational modifications remains limited and requires further improvement [<xref ref-type="bibr" rid="B89">89</xref>].</p>
<p id="p-70">Despite the great theoretical potential of AMPs, the clinical translation of AMPs is difficult. A major challenge is that clinical validation of efficacy is difficult, with many drug candidates, such as pexiganan and iseganan, performing poorly in phase III trials. In addition, some drugs may not be able to replicate the efficacy results in the earlier phase of the trial. This indicates that AMP not only needs to overcome its own stability and toxicity problems, but also must pass rigorous clinical trials to prove its clear and safe therapeutic advantages in the complex human environment [<xref ref-type="bibr" rid="B141">141</xref>].</p>
<p id="p-71">To address these challenges, several strategic improvements can be pursued. First, establishing a standardized, multimodal, and high-quality AMP database is essential. This can be initiated by standardizing experimental conditions and evaluation criteria, systematically collecting multi-dimensional data—including sequences, structures, functions, toxicity, MIC values, ADMET properties, etc.—and clearly defining negative samples to facilitate data sharing and open access [<xref ref-type="bibr" rid="B47">47</xref>].</p>
<p id="p-72">Second, developing highly interpretable and flexible AI models is crucial. Integrating explainability tools such as SHAP, LIME, and attention visualization, along with combining biological physical features with sequence embeddings, can help transition models from mere statistical fitting to biologically meaningful prediction, thereby enhancing their translational value [<xref ref-type="bibr" rid="B115">115</xref>].</p>
<p id="p-73">We can also promote multimodal fusion and structural perception modeling, strengthening the deep integration of multiple information sources such as sequences, structures, and functions, developing GNN and multimodal Transformer architectures suitable for short, cyclic, and modified peptides, and enhancing the model’s ability to understand and predict complex structural characteristics. Alternatively, we can utilize advanced generative technologies such as cVAE, cGAN, and diffusion models to achieve simultaneous optimization of multiple objectives, including antimicrobial activity, low toxicity, and target specificity, improving the synthesizability and functionality of generated peptides, and developing conditional generation models and multi-objective optimization strategies [<xref ref-type="bibr" rid="B6">6</xref>].</p>
<p id="p-74">AI-driven AMP discovery is currently at a critical stage of transitioning from “tool-assisted” to “intelligent design”. Systematic breakthroughs in data quality, model interpretability, structural modeling, and experimental validation are essential to fully realize the potential of AI in antibacterial peptide drug development and to provide sustainable solutions to the global AMR crisis.</p>
</sec>
<sec id="s8">
<title>Conclusions</title>
<p id="p-75">The escalating global crisis of AMR underscores the urgent need for innovative therapeutic alternatives. AMPs represent a promising class of agents with broad-spectrum activity, low resistance development, and diverse mechanisms of action. However, the vast and largely unexplored peptide sequence space makes traditional experimental screening methods impractical.</p>
<p id="p-76">The integration of AI and big data models has revolutionized AMP discovery and characterization. ML and DL approaches—including discriminative models like RFs and LSTMs, and generative models such as variational autoencoders—have demonstrated remarkable efficacy in predicting, classifying, and even designing novel AMPs with desired properties. These models leverage large-scale AMP databases (e.g., dbAMP, DRAMP, DBAASP, LAMP2, CAMP<sub>R3</sub>) and diverse feature sets to achieve high predictive accuracy, sensitivity, and specificity.</p>
<p id="p-77">Despite these advances, several challenges remain, including data scarcity, class imbalance, and inconsistent experimental annotations. The present study highlights the effectiveness of even simple models like linear kernel SVM in data-scarce settings, achieving high performance metrics and enabling the experimental validation of AI-predicted peptides.</p>
</sec>
</body>
<back>
<glossary>
<title>Abbreviations</title>
<def-list>
<def-item>
<term>ACP</term>
<def>
<p>anti-cancer peptide</p>
</def>
</def-item>
<def-item>
<term>AI</term>
<def>
<p>artificial intelligence</p>
</def>
</def-item>
<def-item>
<term>AMP</term>
<def>
<p>antimicrobial peptide</p>
</def>
</def-item>
<def-item>
<term>AMR</term>
<def>
<p>antimicrobial resistance</p>
</def>
</def-item>
<def-item>
<term>AVPs</term>
<def>
<p>antiviral peptides</p>
</def>
</def-item>
<def-item>
<term>BGCs</term>
<def>
<p>biosynthetic gene clusters</p>
</def>
</def-item>
<def-item>
<term>CD-HIT</term>
<def>
<p>cluster database at high identity with tolerance</p>
</def>
</def-item>
<def-item>
<term>CNN</term>
<def>
<p>convolutional neural networks</p>
</def>
</def-item>
<def-item>
<term>cVAE</term>
<def>
<p>conditional variational autoencoder</p>
</def>
</def-item>
<def-item>
<term>DBAASP</term>
<def>
<p>Database of Antimicrobial Activity and Structure of Peptides</p>
</def>
</def-item>
<def-item>
<term>DL</term>
<def>
<p>deep learning</p>
</def>
</def-item>
<def-item>
<term>DNN</term>
<def>
<p>deep neural networks</p>
</def>
</def-item>
<def-item>
<term>DT</term>
<def>
<p>decision tree</p>
</def>
</def-item>
<def-item>
<term>ESM</term>
<def>
<p>evolutionary scale modeling</p>
</def>
</def-item>
<def-item>
<term>FP</term>
<def>
<p>false positive</p>
</def>
</def-item>
<def-item>
<term>GAT</term>
<def>
<p>graph attention network</p>
</def>
</def-item>
<def-item>
<term>GCL</term>
<def>
<p>graph contrastive learning</p>
</def>
</def-item>
<def-item>
<term>GNN</term>
<def>
<p>graph neural networks</p>
</def>
</def-item>
<def-item>
<term>K-NN</term>
<def>
<p>K-nearest neighbor</p>
</def>
</def-item>
<def-item>
<term>LLMs</term>
<def>
<p>large language models</p>
</def>
</def-item>
<def-item>
<term>LPS</term>
<def>
<p>lipopolysaccharide</p>
</def>
</def-item>
<def-item>
<term>MCC</term>
<def>
<p>Matthews correlation coefficient</p>
</def>
</def-item>
<def-item>
<term>MD</term>
<def>
<p>molecular dynamics</p>
</def>
</def-item>
<def-item>
<term>MDR</term>
<def>
<p>multidrug-resistant</p>
</def>
</def-item>
<def-item>
<term>MIC</term>
<def>
<p>minimum inhibitory concentration</p>
</def>
</def-item>
<def-item>
<term>ML</term>
<def>
<p>machine learning</p>
</def>
</def-item>
<def-item>
<term>MRSA</term>
<def>
<p>methicillin-resistant <italic>Staphylococcus aureus</italic></p>
</def>
</def-item>
<def-item>
<term>NLP</term>
<def>
<p>natural language processing</p>
</def>
</def-item>
<def-item>
<term>pLM</term>
<def>
<p>protein language model</p>
</def>
</def-item>
<def-item>
<term>R&amp;D</term>
<def>
<p>research and development</p>
</def>
</def-item>
<def-item>
<term>RNN</term>
<def>
<p>recurrent neural networks</p>
</def>
</def-item>
<def-item>
<term>SVM</term>
<def>
<p>support vector machine</p>
</def>
</def-item>
</def-list>
</glossary>
<sec id="s9">
<title>Declarations</title>
<sec id="t-9-1">
<title>Author contributions</title>
<p>YZ: Conceptualization, Investigation, Writing—original draft, Writing—review &amp; editing. ZW: Investigation, Writing—original draft, Writing—review &amp; editing. HZ: Validation, Writing—review &amp; editing, Supervision, Funding acquisition. All authors read and approved the submitted version.</p>
</sec>
<sec id="t-9-2" sec-type="COI-statement">
<title>Conflicts of interest</title>
<p>The authors declare that they have no conflicts of interest.</p>
</sec>
<sec id="t-9-3">
<title>Ethical approval</title>
<p>Not applicable.</p>
</sec>
<sec id="t-9-4">
<title>Consent to participate</title>
<p>Not applicable.</p>
</sec>
<sec id="t-9-5">
<title>Consent to publication</title>
<p>Not applicable.</p>
</sec>
<sec id="t-9-6" sec-type="data-availability">
<title>Availability of data and materials</title>
<p>Not applicable.</p>
</sec>
<sec id="t-9-7">
<title>Funding</title>
<p>National Natural Science Foundation of China [82073767]; Priority Academic Program Development of Jiangsu Higher Education Institutions [PAPD 2023007]; 2026 College Students’ Innovation and Entrepreneurship Training Program [2026303]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.</p>
</sec>
<sec id="t-9-8">
<title>Copyright</title>
<p>© The Author(s) 2026.</p>
</sec>
</sec>
<sec id="s10">
<title>Publisher’s note</title>
<p>Open Exploration maintains a neutral stance on jurisdictional claims in published institutional affiliations and maps. All opinions expressed in this article are the personal views of the author(s) and do not represent the stance of the editorial team or the publisher.</p>
</sec>
<ref-list>
<ref id="B1">
<label>1</label>
<element-citation publication-type="web">
<article-title>2021 Antibacterial agents in clinical and preclinical development: an overview and analysis [Internet]</article-title>
<comment>WHO; c2026 [cited 2025 July 21]. Available from: <uri xlink:href="https://www.who.int/publications/i/item/9789240047655">https://www.who.int/publications/i/item/9789240047655</uri></comment>
</element-citation>
</ref>
<ref id="B2">
<label>2</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cook</surname>
<given-names>MA</given-names>
</name>
<name>
<surname>Wright</surname>
<given-names>GD</given-names>
</name>
</person-group>
<article-title>The past, present, and future of antibiotics</article-title>
<source>Sci Transl Med</source>
<year iso-8601-date="2022">2022</year>
<volume>14</volume>
<elocation-id>eabo7793</elocation-id>
<pub-id pub-id-type="doi">10.1126/scitranslmed.abo7793</pub-id>
<pub-id pub-id-type="pmid">35947678</pub-id>
</element-citation>
</ref>
<ref id="B3">
<label>3</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sinha</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Upadhyay</surname>
<given-names>LSB</given-names>
</name>
</person-group>
<article-title>Understanding antimicrobial resistance (AMR) mechanisms and advancements in AMR diagnostics</article-title>
<source>Diagn Microbiol Infect Dis</source>
<year iso-8601-date="2025">2025</year>
<volume>113</volume>
<elocation-id>116949</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.diagmicrobio.2025.116949</pub-id>
<pub-id pub-id-type="pmid">40544537</pub-id>
</element-citation>
</ref>
<ref id="B4">
<label>4</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Moretta</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Scieuzo</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Petrone</surname>
<given-names>AM</given-names>
</name>
<name>
<surname>Salvia</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Manniello</surname>
<given-names>MD</given-names>
</name>
<name>
<surname>Franco</surname>
<given-names>A</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Antimicrobial Peptides: A New Hope in Biomedical and Pharmaceutical Fields</article-title>
<source>Front Cell Infect Microbiol</source>
<year iso-8601-date="2021">2021</year>
<volume>11</volume>
<elocation-id>668632</elocation-id>
<pub-id pub-id-type="doi">10.3389/fcimb.2021.668632</pub-id>
<pub-id pub-id-type="pmid">34195099</pub-id>
<pub-id pub-id-type="pmcid">PMC8238046</pub-id>
</element-citation>
</ref>
<ref id="B5">
<label>5</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Brizuela</surname>
<given-names>CA</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Stokes</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>de la Fuente-Nunez</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>AI Methods for Antimicrobial Peptides: Progress and Challenges</article-title>
<source>Microb Biotechnol</source>
<year iso-8601-date="2025">2025</year>
<volume>18</volume>
<elocation-id>e70072</elocation-id>
<pub-id pub-id-type="doi">10.1111/1751-7915.70072</pub-id>
<pub-id pub-id-type="pmid">39754551</pub-id>
<pub-id pub-id-type="pmcid">PMC11702388</pub-id>
</element-citation>
</ref>
<ref id="B6">
<label>6</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Huan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Kong</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Mou</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Yi</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Antimicrobial Peptides: Classification, Design, Application and Research Progress in Multiple Fields</article-title>
<source>Front Microbiol</source>
<year iso-8601-date="2020">2020</year>
<volume>11</volume>
<elocation-id>582779</elocation-id>
<pub-id pub-id-type="doi">10.3389/fmicb.2020.582779</pub-id>
<pub-id pub-id-type="pmid">33178164</pub-id>
<pub-id pub-id-type="pmcid">PMC7596191</pub-id>
</element-citation>
</ref>
<ref id="B7">
<label>7</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Torres</surname>
<given-names>MT</given-names>
</name>
<name>
<surname>de la Fuente-Nunez</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Toward computer-made artificial antibiotics</article-title>
<source>Curr Opin Microbiol</source>
<year iso-8601-date="2019">2019</year>
<volume>51</volume>
<fpage>30</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1016/j.mib.2019.03.004</pub-id>
<pub-id pub-id-type="pmid">31082661</pub-id>
</element-citation>
</ref>
<ref id="B8">
<label>8</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pirtskhalava</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Amstrong</surname>
<given-names>AA</given-names>
</name>
<name>
<surname>Grigolava</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Chubinidze</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Alimbarashvili</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Vishnepolsky</surname>
<given-names>B</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DBAASP v3: database of antimicrobial/cytotoxic activity and structure of peptides as a resource for development of new therapeutics</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2021">2021</year>
<volume>49</volume>
<fpage>D288</fpage>
<lpage>97</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkaa991</pub-id>
<pub-id pub-id-type="pmid">33151284</pub-id>
<pub-id pub-id-type="pmcid">PMC7778994</pub-id>
</element-citation>
</ref>
<ref id="B9">
<label>9</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>G</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Artificial intelligence in drug development</article-title>
<source>Nat Med</source>
<year iso-8601-date="2025">2025</year>
<volume>31</volume>
<fpage>45</fpage>
<lpage>59</lpage>
<pub-id pub-id-type="doi">10.1038/s41591-024-03434-4</pub-id>
<pub-id pub-id-type="pmid">39833407</pub-id>
</element-citation>
</ref>
<ref id="B10">
<label>10</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>AI-Guided Design of Antimicrobial Peptide Hydrogels for Precise Treatment of Drug-resistant Bacterial Infections</article-title>
<source>Adv Mater</source>
<year iso-8601-date="2025">2025</year>
<volume>37</volume>
<elocation-id>2500043</elocation-id>
<pub-id pub-id-type="doi">10.1002/adma.202500043</pub-id>
<pub-id pub-id-type="pmid">40159831</pub-id>
</element-citation>
</ref>
<ref id="B11">
<label>11</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mishra</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Basu</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Shehadeh</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Felix</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Kollala</surname>
<given-names>SS</given-names>
</name>
<name>
<surname>Chhonker</surname>
<given-names>YS</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Antimicrobial peptide developed with machine learning sequence optimization targets drug resistant <italic>Staphylococcus aureus</italic> in mice</article-title>
<source>J Clin Invest</source>
<year iso-8601-date="2025">2025</year>
<volume>135</volume>
<elocation-id>e185430</elocation-id>
<pub-id pub-id-type="doi">10.1172/JCI185430</pub-id>
<pub-id pub-id-type="pmid">40261713</pub-id>
<pub-id pub-id-type="pmcid">PMC12165799</pub-id>
</element-citation>
</ref>
<ref id="B12">
<label>12</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Darby</surname>
<given-names>EM</given-names>
</name>
<name>
<surname>Trampari</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Siasat</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Gaya</surname>
<given-names>MS</given-names>
</name>
<name>
<surname>Alav</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Webber</surname>
<given-names>MA</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Molecular mechanisms of antibiotic resistance revisited</article-title>
<source>Nat Rev Microbiol</source>
<year iso-8601-date="2023">2023</year>
<volume>21</volume>
<fpage>280</fpage>
<lpage>95</lpage>
<pub-id pub-id-type="doi">10.1038/s41579-022-00820-y</pub-id>
<pub-id pub-id-type="pmid">36411397</pub-id>
</element-citation>
</ref>
<ref id="B13">
<label>13</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Catalano</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Iacopetta</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Ceramella</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Scumaci</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Giuzio</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Saturnino</surname>
<given-names>C</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Multidrug Resistance (MDR): A Widespread Phenomenon in Pharmacological Therapies</article-title>
<source>Molecules</source>
<year iso-8601-date="2022">2022</year>
<volume>27</volume>
<elocation-id>616</elocation-id>
<pub-id pub-id-type="doi">10.3390/molecules27030616</pub-id>
<pub-id pub-id-type="pmid">35163878</pub-id>
<pub-id pub-id-type="pmcid">PMC8839222</pub-id>
</element-citation>
</ref>
<ref id="B14">
<label>14</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Kizhakkedathu</surname>
<given-names>JN</given-names>
</name>
<name>
<surname>Straus</surname>
<given-names>SK</given-names>
</name>
</person-group>
<article-title>Antimicrobial Peptides: Diversity, Mechanism of Action and Strategies to Improve the Activity and Biocompatibility In Vivo</article-title>
<source>Biomolecules</source>
<year iso-8601-date="2018">2018</year>
<volume>8</volume>
<elocation-id>4</elocation-id>
<pub-id pub-id-type="doi">10.3390/biom8010004</pub-id>
<pub-id pub-id-type="pmid">29351202</pub-id>
<pub-id pub-id-type="pmcid">PMC5871973</pub-id>
</element-citation>
</ref>
<ref id="B15">
<label>15</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>The Role and Mechanisms of Antimicrobial Peptides in Overcoming Multidrug-Resistant Bacteria</article-title>
<source>Molecules</source>
<year iso-8601-date="2025">2025</year>
<volume>30</volume>
<elocation-id>128</elocation-id>
<pub-id pub-id-type="doi">10.3390/molecules30010128</pub-id>
<pub-id pub-id-type="pmid">39795190</pub-id>
<pub-id pub-id-type="pmcid">PMC11721820</pub-id>
</element-citation>
</ref>
<ref id="B16">
<label>16</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>K</surname>
<given-names>R G</given-names>
</name>
<name>
<surname>Balenahalli</surname>
<given-names>Narasingappa R</given-names>
</name>
<name>
<surname>Vishnu</surname>
<given-names>Vyas G</given-names>
</name>
</person-group>
<article-title>Unveiling mechanisms of antimicrobial peptide: Actions beyond the membranes disruption</article-title>
<source>Heliyon</source>
<year iso-8601-date="2024">2024</year>
<volume>10</volume>
<elocation-id>e38079</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.heliyon.2024.e38079</pub-id>
<pub-id pub-id-type="pmid">39386776</pub-id>
<pub-id pub-id-type="pmcid">PMC11462253</pub-id>
</element-citation>
</ref>
<ref id="B17">
<label>17</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marquette</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Bechinger</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>Biophysical Investigations Elucidating the Mechanisms of Action of Antimicrobial Peptides and Their Synergism</article-title>
<source>Biomolecules</source>
<year iso-8601-date="2018">2018</year>
<volume>8</volume>
<elocation-id>18</elocation-id>
<pub-id pub-id-type="doi">10.3390/biom8020018</pub-id>
<pub-id pub-id-type="pmid">29670065</pub-id>
<pub-id pub-id-type="pmcid">PMC6023007</pub-id>
</element-citation>
</ref>
<ref id="B18">
<label>18</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Malanovic</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Lohner</surname>
<given-names>K</given-names>
</name>
</person-group>
<article-title>Antimicrobial Peptides Targeting Gram-Positive Bacteria</article-title>
<source>Pharmaceuticals (Basel)</source>
<year iso-8601-date="2016">2016</year>
<volume>9</volume>
<elocation-id>59</elocation-id>
<pub-id pub-id-type="doi">10.3390/ph9030059</pub-id>
<pub-id pub-id-type="pmid">27657092</pub-id>
<pub-id pub-id-type="pmcid">PMC5039512</pub-id>
</element-citation>
</ref>
<ref id="B19">
<label>19</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Panina</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Krylov</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Nolde</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Efremov</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Chugunov</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Environmental and dynamic effects explain how nisin captures membrane-bound lipid II</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2020">2020</year>
<volume>10</volume>
<elocation-id>8821</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41598-020-65522-y</pub-id>
<pub-id pub-id-type="pmid">32483218</pub-id>
<pub-id pub-id-type="pmcid">PMC7264305</pub-id>
</element-citation>
</ref>
<ref id="B20">
<label>20</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Oren</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Shai</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>Mode of action of linear amphipathic alpha-helical antimicrobial peptides</article-title>
<source>Biopolymers</source>
<year iso-8601-date="1998">1998</year>
<volume>47</volume>
<fpage>451</fpage>
<lpage>63</lpage>
<pub-id pub-id-type="doi">10.1002/(SICI)1097-0282(1998)47:6&lt;451::AID-BIP4&gt;3.0.CO;2-F</pub-id>
<pub-id pub-id-type="pmid">10333737</pub-id>
</element-citation>
</ref>
<ref id="B21">
<label>21</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matsuzaki</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Sugishita</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Ishibe</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Ueha</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Nakata</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Miyajima</surname>
<given-names>K</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Relationship of membrane curvature to the formation of pores by magainin 2</article-title>
<source>Biochemistry</source>
<year iso-8601-date="1998">1998</year>
<volume>37</volume>
<fpage>11856</fpage>
<lpage>63</lpage>
<pub-id pub-id-type="doi">10.1021/bi980539y</pub-id>
<pub-id pub-id-type="pmid">9718308</pub-id>
</element-citation>
</ref>
<ref id="B22">
<label>22</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yoneyama</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Imura</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Ohno</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Zendo</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Nakayama</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Matsuzaki</surname>
<given-names>K</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Peptide-lipid huge toroidal pore, a new antimicrobial mechanism mediated by a lactococcal bacteriocin, lacticin Q</article-title>
<source>Antimicrob Agents Chemother</source>
<year iso-8601-date="2009">2009</year>
<volume>53</volume>
<fpage>3211</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.1128/AAC.00209-09</pub-id>
<pub-id pub-id-type="pmid">19470516</pub-id>
<pub-id pub-id-type="pmcid">PMC2715615</pub-id>
</element-citation>
</ref>
<ref id="B23">
<label>23</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yasir</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Willcox</surname>
<given-names>MDP</given-names>
</name>
<name>
<surname>Dutta</surname>
<given-names>D</given-names>
</name>
</person-group>
<article-title>Action of Antimicrobial Peptides against Bacterial Biofilms</article-title>
<source>Materials (Basel)</source>
<year iso-8601-date="2018">2018</year>
<volume>11</volume>
<elocation-id>2468</elocation-id>
<pub-id pub-id-type="doi">10.3390/ma11122468</pub-id>
<pub-id pub-id-type="pmid">30563067</pub-id>
<pub-id pub-id-type="pmcid">PMC6317029</pub-id>
</element-citation>
</ref>
<ref id="B24">
<label>24</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Martinez</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Gonçalves</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Felício</surname>
<given-names>MR</given-names>
</name>
<name>
<surname>Maturana</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Santos</surname>
<given-names>NC</given-names>
</name>
<name>
<surname>Semorile</surname>
<given-names>L</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Synergistic and antibiofilm activity of the antimicrobial peptide P5 against carbapenem-resistant <italic>Pseudomonas aeruginosa</italic></article-title>
<source>Biochim Biophys Acta Biomembr</source>
<year iso-8601-date="2019">2019</year>
<volume>1861</volume>
<fpage>1329</fpage>
<lpage>37</lpage>
<pub-id pub-id-type="doi">10.1016/j.bbamem.2019.05.008</pub-id>
<pub-id pub-id-type="pmid">31095945</pub-id>
</element-citation>
</ref>
<ref id="B25">
<label>25</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nicolas</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>Multifunctional host defense peptides: intracellular-targeting antimicrobial peptides</article-title>
<source>FEBS J</source>
<year iso-8601-date="2009">2009</year>
<volume>276</volume>
<fpage>6483</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1111/j.1742-4658.2009.07359.x</pub-id>
<pub-id pub-id-type="pmid">19817856</pub-id>
</element-citation>
</ref>
<ref id="B26">
<label>26</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ruffolo</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Madani</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Designing proteins with language models</article-title>
<source>Nat Biotechnol</source>
<year iso-8601-date="2024">2024</year>
<volume>42</volume>
<fpage>200</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1038/s41587-024-02123-4</pub-id>
<pub-id pub-id-type="pmid">38361067</pub-id>
</element-citation>
</ref>
<ref id="B27">
<label>27</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Benfield</surname>
<given-names>AH</given-names>
</name>
<name>
<surname>Henriques</surname>
<given-names>ST</given-names>
</name>
</person-group>
<article-title>Mode-of-Action of Antimicrobial Peptides: Membrane Disruption vs. Intracellular Mechanisms</article-title>
<source>Front Med Technol</source>
<year iso-8601-date="2020">2020</year>
<volume>2</volume>
<elocation-id>610997</elocation-id>
<pub-id pub-id-type="doi">10.3389/fmedt.2020.610997</pub-id>
<pub-id pub-id-type="pmid">35047892</pub-id>
<pub-id pub-id-type="pmcid">PMC8757789</pub-id>
</element-citation>
</ref>
<ref id="B28">
<label>28</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Escherichia coli PagP Enzyme-Based De Novo Design and In Vitro Activity of Antibacterial Peptide LL-37</article-title>
<source>Med Sci Monit</source>
<year iso-8601-date="2017">2017</year>
<volume>23</volume>
<fpage>2558</fpage>
<lpage>64</lpage>
<pub-id pub-id-type="doi">10.12659/msm.902095</pub-id>
<pub-id pub-id-type="pmid">28550277</pub-id>
<pub-id pub-id-type="pmcid">PMC5458668</pub-id>
</element-citation>
</ref>
<ref id="B29">
<label>29</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhattacharjya</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Ramamoorthy</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>LL-37: Structures, Antimicrobial Activity, and Influence on Amyloid-Related Diseases</article-title>
<source>Biomolecules</source>
<year iso-8601-date="2024">2024</year>
<volume>14</volume>
<elocation-id>320</elocation-id>
<pub-id pub-id-type="doi">10.3390/biom14030320</pub-id>
<pub-id pub-id-type="pmid">38540740</pub-id>
<pub-id pub-id-type="pmcid">PMC10968335</pub-id>
</element-citation>
</ref>
<ref id="B30">
<label>30</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schneider</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Kruse</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Wimmer</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Wiedemann</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Sass</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Pag</surname>
<given-names>U</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Plectasin, a fungal defensin, targets the bacterial cell wall precursor Lipid II</article-title>
<source>Science</source>
<year iso-8601-date="2010">2010</year>
<volume>328</volume>
<fpage>1168</fpage>
<lpage>72</lpage>
<pub-id pub-id-type="doi">10.1126/science.1185723</pub-id>
<pub-id pub-id-type="pmid">20508130</pub-id>
</element-citation>
</ref>
<ref id="B31">
<label>31</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Casteels</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Ampe</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Jacobs</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Vaeck</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Tempst</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>Apidaecins: antibacterial peptides from honeybees</article-title>
<source>EMBO J</source>
<year iso-8601-date="1989">1989</year>
<volume>8</volume>
<fpage>2387</fpage>
<lpage>91</lpage>
<pub-id pub-id-type="doi">10.1002/j.1460-2075.1989.tb08368.x</pub-id>
<pub-id pub-id-type="pmid">2676519</pub-id>
<pub-id pub-id-type="pmcid">PMC401180</pub-id>
</element-citation>
</ref>
<ref id="B32">
<label>32</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Skowron</surname>
<given-names>KJ</given-names>
</name>
<name>
<surname>Baliga</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Johnson</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Kremiller</surname>
<given-names>KM</given-names>
</name>
<name>
<surname>Castroverde</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Dean</surname>
<given-names>TT</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Structure-Activity Relationships of the Antimicrobial Peptide Natural Product Apidaecin</article-title>
<source>J Med Chem</source>
<year iso-8601-date="2023">2023</year>
<volume>66(17)</volume>
<fpage>11831</fpage>
<lpage>42</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jmedchem.3c00406</pub-id>
<pub-id pub-id-type="pmid">37603874</pub-id>
<pub-id pub-id-type="pmcid">PMC10768847</pub-id>
</element-citation>
</ref>
<ref id="B33">
<label>33</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xuan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Bo</surname>
<given-names>L</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Antimicrobial peptides for combating drug-resistant bacterial infections</article-title>
<source>Drug Resist Updat</source>
<year iso-8601-date="2023">2023</year>
<volume>68</volume>
<elocation-id>100954</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.drup.2023.100954</pub-id>
<pub-id pub-id-type="pmid">36905712</pub-id>
</element-citation>
</ref>
<ref id="B34">
<label>34</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Qi</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Shan</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>H</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Pretreatment with cathelicidin-BF ameliorates <italic>Pseudomonas aeruginosa</italic> pneumonia in mice by enhancing NETosis and the autophagy of recruited neutrophils and macrophages</article-title>
<source>Int Immunopharmacol</source>
<year iso-8601-date="2018">2018</year>
<volume>65</volume>
<fpage>382</fpage>
<lpage>91</lpage>
<pub-id pub-id-type="doi">10.1016/j.intimp.2018.10.030</pub-id>
<pub-id pub-id-type="pmid">30380513</pub-id>
</element-citation>
</ref>
<ref id="B35">
<label>35</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marks</surname>
<given-names>LR</given-names>
</name>
<name>
<surname>Clementi</surname>
<given-names>EA</given-names>
</name>
<name>
<surname>Hakansson</surname>
<given-names>AP</given-names>
</name>
</person-group>
<article-title>The human milk protein-lipid complex HAMLET sensitizes bacterial pathogens to traditional antimicrobial agents</article-title>
<source>PLoS One</source>
<year iso-8601-date="2012">2012</year>
<volume>7</volume>
<elocation-id>e43514</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0043514</pub-id>
<pub-id pub-id-type="pmid">22905269</pub-id>
<pub-id pub-id-type="pmcid">PMC3419703</pub-id>
</element-citation>
</ref>
<ref id="B36">
<label>36</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Matsumoto</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Yamazaki</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Kawakami</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Miyoshi</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Ooi</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hashimoto</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>
<italic>In vivo</italic> target exploration of apidaecin based on Acquired Resistance induced by Gene Overexpression (ARGO assay)</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2017">2017</year>
<volume>7</volume>
<elocation-id>12136</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41598-017-12039-6</pub-id>
<pub-id pub-id-type="pmid">28939819</pub-id>
<pub-id pub-id-type="pmcid">PMC5610309</pub-id>
</element-citation>
</ref>
<ref id="B37">
<label>37</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghosh</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Kar</surname>
<given-names>RK</given-names>
</name>
<name>
<surname>Jana</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Saha</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Jana</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Krishnamoorthy</surname>
<given-names>J</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Indolicidin targets duplex DNA: structural and mechanistic insight through a combination of spectroscopy and microscopy</article-title>
<source>ChemMedChem</source>
<year iso-8601-date="2014">2014</year>
<volume>9</volume>
<fpage>2052</fpage>
<lpage>8</lpage>
<pub-id pub-id-type="doi">10.1002/cmdc.201402215</pub-id>
<pub-id pub-id-type="pmid">25044630</pub-id>
</element-citation>
</ref>
<ref id="B38">
<label>38</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lai</surname>
<given-names>PK</given-names>
</name>
<name>
<surname>Kaznessis</surname>
<given-names>YN</given-names>
</name>
</person-group>
<article-title>Insights into Membrane Translocation of Protegrin Antimicrobial Peptides by Multistep Molecular Dynamics Simulations</article-title>
<source>ACS Omega</source>
<year iso-8601-date="2018">2018</year>
<volume>3</volume>
<fpage>6056</fpage>
<lpage>65</lpage>
<pub-id pub-id-type="doi">10.1021/acsomega.8b00483</pub-id>
<pub-id pub-id-type="pmid">29978143</pub-id>
<pub-id pub-id-type="pmcid">PMC6026836</pub-id>
</element-citation>
</ref>
<ref id="B39">
<label>39</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tian</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Shao</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>W</given-names>
</name>
</person-group>
<article-title>From AI-Driven Sequence Generation to Molecular Simulation: A Comprehensive Framework for Antimicrobial Peptide Discovery</article-title>
<source>J Chem Inf Model</source>
<year iso-8601-date="2025">2025</year>
<volume>65</volume>
<fpage>9566</fpage>
<lpage>75</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jcim.5c00892</pub-id>
<pub-id pub-id-type="pmid">40879040</pub-id>
</element-citation>
</ref>
<ref id="B40">
<label>40</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DTBind: A Mechanism-Driven Deep Learning Framework for Accurate Prediction of Drug-Target Molecular Recognition</article-title>
<source>Research (Wash D C)</source>
<year iso-8601-date="2025">2025</year>
<volume>8</volume>
<elocation-id>1022</elocation-id>
<pub-id pub-id-type="doi">10.34133/research.1022</pub-id>
<pub-id pub-id-type="pmid">41341719</pub-id>
<pub-id pub-id-type="pmcid">PMC12669800</pub-id>
</element-citation>
</ref>
<ref id="B41">
<label>41</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Waterhouse</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Bertoni</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Bienert</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Studer</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Tauriello</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Gumienny</surname>
<given-names>R</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>SWISS-MODEL: homology modelling of protein structures and complexes</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2018">2018</year>
<volume>46</volume>
<fpage>W296</fpage>
<lpage>303</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gky427</pub-id>
<pub-id pub-id-type="pmid">29788355</pub-id>
<pub-id pub-id-type="pmcid">PMC6030848</pub-id>
</element-citation>
</ref>
<ref id="B42">
<label>42</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ramazi</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Mohammadi</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Allahverdi</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Khalili</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Abdolmaleki</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>A review on antimicrobial peptides databases and the computational tools</article-title>
<source>Database (Oxford)</source>
<year iso-8601-date="2022">2022</year>
<volume>2022</volume>
<elocation-id>baac011</elocation-id>
<pub-id pub-id-type="doi">10.1093/database/baac011</pub-id>
<pub-id pub-id-type="pmid">35305010</pub-id>
<pub-id pub-id-type="pmcid">PMC9216472</pub-id>
</element-citation>
</ref>
<ref id="B43">
<label>43</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abramson</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Adler</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Dunger</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Evans</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Green</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Pritzel</surname>
<given-names>A</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Accurate structure prediction of biomolecular interactions with AlphaFold 3</article-title>
<source>Nature</source>
<year iso-8601-date="2024">2024</year>
<volume>630</volume>
<fpage>493</fpage>
<lpage>500</lpage>
<pub-id pub-id-type="doi">10.1038/s41586-024-07487-w</pub-id>
<pub-id pub-id-type="pmid">38718835</pub-id>
<pub-id pub-id-type="pmcid">PMC11168924</pub-id>
</element-citation>
</ref>
<ref id="B44">
<label>44</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Nasif</surname>
<given-names>KFA</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Deng</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>AI-Driven Deep Learning Techniques in Protein Structure Prediction</article-title>
<source>Int J Mol Sci</source>
<year iso-8601-date="2024">2024</year>
<volume>25</volume>
<elocation-id>8426</elocation-id>
<pub-id pub-id-type="doi">10.3390/ijms25158426</pub-id>
<pub-id pub-id-type="pmid">39125995</pub-id>
<pub-id pub-id-type="pmcid">PMC11313475</pub-id>
</element-citation>
</ref>
<ref id="B45">
<label>45</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tian</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Shao</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>W</given-names>
</name>
</person-group>
<article-title>Flexible Tail of Antimicrobial Peptide PGLa Facilitates Water Pore Formation in Membranes</article-title>
<source>J Phys Chem B</source>
<year iso-8601-date="2025">2025</year>
<volume>129</volume>
<fpage>1453</fpage>
<lpage>61</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jpcb.4c06190</pub-id>
<pub-id pub-id-type="pmid">39847609</pub-id>
</element-citation>
</ref>
<ref id="B46">
<label>46</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiménez-Osés</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Peccati</surname>
<given-names>F</given-names>
</name>
</person-group>
<article-title>Structure Prediction of Alternate Frame Folding Systems with AlphaFold3</article-title>
<source>J Chem Inf Model</source>
<year iso-8601-date="2025">2025</year>
<volume>65</volume>
<fpage>8229</fpage>
<lpage>37</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jcim.5c00906</pub-id>
<pub-id pub-id-type="pmid">40785361</pub-id>
<pub-id pub-id-type="pmcid">PMC12344692</pub-id>
</element-citation>
</ref>
<ref id="B47">
<label>47</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Musin</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Asyanova</surname>
<given-names>E</given-names>
</name>
</person-group>
<article-title>How Machine Learning Helps in Combating Antimicrobial Resistance: A Review of AMP Analysis and Generation Methods</article-title>
<source>Int J Pept Res Ther</source>
<year iso-8601-date="2025">2025</year>
<volume>31</volume>
<elocation-id>59</elocation-id>
<pub-id pub-id-type="doi">10.1007/s10989-025-10716-z</pub-id>
</element-citation>
</ref>
<ref id="B48">
<label>48</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nawaz</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Huiyuan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Akhtar</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Tianyue</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Deep learning in the discovery of antiviral peptides and peptidomimetics: databases and prediction tools</article-title>
<source>Mol Divers</source>
<year iso-8601-date="2025">2025</year>
<volume>29</volume>
<fpage>3753</fpage>
<lpage>88</lpage>
<pub-id pub-id-type="doi">10.1007/s11030-025-11173-y</pub-id>
<pub-id pub-id-type="pmid">40153158</pub-id>
</element-citation>
</ref>
<ref id="B49">
<label>49</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mondal</surname>
<given-names>RK</given-names>
</name>
<name>
<surname>Sen</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Arya</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Samanta</surname>
<given-names>SK</given-names>
</name>
</person-group>
<article-title>Developing anti-microbial peptide database version 1 to provide comprehensive and exhaustive resource of manually curated AMPs</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2023">2023</year>
<volume>13</volume>
<elocation-id>17843</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41598-023-45016-3</pub-id>
<pub-id pub-id-type="pmid">37857659</pub-id>
<pub-id pub-id-type="pmcid">PMC10587344</pub-id>
</element-citation>
</ref>
<ref id="B50">
<label>50</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yao</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Guan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Xie</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Chung</surname>
<given-names>CR</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>D</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>dbAMP 3.0: updated resource of antimicrobial activity and structural annotation of peptides in the post-pandemic era</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2025">2025</year>
<volume>53</volume>
<fpage>D364</fpage>
<lpage>76</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkae1019</pub-id>
<pub-id pub-id-type="pmid">39540425</pub-id>
<pub-id pub-id-type="pmcid">PMC11701527</pub-id>
</element-citation>
</ref>
<ref id="B51">
<label>51</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jhong</surname>
<given-names>JH</given-names>
</name>
<name>
<surname>Chi</surname>
<given-names>YH</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>WC</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>TH</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>KY</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>TY</given-names>
</name>
</person-group>
<article-title>dbAMP: an integrated resource for exploring antimicrobial peptides with functional activities and physicochemical properties on transcriptome and proteome data</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2019">2019</year>
<volume>47</volume>
<fpage>D285</fpage>
<lpage>97</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gky1030</pub-id>
<pub-id pub-id-type="pmid">30380085</pub-id>
<pub-id pub-id-type="pmcid">PMC6323920</pub-id>
</element-citation>
</ref>
<ref id="B52">
<label>52</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Hao</surname>
<given-names>C</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DRAMP 4.0: an open-access data repository dedicated to the clinical translation of antimicrobial peptides</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2025">2025</year>
<volume>53</volume>
<fpage>D403</fpage>
<lpage>10</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkae1046</pub-id>
<pub-id pub-id-type="pmid">39526377</pub-id>
<pub-id pub-id-type="pmcid">PMC11701585</pub-id>
</element-citation>
</ref>
<ref id="B53">
<label>53</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fan</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Lao</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>H</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DRAMP: a comprehensive data repository of antimicrobial peptides</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2016">2016</year>
<volume>6</volume>
<elocation-id>24482</elocation-id>
<pub-id pub-id-type="doi">10.1038/srep24482</pub-id>
<pub-id pub-id-type="pmid">27075512</pub-id>
<pub-id pub-id-type="pmcid">PMC4830929</pub-id>
</element-citation>
</ref>
<ref id="B54">
<label>54</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>J</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DRAMP 2.0, an updated data repository of antimicrobial peptides</article-title>
<source>Sci Data</source>
<year iso-8601-date="2019">2019</year>
<volume>6</volume>
<elocation-id>148</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41597-019-0154-y</pub-id>
<pub-id pub-id-type="pmid">31409791</pub-id>
<pub-id pub-id-type="pmcid">PMC6692298</pub-id>
</element-citation>
</ref>
<ref id="B55">
<label>55</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Shi</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Kang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Hu</surname>
<given-names>Y</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DRAMP 3.0: an enhanced comprehensive data repository of antimicrobial peptides</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2022">2022</year>
<volume>50</volume>
<fpage>D488</fpage>
<lpage>96</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkab651</pub-id>
<pub-id pub-id-type="pmid">34390348</pub-id>
<pub-id pub-id-type="pmcid">PMC8728287</pub-id>
</element-citation>
</ref>
<ref id="B56">
<label>56</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pirtskhalava</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Gabrielian</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Cruz</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Griggs</surname>
<given-names>HL</given-names>
</name>
<name>
<surname>Squires</surname>
<given-names>RB</given-names>
</name>
<name>
<surname>Hurt</surname>
<given-names>DE</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DBAASP v.2: an enhanced database of structure and antimicrobial/cytotoxic activity of natural and synthetic peptides</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2016">2016</year>
<volume>44</volume>
<fpage>D1104</fpage>
<lpage>12</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkv1174</pub-id>
<pub-id pub-id-type="pmid">26578581</pub-id>
<pub-id pub-id-type="pmcid">PMC4702840</pub-id>
</element-citation>
</ref>
<ref id="B57">
<label>57</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ye</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Ge</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>G</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>LAMP2: a major update of the database linking antimicrobial peptides</article-title>
<source>Database (Oxford)</source>
<year iso-8601-date="2020">2020</year>
<volume>2020</volume>
<elocation-id>baaa061</elocation-id>
<pub-id pub-id-type="doi">10.1093/database/baaa061</pub-id>
<pub-id pub-id-type="pmid">32844169</pub-id>
<pub-id pub-id-type="pmcid">PMC7447557</pub-id>
</element-citation>
</ref>
<ref id="B58">
<label>58</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Waghu</surname>
<given-names>FH</given-names>
</name>
<name>
<surname>Barai</surname>
<given-names>RS</given-names>
</name>
<name>
<surname>Gurung</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Idicula-Thomas</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>CAMP<sub>R3</sub>: a database on sequences, structures and signatures of antimicrobial peptides</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2016">2016</year>
<volume>44</volume>
<fpage>D1094</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkv1051</pub-id>
<pub-id pub-id-type="pmid">26467475</pub-id>
<pub-id pub-id-type="pmcid">PMC4702787</pub-id>
</element-citation>
</ref>
<ref id="B59">
<label>59</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gawde</surname>
<given-names>U</given-names>
</name>
<name>
<surname>Chakraborty</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Waghu</surname>
<given-names>FH</given-names>
</name>
<name>
<surname>Barai</surname>
<given-names>RS</given-names>
</name>
<name>
<surname>Khanderkar</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Indraguru</surname>
<given-names>R</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>CAMP<sub>R4</sub>: a database of natural and synthetic antimicrobial peptides</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2023">2023</year>
<volume>51</volume>
<fpage>D377</fpage>
<lpage>83</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkac933</pub-id>
<pub-id pub-id-type="pmid">36370097</pub-id>
<pub-id pub-id-type="pmcid">PMC9825550</pub-id>
</element-citation>
</ref>
<ref id="B60">
<label>60</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
</person-group>
<article-title>APD3: the antimicrobial peptide database as a tool for research and education</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2016">2016</year>
<volume>44</volume>
<fpage>D1087</fpage>
<lpage>93</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkv1278</pub-id>
<pub-id pub-id-type="pmid">26602694</pub-id>
<pub-id pub-id-type="pmcid">PMC4702905</pub-id>
</element-citation>
</ref>
<ref id="B61">
<label>61</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>APD: the Antimicrobial Peptide Database</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2004">2004</year>
<volume>32</volume>
<fpage>D590</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkh025</pub-id>
<pub-id pub-id-type="pmid">14681488</pub-id>
<pub-id pub-id-type="pmcid">PMC308759</pub-id>
</element-citation>
</ref>
<ref id="B62">
<label>62</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Schmidt</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
</person-group>
<article-title>APD6: the antimicrobial peptide database is expanded to promote research and development by deploying an unprecedented information pipeline</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2026">2026</year>
<volume>54</volume>
<fpage>D363</fpage>
<lpage>74</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkaf860</pub-id>
<pub-id pub-id-type="pmid">40927993</pub-id>
<pub-id pub-id-type="pmcid">PMC12807733</pub-id>
</element-citation>
</ref>
<ref id="B63">
<label>63</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Piotto</surname>
<given-names>SP</given-names>
</name>
<name>
<surname>Sessa</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Concilio</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Iannelli</surname>
<given-names>P</given-names>
</name>
</person-group>
<article-title>YADAMP: yet another database of antimicrobial peptides</article-title>
<source>Int J Antimicrob Agents</source>
<year iso-8601-date="2012">2012</year>
<volume>39</volume>
<fpage>346</fpage>
<lpage>51</lpage>
<pub-id pub-id-type="doi">10.1016/j.ijantimicag.2011.12.003</pub-id>
<pub-id pub-id-type="pmid">22325123</pub-id>
</element-citation>
</ref>
<ref id="B64">
<label>64</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gómez</surname>
<given-names>EA</given-names>
</name>
<name>
<surname>Giraldo</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Orduz</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>InverPep: A database of invertebrate antimicrobial peptides</article-title>
<source>J Glob Antimicrob Resist</source>
<year iso-8601-date="2017">2017</year>
<volume>8</volume>
<fpage>13</fpage>
<lpage>7</lpage>
<pub-id pub-id-type="doi">10.1016/j.jgar.2016.10.003</pub-id>
<pub-id pub-id-type="pmid">27888793</pub-id>
</element-citation>
</ref>
<ref id="B65">
<label>65</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Das</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Jaiswal</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Khan</surname>
<given-names>FN</given-names>
</name>
<name>
<surname>Ahamad</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>PlantPepDB: A manually curated plant peptide database</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2020">2020</year>
<volume>10</volume>
<elocation-id>2194</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41598-020-59165-2</pub-id>
<pub-id pub-id-type="pmid">32042035</pub-id>
<pub-id pub-id-type="pmcid">PMC7010657</pub-id>
</element-citation>
</ref>
<ref id="B66">
<label>66</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Long</surname>
<given-names>J</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>A database of anti-coronavirus peptides</article-title>
<source>Sci Data</source>
<year iso-8601-date="2022">2022</year>
<volume>9</volume>
<elocation-id>294</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41597-022-01394-3</pub-id>
<pub-id pub-id-type="pmid">35697698</pub-id>
<pub-id pub-id-type="pmcid">PMC9192597</pub-id>
</element-citation>
</ref>
<ref id="B67">
<label>67</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qureshi</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Thakur</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Tandon</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>AVPdb: a database of experimentally validated antiviral peptides targeting medically important viruses</article-title>
<source>Nucleic Acids Res</source>
<year iso-8601-date="2014">2014</year>
<volume>42</volume>
<fpage>D1147</fpage>
<lpage>53</lpage>
<pub-id pub-id-type="doi">10.1093/nar/gkt1191</pub-id>
<pub-id pub-id-type="pmid">24285301</pub-id>
<pub-id pub-id-type="pmcid">PMC3964995</pub-id>
</element-citation>
</ref>
<ref id="B68">
<label>68</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Qureshi</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Thakur</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>HIPdb: a database of experimentally validated HIV inhibiting peptides</article-title>
<source>PLoS One</source>
<year iso-8601-date="2013">2013</year>
<volume>8</volume>
<elocation-id>e54908</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0054908</pub-id>
<pub-id pub-id-type="pmid">23359817</pub-id>
<pub-id pub-id-type="pmcid">PMC3554673</pub-id>
</element-citation>
</ref>
<ref id="B69">
<label>69</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Nawaz</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Yao</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Akhtar</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>Y</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>DRAVP 2.0: A Curated and Genomically Annotated Database of Antiviral Peptides and Proteins</article-title>
<source>J Mol Biol</source>
<year iso-8601-date="2026">2026</year>
<volume> 5</volume>
<elocation-id>169628</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.jmb.2026.169628</pub-id>
<pub-id pub-id-type="pmid">41500382</pub-id>
</element-citation>
</ref>
<ref id="B70">
<label>70</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Lao</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zheng</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>DCTPep, the data of cancer therapy peptides</article-title>
<source>Sci Data</source>
<year iso-8601-date="2024">2024</year>
<volume>11</volume>
<elocation-id>541</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41597-024-03388-9</pub-id>
<pub-id pub-id-type="pmid">38796630</pub-id>
<pub-id pub-id-type="pmcid">PMC11128002</pub-id>
</element-citation>
</ref>
<ref id="B71">
<label>71</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>The antimicrobial peptide database is 20 years old: Recent developments and future directions</article-title>
<source>Protein Sci</source>
<year iso-8601-date="2023">2023</year>
<volume>32</volume>
<elocation-id>e4778</elocation-id>
<pub-id pub-id-type="doi">10.1002/pro.4778</pub-id>
<pub-id pub-id-type="pmid">37695921</pub-id>
<pub-id pub-id-type="pmcid">PMC10535814</pub-id>
</element-citation>
</ref>
<ref id="B72">
<label>72</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Di</surname>
<given-names>Bonaventura I</given-names>
</name>
<name>
<surname>Visini</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Gan</surname>
<given-names>BH</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Probst</surname>
<given-names>D</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Design, crystal structure and atomic force microscopy study of thioether ligated D,L-cyclic antimicrobial peptides against multidrug resistant <italic>Pseudomonas aeruginosa</italic></article-title>
<source>Chem Sci</source>
<year iso-8601-date="2017">2017</year>
<volume>8</volume>
<fpage>7464</fpage>
<lpage>75</lpage>
<pub-id pub-id-type="doi">10.1039/c7sc01599b</pub-id>
<pub-id pub-id-type="pmid">29163899</pub-id>
<pub-id pub-id-type="pmcid">PMC5676089</pub-id>
</element-citation>
</ref>
<ref id="B73">
<label>73</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kumar</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Bhagwat</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Pillai</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>A review on the diversity of antimicrobial peptides and genome mining strategies for their prediction</article-title>
<source>Biochimie</source>
<year iso-8601-date="2024">2024</year>
<volume>227</volume>
<fpage>99</fpage>
<lpage>115</lpage>
<pub-id pub-id-type="doi">10.1016/j.biochi.2024.06.013</pub-id>
<pub-id pub-id-type="pmid">38944107</pub-id>
</element-citation>
</ref>
<ref id="B74">
<label>74</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaweewan</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Hemmi</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Komaki</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Harada</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Kodani</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>Isolation and structure determination of a new lasso peptide specialicin based on genome mining</article-title>
<source>Bioorg Med Chem</source>
<year iso-8601-date="2018">2018</year>
<volume>26</volume>
<fpage>6050</fpage>
<lpage>5</lpage>
<pub-id pub-id-type="doi">10.1016/j.bmc.2018.11.007</pub-id>
<pub-id pub-id-type="pmid">30448257</pub-id>
</element-citation>
</ref>
<ref id="B75">
<label>75</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Zeng</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Feng</surname>
<given-names>L</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Unveiling the evolution of antimicrobial peptides in gut microbes via foundation-model-powered framework</article-title>
<source>Cell Rep</source>
<year iso-8601-date="2025">2025</year>
<volume>44</volume>
<elocation-id>115773</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.celrep.2025.115773</pub-id>
<pub-id pub-id-type="pmid">40445833</pub-id>
</element-citation>
</ref>
<ref id="B76">
<label>76</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lata</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Sharma</surname>
<given-names>BK</given-names>
</name>
<name>
<surname>Raghava</surname>
<given-names>GP</given-names>
</name>
</person-group>
<article-title>Analysis and prediction of antibacterial peptides</article-title>
<source>BMC Bioinformatics</source>
<year iso-8601-date="2007">2007</year>
<volume>8</volume>
<elocation-id>263</elocation-id>
<pub-id pub-id-type="doi">10.1186/1471-2105-8-263</pub-id>
<pub-id pub-id-type="pmid">17645800</pub-id>
<pub-id pub-id-type="pmcid">PMC2041956</pub-id>
</element-citation>
</ref>
<ref id="B77">
<label>77</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Akhter</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>JH</given-names>
</name>
</person-group>
<article-title>BPAGS: a web application for bacteriocin prediction via feature evaluation using alternating decision tree, genetic algorithm, and linear support vector classifier</article-title>
<source>Front Bioinform</source>
<year iso-8601-date="2024">2024</year>
<volume>3</volume>
<elocation-id>1284705</elocation-id>
<pub-id pub-id-type="doi">10.3389/fbinf.2023.1284705</pub-id>
<pub-id pub-id-type="pmid">38268970</pub-id>
<pub-id pub-id-type="pmcid">PMC10807691</pub-id>
</element-citation>
</ref>
<ref id="B78">
<label>78</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lira</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Perez</surname>
<given-names>PS</given-names>
</name>
<name>
<surname>Baranauskas</surname>
<given-names>JA</given-names>
</name>
<name>
<surname>Nozawa</surname>
<given-names>SR</given-names>
</name>
</person-group>
<article-title>Prediction of antimicrobial activity of synthetic peptides by a decision tree model</article-title>
<source>Appl Environ Microbiol</source>
<year iso-8601-date="2013">2013</year>
<volume>79</volume>
<fpage>3156</fpage>
<lpage>9</lpage>
<pub-id pub-id-type="doi">10.1128/AEM.02804-12</pub-id>
<pub-id pub-id-type="pmid">23455341</pub-id>
<pub-id pub-id-type="pmcid">PMC3685263</pub-id>
</element-citation>
</ref>
<ref id="B79">
<label>79</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elalouf</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Elalouf</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Rosenfeld</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Maoz</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>Artificial intelligence in drug resistance management</article-title>
<source>3 Biotech</source>
<year iso-8601-date="2025">2025</year>
<volume>15</volume>
<elocation-id>126</elocation-id>
<pub-id pub-id-type="doi">10.1007/s13205-025-04282-w</pub-id>
<pub-id pub-id-type="pmid">40235844</pub-id>
<pub-id pub-id-type="pmcid">PMC11996750</pub-id>
</element-citation>
</ref>
<ref id="B80">
<label>80</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xiao</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>WZ</given-names>
</name>
<name>
<surname>Jia</surname>
<given-names>JH</given-names>
</name>
<name>
<surname>Chou</surname>
<given-names>KC</given-names>
</name>
</person-group>
<article-title>iAMP-2L: a two-level multi-label classifier for identifying antimicrobial peptides and their functional types</article-title>
<source>Anal Biochem</source>
<year iso-8601-date="2013">2013</year>
<volume>436</volume>
<fpage>168</fpage>
<lpage>77</lpage>
<pub-id pub-id-type="doi">10.1016/j.ab.2013.01.019</pub-id>
<pub-id pub-id-type="pmid">23395824</pub-id>
</element-citation>
</ref>
<ref id="B81">
<label>81</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bai</surname>
<given-names>LY</given-names>
</name>
<name>
<surname>Dai</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Junaid</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>SL</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>X</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Prediction of Effective Drug Combinations by an Improved Naïve Bayesian Algorithm</article-title>
<source>Int J Mol Sci</source>
<year iso-8601-date="2018">2018</year>
<volume>19</volume>
<elocation-id>467</elocation-id>
<pub-id pub-id-type="doi">10.3390/ijms19020467</pub-id>
<pub-id pub-id-type="pmid">29401735</pub-id>
<pub-id pub-id-type="pmcid">PMC5855689</pub-id>
</element-citation>
</ref>
<ref id="B82">
<label>82</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jana</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Sarkar</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Ganguli</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Mukherjee</surname>
<given-names>SK</given-names>
</name>
<name>
<surname>Mandal</surname>
<given-names>RS</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>ABDpred: Prediction of active antimicrobial compounds using supervised machine learning techniques</article-title>
<source>Indian J Med Res</source>
<year iso-8601-date="2024">2024</year>
<volume>159</volume>
<fpage>78</fpage>
<lpage>90</lpage>
<pub-id pub-id-type="doi">10.4103/ijmr.ijmr_1832_22</pub-id>
<pub-id pub-id-type="pmid">38345040</pub-id>
<pub-id pub-id-type="pmcid">PMC10954100</pub-id>
</element-citation>
</ref>
<ref id="B83">
<label>83</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>CC</given-names>
</name>
<name>
<surname>Hung</surname>
<given-names>YT</given-names>
</name>
<name>
<surname>Chou</surname>
<given-names>CY</given-names>
</name>
<name>
<surname>Hsuan</surname>
<given-names>SL</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>ZW</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>PY</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Using random forest to predict antimicrobial minimum inhibitory concentrations of nontyphoidal <italic>Salmonella</italic> in Taiwan</article-title>
<source>Vet Res</source>
<year iso-8601-date="2023">2023</year>
<volume>54</volume>
<elocation-id>11</elocation-id>
<pub-id pub-id-type="doi">10.1186/s13567-023-01141-5</pub-id>
<pub-id pub-id-type="pmid">36747286</pub-id>
<pub-id pub-id-type="pmcid">PMC9903507</pub-id>
</element-citation>
</ref>
<ref id="B84">
<label>84</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Han</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>J</given-names>
</name>
</person-group>
<article-title>Protocol for predicting peptides with anticancer and antimicrobial properties by a tri-fusion neural network</article-title>
<source>STAR Protoc</source>
<year iso-8601-date="2023">2023</year>
<volume>4</volume>
<elocation-id>102541</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.xpro.2023.102541</pub-id>
<pub-id pub-id-type="pmid">37660298</pub-id>
<pub-id pub-id-type="pmcid">PMC10491854</pub-id>
</element-citation>
</ref>
<ref id="B85">
<label>85</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bournez</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Riool</surname>
<given-names>M</given-names>
</name>
<name>
<surname>de Boer</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Cordfunke</surname>
<given-names>RA</given-names>
</name>
<name>
<surname>de Best</surname>
<given-names>L</given-names>
</name>
<name>
<surname>van Leeuwen</surname>
<given-names>R</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>CalcAMP: A New Machine Learning Model for the Accurate Prediction of Antimicrobial Activity of Peptides</article-title>
<source>Antibiotics (Basel)</source>
<year iso-8601-date="2023">2023</year>
<volume>12</volume>
<elocation-id>725</elocation-id>
<pub-id pub-id-type="doi">10.3390/antibiotics12040725</pub-id>
<pub-id pub-id-type="pmid">37107088</pub-id>
<pub-id pub-id-type="pmcid">PMC10135148</pub-id>
</element-citation>
</ref>
<ref id="B86">
<label>86</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lv</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Hesham</surname>
<given-names>AE</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>AMPpred-EL: An effective antimicrobial peptide prediction model based on ensemble learning</article-title>
<source>Comput Biol Med</source>
<year iso-8601-date="2022">2022</year>
<volume>146</volume>
<elocation-id>105577</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.compbiomed.2022.105577</pub-id>
<pub-id pub-id-type="pmid">35576825</pub-id>
</element-citation>
</ref>
<ref id="B87">
<label>87</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jaiswal</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S</given-names>
</name>
</person-group>
<article-title>PTPAMP: prediction tool for plant-derived antimicrobial peptides</article-title>
<source>Amino Acids</source>
<year iso-8601-date="2023">2023</year>
<volume>55</volume>
<fpage>1</fpage>
<lpage>17</lpage>
<pub-id pub-id-type="doi">10.1007/s00726-022-03190-0</pub-id>
<pub-id pub-id-type="pmid">35864258</pub-id>
</element-citation>
</ref>
<ref id="B88">
<label>88</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Xiong</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Cui</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Deep Learning in Antimicrobial Peptide Prediction</article-title>
<source>J Chem Inf Model</source>
<year iso-8601-date="2025">2025</year>
<volume>65</volume>
<fpage>7373</fpage>
<lpage>92</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jcim.5c00530</pub-id>
<pub-id pub-id-type="pmid">40626654</pub-id>
</element-citation>
</ref>
<ref id="B89">
<label>89</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ma</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Xia</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Yu</surname>
<given-names>Y</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Identification of antimicrobial peptides from the human gut microbiome using deep learning</article-title>
<source>Nat Biotechnol</source>
<year iso-8601-date="2022">2022</year>
<volume>40</volume>
<fpage>921</fpage>
<lpage>31</lpage>
<pub-id pub-id-type="doi">10.1038/s41587-022-01226-0</pub-id>
<pub-id pub-id-type="pmid">35241840</pub-id>
</element-citation>
</ref>
<ref id="B90">
<label>90</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Cai</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Wong</surname>
<given-names>DF</given-names>
</name>
<name>
<surname>Siu</surname>
<given-names>SWI</given-names>
</name>
</person-group>
<article-title>Recent Progress in the Discovery and Design of Antimicrobial Peptides Using Traditional Machine Learning and Deep Learning</article-title>
<source>Antibiotics (Basel)</source>
<year iso-8601-date="2022">2022</year>
<volume>11</volume>
<elocation-id>1451</elocation-id>
<pub-id pub-id-type="doi">10.3390/antibiotics11101451</pub-id>
<pub-id pub-id-type="pmid">36290108</pub-id>
<pub-id pub-id-type="pmcid">PMC9598685</pub-id>
</element-citation>
</ref>
<ref id="B91">
<label>91</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xing</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Huo</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>G</given-names>
</name>
</person-group>
<article-title>iAMP-Attenpred: a novel antimicrobial peptide predictor based on BERT feature extraction method and CNN-BiLSTM-Attention combination model</article-title>
<source>Brief Bioinform</source>
<year iso-8601-date="2024">2024</year>
<volume>25</volume>
<elocation-id>bbad443</elocation-id>
<pub-id pub-id-type="doi">10.1093/bib/bbad443</pub-id>
<pub-id pub-id-type="pmid">38055840</pub-id>
<pub-id pub-id-type="pmcid">PMC10699745</pub-id>
</element-citation>
</ref>
<ref id="B92">
<label>92</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Zhuo</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zou</surname>
<given-names>Q</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Diff-AMP: tailored designed antimicrobial peptide framework with all-in-one generation, identification, prediction and optimization</article-title>
<source>Brief Bioinform</source>
<year iso-8601-date="2024">2024</year>
<volume>25</volume>
<elocation-id>bbae078</elocation-id>
<pub-id pub-id-type="doi">10.1093/bib/bbae078</pub-id>
<pub-id pub-id-type="pmid">38446739</pub-id>
<pub-id pub-id-type="pmcid">PMC10939340</pub-id>
</element-citation>
</ref>
<ref id="B93">
<label>93</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Nam</surname>
<given-names>H</given-names>
</name>
</person-group>
<article-title>AMP-BERT: Prediction of antimicrobial peptide function based on a BERT model</article-title>
<source>Protein Sci</source>
<year iso-8601-date="2023">2023</year>
<volume>32</volume>
<elocation-id>e4529</elocation-id>
<pub-id pub-id-type="doi">10.1002/pro.4529</pub-id>
<pub-id pub-id-type="pmid">36461699</pub-id>
<pub-id pub-id-type="pmcid">PMC9793967</pub-id>
</element-citation>
</ref>
<ref id="B94">
<label>94</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lobo</surname>
<given-names>F</given-names>
</name>
<name>
<surname>González</surname>
<given-names>MS</given-names>
</name>
<name>
<surname>Boto</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Pérez</surname>
<given-names>de la Lastra JM</given-names>
</name>
</person-group>
<article-title>Prediction of Antifungal Activity of Antimicrobial Peptides by Transfer Learning from Protein Pretrained Models</article-title>
<source>Int J Mol Sci</source>
<year iso-8601-date="2023">2023</year>
<volume>24</volume>
<elocation-id>10270</elocation-id>
<pub-id pub-id-type="doi">10.3390/ijms241210270</pub-id>
<pub-id pub-id-type="pmid">37373415</pub-id>
<pub-id pub-id-type="pmcid">PMC10299371</pub-id>
</element-citation>
</ref>
<ref id="B95">
<label>95</label>
<element-citation publication-type="web">
<person-group person-group-type="author">
<name>
<surname>Vaswani</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Shazeer</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Parmar</surname>
<given-names>N</given-names>
</name>
<name>
<surname>Uszkoreit</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Gomez</surname>
<given-names>AN</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Attention Is All You Need</article-title>
<comment>arXiv:1706.03762 [Preprint]. 2017 [cited 2025 August 14]. Available from: <uri xlink:href="https://doi.org/10.48550/arXiv.1706.03762">https://doi.org/10.48550/arXiv.1706.03762</uri></comment>
</element-citation>
</ref>
<ref id="B96">
<label>96</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Akin</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Rao</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Hie</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>W</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Evolutionary-scale prediction of atomic-level protein structure with a language model</article-title>
<source>Science</source>
<year iso-8601-date="2023">2023</year>
<volume>379</volume>
<fpage>1123</fpage>
<lpage>30</lpage>
<pub-id pub-id-type="doi">10.1126/science.ade2574</pub-id>
<pub-id pub-id-type="pmid">36927031</pub-id>
</element-citation>
</ref>
<ref id="B97">
<label>97</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Elnaggar</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Heinzinger</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Dallago</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Rehawi</surname>
<given-names>G</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Jones</surname>
<given-names>L</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning</article-title>
<source>IEEE Trans Pattern Anal Mach Intell</source>
<year iso-8601-date="2022">2022</year>
<volume>44</volume>
<fpage>7112</fpage>
<lpage>27</lpage>
<pub-id pub-id-type="doi">10.1109/TPAMI.2021.3095381</pub-id>
<pub-id pub-id-type="pmid">34232869</pub-id>
</element-citation>
</ref>
<ref id="B98">
<label>98</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>LT</given-names>
</name>
<name>
<surname>Quinn</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Dumas</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Hong</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Lopez-Gonzalez</surname>
<given-names>M</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Target sequence-conditioned design of peptide binders using masked language modeling</article-title>
<source>Nat Biotechnol</source>
<year iso-8601-date="2025">2025</year>
<volume>[Epub ahead of print]</volume>
<pub-id pub-id-type="doi">10.1038/s41587-025-02761-2</pub-id>
<pub-id pub-id-type="pmid">40804173</pub-id>
</element-citation>
</ref>
<ref id="B99">
<label>99</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cao</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Ge</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Harvey</surname>
<given-names>PJ</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Ma</surname>
<given-names>Y</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Designing antimicrobial peptides using deep learning and molecular dynamic simulations</article-title>
<source>Brief Bioinform</source>
<year iso-8601-date="2023">2023</year>
<volume>24</volume>
<elocation-id>bbad058</elocation-id>
<pub-id pub-id-type="doi">10.1093/bib/bbad058</pub-id>
<pub-id pub-id-type="pmid">36857616</pub-id>
</element-citation>
</ref>
<ref id="B100">
<label>100</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Al-Omari</surname>
<given-names>AM</given-names>
</name>
<name>
<surname>Akkam</surname>
<given-names>YH</given-names>
</name>
<name>
<surname>Zyout</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Younis</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Tawalbeh</surname>
<given-names>SM</given-names>
</name>
<name>
<surname>Al-Sawalmeh</surname>
<given-names>K</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Accelerating antimicrobial peptide design: Leveraging deep learning for rapid discovery</article-title>
<source>PLoS One</source>
<year iso-8601-date="2024">2024</year>
<volume>19</volume>
<elocation-id>e0315477</elocation-id>
<pub-id pub-id-type="doi">10.1371/journal.pone.0315477</pub-id>
<pub-id pub-id-type="pmid">39705302</pub-id>
<pub-id pub-id-type="pmcid">PMC11661626</pub-id>
</element-citation>
</ref>
<ref id="B101">
<label>101</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yan</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Lv</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>B</given-names>
</name>
</person-group>
<article-title>sAMPpred-GAT: prediction of antimicrobial peptide by graph attention network and predicted peptide structure</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2023">2023</year>
<volume>39</volume>
<elocation-id>btac715</elocation-id>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btac715</pub-id>
<pub-id pub-id-type="pmid">36342186</pub-id>
<pub-id pub-id-type="pmcid">PMC9805557</pub-id>
</element-citation>
</ref>
<ref id="B102">
<label>102</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sagendorf</surname>
<given-names>JM</given-names>
</name>
<name>
<surname>Mitra</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Huang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>XS</given-names>
</name>
<name>
<surname>Rohs</surname>
<given-names>R</given-names>
</name>
</person-group>
<article-title>Structure-based prediction of protein-nucleic acid binding using graph neural networks</article-title>
<source>Biophys Rev</source>
<year iso-8601-date="2024">2024</year>
<volume>16</volume>
<fpage>297</fpage>
<lpage>314</lpage>
<pub-id pub-id-type="doi">10.1007/s12551-024-01201-w</pub-id>
<pub-id pub-id-type="pmid">39345796</pub-id>
<pub-id pub-id-type="pmcid">PMC11427629</pub-id>
</element-citation>
</ref>
<ref id="B103">
<label>103</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wan</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Torres</surname>
<given-names>MDT</given-names>
</name>
<name>
<surname>Peng</surname>
<given-names>J</given-names>
</name>
<name>
<surname>de la Fuente-Nunez</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Deep-learning-enabled antibiotic discovery through molecular de-extinction</article-title>
<source>Nat Biomed Eng</source>
<year iso-8601-date="2024">2024</year>
<volume>8</volume>
<fpage>854</fpage>
<lpage>71</lpage>
<pub-id pub-id-type="doi">10.1038/s41551-024-01201-x</pub-id>
<pub-id pub-id-type="pmid">38862735</pub-id>
<pub-id pub-id-type="pmcid">PMC11310081</pub-id>
</element-citation>
</ref>
<ref id="B104">
<label>104</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Gao</surname>
<given-names>X</given-names>
</name>
</person-group>
<article-title>UniproLcad: Accurate Identification of Antimicrobial Peptide by Fusing Multiple Pre-Trained Protein Language Models</article-title>
<source>Symmetry</source>
<year iso-8601-date="2024">2024</year>
<volume>16</volume>
<elocation-id>464</elocation-id>
<pub-id pub-id-type="doi">10.3390/sym16040464</pub-id>
</element-citation>
</ref>
<ref id="B105">
<label>105</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Rives</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Meier</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Sercu</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Goyal</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>J</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences</article-title>
<source>Proc Natl Acad Sci U S A</source>
<year iso-8601-date="2021">2021</year>
<volume>118</volume>
<elocation-id>e2016239118</elocation-id>
<pub-id pub-id-type="doi">10.1073/pnas.2016239118</pub-id>
<pub-id pub-id-type="pmid">33876751</pub-id>
<pub-id pub-id-type="pmcid">PMC8053943</pub-id>
</element-citation>
</ref>
<ref id="B106">
<label>106</label>
<element-citation publication-type="web">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Dong</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zong</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>HMD-AMP: Protein Language-Powered Hierarchical Multi-label Deep Forest for Annotating Antimicrobial Peptides</article-title>
<comment>arXiv:2111.06023 [Preprint]. 2021 [cited 2025 August 18]. Available from: <uri xlink:href="https://doi.org/10.48550/arXiv.2111.06023">https://doi.org/10.48550/arXiv.2111.06023</uri></comment>
</element-citation>
</ref>
<ref id="B107">
<label>107</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Guntuboina</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Das</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Mollaei</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Barati</surname>
<given-names>Farimani A</given-names>
</name>
</person-group>
<article-title>PeptideBERT: A Language Model Based on Transformers for Peptide Property Prediction</article-title>
<source>J Phys Chem Lett</source>
<year iso-8601-date="2023">2023</year>
<volume>14</volume>
<fpage>10427</fpage>
<lpage>34</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jpclett.3c02398</pub-id>
<pub-id pub-id-type="pmid">37956397</pub-id>
<pub-id pub-id-type="pmcid">PMC10683064</pub-id>
</element-citation>
</ref>
<ref id="B108">
<label>108</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Xiong</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Cui</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>H</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>PepGraphormer: an ESM-GAT hybrid deep learning framework for antimicrobial peptide prediction</article-title>
<source>J Cheminform</source>
<year iso-8601-date="2026">2026</year>
<volume>18</volume>
<elocation-id>15</elocation-id>
<pub-id pub-id-type="doi">10.1186/s13321-025-01144-8</pub-id>
<pub-id pub-id-type="pmid">41491543</pub-id>
<pub-id pub-id-type="pmcid">PMC12870942</pub-id>
</element-citation>
</ref>
<ref id="B109">
<label>109</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Ren</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Luo</surname>
<given-names>X</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>A Foundation Model Identifies Broad-Spectrum Antimicrobial Peptides against Drug-Resistant Bacterial Infection</article-title>
<source>Nat Commun</source>
<year iso-8601-date="2024">2024</year>
<volume>15</volume>
<elocation-id>7538</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41467-024-51933-2</pub-id>
<pub-id pub-id-type="pmid">39214978</pub-id>
<pub-id pub-id-type="pmcid">PMC11364768</pub-id>
</element-citation>
</ref>
<ref id="B110">
<label>110</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>R</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>BroadAMP-GPT: AI-Driven generation of broad-spectrum antimicrobial peptides for combating multidrug-resistant ESKAPE pathogens</article-title>
<source>Gut Microbes</source>
<year iso-8601-date="2025">2025</year>
<volume>17</volume>
<elocation-id>2523811</elocation-id>
<pub-id pub-id-type="doi">10.1080/19490976.2025.2523811</pub-id>
<pub-id pub-id-type="pmid">40568788</pub-id>
<pub-id pub-id-type="pmcid">PMC12203862</pub-id>
</element-citation>
</ref>
<ref id="B111">
<label>111</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>BL</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Q</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>BatGPT-Chem: A Foundation Large Model for Chemical Engineering</article-title>
<source>Research (Wash D C)</source>
<year iso-8601-date="2025">2025</year>
<volume>8</volume>
<elocation-id>0827</elocation-id>
<pub-id pub-id-type="doi">10.34133/research.0827</pub-id>
<pub-id pub-id-type="pmid">40936797</pub-id>
<pub-id pub-id-type="pmcid">PMC12421729</pub-id>
</element-citation>
</ref>
<ref id="B112">
<label>112</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szymczak</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Możejko</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Grzegorzek</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Jurczak</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Bauer</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Neubauer</surname>
<given-names>D</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Discovering highly potent antimicrobial peptides with deep generative model HydrAMP</article-title>
<source>Nat Commun</source>
<year iso-8601-date="2023">2023</year>
<volume>14</volume>
<elocation-id>1453</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41467-023-36994-z</pub-id>
<pub-id pub-id-type="pmid">36922490</pub-id>
<pub-id pub-id-type="pmcid">PMC10017685</pub-id>
</element-citation>
</ref>
<ref id="B113">
<label>113</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yu</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Qiao</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Multi-CGAN: Deep Generative Model-Based Multiproperty Antimicrobial Peptide Design</article-title>
<source>J Chem Inf Model</source>
<year iso-8601-date="2024">2024</year>
<volume>64</volume>
<fpage>316</fpage>
<lpage>26</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jcim.3c01881</pub-id>
<pub-id pub-id-type="pmid">38135439</pub-id>
</element-citation>
</ref>
<ref id="B114">
<label>114</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Szymczak</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Zarzecki</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Duan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Coelho</surname>
<given-names>LP</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>AI-Driven Antimicrobial Peptide Discovery: Mining and Generation</article-title>
<source>Acc Chem Res</source>
<year iso-8601-date="2025">2025</year>
<volume>58</volume>
<fpage>1831</fpage>
<lpage>46</lpage>
<pub-id pub-id-type="doi">10.1021/acs.accounts.0c00594</pub-id>
<pub-id pub-id-type="pmid">40459283</pub-id>
<pub-id pub-id-type="pmcid">PMC12177927</pub-id>
</element-citation>
</ref>
<ref id="B115">
<label>115</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Bernards</surname>
<given-names>MT</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Shao</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>Sequence-based peptide identification, generation, and property prediction with deep learning: a review</article-title>
<source>Mol Syst Des Eng</source>
<year iso-8601-date="2021">2021</year>
<volume>6</volume>
<fpage>406</fpage>
<lpage>28</lpage>
<pub-id pub-id-type="doi">10.1039/D0ME00161A</pub-id>
</element-citation>
</ref>
<ref id="B116">
<label>116</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chung</surname>
<given-names>CR</given-names>
</name>
<name>
<surname>Jhong</surname>
<given-names>JH</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wan</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Horng</surname>
<given-names>JT</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Characterization and Identification of Natural Antimicrobial Peptides on Different Organisms</article-title>
<source>Int J Mol Sci</source>
<year iso-8601-date="2020">2020</year>
<volume>21</volume>
<elocation-id>986</elocation-id>
<pub-id pub-id-type="doi">10.3390/ijms21030986</pub-id>
<pub-id pub-id-type="pmid">32024233</pub-id>
<pub-id pub-id-type="pmcid">PMC7038045</pub-id>
</element-citation>
</ref>
<ref id="B117">
<label>117</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fu</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Niu</surname>
<given-names>B</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Wu</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W</given-names>
</name>
</person-group>
<article-title>CD-HIT: accelerated for clustering the next-generation sequencing data</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2012">2012</year>
<volume>28</volume>
<fpage>3150</fpage>
<lpage>2</lpage>
<pub-id pub-id-type="doi">10.1093/bioinformatics/bts565</pub-id>
<pub-id pub-id-type="pmid">23060610</pub-id>
<pub-id pub-id-type="pmcid">PMC3516142</pub-id>
</element-citation>
</ref>
<ref id="B118">
<label>118</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kondratenko</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Korobeynikov</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Lapidus</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>CDSnake: Snakemake pipeline for retrieval of annotated OTUs from paired-end reads using CD-HIT utilities</article-title>
<source>BMC Bioinformatics</source>
<year iso-8601-date="2020">2020</year>
<volume>21</volume>
<elocation-id>303</elocation-id>
<pub-id pub-id-type="doi">10.1186/s12859-020-03591-6</pub-id>
<pub-id pub-id-type="pmid">32703166</pub-id>
<pub-id pub-id-type="pmcid">PMC7379820</pub-id>
</element-citation>
</ref>
<ref id="B119">
<label>119</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lertampaiporn</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Vorapreeda</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Hongsthong</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Thammarongtham</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Ensemble-AMPPred: Robust AMP Prediction and Recognition Using the Ensemble Learning Method with a New Hybrid Feature for Differentiating AMPs</article-title>
<source>Genes (Basel)</source>
<year iso-8601-date="2021">2021</year>
<volume>12</volume>
<elocation-id>137</elocation-id>
<pub-id pub-id-type="doi">10.3390/genes12020137</pub-id>
<pub-id pub-id-type="pmid">33494403</pub-id>
<pub-id pub-id-type="pmcid">PMC7911732</pub-id>
</element-citation>
</ref>
<ref id="B120">
<label>120</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Amerifar</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Norouzi</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Ghandi</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>A tool for feature extraction from biological sequences</article-title>
<source>Brief Bioinform</source>
<year iso-8601-date="2022">2022</year>
<volume>23</volume>
<elocation-id>bbac108</elocation-id>
<pub-id pub-id-type="doi">10.1093/bib/bbac108</pub-id>
<pub-id pub-id-type="pmid">35383372</pub-id>
</element-citation>
</ref>
<ref id="B121">
<label>121</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Das</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Sercu</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Wadhawan</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Padhi</surname>
<given-names>I</given-names>
</name>
<name>
<surname>Gehrmann</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Cipcigan</surname>
<given-names>F</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Accelerated antimicrobial discovery via deep generative models and molecular dynamics simulations</article-title>
<source>Nat Biomed Eng</source>
<year iso-8601-date="2021">2021</year>
<volume>5</volume>
<fpage>613</fpage>
<lpage>23</lpage>
<pub-id pub-id-type="doi">10.1038/s41551-021-00689-x</pub-id>
<pub-id pub-id-type="pmid">33707779</pub-id>
</element-citation>
</ref>
<ref id="B122">
<label>122</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gligorijević</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Renfrew</surname>
<given-names>PD</given-names>
</name>
<name>
<surname>Kosciolek</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Leman</surname>
<given-names>JK</given-names>
</name>
<name>
<surname>Berenberg</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Vatanen</surname>
<given-names>T</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Structure-based protein function prediction using graph convolutional networks</article-title>
<source>Nat Commun</source>
<year iso-8601-date="2021">2021</year>
<volume>12</volume>
<elocation-id>3168</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41467-021-23303-9</pub-id>
<pub-id pub-id-type="pmid">34039967</pub-id>
<pub-id pub-id-type="pmcid">PMC8155034</pub-id>
</element-citation>
</ref>
<ref id="B123">
<label>123</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>X</given-names>
</name>
</person-group>
<article-title>Graph Clustering with High-Order Contrastive Learning</article-title>
<source>Entropy (Basel)</source>
<year iso-8601-date="2023">2023</year>
<volume>25</volume>
<elocation-id>1432</elocation-id>
<pub-id pub-id-type="doi">10.3390/e25101432</pub-id>
<pub-id pub-id-type="pmid">37895553</pub-id>
<pub-id pub-id-type="pmcid">PMC10606795</pub-id>
</element-citation>
</ref>
<ref id="B124">
<label>124</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>K</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>W</given-names>
</name>
</person-group>
<article-title>Local structure-aware graph contrastive representation learning</article-title>
<source>Neural Netw</source>
<year iso-8601-date="2024">2024</year>
<volume>172</volume>
<elocation-id>106083</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.neunet.2023.12.037</pub-id>
<pub-id pub-id-type="pmid">38182463</pub-id>
</element-citation>
</ref>
<ref id="B125">
<label>125</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jiao</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Bao</surname>
<given-names>Q</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>Z</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Contrastive representation learning on dynamic networks</article-title>
<source>Neural Netw</source>
<year iso-8601-date="2024">2024</year>
<volume>174</volume>
<elocation-id>106240</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.neunet.2024.106240</pub-id>
<pub-id pub-id-type="pmid">38521019</pub-id>
</element-citation>
</ref>
<ref id="B126">
<label>126</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ramos-Llorens</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Bello-Madruga</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Valle</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Andreu</surname>
<given-names>D</given-names>
</name>
<name>
<surname>Torrent</surname>
<given-names>M</given-names>
</name>
</person-group>
<article-title>PyAMPA: a high-throughput prediction and optimization tool for antimicrobial peptides</article-title>
<source>mSystems</source>
<year iso-8601-date="2024">2024</year>
<volume>9</volume>
<elocation-id>e01358-23</elocation-id>
<pub-id pub-id-type="doi">10.1128/msystems.01358-23</pub-id>
<pub-id pub-id-type="pmid">38934543</pub-id>
<pub-id pub-id-type="pmcid">PMC11264690</pub-id>
</element-citation>
</ref>
<ref id="B127">
<label>127</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Warren</surname>
<given-names>RL</given-names>
</name>
<name>
<surname>Birol</surname>
<given-names>I</given-names>
</name>
</person-group>
<article-title>Models and data of AMPlify: a deep learning tool for antimicrobial peptide prediction</article-title>
<source>BMC Res Notes</source>
<year iso-8601-date="2023">2023</year>
<volume>16</volume>
<elocation-id>11</elocation-id>
<pub-id pub-id-type="doi">10.1186/s13104-023-06279-1</pub-id>
<pub-id pub-id-type="pmid">36732807</pub-id>
<pub-id pub-id-type="pmcid">PMC9896668</pub-id>
</element-citation>
</ref>
<ref id="B128">
<label>128</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhou</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Kong</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Ding</surname>
<given-names>B</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>TriNet: A tri-fusion neural network for the prediction of anticancer and antimicrobial peptides</article-title>
<source>Patterns (N Y)</source>
<year iso-8601-date="2023">2023</year>
<volume>4</volume>
<elocation-id>100702</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.patter.2023.100702</pub-id>
<pub-id pub-id-type="pmid">36960450</pub-id>
<pub-id pub-id-type="pmcid">PMC10028424</pub-id>
</element-citation>
</ref>
<ref id="B129">
<label>129</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Bhadra</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Sethiya</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Qin</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Tai</surname>
<given-names>HK</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Deep-AmPEP30: Improve Short Antimicrobial Peptides Prediction with Deep Learning</article-title>
<source>Mol Ther Nucleic Acids</source>
<year iso-8601-date="2020">2020</year>
<volume>20</volume>
<fpage>882</fpage>
<lpage>94</lpage>
<pub-id pub-id-type="doi">10.1016/j.omtn.2020.05.006</pub-id>
<pub-id pub-id-type="pmid">32464552</pub-id>
<pub-id pub-id-type="pmcid">PMC7256447</pub-id>
</element-citation>
</ref>
<ref id="B130">
<label>130</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lin</surname>
<given-names>TT</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>LY</given-names>
</name>
<name>
<surname>Lu</surname>
<given-names>IH</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>WC</given-names>
</name>
<name>
<surname>Hsu</surname>
<given-names>ZR</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>S</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>AI4AMP: an Antimicrobial Peptide Predictor Using Physicochemical Property-Based Encoding Method and Deep Learning</article-title>
<source>mSystems</source>
<year iso-8601-date="2021">2021</year>
<volume>6</volume>
<elocation-id>e0029921</elocation-id>
<pub-id pub-id-type="doi">10.1128/mSystems.00299-21</pub-id>
<pub-id pub-id-type="pmid">34783578</pub-id>
<pub-id pub-id-type="pmcid">PMC8594441</pub-id>
</element-citation>
</ref>
<ref id="B131">
<label>131</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fernández-Díaz</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Cossio-Pérez</surname>
<given-names>R</given-names>
</name>
<name>
<surname>Agoni</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Lam</surname>
<given-names>HT</given-names>
</name>
<name>
<surname>Lopez</surname>
<given-names>V</given-names>
</name>
<name>
<surname>Shields</surname>
<given-names>DC</given-names>
</name>
</person-group>
<article-title>AutoPeptideML: a study on how to build more trustworthy peptide bioactivity predictors</article-title>
<source>Bioinformatics</source>
<year iso-8601-date="2024">2024</year>
<volume>40</volume>
<elocation-id>btae555</elocation-id>
<pub-id pub-id-type="doi">10.1093/bioinformatics/btae555</pub-id>
<pub-id pub-id-type="pmid">39292535</pub-id>
<pub-id pub-id-type="pmcid">PMC11438549</pub-id>
</element-citation>
</ref>
<ref id="B132">
<label>132</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bhadra</surname>
<given-names>P</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Fong</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Siu</surname>
<given-names>SWI</given-names>
</name>
</person-group>
<article-title>AmPEP: Sequence-based prediction of antimicrobial peptides using distribution patterns of amino acid properties and random forest</article-title>
<source>Sci Rep</source>
<year iso-8601-date="2018">2018</year>
<volume>8</volume>
<elocation-id>1697</elocation-id>
<pub-id pub-id-type="doi">10.1038/s41598-018-19752-w</pub-id>
<pub-id pub-id-type="pmid">29374199</pub-id>
<pub-id pub-id-type="pmcid">PMC5785966</pub-id>
</element-citation>
</ref>
<ref id="B133">
<label>133</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Xu</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>X</given-names>
</name>
<name>
<surname>Landersdorfer</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>HH</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>iAMPCN: a deep-learning approach for identifying antimicrobial peptides and their functional activities</article-title>
<source>Brief Bioinform</source>
<year iso-8601-date="2023">2023</year>
<volume>24</volume>
<elocation-id>bbad240</elocation-id>
<pub-id pub-id-type="doi">10.1093/bib/bbad240</pub-id>
<pub-id pub-id-type="pmid">37369638</pub-id>
<pub-id pub-id-type="pmcid">PMC10359087</pub-id>
</element-citation>
</ref>
<ref id="B134">
<label>134</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wang</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Meng</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Xia</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Luan</surname>
<given-names>Y</given-names>
</name>
</person-group>
<article-title>PAMPred: A hierarchical evolutionary ensemble framework for identifying plant antimicrobial peptides</article-title>
<source>Comput Biol Med</source>
<year iso-8601-date="2023">2023</year>
<volume>166</volume>
<elocation-id>107545</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.compbiomed.2023.107545</pub-id>
<pub-id pub-id-type="pmid">37806057</pub-id>
</element-citation>
</ref>
<ref id="B135">
<label>135</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Saravanan</surname>
<given-names>KM</given-names>
</name>
<name>
<surname>Wei</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Jiao</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Pan</surname>
<given-names>Y</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Deep Learning-Based Bioactive Therapeutic Peptide Generation and Screening</article-title>
<source>J Chem Inf Model</source>
<year iso-8601-date="2023">2023</year>
<volume>63</volume>
<fpage>835</fpage>
<lpage>45</lpage>
<pub-id pub-id-type="doi">10.1021/acs.jcim.2c01485</pub-id>
<pub-id pub-id-type="pmid">36724090</pub-id>
</element-citation>
</ref>
<ref id="B136">
<label>136</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ghanbarzadeh</surname>
<given-names>Z</given-names>
</name>
<name>
<surname>Hemmati</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Mohagheghzadeh</surname>
<given-names>A</given-names>
</name>
</person-group>
<article-title>Humanizing plant-derived snakins and their encrypted antimicrobial peptides</article-title>
<source>Biochimie</source>
<year iso-8601-date="2022">2022</year>
<volume>199</volume>
<fpage>92</fpage>
<lpage>111</lpage>
<pub-id pub-id-type="doi">10.1016/j.biochi.2022.04.011</pub-id>
<pub-id pub-id-type="pmid">35472564</pub-id>
</element-citation>
</ref>
<ref id="B137">
<label>137</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Shen</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Chen</surname>
<given-names>L</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>J</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Analogs of the Cathelicidin-Derived Antimicrobial Peptide PMAP-23 Exhibit Improved Stability and Antibacterial Activity</article-title>
<source>Probiotics Antimicrob Proteins</source>
<year iso-8601-date="2021">2021</year>
<volume>13</volume>
<fpage>273</fpage>
<lpage>86</lpage>
<pub-id pub-id-type="doi">10.1007/s12602-020-09686-z</pub-id>
<pub-id pub-id-type="pmid">32666297</pub-id>
</element-citation>
</ref>
<ref id="B138">
<label>138</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lyu</surname>
<given-names>Y</given-names>
</name>
<name>
<surname>Tan</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Xue</surname>
<given-names>M</given-names>
</name>
<name>
<surname>Hou</surname>
<given-names>W</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>C</given-names>
</name>
<name>
<surname>Shan</surname>
<given-names>A</given-names>
</name>
<etal>et al.</etal>
</person-group>
<article-title>Broad-spectrum hybrid antimicrobial peptides derived from PMAP-23 with potential LPS binding ability</article-title>
<source>Biochem Pharmacol</source>
<year iso-8601-date="2023">2023</year>
<volume>210</volume>
<elocation-id>115500</elocation-id>
<pub-id pub-id-type="doi">10.1016/j.bcp.2023.115500</pub-id>
<pub-id pub-id-type="pmid">36921633</pub-id>
</element-citation>
</ref>
<ref id="B139">
<label>139</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schmidtchen</surname>
<given-names>A</given-names>
</name>
<name>
<surname>Frick</surname>
<given-names>IM</given-names>
</name>
<name>
<surname>Andersson</surname>
<given-names>E</given-names>
</name>
<name>
<surname>Tapper</surname>
<given-names>H</given-names>
</name>
<name>
<surname>Björck</surname>
<given-names>L</given-names>
</name>
</person-group>
<article-title>Proteinases of common pathogenic bacteria degrade and inactivate the antibacterial peptide LL-37</article-title>
<source>Mol Microbiol</source>
<year iso-8601-date="2002">2002</year>
<volume>46</volume>
<fpage>157</fpage>
<lpage>68</lpage>
<pub-id pub-id-type="doi">10.1046/j.1365-2958.2002.03146.x</pub-id>
<pub-id pub-id-type="pmid">12366839</pub-id>
</element-citation>
</ref>
<ref id="B140">
<label>140</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Gunasekera</surname>
<given-names>S</given-names>
</name>
<name>
<surname>Muhammad</surname>
<given-names>T</given-names>
</name>
<name>
<surname>Strömstedt</surname>
<given-names>AA</given-names>
</name>
<name>
<surname>Rosengren</surname>
<given-names>KJ</given-names>
</name>
<name>
<surname>Göransson</surname>
<given-names>U</given-names>
</name>
</person-group>
<article-title>Backbone Cyclization and Dimerization of LL-37-Derived Peptides Enhance Antimicrobial Activity and Proteolytic Stability</article-title>
<source>Front Microbiol</source>
<year iso-8601-date="2020">2020</year>
<volume>11</volume>
<elocation-id>168</elocation-id>
<pub-id pub-id-type="doi">10.3389/fmicb.2020.00168</pub-id>
<pub-id pub-id-type="pmid">32153522</pub-id>
<pub-id pub-id-type="pmcid">PMC7046553</pub-id>
</element-citation>
</ref>
<ref id="B141">
<label>141</label>
<element-citation publication-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wan</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Wong</surname>
<given-names>F</given-names>
</name>
<name>
<surname>Collins</surname>
<given-names>JJ</given-names>
</name>
<name>
<surname>de la Fuente-Nunez</surname>
<given-names>C</given-names>
</name>
</person-group>
<article-title>Machine learning for antimicrobial peptide identification and design</article-title>
<source>Nat Rev Bioeng</source>
<year iso-8601-date="2024">2024</year>
<volume>2</volume>
<fpage>392</fpage>
<lpage>407</lpage>
<pub-id pub-id-type="doi">10.1038/s44222-024-00152-x</pub-id>
<pub-id pub-id-type="pmid">39850516</pub-id>
<pub-id pub-id-type="pmcid">PMC11756916</pub-id>
</element-citation>
</ref>
</ref-list>
</back>
</article>