Yuna Oikawa
;
Takanori Uzawa
;
Francois Berenger
;
Noriko Minagawa
;
Akiko Yumoto
;
Hideaki Takaku
;
Ryo Tamura
;
Yoshihiro Ito
;
Koji Tsuda
説明:
(abstract)Language models have been increasingly popular in therapeutic peptide generation, but molecular diversity remains limited due to reliance on the 20 canonical amino acids. We propose a language model that generates peptidomimetics incorporating noncanonical elements like noncanonical amino acids and terminal modifications. To accomplish this, we created a vocabulary of over 17,000 noncanonical elements by extracting them from chemical formulas stored in the ChEMBL database. Our pretrained language model, GPepT, showed improved diversity in molecular structures and chemical properties. To demonstrate its real-world application, we fine-tuned the model for antimicrobial peptides. Experimental validation revealed that one of the generated peptidomimetics exhibited effective antimicrobial activity, marking a successful case of AI-driven peptide development. GPepT is fully accessible on HuggingFace: https://huggingface.co/Playingyoyo/GPepT.
権利情報:
キーワード: Language model, amino acid
刊行年月日: 2025-07-22
出版者: American Chemical Society (ACS)
掲載誌:
研究助成金:
原稿種別: 出版者版 (Version of record)
MDR DOI:
公開URL: https://doi.org/10.1021/acsmedchemlett.5c00375
関連資料:
その他の識別子:
連絡先:
更新時刻: 2025-08-25 12:30:37 +0900
MDRでの公開時刻: 2025-08-25 12:19:24 +0900
| ファイル名 | サイズ | |||
|---|---|---|---|---|
| ファイル名 |
oikawa-et-al-2025-gpept-a-foundation-language-model-for-peptidomimetics-incorporating-noncanonical-amino-acids.pdf
(サムネイル)
application/pdf |
サイズ | 3MB | 詳細 |
| ファイル名 |
ml5c00375_si_002.pdf
application/pdf |
サイズ | 841KB | 詳細 |