34 Citations (Scopus)

Abstract

In recent years, the rapid growth of biological data has increased interest in using bioinformatics to analyze and interpret this data. Proteomics, which studies the structure, function, and interactions of proteins, is a crucial area of bioinformatics. Using natural language processing (NLP) techniques in proteomics is an emerging field that combines machine learning and text mining to analyze biological data. Recently, transformer-based NLP models have gained significant attention for their ability to process variable-length input sequences in parallel, using self-attention mechanisms to capture long-range dependencies. In this review paper, we discuss the recent advancements in transformer-based NLP models in proteome bioinformatics and examine their advantages, limitations, and potential applications to improve the accuracy and efficiency of various tasks. Additionally, we highlight the challenges and future directions of using these models in proteome bioinformatics research. Overall, this review provides valuable insights into the potential of transformer-based NLP models to revolutionize proteome bioinformatics.

Original languageEnglish
Article number2300011
JournalProteomics
Volume23
Issue number23-24
DOIs
Publication statusPublished - Dec 2023

Keywords

  • bioinformatics
  • deep learning
  • drug discovery
  • explainable artificial intelligence
  • natural language processing
  • protein expression
  • protein function prediction
  • transformer attention

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology

Fingerprint

Dive into the research topics of 'Leveraging transformers-based language models in proteome bioinformatics'. Together they form a unique fingerprint.

Cite this