Human Breathomics Database

Tien Chueh Kuo, Cheng En Tan, San Yuan Wang, Olivia A. Lin, Bo Han Su, Ming Tsung Hsu, Jessica Lin, Yu Yen Cheng, Ciao Sin Chen, Yu Chieh Yang, Kuo Hsing Chen, Shu Wen Lin, Chao Chi Ho, Ching Hua Kuo, Yufeng Jane Tseng

Research output: Contribution to journalArticlepeer-review

33 Citations (Scopus)


Breathomics is a special branch of metabolomics that quantifies volatile organic compounds (VOCs) from collected exhaled breath samples. Understanding how breath molecules are related to diseases, mechanisms and pathways identified from experimental analytical measurements is challenging due to the lack of an organized resource describing breath molecules, related references and biomedical information embedded in the literature. To provide breath VOCs, related references and biomedical information, we aim to organize a database composed of manually curated information and automatically extracted biomedical information. First, VOCs-related disease information was manually organized from 207 literature linked to 99 VOCs and known Medical Subject Headings (MeSH) terms. Then an automated text mining algorithm was used to extract biomedical information from this literature. In the end, the manually curated information and auto-extracted biomedical information was combined to form a breath molecule database-the Human Breathomics Database (HBDB). We first manually curated and organized disease information including MeSH term from 207 literatures associated with 99 VOCs. Then, an automatic pipeline of text mining approach was used to collect 2766 literatures and extract biomedical information from breath researches. We combined curated information with automatically extracted biomedical information to assemble a breath molecule database, the HBDB. The HBDB is a database that includes references, VOCs and diseases associated with human breathomics. Most of these VOCs were detected in human breath samples or exhaled breath condensate samples. So far, the database contains a total of 913 VOCs in relation to human exhaled breath researches reported in 2766 publications. The HBDB is the most comprehensive HBDB of VOCs in human exhaled breath to date. It is a useful and organized resource for researchers and clinicians to identify and further investigate potential biomarkers from the breath of patients. Database URL:

Original languageEnglish
JournalDatabase : the journal of biological databases and curation
Publication statusPublished - Jan 1 2020

ASJC Scopus subject areas

  • Information Systems
  • General Biochemistry,Genetics and Molecular Biology
  • General Agricultural and Biological Sciences


Dive into the research topics of 'Human Breathomics Database'. Together they form a unique fingerprint.

Cite this