Human Pol II promoter prediction by using nucleotide property composition features

Wen Lin Huang, Chun Wei Tung, Shinn Ying Ho

研究成果: 書貢獻/報告類型會議貢獻

1 引文 斯高帕斯(Scopus)

摘要

RNA polymerase II (Pol II) promoter is a key region that regulates differential transcription of protein coding genes. The identification of the RNA polymerase II (Pol II) promoter is one of the most challenging problems in genome annotation. Though many promoter prediction methods and tools have been developed, they have not yet extracted informative features from large-scale DNA sequences to improve predictive accuracy. A prediction method ProPolyII, which involves mining informative nucleotide property composition (NPC) features, is proposed to design a support vector machine-based classifier. An existing data set HumP (1872 human promoters and 1870 non-promoters) is used to evaluate ProPolyII for promoter prediction. ProPolyII yields 70 informative NPC features with training and test accuracies of 99.1% and 95.1%, respectively. The 70 NPC features consist of 46 4-mer motifs, 3 nucleotide properties and 21 global descriptors. The accuracies are better than those of Prom-Machine (94.9% and 91.1%) and M1 (97.4% and 93.6%) which uses top 128 4-mer motifs and 36 global descriptors, respectively. The high predictive performance indicates that ProPolyII can be beneficial in the identification of promoters comparative to other methods.

原文英語
主出版物標題ISB 2010 Proceedings - International Symposium on Biocomputing
DOIs
出版狀態已發佈 - 5月 3 2010
對外發佈
事件International Symposium on Biocomputing, ISB 2010 - Calicut, Kerala, 印度
持續時間: 2月 15 20102月 17 2010

出版系列

名字ISB 2010 Proceedings - International Symposium on Biocomputing

會議

會議International Symposium on Biocomputing, ISB 2010
國家/地區印度
城市Calicut, Kerala
期間2/15/102/17/10

ASJC Scopus subject areas

  • 一般生物化學,遺傳學和分子生物學
  • 計算機理論與數學
  • 軟體
  • 藥學科學

指紋

深入研究「Human Pol II promoter prediction by using nucleotide property composition features」主題。共同形成了獨特的指紋。

引用此