TY - JOUR
T1 - Multiple-omic data analysis of Klebsiella pneumoniae MGH 78578 reveals its transcriptional architecture and regulatory features
AU - Seo, Joo Hyun
AU - Hong, Jay S.
AU - Kim, Donghyuk
AU - Cho, Byung Kwan
AU - Huang, Tzu Wen
AU - Tsai, Shih Feng
AU - Palsson, Bernhard O.
AU - Charusanti, Pep
N1 - Funding Information:
This study was funded from a grant from the National Research Institute of Taiwan through PH-099-SP-10.
PY - 2012/11/29
Y1 - 2012/11/29
N2 - Background: The increasing number of infections caused by strains of Klebsiella pneumoniae that are resistant to multiple antibiotics has developed into a major medical problem worldwide. The development of next-generation sequencing technologies now permits rapid sequencing of many K. pneumoniae isolates, but sequence information alone does not provide important structural and operational information for its genome.Results: Here we take a systems biology approach to annotate the K. pneumoniae MGH 78578 genome at the structural and operational levels. Through the acquisition and simultaneous analysis of multiple sample-matched -omics data sets from two growth conditions, we detected 2677, 1227, and 1066 binding sites for RNA polymerase, RpoD, and RpoS, respectively, 3660 RNA polymerase-guided transcript segments, and 3585 transcription start sites throughout the genome. Moreover, analysis of the transcription start site data identified 83 probable leaderless mRNAs, while analysis of unannotated transcripts suggested the presence of 119 putative open reading frames, 15 small RNAs, and 185 antisense transcripts that are not currently annotated.Conclusions: These findings highlight the strengths of systems biology approaches to the refinement of sequence-based annotations, and to provide new insight into fundamental genome-level biology for this important human pathogen.
AB - Background: The increasing number of infections caused by strains of Klebsiella pneumoniae that are resistant to multiple antibiotics has developed into a major medical problem worldwide. The development of next-generation sequencing technologies now permits rapid sequencing of many K. pneumoniae isolates, but sequence information alone does not provide important structural and operational information for its genome.Results: Here we take a systems biology approach to annotate the K. pneumoniae MGH 78578 genome at the structural and operational levels. Through the acquisition and simultaneous analysis of multiple sample-matched -omics data sets from two growth conditions, we detected 2677, 1227, and 1066 binding sites for RNA polymerase, RpoD, and RpoS, respectively, 3660 RNA polymerase-guided transcript segments, and 3585 transcription start sites throughout the genome. Moreover, analysis of the transcription start site data identified 83 probable leaderless mRNAs, while analysis of unannotated transcripts suggested the presence of 119 putative open reading frames, 15 small RNAs, and 185 antisense transcripts that are not currently annotated.Conclusions: These findings highlight the strengths of systems biology approaches to the refinement of sequence-based annotations, and to provide new insight into fundamental genome-level biology for this important human pathogen.
KW - Infectious disease
KW - Klebsiella pneumoniae
KW - Omics data
KW - Systems biology
KW - Transcriptional architecture
UR - http://www.scopus.com/inward/record.url?scp=84870057527&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84870057527&partnerID=8YFLogxK
U2 - 10.1186/1471-2164-13-679
DO - 10.1186/1471-2164-13-679
M3 - Article
C2 - 23194155
AN - SCOPUS:84870057527
SN - 1471-2164
VL - 13
JO - BMC Genomics
JF - BMC Genomics
IS - 1
M1 - 679
ER -