Classification of PICO elements by text features systematically extracted from PubMed abstracts

Ke Chun Huang, Charles Chih Ho Liu, Shung Shiang Yang, Furen Xiao, Jau Min Wong, Chun Chih Liao, I. Jen Chiang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

We propose and evaluate a systematic approach to detect and classify Patient/Problem, Intervention, Comparison and Outcome (PICO) from the medical literature. The training and test corpora were generated systematically and automatically from structured PubMed abstracts. 23,472 sentences by exact pattern match of head words of P-I-O categories. Afterward, the terms with top frequencies were used as the features of Naïve Bayesian classifier. This approach achieves F-measure values of 0.91 for Patient/Problem, 0.75 for Intervention and 0.88 for Outcome, comparable to previous studied based on mixed textural, paragraphical, and semantic features. In conclusion, we show that by stricter pattern matching criteria of training set, detection and classification of PICO elements can be reproducible with minimal expert intervention. The results of this work are higher than previous studies.

Original languageEnglish
Title of host publicationProceedings - 2011 IEEE International Conference on Granular Computing, GrC 2011
Pages279-283
Number of pages5
DOIs
Publication statusPublished - 2011
Event2011 IEEE International Conference on Granular Computing, GrC 2011 - Kaohsiung, Taiwan
Duration: Nov 8 2011Nov 10 2011

Publication series

NameProceedings - 2011 IEEE International Conference on Granular Computing, GrC 2011

Other

Other2011 IEEE International Conference on Granular Computing, GrC 2011
Country/TerritoryTaiwan
CityKaohsiung
Period11/8/1111/10/11

Keywords

  • information extraction
  • natural language processing
  • question answering
  • text mining

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'Classification of PICO elements by text features systematically extracted from PubMed abstracts'. Together they form a unique fingerprint.

Cite this