Extracting eligibility criteria from the narrative text of scientific research articles

Ching Yun Lin, Der Ming Liou, Mei Lien Pan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Eligibility criteria among hundreds of National Health Insurance Research Database (NHIRD) research papers have similar constituent elements, such as demographic characteristics or diagnostic codes. The study results of the same disease could vary among different research due to the variation of the criteria statements, therefore the narrative patterns analysis tool would be helpful for summarizing the knowledge implicitly contained in the eligibility criteria. In this study, we developed a series of R-based text processing methods to extract the narrative eligibility criteria in NHIRD papers by simplifying the article titles and content paragraphs, identifying medical concepts and abbreviations, then detecting basic demographic characteristics and ICD-9-CM diagnosis codes. Although there is still room for improvement on study type identifying, the high performance in classifying the study type, detecting age restrictions and extracting ICD-9-CM codes still shows the system usefulness for the analysis of eligibility criteria.

Original languageEnglish
Title of host publicationMEDINFO 2017
Subtitle of host publicationPrecision Healthcare through Informatics - Proceedings of the 16th World Congress on Medical and Health Informatics
EditorsZhao Dongsheng, Adi V. Gundlapalli, Jaulent Marie-Christine
PublisherIOS Press
Pages481-485
Number of pages5
ISBN (Electronic)9781614998297
DOIs
Publication statusPublished - Jan 1 2017
Externally publishedYes
Event16th World Congress of Medical and Health Informatics: Precision Healthcare through Informatics, MedInfo 2017 - Hangzhou, China
Duration: Aug 21 2017Aug 25 2017

Publication series

NameStudies in Health Technology and Informatics
Volume245
ISSN (Print)0926-9630
ISSN (Electronic)1879-8365

Other

Other16th World Congress of Medical and Health Informatics: Precision Healthcare through Informatics, MedInfo 2017
Country/TerritoryChina
CityHangzhou
Period8/21/178/25/17

Keywords

  • Database
  • Natural language processing

ASJC Scopus subject areas

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Fingerprint

Dive into the research topics of 'Extracting eligibility criteria from the narrative text of scientific research articles'. Together they form a unique fingerprint.

Cite this