TY - JOUR
T1 - Comparison of a semi-automatic annotation tool and a natural language processing application for the generation of clinical statement entries
AU - Lin, Ching Heng
AU - Wu, Nai Yuan
AU - Lai, Wei Shao
AU - Liou, Der Ming
N1 - Publisher Copyright:
© The Author 2014. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved.
PY - 2015/1/1
Y1 - 2015/1/1
N2 - Background and objective Electronic medical records with encoded entries should enhance the semantic interoperability of document exchange. However, it remains a challenge to encode the narrative concept and to transform the coded concepts into a standard entry-level document. This study aimed to use a novel approach for the generation of entrylevel interoperable clinical documents. Methods Using HL7 clinical document architecture (CDA) as the example, we developed three pipelines to generate entry- level CDA documents. The first approach was a semi-automatic annotation pipeline (SAAP), the second was a natural language processing (NLP) pipeline, and the third merged the above two pipelines. We randomly selected 50 test documents from the i2b2 corpora to evaluate the performance of the three pipelines. Results The 50 randomly selected test documents contained 9365 words, including 588 Observation terms and 123 Procedure terms. For the Observation terms, the merged pipeline had a significantly higher F-measure than the NLP pipeline (0.89 vs 0.80, p < 0.0001), but a similar F-measure to that of the SAAP (0.89 vs 0.87). For the Procedure terms, the F-measure was not significantly different among the three pipelines. Conclusions The combination of a semi-automatic annotation approach and the NLP application seems to be a solution for generating entry-level interoperable clinical documents.
AB - Background and objective Electronic medical records with encoded entries should enhance the semantic interoperability of document exchange. However, it remains a challenge to encode the narrative concept and to transform the coded concepts into a standard entry-level document. This study aimed to use a novel approach for the generation of entrylevel interoperable clinical documents. Methods Using HL7 clinical document architecture (CDA) as the example, we developed three pipelines to generate entry- level CDA documents. The first approach was a semi-automatic annotation pipeline (SAAP), the second was a natural language processing (NLP) pipeline, and the third merged the above two pipelines. We randomly selected 50 test documents from the i2b2 corpora to evaluate the performance of the three pipelines. Results The 50 randomly selected test documents contained 9365 words, including 588 Observation terms and 123 Procedure terms. For the Observation terms, the merged pipeline had a significantly higher F-measure than the NLP pipeline (0.89 vs 0.80, p < 0.0001), but a similar F-measure to that of the SAAP (0.89 vs 0.87). For the Procedure terms, the F-measure was not significantly different among the three pipelines. Conclusions The combination of a semi-automatic annotation approach and the NLP application seems to be a solution for generating entry-level interoperable clinical documents.
KW - Auto-complete technique
KW - CDA entry level
KW - Natural language processing
UR - http://www.scopus.com/inward/record.url?scp=84929502116&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84929502116&partnerID=8YFLogxK
U2 - 10.1136/amiajnl-2014-002991
DO - 10.1136/amiajnl-2014-002991
M3 - Article
C2 - 25332357
AN - SCOPUS:84929502116
SN - 1067-5027
VL - 22
SP - 132
EP - 142
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 1
ER -