charts

Publications

Publication details

Unsupervised Method for Automatic Construction of a Disease Dictionary from a Large Free Text Collection
Conference Proceeding
Reference:
R. Xu, K. S. Supekar, A. A. Morgan, A. K. Das, A. M. Garber. AMIA Annual Symposium, Washington DC. Published in 2008.
Abstract:

Concept specific lexicons (e.g. diseases, drugs, anat-
omy) are a critical source of background knowledge for
many medical language-processing systems. However,
the rapid pace of biomedical research and the lack of
constraints on usage ensure that such dictionaries are
incomplete. Focusing on disease terminology, we have
developed an automated, unsupervised, iterative pattern
learning approach for constructing a comprehensive
medical dictionary of disease terms from randomized
clinical trial (RCT) abstracts, and we compared differ-
ent ranking methods for automatically extracting con-
textual patterns and concept terms. When used to
identify disease concepts from 100 randomly chosen,
manually annotated clinical abstracts, our disease dic-
tionary shows significant performance improvement
(F1 increased by 35-88%) over available, manually
created disease terminologies.

Back to Search Results
 
Information last updated: Fri Jun 27 2008
Make Corrections to this Publication
Stanford School of Medicine