charts

Publications

Publication details

Ontology-driven Indexing of Public Datasets for Translational Bioinformatics
Conference Proceeding
Reference:
N. H. Shah, A. P. Chiang, A. J. Butte, R. Chen, M. A. Musen. American Medical Informatics Association Symposium on Translational Bioinformatics, San Francisco, CA, March 10-12, 2008. Published in 2008.
Abstract:

The volume of publicly available genomic scale data is increasing. Genomic datasets in public repositories are annotated with free-text fields describing the pathological state of the studied sample. These annotations are not mapped to concepts in any ontology, making it difficult to integrate these datasets across repositories. We have previously developed methods to map text-annotations of tissue microarrays to concepts in the NCI thesaurus and SNOMED-CT.

In this work we generalize our methods to map text annotations of gene expression datasets to concepts in the UMLS. We demonstrate the utility of our methods by processing annotations of datasets in the Gene Expression Omnibus. We demonstrate that we enable ontology-based querying and integration of tissue and gene expression microarray data. We enable identification of datasets on specific diseases across both repositories. Our approach provides the basis for ontology-driven data integration for translational research on gene and protein expression data.

Full PDF version available here
View the NCBO project
Back to Search Results
 
Information last updated: Fri May 9 2008
Make Corrections to this Publication
Stanford School of Medicine