charts

Publications

Publication details

The computational analysis of scientific literature to define and recognize gene expression clusters
Journal Article
Reference:
S. Raychaudhuri, J. T. Chang, F. Imam, R. B. Altman. Nucleic Acids Research, 31, August 1, 2003, 4553-4560. Published in 2003.
Abstract:

A limitation of many gene expression analytic
approaches is that they do not incorporate comprehensive
background knowledge about the genes
into the analysis. We present a computational
method that leverages the peer-reviewed literature
in the automatic analysis of gene expression data
sets. Including the literature in the analysis of gene
expression data offers an opportunity to incorporate
functional information about the genes when deŽning
expression clusters. We have created a method
that associates gene expression proŽles with
known biological functions. Our method has two
steps. First, we apply hierarchical clustering to the
given gene expression data set. Secondly, we use
text from abstracts about genes to (i) resolve hierarchical
cluster boundaries to optimize the functional
coherence of the clusters and (ii) recognize
those clusters that are most functionally coherent.
In the case where a gene has not been investigated
and therefore lacks primary literature, articles about
well-studied homologous genes are added as references.
We apply our method to two large gene
expression data sets with different properties. The
Žrst contains measurements for a subset of well-studied
Saccharomyces cerevisiae genes with
multiple literature references, and the second contains
newly discovered genes in Drosophila melanogaster;
many have no literature references at all. In
both cases, we are able to rapidly deŽne and identify
the biologically relevant gene expression proŽles
without manual intervention. In both cases, we
identiŽed novel clusters that were not noted by the
original investigators.

Full PDF version available here
Back to Search Results
 
Information last updated: Sat Jun 2 2007
Make Corrections to this Publication
Stanford School of Medicine