charts

Publications

Publication details

GAPSCORE: finding gene and protein names one word at a time
Journal Article
Reference:
J. T. Chang, H. Schutze, R. B. Altman. Bioinformatics, Vol.20 no.2, January 22, 216-225. Published in 2004.
Abstract:

Motivation: New high-throughput technologies have
accelerated the accumulation of knowledge about genes and
proteins. However, much knowledge is still stored as written
natural language text. Therefore, we have developed a new
method, GAPSCORE, to identify gene and protein names in
text. GAPSCORE scores words based on a statistical model
of gene names that quantifies their appearance, morphology
and context.
Results: We evaluated GAPSCORE against the Yapex data
set and achieved an F -score of 82.5% (83.3% recall, 81.5%
precision) for partial matches and 57.6% (58.5% recall, 56.7%
precision) for exact matches. Since the method is statistical,
users can choose score cutoffs that adjust the performance
according to their needs.
Availability: GAPSCORE is available at http://bionlp.stanford.edu/gapscore/
Contact: russ.altman@stanford.edu

Full PDF version available here
Back to Search Results
 
Information last updated: Sat Jun 2 2007
Make Corrections to this Publication
Stanford School of Medicine