Semantic similarity measures as tools for exploring the gene ontology

P W Lord; R D Stevens; A Brass; C A Goble

doi:10.1142/9789812776303_0056

Semantic similarity measures as tools for exploring the gene ontology

Pac Symp Biocomput. 2003:601-12. doi: 10.1142/9789812776303_0056.

Authors

P W Lord¹, R D Stevens, A Brass, C A Goble

Affiliation

¹ Department of Computer Science, University of Manchester, Oxford Road, Manchester, M13 9PL, UK. p.lord@russet.org.uk

PMID: 12603061
DOI: 10.1142/9789812776303_0056

Abstract

Many bioinformatics resources hold data in the form of sequences. Often this sequence data is associated with a large amount of annotation. In many cases this data has been hard to model, and has been represented as scientific natural language, which is not readily computationally amenable. The development of the Gene Ontology provides us with a more accessible representation of some of this data. However it is not clear how this data can best be searched, or queried. Recently we have adapted information content based measures for use with the Gene Ontology (GO). In this paper we present detailed investigation of the properties of these measures, and examine various properties of GO, which may have implications for its future design.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Classification
Computational Biology*
Databases, Protein
Genomics / statistics & numerical data*
Humans
Proteomics / statistics & numerical data
Sequence Alignment / statistics & numerical data