A computational strategy for protein function assignment which addresses the multidomain problem

AJ Pérez, A Rodríguez, O Trelles… - … and functional genomics, 2002 - Wiley Online Library
Comparative and functional genomics, 2002Wiley Online Library
A method for assigning functions to unknown sequences based on finding correlations
between short signals and functional annotations in a protein database is presented. This
approach is based on keyword (KW) and feature (FT) information stored in the SWISS‐
PROT database. The former refers to particular protein characteristics and the latter locates
these characteristics at a specific sequence position. In this way, a certain keyword is only
assigned to a sequence if sequence similarity is found in the position described by the FT …
Abstract
A method for assigning functions to unknown sequences based on finding correlations between short signals and functional annotations in a protein database is presented. This approach is based on keyword (KW) and feature (FT) information stored in the SWISS‐PROT database. The former refers to particular protein characteristics and the latter locates these characteristics at a specific sequence position. In this way, a certain keyword is only assigned to a sequence if sequence similarity is found in the position described by the FT field. Exhaustive tests performed over sequences with homologues (cluster set) and without homologues (singleton set) in the database show that assigning functions is much ‘cleaner’ when information about domains (FT field) is used, than when only the keywords are used. Copyright © 2002 John Wiley & Sons, Ltd.
Wiley Online Library
以上显示的是最相近的搜索结果。 查看全部搜索结果