Matching carbohydrate-binding domains in Arabidopsis thaliana genome: development of a lectin database

MAIN

©1996-2019 All Rights Reserved. Online Journal of Bioinformatics. You may not store these pages in any form except for your own personal use. All other usage or distribution is illegal under international copyright treaties. Permission to use any of these pages in any other way besides the before mentioned must be gained in writing from the publisher. This article is exclusively copyrighted in its entirety to OJB publications. This article may be copied once but may not be reproduced or re-transmitted without the express permission of the editors.

OJB©

Online Journal of Bioinformatics©

Volume 4 : 96-105, 2003.

Matching carbohydrate-binding domains in Arabidopsis thaliana genome: development of a lectin database.

Moreno FB¹, Facó F¹, Ceccatto VM², Sampaio AH³, Costa ASB¹, Freitas JLT¹, Nogueira LL¹, Lima ME⁴, Lima-Filho JL⁵, Cavada BS¹*

¹Departamento de Bioquímica e Biologia Molecular, Universidade Federal do Ceará, Fortaleza-CE Campus do Pici S/N CEP: 60451-970 Caixa Postal 6033 Brasil. ²Universidade Estadual do Ceará (UECE). ³Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Fortaleza-CE, ⁴LIKA – Universidade Federal do Pernambuco (UFPE), ⁵Cin – Universidade Federal do Pernambuco(UFPE) *Correspondence bscavada@ufc.br

ABSTRACT

Moreno FB, Facó F, Ceccatto VM, Sampaio AH, Costa ASB, Freitas JLT, Nogueira LL, Lima ME, Lima-Filho JL, Cavada SB., Matching carbohydrate-binding domains in Arabidopsis thaliana genome: development of a lectin database, Onl J Bioinform., 4: 96-195, 2003. Processing of databases used for homology searching requires great computational power. Processing time can be reduced by integrating databases. PERL was used to filter specific sequences from a non-redundant protein database by counting and classifying sequences taxonomically. A regular expression was matched against a string. The script was used to build the first lectin database with 1,639 sequences entries in FASTA format. The program was applied to the analysis of Arabidopsis thaliana genome. All the unclassified open reading frames from this genome were catalogued and analyzed by homology searching. Six possible proteins containing carbohydrate domains were found. The proposed lectin database and PERL scripts could be used as a generic proteomic tool.

KEYWORDS: lectin, Arabdopsis, database, genome, proteome.

MAIN

FULL-TEXT (SUBSCRIPTION OR PURCHASE TITLE $25USD)