Protein sequences derived from cDNA and genomic sequence can be found in two main databases, SWISSPROT/TrEMBL (Bairoch and Apweiler 2000) and PIR (Barker etal. 2001). SWISSPROT is a highly curated database of protein sequence, derived from the EMBL nucleotide database. There is a minimal level of redundancy in the data. Each entry is highly integrated with other bioinformatic databases. TrEMBL (translated EMBL) is a supplement of SWISSPROT and represents the translations of EMBL nucleotide sequence not yet integrated into SWISSPROT (http://www.ebi.ac.uk/swissprot/ and http://www.expasy.ch/sprot/sprot-top.html). Protein Information Resource-Protein Sequence Database (PIR-PSD) provides a similar resource to SWISSPROT (http://pir.georgetown.edu/).
Was this article helpful?