Bioinformatics and Functional Genomics

Chapter: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | App 1 | App 2


Chapter 10: Multiple sequence alignment


Web resources from Chapter 10
Website URL
ClustalW http://www2.ebi.ac.uk/clustalw/
HMMER http://hmmer.wustl.edu
PFAM-Washington University of St Louis http://Pfam.wustl.edu/
PFAM-The Sanger Center http://www.sanger.ac.uk/Software/Pfam/
PFAM http://www.cgb.ki.se/Pfam
PFAM http://pfam.jouy.inra.fr
PFAM DNA query http://www.sanger.ac.uk/Software/Pfam/dnasearch.shtml
SMART http://smart.embl-heidelberg.de/
CDD http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml
DART http://www.ncbi.nlm.nih.gov/Structure/lexington/html/overview.html
BLOCKS http://www.blocks.fhcrc.org/
PROSITE http://www.expasy.org/prosite/
InterPro http://www.ebi.ac.uk/interpro/
MetaFam http://metafam.ahc.umn.edu/
iProClass http://pir.georgetown.edu/iproclass/
PopSet http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=popset
BALiBASE http://www-igbmc.u-strasbg.fr/BioInfo/BAliBASE/index.html

 

Tables

Table 10-1. Databases of multiple sequence alignments. See text for details. Abbreviations: HMM, Hidden Markov Model; MSA, multiple sequence alignment.
Name Description
BLOCKS HMM-like profile HMM library; ungapped
CDD Conserved domain database (PFAM plus SMART)
Interpro A unified resource combining PROSITE, PRINTS, ProDom and Pfam, SMART, and TIGRfam
iProClass database From the Protein Information Resource
MetaFAM Proteins compiled from six databases
PFAM Profile HMM library
PIR-ALN Database of alignments
PRINTS Protein fingerprints from SWISS-PROT/TrEMBL
ProDom Uses PSI-BLAST to cluster Swiss-Prot
Prosite A dictionary of protein motifs
SMART Simple Modular Architecture Research Tool
TIGRFAMs HMM library of protein families
 
Table 10-2. Databases on which Interpro (Release 3.2) is based (http://www.ebi.ac.uk/interpro/release_notes.html), (October, 2002).
Database Contents
Pfam 7.3 3,865 domains
PRINTS 33.0 1,650 fingerprints
PROSITE 17.5 1,565 profiles
ProDom 2001.3 1,346 domains
SMART 3.1 509 domains
SWISS-PROT 40.27 113,470 entries
TIGRFAMs 1.2 814 families
TrEMBL 21.12 685,610 entries
 
Table 10-3. Multiple sequence alignment programs available on the world wide web. Additional algorithms are listed at ExPASy (http://kr.expasy.org/tools/#align). Abbreviation: MSA, multiple sequence alignment.
Program Description
AMAS (Analyse Multiply Aligned Sequences) At the European Bioinformatics Institute; used to analyze pre-made MSAs
CINEMA Colour INteractive Editor for Multiple Alignments
ClustalW At the European Bioinformatics Institute and other sites
ClustalX Download by FTP
DIALIGN Especially useful for local MSA. From the University of Bielefeld, Germany
Match-Box Web Server 1.3 From the University of Namur, Belgium
MultAlin From INRA (http://www.inra.fr/), Toulouse
Multiple Sequence alignment Version 2.0 At the GeneStream server of the Institut de Génétique Humaine (http://www2.igh.cnrs.fr/home.eng.html)
Multiple Sequence Alignment version 2.1 From Washington University, St. Louis
Musca From the IBM Bioinformatics Group
PileUp Available through the SeqWeb or UNIX versions of GCG
T-COFFEE Slower but more accurate than ClustalW for distantly related proteins
 
 
 

Return to Contents