Bioinformatics & Genomics
Genomic & DNA
| Opens resource in a 2nd browser window.
[EMBL] A database of aligned protein structures and related sequences.
[Protein Information Resources] This site provides access to two databases of protein sequence alignments. The PIR-ALN database is curated and the MIPS-ALN database is computer generated.
EF-Hand Calcium-Binding Proteins Data Library
[Vanderbilt University] The EF-Hand Calcium-Binding Proteins Data Library is a growing collection of published sequence, structural, functional, and other
information about EF-hand calcium-binding proteins and their roles in cellular signal transduction.
[Protein Information Resource] FAM database developed at MIPS. The purpose of the database is to reduce the number of sequences in the search database as well as to increase sensitivity programs like FASTA to identify distantly related sequence families.
Globin Gene Server
[Computer Science & Engineering and Biochemistry & Molecular Biology, Pennsylvania State University] This site houses a prototype database of sequence alignments and experimental results for the beta-like globin gene cluster of mammals. The backbone of this database consists of a large (73 kb) simultaneous alignment of DNA sequences from the beta-like globin gene cluster of humans and a few other mammals, which is annotated to indicate highly-conserved regions and known sequence features.
Histone Database Search
[The National Human Genome Research Institute and The National Center for Biotechnology Information] This site affords the user access to histone gene sequences and give the ability to align the sequences.
Histone Sequence Database
[National Human Genome Research Institute] Database of histone sequences, structures, post-translational modification and gene loci.
[NHGRI] An annotated collection of non-redundant protein sequences, three-dimensional structures, and genomic information for the homeodomain protein family.
Influenza Sequence Database
[Los Alamos National Laboratory] A curated database of nucleotide and amino acid sequences intended to provide an easy sequence deposit and retrieval capabilities to the analysis of hemagglutinin and neuraminidase sequences.
Kabat Database of Sequences of Proteins of Immunological Interest
[Northwestern University] (Elvin A. Kabat) Contains a variety of immunologic protein sequences not found elsewhere.
Top of Page
Metalloprotein Database and Browser
[Scripps Reasearch Institute] TSRI's Metalloprotein site Database and Browser (MDB), is a database that aims to contain quantitative
information on all the metal containing sites available from structures in the PDB distribution, as well as from
in-house (TSRI) structures.
NRL_3D Sequence-Structure Database
[Protein Information Resource] A sequence-structure database derived from the Brookhaven Protein Data Bank.
[Center for Biological Sequence Analysis] Database of O-glycosylated proteins. Site also includes a version of the database which contains no identical O-glycosylation sites for prediction purposes.
[University College London] A non-redundant composite of the SWISS-PROT, PIR, Genbank (translation), and NRL-3D databases. The OWL server includes a BLAST search and other analysis tools.
[Birkbeck College, London] A database which gives each sequence giving the name, sequence, and references peptaibol sequences where atomic coordinates are available, diagrams are given for peptaibols.
[Washington University, St. Louis] Pfam is a large collection of multiple sequence alignments and hidden Markov models covering many common protein domains based on the Swissprot 38 and SP-TrEMBL 11 protein sequence databases.
PIR - Archive Protein Sequence Database
[Protein Information Resource] A database of protein sequences as originally reported in a publication or submission.
PIR - International Protein Sequence Database
[Protein Information Resource] A database containing information concerning all naturally occurring wild-type proteins, of which the primary structure is known.
PRINTS: Protein Fingerprint Database
[University of Manchester] "PRINTS is a compendium of protein fingerprints. A fingerprint is a group of conserved motifs used
to characterise a protein family; its diagnostic power is refined by iterative scanning of a
ProDom: The Protein Domain Database
[INRA] "The ProDom protein domain database consists of an automatic compilation of homologous domains."
Top of Page
[ExPASy] A searchable dictionary of proteins and patterns. Identifies to which family of proteins a sequence belongs.
Protein Information Resource
[National Biomedical Research Foundation] (PIR) Collects, organizes, and distributes the International Protein Sequence Database.
SWISS-PROT and TrEMBL
[ExPASy] SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotations (such as the description of the function of a protein, its domain structure, post-translational modifications, variants, etc.), a minimal level of redundancy, and high level of integration with other databases. TrEMBL is a computer-annotated supplement of SWISS-PROT that contains all the translations of EMBL nucleotide sequence entries not yet integrated in SWISS-PROT.