Protein Sequence |
|| Browse |
Internet Resources
Bioinformatics & Genomics
Databases
Genomic & DNA
Informational
Protein Sequence
![]() |
3D-ALI
[EMBL] A database of aligned protein structures and related sequences.
[more info][12643]
Alignment Databases
[Protein Information Resources] This site provides access to two databases of protein sequence alignments. The PIR-ALN database is curated and the MIPS-ALN database is computer generated.
[more info][12674]
EF-Hand Calcium-Binding Proteins Data Library
[Vanderbilt University] The EF-Hand Calcium-Binding Proteins Data Library is a growing collection of published sequence, structural, functional, and other
information about EF-hand calcium-binding proteins and their roles in cellular signal transduction.
[more info][13790]
FAMBASE
[Protein Information Resource] FAM database developed at MIPS. The purpose of the database is to reduce the number of sequences in the search database as well as to increase sensitivity programs like FASTA to identify distantly related sequence families.
[more info][12651]
Globin Gene Server
[Computer Science & Engineering and Biochemistry & Molecular Biology, Pennsylvania State University] This site houses a prototype database of sequence alignments and experimental results for the beta-like globin gene cluster of mammals. The backbone of this database consists of a large (73 kb) simultaneous alignment of DNA sequences from the beta-like globin gene cluster of humans and a few other mammals, which is annotated to indicate highly-conserved regions and known sequence features.
[more info][12654]
Histone Database Search
[The National Human Genome Research Institute and The National Center for Biotechnology Information] This site affords the user access to histone gene sequences and give the ability to align the sequences.
[more info][12655]
Histone Sequence Database
[National Human Genome Research Institute] Database of histone sequences, structures, post-translational modification and gene loci.
[more info][10996]
Homeodomain Resource
[NHGRI] An annotated collection of non-redundant protein sequences, three-dimensional structures, and genomic information for the homeodomain protein family.
[more info][12011]
Influenza Sequence Database
[Los Alamos National Laboratory] A curated database of nucleotide and amino acid sequences intended to provide an easy sequence deposit and retrieval capabilities to the analysis of hemagglutinin and neuraminidase sequences.
[more info][12627]
Kabat Database of Sequences of Proteins of Immunological Interest
[Northwestern University] (Elvin A. Kabat) Contains a variety of immunologic protein sequences not found elsewhere.
[more info][12659]

Top of Page

Metalloprotein Database and Browser
[Scripps Reasearch Institute] TSRI’s Metalloprotein site Database and Browser (MDB), is a database that aims to contain quantitative
information on all the metal containing sites available from structures in the PDB distribution, as well as from
in-house (TSRI) structures.
[more info][13791]
NRL_3D Sequence-Structure Database
[Protein Information Resource] A sequence-structure database derived from the Brookhaven Protein Data Bank.
[more info][12664]
O-GLYCBASE
[Center for Biological Sequence Analysis] Database of O-glycosylated proteins. Site also includes a version of the database which contains no identical O-glycosylation sites for prediction purposes.
[more info][12666]
OWL
[University College London] A non-redundant composite of the SWISS-PROT, PIR, Genbank (translation), and NRL-3D databases. The OWL server includes a BLAST search and other analysis tools.
[more info][12145]
Peptaibol Database
[Birkbeck College, London] A database which gives each sequence giving the name, sequence, and references peptaibol sequences where atomic coordinates are available, diagrams are given for peptaibols.
[more info][12672]
Pfam
[Washington University, St. Louis] Pfam is a large collection of multiple sequence alignments and hidden Markov models covering many common protein domains based on the Swissprot 38 and SP-TrEMBL 11 protein sequence databases.
[more info][12195]
PIR – Archive Protein Sequence Database
[Protein Information Resource] A database of protein sequences as originally reported in a publication or submission.
[more info][12645]
PIR – International Protein Sequence Database
[Protein Information Resource] A database containing information concerning all naturally occurring wild-type proteins, of which the primary structure is known.
[more info][12682]
PRINTS: Protein Fingerprint Database
[University of Manchester] “PRINTS is a compendium of protein fingerprints. A fingerprint is a group of conserved motifs used
to characterise a protein family; its diagnostic power is refined by iterative scanning of a
SWISS-PROT/TrEMBL composite.”
[more info][12197]
ProDom: The Protein Domain Database
[INRA] “The ProDom protein domain database consists of an automatic compilation of homologous domains.”
[more info][12199]

Top of Page

ProSite Protein
[ExPASy] A searchable dictionary of proteins and patterns. Identifies to which family of proteins a sequence belongs.
[more info][12680]
Protein Information Resource
[National Biomedical Research Foundation] (PIR) Collects, organizes, and distributes the International Protein Sequence Database.
[more info][12681]
SWISS-PROT and TrEMBL
[ExPASy] SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotations (such as the description of the function of a protein, its domain structure, post-translational modifications, variants, etc.), a minimal level of redundancy, and high level of integration with other databases. TrEMBL is a computer-annotated supplement of SWISS-PROT that contains all the translations of EMBL nucleotide sequence entries not yet integrated in SWISS-PROT.
[more info][12694]