Table 6.5. Databases available on BLAST Web server |
Database/Description |
|
A. Peptide Sequence Databases |
|
nr |
|
All non-redundant GenBank CDS translations+RefSeq Proteins+PDB+SwissProt+PIR+PRF |
|
swissprot |
|
Last major release of the SwissProt protein sequence database (no updates) |
|
pat |
|
Proteins from the Patent division of GenPept |
|
Yeast |
|
Yeast (Saccharomyces cerevisiae) genomic CDS translations |
|
ecoli |
|
Escherichia coli genomic CDS translations |
|
pdb |
|
Sequences derived from the three-dimensional structure from Brookhaven Protein Data Bank |
|
Drosophila genome |
|
Drosophila genome proteins provided by Celera and Berkeley Drosophila Genome Project (BDGP) |
|
month |
|
All new or revised GenBank CDS translation+PDB+SwissProt+PIR+PRF released in the last 30 days |
B. Nucleotide Sequence Databases |
|
nr |
|
All GenBank+RefSeq Nucleotides+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, or phase 0, 1, or 2 HTGS sequences); no longer "non-redundant" |
|
est |
|
Database of GenBank+EMBL+DDBJ sequences from EST Divisions |
|
est_human |
|
Human subset of GenBank+EMBL+DDBJ sequences from EST Divisions |
|
est_mouse |
|
Mouse subset of GenBank+EMBL+DDBJ sequences from EST Divisions |
|
est_others |
|
Non-Mouse, non-Human sequences of GenBank+EMBL+DDBJ sequences from EST Divisions |
|
gss |
|
Genome survey sequence, includes single-pass genomic data, exon-trapped sequences, and Alu PCR sequences |
|
htgs |
|
Unfinished high-throughput genomic sequences: phase 0, 1, and 2 (finished, phase 3 HTG sequences are in nr) |
|
pat |
|
Nucleotides from the Patent division of GenBank |
|
|
yeast |
|
Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences |
|
|
mito |
|
Database of mitochondrial sequences |
|
|
vector |
|
Vector subset of GenBank(R), NCBI, in ftp://ftp.ncbi.nih.gov/blast/db |
|
|
E. coli |
|
Escherichia coli genomic nucleotide sequences |
|
|
pdb |
|
Sequences derived from the three-dimensional structure from Brookhaven Protein Data Bank |
|
|
Drosophila genome |
|
Drosophila genome provided by Celera and Berkeley Drosophila Genome Project (BDGP) |
|
|
month |
|
All new or revised GenBank+EMBL+DDBJ+PDB sequences released in the last 30 days |
|
|
alu |
|
Select Alu repeats from REPBASE, suitable fro masking Alu repeats from query sequences. It is available by anonymous FTP from ftp.ncbi.nih.gov (under the /pub/jmc/alu directory). See "Alu alert" by Claverie and Makalowski (1994) |
|
|
dbsts |
|
Database of GenBank+EMBL+DDBJ sequences from STS Divisions |
|
|
chromosome |
|
Searches complete genomes, complete chromosome, or contigs from the NCBI Reference Sequence project |
|
C. Human Genome Blast Databases |
|
genome |
|
Human genomic contig sequences with NT_#### accessions |
|
|
mrna |
|
Human RefSeq mrna with NM_#### or XM_#### accessions |
|
|
protein |
|
Human RefSeq proteins with NP_#### or XP_#### accessions |
|
|
gscan mrna |
|
Predicted mRNA sequences generated by running GenomeScan program on human genomic contigs |
|
|
gscan protein |
|
CDS translations from gscan mrna set |
|
D. CDD Search |
Compares protein sequences to the conserved Domain Database. The CDD is a database containing a collection of functional and/or structural domain derived from two popular collections, Smart and Pfam, plus contributions from colleagues at NCBI. For more information, see the CDD homepage. |
|
Source: http://www.ncbi.nlm.nih.gov/blast/html/blastcgihelp.html#protein_databases
|