NCBI在线blast数据库的简要说明


Peptide Sequence Databases蛋白序列的数据库

nr
All non-redundant GenBank CDS translations + RefSeq Proteins + PDB + SwissProt + PIR + PRF
所有非冗余的的GenBank CDS区的翻译序列 + 参考序列的蛋白 + PDB数据库 + SwissProt蛋白数据库 + PRF蛋白数据库

refseq
RefSeq protein sequences from NCBI’s Reference Sequence Project.
所有NCBI的参考序列

swissprot
Last major release of the SWISS-PROT protein sequence database (no updates).
swissprot的蛋白数据库

pat
Proteins from the Patent division of GenPept.
专利的蛋白数据库

pdb
Sequences derived from the 3-dimensional structure from Brookhaven Protein Data Bank.
PDB数据库

month
All new or revised GenBank CDS translation+PDB+SwissProt+PIR+PRF released in the last 30 days.
一个月内新增加的蛋白序列

env_nr
Protein sequences from environmental samples.
来自environmental samples的蛋白序列

Nucleotide Sequence Databases核酸数据库
nr
All GenBank + RefSeq Nucleotides + EMBL + DDBJ + PDB sequences (excluding HTGS0,1,2, EST, GSS, STS, PAT, WGS). No longer “non-redundant”.
所有GenBank的核酸序列 + 参考序列中的核酸序列+ EMBL +DDBJ +PDB核酸序列(但不包括HTG,EST,GSS等序列)

refseq_rna
RNA entries from NCBI’s Reference Sequence project
NCBI参考序列中的核酸序列

refseq_genomic
Genomic entries from NCBI’s Reference Sequence project
NCBI参考序列中的基因组序列

est
Database of GenBank + EMBL + DDBJ sequences from EST Divisions
来自GenBank + EMBL + DDBJ 的EST序列

est_human
Human subset of est.
人的EST序列

est_mouse
Mouse subset.
小鼠的EST序列

est_others
Non-Mouse, non-Human subset of est.、
除了人与小鼠之外的EST序列

gss
Genome Survey Sequence, includes single-pass genomic data, exon-trapped sequences, and Alu PCR sequences.

htgs
Unfinished High Throughput Genomic Sequences: phases 0, 1 and 2 (finished, phase 3 HTG sequences are in nr)
未发布的高通量的基因组测序

pat
Nucleotides from the Patent division of GenBank.
专利的核酸序列

pdb
Sequences derived from the 3-dimensional structure from Brookhaven Protein Data Bank
PDB核酸序列

month
All new or revised GenBank + EMBL + DDBJ + PDB sequences released in the last 30 days.
一个月内新增的核酸序列

dbsts
Database of GenBank+EMBL+DDBJ sequences from STS Divisions .
STS数据库

chromosome
A database with complete genomes and chromosomes from the NCBI Reference Sequence project..
NCBI参考序列计划中所有的完整基因组和染色体序列

wgs
A database for whole genome shotgun sequence entries.
基因组鸟枪法测序得到的序列

env_nt
Nucleotide sequences from environmental samples, including those from Sargasso Sea and Mine Drainage
projects.
来自environmental samples的核酸序列。

附:NCBI在线Blast的图文说明


《“NCBI在线blast数据库的简要说明”》 有 3 条评论

  1. 你好,可以请教下如何比对两种蛋白的不同氨基酸序列吗?
    我用ncbi上面的blast2,结果它仅仅是把两种蛋白的氨基酸序列并排放在一起,没有任何的标记。
    这两种蛋白的其实是一种蛋白的不同亚型,可以氨基酸序列的差异非常的小,但是还是有差异的,所以进化树确实是做不出来的,求教……

  2. 现在核酸比对时没有env_nt这个库了,但有一个16S ribosomal RNA sequences (Bacteria and Aechaea),请问这两个库是同一个吗?环境样本用哪一个库比较合适?因为发现同样的序列用nr和16S出来的result不一样。