NCBI的 Gene GeneID:
# 下载 wget ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz #提取信息 gzcat gene_info.gz | cut -f2,3,5,9,13 # will give you # 1.the GeneID # 2.the current official symbol or database identifier if no official symbol is available # 3.a pipe-delimited set of aliases # 4.the full name # 5.the nomenclature status of the name, where 0 = official from a nomenclature committee, I = interim from a nomenclature committee, - = NCBI-supplied.
Symbols beginning with LOC. When a published symbol is not available, and orthologs have not yet been determined, Gene will provide a symbol that is constructed as ‘LOC’ + the GeneID.
he sequences represented by NCBI’s predictions are provided in accessions beginning with XM_ or XP_ or XR_
For a limited number of genes in the human genome, gene-specific genomic RefSeqs, termed RefSeqGenes, have been created. These have a RefSeq accession beginning with NG_ and can be retrieved from the Nucleotide database using the query refseqgene[keyword].
RefSeq RNA records are often based on cDNA sequences submitted to GenBank. They therefore can differ from the reference genomic sequence, either for biological reasons (variation or RNA editing) or some unresolved sequence discrepancy.
NCBI的自动基因组注释程序,会比较序列的mRNA以及可编码区的蛋白序列信息。如果蛋白序列和几种模式生物的蛋白序列比较后,有相似度高,名字确定的蛋白存在,程序会使用”similar to“ + name of similar protein命名自动注释的蛋白,现在好像是在结尾添加 + like。
HGNC(人类基因命名委员会)只对人类基因进行命名。 HUGO Gene Symbol:HUGO Gene Symbol(也叫做HGNC Symbol,即基因符号)是HGNC组织对基因进行命名描述的一个缩写标识符(如:TP53), 这些基因符号都是唯一的。 Gene Name:Gene Name是经过HGNC批准的全基因名称;对应于上面批准的符号(Gene Symbol)。例如TP53对应的Gene Name就是:tumor protein p53 。 HGNC ID:HGNC ID是HGNC数据库分配的基因编号,每一个标准的Symbol都有对应的HGNC ID 。我们可以用这个编号,在HGNC数据库中搜索相关的基因。 例如:HGNC:11998 有时候HGNC会对一些已经命名过的基因进行重新审查和重新命名,以确保新的基因命名在描述基因功能方面更加的准确。 当一个基因被HGNC分配了新的Gene Symbol时,它之前的命名,会被当作同义词继续使用,所以一般建议使用HGNC ID而不是HGNC Symbol来 作为我们处理数据中的唯一标识符。 Ensembl:ENSG00000 即Ensembl数据库的ID编号, MIM:是来源于OMIM数据库(Online Mendelian Inheritance in Man ,人类孟德尔遗传在线数据库)的编号 Vega:OTTHUMG00000来自Vega数据库(Vertebrate Genome Annotation,脊椎动物基因组注释 )
发布者:全栈程序员-站长,转载请注明出处:https://javaforall.net/221452.html原文链接:https://javaforall.net
