Your browser version may not work well with NCBI's Web applications. More information here...
HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
HomoloGene Release 63 Statistics



Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species.

Species   Number of Genes   HomoloGene
  Input Grouped   groups
Homo sapiens 22,849  19,978   19,235
Pan troglodytes 25,096  17,474   16,777
Canis lupus familiaris 19,766  16,827   16,023
Bos taurus 23,797  18,443   16,153
Mus musculus 25,388  21,797   19,043
Rattus norvegicus 21,991  19,277   17,523
Gallus gallus 17,959  13,220   11,982
Danio rerio 26,288  20,774   13,916
Drosophila melanogaster 14,085  9,256   7,712
Anopheles gambiae 13,909  9,263   7,634
Caenorhabditis elegans 20,077  8,693   4,827
Schizosaccharomyces pombe 5,043  3,242   2,953
Saccharomyces cerevisiae 5,880  4,855   4,373
Kluyveromyces lactis 5,335  4,464   4,387
Eremothecium gossypii 4,722  3,930   3,886
Magnaporthe grisea 12,832  7,293   6,360
Neurospora crassa 10,079  6,171   6,035
Arabidopsis thaliana 26,981  19,968   11,207
Oryza sativa 26,887  17,323   10,656
Plasmodium falciparum 5,266  2,130   819


'*' indicates organisms where new genome annotation data is used in this build.


Last updated on: Fri Dec 5 2008



We have recently adopted a new build procedure that makes use of amino acid sequence searching (blastp) to find more distant relationships, but the procedure still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.




Sources of Additional Information



HomoloGene entries have been augumented with homology and phenotype information drawn from the following sources.

Online Mendelian Inheritance in Man (OMIM)

Mouse Genome Informatics (MGI)

Zebrafish Information Network (ZFIN)

Saccharomyces Genome Database (SGD)

Clusters of Orthologous Groups (COG)

FlyBase

 

What's New
HomoloGene release 63 is now public. It includes an improved approach for predicting putative paralogs.


Tip of The Day




Related Resources


Entrez Genomes


A collection of complete genome sequences that includes more than 1000 viruses and over hundred microbes

  Archaea

  Bacteria

  Eukaryota

  Viruses



  COGs

Phylogenetic classification of proteins encoded in complete genomes.