Description

Gene predictions on May 2003 release of C. elegans sequences. Fgenesh++ predictions are based on Softberry's gene finding software.

19112 genes includes 4682 genes correponding known mRNA and 13085 having similarity with proteins from NR database and 1345 ab initio predictions.

Methods

Fgenesh++ uses both hidden Markov models (HMMs) and protein similarity to find genes in a completely automated manner. For more information, see the paper Solovyev VV (2001), "Statistical approaches in Eukaryotic gene prediction" in the Handbook of Statistical Genetics (ed. Balding D. et al.), John Wiley & Sons, Ltd., p. 83-127.

Credits

The Fgenesh++ gene predictions were produced by Softberry Inc. Commercial use of these predictions is restricted to viewing in this browser. Please contact Softberry Inc. to make arrangements for further commercial access.