Finished genomic sequence is analysed on a clone by clone basis using a combination of similarity searches against DNA and protein databases as well as a series of ab initio gene predictions (GENSCAN, FGENESH). Gene structures are annotated on the basis of human interpretation of the combined supportive evidence generated during sequence analysis. In parallel, experimental methods are being applied to extend incomplete gene structures and discover new genes. The latter is initiated by comparative analysis of the finished sequence with vertebrate datasets such as the Riken mouse cDNAs, mouse whole-genome shotgun data and GenescopeTetraodon Ecores.
Currently over 10% of the human genome including two complete chromosomes, 20 ( http://www.sanger.ac.uk/HGP/Chr20/) and 22 ( http://www.sanger.ac.uk/HGP/Chr22/) have been annotated by The Sequence Annotation Team in collaboration with the individual Chromosome Project Teams ( http://www.sanger.ac.uk/HGP/Humana/). Email enquires can be sent to humquery@sanger.ac.uk.