Abstract: The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.... [Click above reference link for full abstract]
A genome position can be specified by chromosomal coordinate range, COG
ID, or keywords from the GenBank or TIGR description of a gene.
The available chromosome/plasmid names are:
Browser Chrom/Plasmid Name | Length (bp) | GC Content (%) | Gene Count | NCBI RefSeq Accession |
---|---|---|---|---|
chr | 4639675 | 50.79 | 4410 | NC_000913 |
The following list shows examples of valid position queries for this genome:
Request: | Genome Browser Response: |
---|---|
chr | Displays the entire sequence "chr" in the browser window |
chr:1-10000 | Displays first ten thousand bases of the sequence "chr" |
transporter | Lists all genes with "transporter" in the name or description |
b0010 | Display genome at position of gene b0010 |
If you use the browser in your published research, please cite our publication in the Nucleic Acids Research Database Issue. Citations and positive feedback will help us obtain funding to continue development of this community resource.