(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.soe.ucsc.edu/research/compbio/sam.html
Citations (SAM, SAM-T99, HMMs)
- R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: Extension and analysis of the basic method, CABIOS 12:95-107, 1996.
- K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting remote protein homologies, Bioinformatics 14(10):846-856, 1998.
- A. Krogh et al., Hidden Markov models in computational biology: Applications to protein modeling, JMB 235:1501-1531, Feb 1994.
Sequence numbers correspond to the following labels:
-
- T0358 NESG:ER397, Escherichia coli, 87 res
-
- gi|83586571|ref|ZP_00925204.1|_1:77 hypothetical protein Ecol1_01002225 [Escherichia coli 101-1]
-
- gi|110345587|gb|ABG71824.1|_1:77 putative cytoplasmic protein [Escherichia coli 536]
- gi|26110819|gb|AAN83003.1| Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26250389|ref|NP_756429.1| Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|24528054|emb|CAD33784.1| hypothetical protein [Escherichia coli]
- gi|70608383|gb|AAZ04455.1| conserved hypothetical protein [Escherichia coli]
-
- gi|75259093|ref|ZP_00730461.1|_1:77 hypothetical protein EcolE2_01001358 [Escherichia coli E22]
-
- gi|75230630|ref|ZP_00717105.1|_1:77 hypothetical protein EcolB7_01001278 [Escherichia coli B7A]
-
- gi|75208230|ref|ZP_00708597.1|_1:77 hypothetical protein EcolB_01004504 [Escherichia coli B171]
-
- gi|75238464|ref|ZP_00722461.1|_1:77 hypothetical protein EcolF_01003511 [Escherichia coli F11]
-
- gi|89033311|gb|ABD59989.1|_1:77 hypothetical protein [Escherichia coli]
- gi|50346328|ref|NP_052923.2| hypothetical protein R100p043 [Plasmid R100]
-
- gi|75512277|ref|ZP_00734839.1|_1:77 hypothetical protein Ecol5_01003544 [Escherichia coli 53638]
-
- gi|37962737|gb|AAR05684.1|_3:79 conserved hypothetical protein [Salmonella typhimurium]
- gi|58383258|ref|YP_194828.1| hypothetical protein pU302L_021 [Salmonella typhimurium]
-
- gi|45758187|gb|AAS76399.1|_1:77 YkfF [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67]
- gi|60115629|ref|YP_209420.1| YkfF [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67]
-
- gi|73476837|gb|AAZ76452.1|_1:77 hypothetical protein LH0085 [Escherichia coli]
- gi|73853253|ref|YP_308749.1| hypothetical protein LH0085 [Escherichia coli]
-
- gi|26111397|gb|AAN83579.1|_1:77 Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26250965|ref|NP_757005.1| Hypothetical protein ykfF [Escherichia coli CFT073]
-
- gi|91213937|ref|YP_543923.1|_1:77 hypothetical protein YkfF [Escherichia coli UTI89]
- gi|91075511|gb|ABE10392.1| hypothetical protein YkfF [Escherichia coli UTI89]
-
- gi|88770148|gb|ABD51585.1|_10:86 conserved hypothetical protein [Escherichia coli]
-
- gi|91206338|ref|YP_538692.1|_3:79 hypothetical protein UTI89_P093 [Escherichia coli UTI89]
- gi|91075789|gb|ABE10669.1| hypothetical protein UTI89_P093 [Escherichia coli UTI89]
-
- gi|5702179|gb|AAD47188.1|_1:77 unknown; orf77 [Escherichia coli]
- gi|75234993|ref|ZP_00719258.1| hypothetical protein EcolE1_01003133 [Escherichia coli E110019]
-
- gi|38606100|gb|AAR25064.1|_3:79 YdeA [Escherichia coli]
- gi|41056963|ref|NP_957583.1| YdeA [Escherichia coli]
- gi|5103191|dbj|BAA78827.1| 94% identical to gp:AF074613_85[hypothetical protein of plasmid pO157], 61% identical (1 gap) to 64 residues of 79 aa protein gp:ECAE000133_3[hypothetical protein of E. coli] [Plasmid R100]
-
- gi|8918884|dbj|BAA97931.1|_3:79 yfjA [Plasmid F]
- gi|9507774|ref|NP_061440.1| hypothetical protein Fpla063 [Plasmid F]
-
- gi|41223299|ref|NP_958724.1|_1:77 hypothetical protein ECp054 [Escherichia coli O157:H7 str. Sakai]
-
- gi|75234748|ref|ZP_00719058.1|_1:77 hypothetical protein EcolE1_01003386 [Escherichia coli E110019]
-
- gi|75226917|ref|ZP_00713803.1|_1:77 hypothetical protein EcolB7_01004569 [Escherichia coli B7A]
-
- gi|110344756|gb|ABG70993.1|_1:77 hypothetical protein YkfF [Escherichia coli 536]
-
-
- gi|3822199|gb|AAC70153.1|_10:86 hypothetical protein [Escherichia coli O157:H7]
- gi|75994531|ref|YP_325645.1| hypothetical protein L7085 [Escherichia coli O157:H7 EDL933]
-
- gi|47154999|emb|CAE85198.1|_13:89 YkfF protein [Escherichia coli]
-
- gi|71559064|ref|YP_271791.1|_1:73 hypothetical protein pOU1113_67 [Salmonella enterica]
- gi|68166363|gb|AAY88124.1| hypothetical protein [Salmonella enterica]
-
- gi|16445257|gb|AAL23475.1|_6:78 putative cytoplasmic protein [Salmonella typhimurium LT2]
- gi|17233436|ref|NP_490555.1| putative cytoplasmic protein [Salmonella typhimurium LT2]
-
- gi|16128234|ref|NP_414783.1|_1:79 CP4-6 prophage; predicted protein [Escherichia coli K12]
- gi|1786443|gb|AAC73352.1| CP4-6 prophage; predicted protein [Escherichia coli K12]
- gi|89107122|ref|AP_000902.1| hypothetical protein [Escherichia coli W3110]
- gi|3025208|sp|P75677|YKFF_ECOLI Hypothetical protein ykfF
- gi|85674398|dbj|BAA77918.2| hypothetical protein [Escherichia coli W3110]
-
- gi|31795419|ref|NP_857872.1|_1:75 hypothetical protein Y1091 [Yersinia pestis KIM]
- gi|45357335|gb|AAS58729.1| hypothetical protein pMT098 [Yersinia pestis biovar Medievalis str. 91001]
- gi|108793825|ref|YP_636672.1| hypothetical protein YPN_MT0093 [Yersinia pestis Nepal516]
- gi|108793621|ref|YP_636780.1| hypothetical protein YPA_MT0089 [Yersinia pestis Antiqua]
- gi|108782168|gb|ABG16225.1| hypothetical protein YPA_MT0089 [Yersinia pestis Antiqua]
- gi|108777889|gb|ABG20407.1| hypothetical protein YPN_MT0093 [Yersinia pestis Nepal516]
- gi|3883089|gb|AAC82749.1| unknown [Yersinia pestis KIM]
- gi|2996363|gb|AAC13243.1| unknown [Yersinia pestis KIM]
- gi|31795259|ref|NP_857714.1| hypothetical protein YPKMT101 [Yersinia pestis KIM]
- gi|5834744|emb|CAB55242.1| hypothetical protein YPMT1.60c [Yersinia pestis CO92]
- gi|52538124|emb|CAG27550.1| hypothetical protein [Yersinia pestis]
- gi|52788195|ref|YP_094023.1| hypothetical protein pG8786_144 [Yersinia pestis]
- gi|16082850|ref|NP_395404.1| hypothetical protein YPMT1.60c [Yersinia pestis CO92]
- gi|89103272|ref|ZP_01175857.1| hypothetical protein Ypesb_01002735 [Yersinia pestis biovar Orientalis str. IP275]
- gi|45478682|ref|NP_995538.1| hypothetical protein pMT098 [Yersinia pestis biovar Medievalis str. 91001]
-
- gi|57434425|emb|CAI43842.1|_1:77 hypothetical protein [Escherichia coli]
- gi|57545662|gb|AAW51753.1| Aec70 [Escherichia coli]
-
- gi|26109909|gb|AAN82114.1|_1:77 Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26249501|ref|NP_755541.1| Hypothetical protein ykfF [Escherichia coli CFT073]
-
- gi|26106581|gb|AAN78767.1|_1:77 Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26246184|ref|NP_752223.1| Hypothetical protein ykfF [Escherichia coli CFT073]
-
-
- gi|18265878|gb|AAL67351.1|_1:77 intergenic-region protein [Escherichia coli]
-
- gi|16505971|emb|CAD09857.1|_1:76 hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18]
- gi|18466655|ref|NP_569463.1| hypothetical protein HCM1.279bc [Salmonella enterica subsp. enterica serovar Typhi str. CT18]
- gi|10957300|ref|NP_058324.1| hypothetical protein [Salmonella typhi]
- gi|7800353|gb|AAF69949.1| orf; hypothetical protein [Salmonella typhi]
-
- gi|75259064|ref|ZP_00730432.1|_1:52 hypothetical protein EcolE2_01001328 [Escherichia coli E22]
- gi|75212146|ref|ZP_00712186.1| hypothetical protein EcolB_01000748 [Escherichia coli B171]
- gi|75208370|ref|ZP_00708708.1| hypothetical protein EcolB_01004372 [Escherichia coli B171]
-
- gi|75189020|ref|ZP_00702287.1|_13:64 hypothetical protein EcolE_01002298 [Escherichia coli E24377A]
10 20 30 40
| | | |
1 ...M..TQ.S..VL....LPP.GPFTRRQAQAVTTTYSNITLED.DQGSHFRLVVRD...TEGRM
2 ...M..SE.Y..FRifqgLPD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DGTL
3 ...M..SD.ChpVL....LPE.GPFSREQAVAVTTAYRNVLIED.DQGTHFRLVIRN...AEGQL
4 ...M..SE.Y..FRilqgLPD.GPFTRKHAEAVAAQYRNVFIED.DHGEHFRLVVRN...-NGTM
5 ...M..SE.Y..FRilqgLPD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRN...NNG-L
6 ...M..SE.Y..FRifqgLPD.GSFTREQAEAVAAQYQNVFIED.DQGTHFRLVVRQ...-DGTL
7 ...M..SE.Y..FRilqgLPD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRN...NSG-L
8 ...M..SE.Y..FRilqgLPD.GPFTRKQAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NGTM
9 ...M..SE.Y..FRifqgLPD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DGTL
10 mr.M..SE.Y..FRilqgLPD.GPFTRKQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DGTL
11 ...M..SE.Y..FRilqgLPD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DGTL
12 ...M..SE.Y..FRilqgLPD.GSFTREQAEAVAAQYRNVFIED.EQGTHFRLVVRN...NSG-L
13 ...M..SD.ChpVL....LPE.GPFSREQAMAVTTAYRNVLIED.DQGTHFRLVIHN...AEGQL
14 ...M..SD.ChpVL....LPE.GPFSREQAVAVTTAYRNVLIED.DQGTHFRLVIRN...AGGQL
15 emrM..SE.Y..FRilqgLPD.GPFTRKQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DGTL
16 mr.M..SE.Y..FRilqgLPD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRN...NSG-L
17 ...M..SE.Y..FRilqgLPD.GPFTRKHAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NGAM
18 mr.M..SE.Y..FRilqgLPD.GPFTRKQAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NGTM
19 mr.M..SE.Y..FRilqgLPD.GPFTRKHAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NGAM
20 ...M..SE.Y..FRilqgLPD.GPFTRKHAEAVAAQYRNVFIEN.DHGEQFRLVVRN...-NGAM
21 ...M..SE.Y..FRilqgLPD.GPFTRKHAEAVAAQYRNVFIED.DHGEQFRLVVRN...NE-TM
22 ...M..SE.Y..FRilqgLPD.GPFTRKQAEAVAAQYRNVFIGD.DHGEQFRLVVRN...-NGAM
23 ...M..PG.CtsRL....LPE.GPFSRNQALAVTTAYLNVLIED.DQGTHFRLVIRN...AEGQL
24 emrM..SE.Y..FRilqgLPD.GSFTREQAEAVAVQYRNVFIED.DHGEQFRLVVRN...-NGAM
25 emrM..SE.Y..FRilqgLPD.GPFTRKHAEAVAAQYRNVFIEN.DHGEQFRLVVRN...-NGAM
26 atdM..PG.CtsRL....LPE.GPFSRNQALAVTTAYLNVLIED.DQGTHFRLVIRN...AEGQL
27 ...M..IP.L..KS....LPD.GTFTHEQAEAVAAQYQNVAIED.DQGTHLRLVVRK...-DGEM
28 dniM..SP.L..KS....LPD.GTFTHEQAEAVAAQYQNVAIED.DQGTHLRLVVRK...-DGEM
29 ...M..TQ.S..VL....LPP.GPFTRRQAQAVTTTYSNITLED.DQGSHFRLVVRD...TEGRM
30 ...M..SEaL..AV....LPD.DTFTREQAEVVAAQYTNVAIED.DQGAHFRLVVRQ...-NGEM
31 ...M..PG.YteYV....LAE.GSFSYGQAVAVITAYRNVFIQD.DPGMHFRRVIRN...AEGQR
32 ...M..PG.YteYV....LAE.GSFSYGQAVAVITAYRNVFIQD.DPGMHFRRVIRN...AEGQR
33 ...MrgCT.L..LL....FSE.APFLREQEVAVITAYRNVFIQD.DPGMHFRWVIRN...AEGQR
34 xsfX..QT.L..SG....LPQwASADCVAGPLVSAGITDINIED.DQGIHVRLIVRD...AEGRM
35 ...MrgCT.L..LL....FSE.APFLREQEVAVITAYRNVFIQD.DPGMHFRWVIRN...AEGQR
36 ...M..SD.L..FS....-SE.SPVTLAQARTVAAGYQNVFIENlQPAGHFQIVIRDhrdHDSQL
37 ...M..PG.CtsRL....LPE.GPFSREQAVAVKTAYRNVFTED.DQGTYSRLVIRN...AEGQL
38 atdM..PG.CtsRL....LPE.GPFSREQAVAVKTAYRNVFTED.DQGTYSRLVIRN...AEGQL
50 60 70
| | |
1 VWRAWNFEPDAGEGLNRYIRTSGIRTDTATR.
2 IWRSWNFEDCAGYWMNRYIRDFGIRK-----.
3 RWRCWNFEPDAGKQLNSYLASEGILRQ----.
4 VWRTWNFEDGAGYWMNHAIRDFGILK-----.
5 VWRTWNFEDGAGYWMNHVIRNFGILK-----.
6 IWRSWNFEDCAGYWMNQYIRDFGIRK-----.
7 VWRTWNFEDGAGYWMNHVIRNFGILK-----.
8 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
9 IWRSWNFEDCAGYWMNRYIRDFGILK-----.
10 IWRSWNFEDCAGYWMNRYIRDFGILK-----.
11 IWRSWNFEDCAGYWMNQYIRDFGILK-----.
12 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
13 RWRCWNFEPDAGKQLNPYLASEGILRQ----.
14 RWRCWNFEPDAGKQLNSYLASEGILRQ----.
15 IWRSWNFEDCAGYWMNRYIRDFGILK-----.
16 VWRTWNFEDGAGYWMNHVIRNFGILK-----.
17 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
18 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
19 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
20 VWRTWNFEDGAGYWMNHVIRDFGIIK-----.
21 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
22 VWRTWNFEDGAGYWMNRYIRDFGILK-----.
23 RWRCWNFEPDAGKQLNPYLASEGILRQ----.
24 VWRTWNFEDGAGYWMNHVIRDFGILK-----.
25 VWRTWNFEDGAGYWMNHVIRDFGIIK-----.
26 RWRCWNFEPDAGKQLNPYLASEGILRQ----.
27 VWRGWNFEPGGEYWLNRCIESHGIRKTQ---.
28 VWRGWNFEPGGEYWLNRCIESHGIRKTQ---.
29 VWRAWNFEPDAGEGLNRYIRTSGIRTDTATR.
30 VWRTWNFEPGGTYWLNRYIADYGIRKPQ---.
31 RWRCRNSEPDAGKVLNTRLASDGLLRQ----.
32 RWRCRNSEADAGKQLNAWLASGGLLRQ----.
33 RWRCRNSEPDAGKVLNTRLASDGPLRQ----.
34 VWRAWNFEPDAGEGFNRYIHRSGIRTDTFPR.
35 RWRCRNS-P---LILNPMPERCLIRGSPVTVl
36 VWRNWNYESGANDALNSYLQSHGLKAS----.
37 RW-----------------------------.
38 RW-----------------------------.