(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.soe.ucsc.edu/research/compbio/sam.html
Citations (SAM, SAM-T99, HMMs)
- R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: Extension and analysis of the basic method, CABIOS 12:95-107, 1996.
- K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting remote protein homologies, Bioinformatics 14(10):846-856, 1998.
- A. Krogh et al., Hidden Markov models in computational biology: Applications to protein modeling, JMB 235:1501-1531, Feb 1994.
Sequence numbers correspond to the following labels:
-
- T0358 NESG:ER397, Escherichia coli, 87 res
-
- gi|110345587|gb|ABG71824.1|_1:77 putative cytoplasmic protein [Escherichia coli 536]
- gi|26110819|gb|AAN83003.1| Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26250389|ref|NP_756429.1| Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|24528054|emb|CAD33784.1| hypothetical protein [Escherichia coli]
- gi|70608383|gb|AAZ04455.1| conserved hypothetical protein [Escherichia coli]
-
- gi|83586571|ref|ZP_00925204.1|_1:77 hypothetical protein Ecol1_01002225 [Escherichia coli 101-1]
-
- gi|75259093|ref|ZP_00730461.1|_1:77 hypothetical protein EcolE2_01001358 [Escherichia coli E22]
-
- gi|75230630|ref|ZP_00717105.1|_1:77 hypothetical protein EcolB7_01001278 [Escherichia coli B7A]
-
- gi|89033311|gb|ABD59989.1|_1:77 hypothetical protein [Escherichia coli]
- gi|50346328|ref|NP_052923.2| hypothetical protein R100p043 [Plasmid R100]
-
- gi|75238464|ref|ZP_00722461.1|_1:77 hypothetical protein EcolF_01003511 [Escherichia coli F11]
-
- gi|75512277|ref|ZP_00734839.1|_1:77 hypothetical protein Ecol5_01003544 [Escherichia coli 53638]
-
- gi|75208230|ref|ZP_00708597.1|_1:77 hypothetical protein EcolB_01004504 [Escherichia coli B171]
-
- gi|37962737|gb|AAR05684.1|_3:79 conserved hypothetical protein [Salmonella typhimurium]
- gi|58383258|ref|YP_194828.1| hypothetical protein pU302L_021 [Salmonella typhimurium]
-
- gi|45758187|gb|AAS76399.1|_1:77 YkfF [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67]
- gi|60115629|ref|YP_209420.1| YkfF [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67]
-
- gi|73476837|gb|AAZ76452.1|_1:77 hypothetical protein LH0085 [Escherichia coli]
- gi|73853253|ref|YP_308749.1| hypothetical protein LH0085 [Escherichia coli]
-
- gi|26111397|gb|AAN83579.1|_1:77 Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26250965|ref|NP_757005.1| Hypothetical protein ykfF [Escherichia coli CFT073]
-
- gi|91213937|ref|YP_543923.1|_1:77 hypothetical protein YkfF [Escherichia coli UTI89]
- gi|91075511|gb|ABE10392.1| hypothetical protein YkfF [Escherichia coli UTI89]
-
- gi|88770148|gb|ABD51585.1|_10:86 conserved hypothetical protein [Escherichia coli]
-
- gi|91206338|ref|YP_538692.1|_3:79 hypothetical protein UTI89_P093 [Escherichia coli UTI89]
- gi|91075789|gb|ABE10669.1| hypothetical protein UTI89_P093 [Escherichia coli UTI89]
-
- gi|38606100|gb|AAR25064.1|_3:79 YdeA [Escherichia coli]
- gi|41056963|ref|NP_957583.1| YdeA [Escherichia coli]
- gi|5103191|dbj|BAA78827.1| 94% identical to gp:AF074613_85[hypothetical protein of plasmid pO157], 61% identical (1 gap) to 64 residues of 79 aa protein gp:ECAE000133_3[hypothetical protein of E. coli] [Plasmid R100]
-
- gi|5702179|gb|AAD47188.1|_1:77 unknown; orf77 [Escherichia coli]
- gi|75234993|ref|ZP_00719258.1| hypothetical protein EcolE1_01003133 [Escherichia coli E110019]
-
- gi|8918884|dbj|BAA97931.1|_3:79 yfjA [Plasmid F]
- gi|9507774|ref|NP_061440.1| hypothetical protein Fpla063 [Plasmid F]
-
- gi|41223299|ref|NP_958724.1|_1:77 hypothetical protein ECp054 [Escherichia coli O157:H7 str. Sakai]
-
- gi|75234748|ref|ZP_00719058.1|_1:77 hypothetical protein EcolE1_01003386 [Escherichia coli E110019]
-
- gi|110344756|gb|ABG70993.1|_1:77 hypothetical protein YkfF [Escherichia coli 536]
-
- gi|75226917|ref|ZP_00713803.1|_1:77 hypothetical protein EcolB7_01004569 [Escherichia coli B7A]
-
-
- gi|3822199|gb|AAC70153.1|_10:86 hypothetical protein [Escherichia coli O157:H7]
- gi|75994531|ref|YP_325645.1| hypothetical protein L7085 [Escherichia coli O157:H7 EDL933]
-
- gi|47154999|emb|CAE85198.1|_13:89 YkfF protein [Escherichia coli]
-
- gi|71559064|ref|YP_271791.1|_1:75 hypothetical protein pOU1113_67 [Salmonella enterica]
- gi|68166363|gb|AAY88124.1| hypothetical protein [Salmonella enterica]
-
- gi|16445257|gb|AAL23475.1|_6:80 putative cytoplasmic protein [Salmonella typhimurium LT2]
- gi|17233436|ref|NP_490555.1| putative cytoplasmic protein [Salmonella typhimurium LT2]
-
- gi|16128234|ref|NP_414783.1|_1:79 CP4-6 prophage; predicted protein [Escherichia coli K12]
- gi|1786443|gb|AAC73352.1| CP4-6 prophage; predicted protein [Escherichia coli K12]
- gi|89107122|ref|AP_000902.1| hypothetical protein [Escherichia coli W3110]
- gi|3025208|sp|P75677|YKFF_ECOLI Hypothetical protein ykfF
- gi|85674398|dbj|BAA77918.2| hypothetical protein [Escherichia coli W3110]
-
- gi|31795419|ref|NP_857872.1|_1:76 hypothetical protein Y1091 [Yersinia pestis KIM]
- gi|45357335|gb|AAS58729.1| hypothetical protein pMT098 [Yersinia pestis biovar Medievalis str. 91001]
- gi|108793825|ref|YP_636672.1| hypothetical protein YPN_MT0093 [Yersinia pestis Nepal516]
- gi|108793621|ref|YP_636780.1| hypothetical protein YPA_MT0089 [Yersinia pestis Antiqua]
- gi|108782168|gb|ABG16225.1| hypothetical protein YPA_MT0089 [Yersinia pestis Antiqua]
- gi|108777889|gb|ABG20407.1| hypothetical protein YPN_MT0093 [Yersinia pestis Nepal516]
- gi|3883089|gb|AAC82749.1| unknown [Yersinia pestis KIM]
- gi|2996363|gb|AAC13243.1| unknown [Yersinia pestis KIM]
- gi|31795259|ref|NP_857714.1| hypothetical protein YPKMT101 [Yersinia pestis KIM]
- gi|5834744|emb|CAB55242.1| hypothetical protein YPMT1.60c [Yersinia pestis CO92]
- gi|52538124|emb|CAG27550.1| hypothetical protein [Yersinia pestis]
- gi|52788195|ref|YP_094023.1| hypothetical protein pG8786_144 [Yersinia pestis]
- gi|16082850|ref|NP_395404.1| hypothetical protein YPMT1.60c [Yersinia pestis CO92]
- gi|89103272|ref|ZP_01175857.1| hypothetical protein Ypesb_01002735 [Yersinia pestis biovar Orientalis str. IP275]
- gi|45478682|ref|NP_995538.1| hypothetical protein pMT098 [Yersinia pestis biovar Medievalis str. 91001]
-
- gi|57434425|emb|CAI43842.1|_1:77 hypothetical protein [Escherichia coli]
- gi|57545662|gb|AAW51753.1| Aec70 [Escherichia coli]
-
- gi|26109909|gb|AAN82114.1|_1:77 Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26249501|ref|NP_755541.1| Hypothetical protein ykfF [Escherichia coli CFT073]
-
- gi|26106581|gb|AAN78767.1|_1:77 Hypothetical protein ykfF [Escherichia coli CFT073]
- gi|26246184|ref|NP_752223.1| Hypothetical protein ykfF [Escherichia coli CFT073]
-
-
- gi|18265878|gb|AAL67351.1|_1:77 intergenic-region protein [Escherichia coli]
-
- gi|16505971|emb|CAD09857.1|_1:76 hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18]
- gi|18466655|ref|NP_569463.1| hypothetical protein HCM1.279bc [Salmonella enterica subsp. enterica serovar Typhi str. CT18]
- gi|10957300|ref|NP_058324.1| hypothetical protein [Salmonella typhi]
- gi|7800353|gb|AAF69949.1| orf; hypothetical protein [Salmonella typhi]
-
- gi|75259064|ref|ZP_00730432.1|_1:52 hypothetical protein EcolE2_01001328 [Escherichia coli E22]
- gi|75212146|ref|ZP_00712186.1| hypothetical protein EcolB_01000748 [Escherichia coli B171]
- gi|75208370|ref|ZP_00708708.1| hypothetical protein EcolB_01004372 [Escherichia coli B171]
-
- gi|75189020|ref|ZP_00702287.1|_13:64 hypothetical protein EcolE_01002298 [Escherichia coli E24377A]
10 20 30 40
| | | |
1 ...MTQ.S..V..L....L..PP.GPFTRRQAQAVTTTYSNITLED.DQGSHFRLVVRD...TEG
2 ...MSD.ChpV..L....L..PE.GPFSREQAVAVTTAYRNVLIED.DQGTHFRLVIRN...AEG
3 ...MSE.Y..F..RifqgL..PD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DG
4 ...MSE.Y..F..RilqgL..PD.GPFTRKHAEAVAAQYRNVFIED.DHGEHFRLVVRN...-NG
5 ...MSE.Y..F..RilqgL..PD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRN...NNG
6 ...MSE.Y..F..RilqgL..PD.GPFTRKQAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NG
7 ...MSE.Y..F..RilqgL..PD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRN...NSG
8 ...MSE.Y..F..RifqgL..PD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DG
9 ...MSE.Y..F..RifqgL..PD.GSFTREQAEAVAAQYQNVFIED.DQGTHFRLVVRQ...-DG
10 mr.MSE.Y..F..RilqgL..PD.GPFTRKQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DG
11 ...MSE.Y..F..RilqgL..PD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DG
12 ...MSE.Y..F..RilqgL..PD.GSFTREQAEAVAAQYRNVFIED.EQGTHFRLVVRN...NSG
13 ...MSD.ChpV..L....L..PE.GPFSREQAMAVTTAYRNVLIED.DQGTHFRLVIHN...AEG
14 ...MSD.ChpV..L....L..PE.GPFSREQAVAVTTAYRNVLIED.DQGTHFRLVIRN...AGG
15 emrMSE.Y..F..RilqgL..PD.GPFTRKQAEAVAAQYRNVFIED.DQGTHFRLVVRQ...-DG
16 mr.MSE.Y..F..RilqgL..PD.GSFTREQAEAVAAQYRNVFIED.DQGTHFRLVVRN...NSG
17 mr.MSE.Y..F..RilqgL..PD.GPFTRKQAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NG
18 ...MSE.Y..F..RilqgL..PD.GPFTRKHAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NG
19 mr.MSE.Y..F..RilqgL..PD.GPFTRKHAEAVAAQYRNVFIED.DHGEQFRLVVRN...-NG
20 ...MSE.Y..F..RilqgL..PD.GPFTRKHAEAVAAQYRNVFIEN.DHGEQFRLVVRN...-NG
21 ...MSE.Y..F..RilqgL..PD.GPFTRKHAEAVAAQYRNVFIED.DHGEQFRLVVRN...NE-
22 ...MPG.C..TsrL....L..PE.GPFSRNQALAVTTAYLNVLIED.DQGTHFRLVIRN...AEG
23 ...MSE.Y..F..RilqgL..PD.GPFTRKQAEAVAAQYRNVFIGD.DHGEQFRLVVRN...-NG
24 emrMSE.Y..F..RilqgL..PD.GSFTREQAEAVAVQYRNVFIED.DHGEQFRLVVRN...-NG
25 emrMSE.Y..F..RilqgL..PD.GPFTRKHAEAVAAQYRNVFIEN.DHGEQFRLVVRN...-NG
26 atdMPG.C..TsrL....L..PE.GPFSRNQALAVTTAYLNVLIED.DQGTHFRLVIRN...AEG
27 ...MIP.L..K..S....L..PD.GTFTHEQAEAVAAQYQNVAIED.DQGTHLRLVVRK...-DG
28 dniMSP.L..K..S....L..PD.GTFTHEQAEAVAAQYQNVAIED.DQGTHLRLVVRK...-DG
29 ...MTQ.S..V..L....L..PP.GPFTRRQAQAVTTTYSNITLED.DQGSHFRLVVRD...TEG
30 ...MSEaL..A..V....L..PD.DTFTREQAEVVAAQYTNVAIED.DQGAHFRLVVRQ...-NG
31 ...MPG.YteY..V....L..AE.GSFSYGQAVAVITAYRNVFIQD.DPGMHFRRVIRN...AEG
32 ...MPG.YteY..V....L..AE.GSFSYGQAVAVITAYRNVFIQD.DPGMHFRRVIRN...AEG
33 ...MRG.C..T..L....LlfSE.APFLREQEVAVITAYRNVFIQD.DPGMHFRWVIRN...AEG
34 xsfXQT.L..S..G....L..PQwASADCVAGPLVSAGITDINIED.DQGIHVRLIVRD...AEG
35 ...MRG.C..T..L....LlfSE.APFLREQEVAVITAYRNVFIQD.DPGMHFRWVIRN...AEG
36 ...MSD.L..F..S....-..SE.SPVTLAQARTVAAGYQNVFIENlQPAGHFQIVIRDhrdHDS
37 ...MPG.C..TsrL....L..PE.GPFSREQAVAVKTAYRNVFTED.DQGTYSRLVIRN...AEG
38 atdMPG.C..TsrL....L..PE.GPFSREQAVAVKTAYRNVFTED.DQGTYSRLVIRN...AEG
50 60 70
| | |
1 RMVWRAWNFEPDAGEGLNRYIRTSGIRTDTATR.
2 QLRWRCWNFEPDAGKQLNSYLASEGILRQ----.
3 TLIWRSWNFEDCAGYWMNRYIRDFGIRK-----.
4 TMVWRTWNFEDGAGYWMNHAIRDFGILK-----.
5 -LVWRTWNFEDGAGYWMNHVIRNFGILK-----.
6 TMVWRTWNFEDGAGYWMNHVIRDFGILK-----.
7 -LVWRTWNFEDGAGYWMNHVIRNFGILK-----.
8 TLIWRSWNFEDCAGYWMNRYIRDFGILK-----.
9 TLIWRSWNFEDCAGYWMNQYIRDFGIRK-----.
10 TLIWRSWNFEDCAGYWMNRYIRDFGILK-----.
11 TLIWRSWNFEDCAGYWMNQYIRDFGILK-----.
12 -LVWRTWNFEDGAGYWMNHVIRDFGILK-----.
13 QLRWRCWNFEPDAGKQLNPYLASEGILRQ----.
14 QLRWRCWNFEPDAGKQLNSYLASEGILRQ----.
15 TLIWRSWNFEDCAGYWMNRYIRDFGILK-----.
16 -LVWRTWNFEDGAGYWMNHVIRNFGILK-----.
17 TMVWRTWNFEDGAGYWMNHVIRDFGILK-----.
18 AMVWRTWNFEDGAGYWMNHVIRDFGILK-----.
19 AMVWRTWNFEDGAGYWMNHVIRDFGILK-----.
20 AMVWRTWNFEDGAGYWMNHVIRDFGIIK-----.
21 TMVWRTWNFEDGAGYWMNHVIRDFGILK-----.
22 QLRWRCWNFEPDAGKQLNPYLASEGILRQ----.
23 AMVWRTWNFEDGAGYWMNRYIRDFGILK-----.
24 AMVWRTWNFEDGAGYWMNHVIRDFGILK-----.
25 AMVWRTWNFEDGAGYWMNHVIRDFGIIK-----.
26 QLRWRCWNFEPDAGKQLNPYLASEGILRQ----.
27 EMVWRGWNFEPGGEYWLNRCIESHGIRKTQ---.
28 EMVWRGWNFEPGGEYWLNRCIESHGIRKTQ---.
29 RMVWRAWNFEPDAGEGLNRYIRTSGIRTDTATR.
30 EMVWRTWNFEPGGTYWLNRYIADYGIRKPQ---.
31 QRRWRCRNSEPDAGKVLNTRLASDGLLRQ----.
32 QRRWRCRNSEADAGKQLNAWLASGGLLRQ----.
33 QRRWRCRNSEPDAGKVLNTRLASDGPLRQ----.
34 RMVWRAWNFEPDAGEGFNRYIHRSGIRTDTFPR.
35 QRRWRCRNS-P---LILNPMPERCLIRGSPVTVl
36 QLVWRNWNYESGANDALNSYLQSHGLKAS----.
37 QLRW-----------------------------.
38 QLRW-----------------------------.