(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.soe.ucsc.edu/research/compbio/sam.html
Citations (SAM, SAM-T99, HMMs)
- R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: Extension and analysis of the basic method, CABIOS 12:95-107, 1996.
- K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting remote protein homologies, Bioinformatics 14(10):846-856, 1998.
- A. Krogh et al., Hidden Markov models in computational biology: Applications to protein modeling, JMB 235:1501-1531, Feb 1994.
Sequence numbers correspond to the following labels:
-
- T0383 Hypothetical Protein SP_1558, Streptococcus pneumoniae TIGR4, 127 res
-
- gi|62528163|ref|ZP_00389421.1|_1:124 COG4835: Uncharacterized protein conserved in bacteria [Streptococcus thermophilus LMD-9]
-
- gi|55738381|gb|AAV62022.1|_1:124 conserved hypothetical protein [Streptococcus thermophilus CNRZ1066]
- gi|55822396|ref|YP_140837.1| hypothetical protein str0421 [Streptococcus thermophilus CNRZ1066]
- gi|55820508|ref|YP_138950.1| conserved hypothetical protein, [Streptococcus thermophilus LMG 18311]
- gi|55736493|gb|AAV60135.1| conserved hypothetical protein, [Streptococcus thermophilus LMG 18311]
-
- gi|14973049|gb|AAK75645.1|_2:128 hypothetical protein SP_1558 [Streptococcus pneumoniae TIGR4]
- gi|15901401|ref|NP_346005.1| hypothetical protein SP1558 [Streptococcus pneumoniae TIGR4]
- gi|66879340|ref|ZP_00404363.1| COG4835: Uncharacterized protein conserved in bacteria [Streptococcus pneumoniae TIGR4]
-
- gi|15459070|gb|AAL00220.1|_2:128 Hypothetical protein [Streptococcus pneumoniae R6]
- gi|15903459|ref|NP_359009.1| hypothetical protein spr1416 [Streptococcus pneumoniae R6]
-
- gi|71853266|gb|AAZ51289.1|_1:124 transposase [Streptococcus pyogenes MGAS5005]
- gi|50903109|gb|AAT86824.1| Transposase [Streptococcus pyogenes MGAS10394]
- gi|13622031|gb|AAK33787.1| hypothetical protein SPy0864 [Streptococcus pyogenes M1 GAS]
- gi|94994158|ref|YP_602256.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10750]
- gi|94992239|ref|YP_600338.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS2096]
- gi|94990238|ref|YP_598338.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10270]
- gi|94988357|ref|YP_596458.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS9429]
- gi|94547666|gb|ABF37712.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10750]
- gi|94545747|gb|ABF35794.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS2096]
- gi|94543746|gb|ABF33794.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS10270]
- gi|94541865|gb|ABF31914.1| hypothetical cytosolic protein [Streptococcus pyogenes MGAS9429]
- gi|71802411|gb|AAX71764.1| transposase [Streptococcus pyogenes MGAS6180]
- gi|50914035|ref|YP_060007.1| Transposase [Streptococcus pyogenes MGAS10394]
- gi|19748091|gb|AAL97569.1| hypothetical protein [Streptococcus pyogenes MGAS8232]
- gi|28811431|dbj|BAC64363.1| hypothetical protein [Streptococcus pyogenes SSI-1]
- gi|21904313|gb|AAM79192.1| conserved hypothetical protein [Streptococcus pyogenes MGAS315]
- gi|21910121|ref|NP_664389.1| hypothetical protein SpyM3_0585 [Streptococcus pyogenes MGAS315]
- gi|28896180|ref|NP_802530.1| hypothetical protein SPs1268 [Streptococcus pyogenes SSI-1]
- gi|19745934|ref|NP_607070.1| hypothetical protein spyM18_0925 [Streptococcus pyogenes MGAS8232]
- gi|15674892|ref|NP_269066.1| hypothetical protein SPy0864 [Streptococcus pyogenes M1 GAS]
- gi|71903316|ref|YP_280119.1| transposase [Streptococcus pyogenes MGAS6180]
- gi|71910484|ref|YP_282034.1| transposase [Streptococcus pyogenes MGAS5005]
-
- gi|76563355|gb|ABA45939.1|_1:124 conserved hypothetical protein [Streptococcus agalactiae A909]
- gi|22534369|gb|AAN00214.1| protein of unknown function [Streptococcus agalactiae 2603V/R]
- gi|22537490|ref|NP_688341.1| hypothetical protein SAG1343 [Streptococcus agalactiae 2603V/R]
- gi|24412993|emb|CAD47072.1| unknown [Streptococcus agalactiae NEM316]
- gi|76788298|ref|YP_329984.1| hypothetical protein SAK_1374 [Streptococcus agalactiae A909]
- gi|25011455|ref|NP_735850.1| hypothetical protein gbs1413 [Streptococcus agalactiae NEM316]
- gi|77414519|ref|ZP_00790666.1| protein of unknown function [Streptococcus agalactiae 515]
- gi|77411141|ref|ZP_00787493.1| protein of unknown function [Streptococcus agalactiae CJB111]
- gi|77162759|gb|EAO73718.1| protein of unknown function [Streptococcus agalactiae CJB111]
- gi|77159442|gb|EAO70606.1| protein of unknown function [Streptococcus agalactiae 515]
- gi|76798772|ref|ZP_00780988.1| Protein of unknown function (DUF1149) superfamily [Streptococcus agalactiae 18RS21]
- gi|76585879|gb|EAO62421.1| Protein of unknown function (DUF1149) superfamily [Streptococcus agalactiae 18RS21]
-
- gi|77409291|ref|ZP_00785996.1|_1:124 protein of unknown function [Streptococcus agalactiae COH1]
- gi|77172088|gb|EAO75252.1| protein of unknown function [Streptococcus agalactiae COH1]
-
- gi|24379350|ref|NP_721305.1|_1:124 hypothetical protein SMU.898 [Streptococcus mutans UA159]
- gi|24377276|gb|AAN58611.1| conserved hypothetical protein [Streptococcus mutans UA159]
-
- gi|81096796|ref|ZP_00875126.1|_1:124 conserved hypothetical protein [Streptococcus suis 89/1591]
- gi|80977148|gb|EAP40701.1| conserved hypothetical protein [Streptococcus suis 89/1591]
-
- gi|62464138|ref|ZP_00383437.1|_1:130 COG4835: Uncharacterized protein conserved in bacteria [Lactococcus lactis subsp. cremoris SK11]
-
- gi|12724566|gb|AAK05661.1|_1:130 UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis Il1403]
- gi|15673545|ref|NP_267719.1| hypothetical protein L5610 [Lactococcus lactis subsp. lactis Il1403]
-
- gi|23003673|ref|ZP_00047327.1|_1:126 hypothetical protein Lgas_03001712 [Lactobacillus gasseri ATCC 33323]
-
- gi|42518757|ref|NP_964687.1|_1:126 hypothetical protein LJ0832 [Lactobacillus johnsonii NCC 533]
- gi|41583043|gb|AAS08653.1| hypothetical protein LJ_0832 [Lactobacillus johnsonii NCC 533]
-
- gi|58254298|gb|AAV42535.1|_1:125 hypothetical protein LBA0658 [Lactobacillus acidophilus NCFM]
- gi|58336981|ref|YP_193566.1| hypothetical protein LBA0658 [Lactobacillus acidophilus NCFM]
-
- gi|29342796|gb|AAO80560.1|_1:122 conserved hypothetical protein [Enterococcus faecalis V583]
- gi|29375336|ref|NP_814490.1| hypothetical protein EF0742 [Enterococcus faecalis V583]
-
- gi|81428094|ref|YP_395093.1|_1:126 hypothetical protein LSA0480 [Lactobacillus sakei subsp. sakei 23K]
- gi|78609735|emb|CAI54781.1| Hypothetical protein [Lactobacillus sakei subsp. sakei 23K]
-
- gi|104773702|ref|YP_618682.1|_1:127 hypothetical protein Ldb0592 [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842]
- gi|103422783|emb|CAI97422.1| Conserved hypothetical protein [Lactobacillus delbrueckii subsp. bulgaricus ATCC 11842]
- gi|62515590|ref|ZP_00386991.1| COG4835: Uncharacterized protein conserved in bacteria [Lactobacillus delbrueckii subsp. bulgaricus ATCC BAA-365]
-
- gi|62514719|ref|ZP_00386216.1|_1:125 COG4835: Uncharacterized protein conserved in bacteria [Lactobacillus casei ATCC 334]
-
- gi|16412859|emb|CAC95663.1|_1:124 lin0430 [Listeria innocua]
- gi|16799507|ref|NP_469775.1| hypothetical protein lin0430 [Listeria innocua Clip11262]
-
- gi|47096598|ref|ZP_00234187.1|_1:124 conserved hypothetical protein [Listeria monocytogenes str. 1/2a F6854]
- gi|47015058|gb|EAL06002.1| conserved hypothetical protein [Listeria monocytogenes str. 1/2a F6854]
- gi|16409785|emb|CAC98486.1| lmo0407 [Listeria monocytogenes]
- gi|16802452|ref|NP_463937.1| hypothetical protein lmo0407 [Listeria monocytogenes EGD-e]
-
- gi|46906645|ref|YP_013034.1|_1:124 hypothetical protein LMOf2365_0427 [Listeria monocytogenes str. 4b F2365]
- gi|47091553|ref|ZP_00229350.1| conserved hypothetical protein [Listeria monocytogenes str. 4b H7858]
- gi|47020230|gb|EAL10966.1| conserved hypothetical protein [Listeria monocytogenes str. 4b H7858]
- gi|46879910|gb|AAT03211.1| conserved hypothetical protein [Listeria monocytogenes str. 4b F2365]
-
- gi|77409470|ref|ZP_00786162.1|_1:88 protein of unknown function [Streptococcus agalactiae COH1]
- gi|77171913|gb|EAO75090.1| protein of unknown function [Streptococcus agalactiae COH1]
-
- gi|56808916|ref|ZP_00366625.1|_1:93 COG4835: Uncharacterized protein conserved in bacteria [Streptococcus pyogenes M49 591]
-
- gi|77406292|ref|ZP_00783358.1|_1:71 protein of unknown function [Streptococcus agalactiae H36B]
- gi|77175104|gb|EAO77907.1| protein of unknown function [Streptococcus agalactiae H36B]
10 20 30 40
| | | |
1 .SNAMNLKR.E.QEFVSQYHFD...A..R..NFE.W..E.N..ENG..APETK.V..DVNFQLLQ
2 .---MDLIH.E.KVFVNRYHYD...V..R..NHE.W..E.K..ENG..VPKTN.V..EVTFQLHH
3 .---MDLIH.E.KVFVNRYHYD...V..R..NHE.W..E.K..ENG..VPKTN.V..EVTFQLHH
4 mEKDMNLKR.E.QEFVSQYHFD...A..R..NFE.W..E.N..ENG..APETK.V..DVNFQLLQ
5 mEKDMNLKR.E.QEFVSQYHFD...A..R..NFE.W..E.N..ENG..APETK.V..DVNFQLLQ
6 .---MQLVR.E.KEFVNQYHYD...A..R..NLE.W..E.K..ENG..TPETN.F..EVTFQLID
7 .---MEVIR.E.QEFVNQYHYD...A..R..NLE.W..E.E..ENG..TPKTN.F..EVTFQLAN
8 .---MEVIR.E.QEFVNQYHYD...A..R..NLE.W..E.E..ENG..TPKTN.F..EVTFQLAN
9 .---MEIVR.Q.KEFVNQYHYD...A..R..NLE.W..E.K..ENG..TPETD.V..EVTFQLVE
10 .---MEIIR.D.KEFVNQYHFD...A..R..NHA.W..E.K..ENG..IPETK.L..KVDFQLIE
11 .-MTLTIER.E.QEFVNQFHYD...A..R..NYE.W..E.K..ENG..TPETN.L..NVQFQLVP
12 .-MALTIER.E.QEFVNQFHYD...A..R..NYE.W..E.K..ENG..TPETN.L..NVQFQLVP
13 .---MDFKNmT.PIIVRSFHYD...L..N..DEQkV..K.N..EVN..VSLRQ.VyqDLDDGSQD
14 .---MDFKNmT.PIVVRSFHYD...L..N..DEQkV..K.N..EVN..VSLRQ.VyqDLEDGSQD
15 .---MDFEK.QtPVTVQSFHYD...L..V..DEG.VaaK.S..EVN..PGIRK.L..DVSGDDEH
16 .---MEIKR.Q.QEIVEAYHYD...M..RvpDSE.V..E.T..DLR..VSFSP.I..EVEEENYP
17 .---MQTKK.D.PISVDAFHFD...R..V..SPD.A..E.P..QQNiqVSLVK.I..DADDEYLQ
18 .---MEFNK.QtPIIVRAFHYD...L..L..DEP.E..EkN..EVN..VAIRQ.VvaTGEDGVED
19 .---METKR.Y.PIAVESFHYD...LvkQ..GTP.V..K.N..DLQ..VAMRQ.I..EWS-DPAK
20 .---MDIVT.N.KIVVEKYNFEttmE..E..NQP.F..E.NkiELE..VHEVEpV..DGNVELMA
21 .---MDIVT.N.KIVVEKYNFEtivE..E..NEQ.F..E.NkiELE..VHEVEpV..NGNVELMS
22 .---MDIVT.N.KIVVEKYNFEtimE..E..NEQ.F..E.NkiELE..VHEVEpV..NGNVELMS
23 .--------.-.----------...-..-..---.-..-.-..---..-----.-..---FQLAN
24 .---MQLVR.E.KEFVNQYHYD...A..R..NLE.W..E.K..ENG..TPETN.F..EVTFQLID
25 .--------.-.----------...-..-..---.-..-.-..---..-----.-..--------
50 60 70 80 90 100
| | | | | |
1 H.D.QENQV....TSLIVILSFMIVFDK..FVISGTISQVNHIDGRIVNEPSELNQEEVETLARP
2 K.D.KEANN....TSVVAVLQFMIVRDE..FVISGIISQMNHIQNRIVNNPKDFSQEEAEYLAAP
3 K.D.KEANN....TSVVAVLQFMIVRDE..FVISGIVSQMNHIQNRIVNNPKDFSQEEAEYLAAP
4 H.D.QENQV....TSLIVILSFMIVFDK..FVISGTISQVNHIDGRIVNEPSELNQEEVETLARP
5 H.D.QENQV....TSLIVILSFMIVFDK..FVISGTISQVNHIDGRIVNEPNELNQEEVETLARP
6 K.D.EQQKE....TVIVSVLQFVIVKEE..FVISGVISQMVRILDRLVDKPSEFTQEEVESLAAP
7 R.D.EAAKV....TSIVAVLQFVIVRDE..FVISGVISQMAHIQGRLINEPSEFSQDEVENLAAP
8 R.D.EAAKV....TSIVAVLQFVIVRDE..FVISGVISQIAHIQGRLINEPSEFSQDEVENLAAP
9 K.N.EELNE....TTVVAVLQFMIVRDE..FVLSGVLSQMVKIKDRLVNQPSEFSQEEVATLAAP
10 Q.N.RAENR....TSMITILRFMIVLDH..FVISGAMSQAVHLPNRLVEEPTEFTDEEKRVLVEP
11 K.E.QLEKLgdrdTGIHAVLTYLIVLDN..IVLSGFVSQLNYVRGKIIKEQEELEQEELAQLAAP
12 K.E.QLEKLgegdTGIHAVLTYLIVLDN..IVLSGFVSQLNYVRGQIIKEQEELEQEELAQLAAP
13 E.G.KTGKY....FEIAVPFEVSPAPGD..FTVSGVITRVVQFVD-YFGDGTDLEPSDYQLLSRP
14 E.G.KNGKY....FEIAVPFEVSPAPGD..FTVSGVITRVVQFVD-YFGDGSDLEPSEYQLLSRP
15 S.E.EEGSY....YDVAVFFDVIPAPAE..FEVSGAIHQIVQIKN-YHGDGTDISNADWQLLSRP
16 E.-.---NS....SALVARLEFRIVFDE..FVLSGAISQINHIIDRKIEKQEDISQEEVDELVRP
17 EaDlKAGNI....YQIVTPFQVMPQGSG..FAVSGQISRVVQLLD-FFGTPDEIEQKEMMKLSRP
18 A.G.EAGGY....YEVAVVYDVTPDEDNi.IEISGVNTQVVQLLG-YHGDGQDLDQETYRLLSRP
19 Q.D.ELKKG....NLFQMMIPFDVVPDDagFEISGKITQIVQVLD-YFGEANELPQAELGKLSRP
20 K.G.KIFKI....T-----IPFLLVLEN..FRIDGRISRIIQLKD-FFGQFSDLEAKDVEGLSNP
21 K.G.KIFKI....T-----IPFLLVLEN..FRIDGRISRIIQLKD-FFGDFSDLEAVDVEGLSNP
22 K.G.KIFKI....T-----IPFLLVLEN..FRIDGRISRIIQLKD-FFGDFSDLEAVDVEGLSNP
23 R.D.EAAKV....TSIVAVLQFVIVRDE..XVISGVISQIAHIQGRLINEPSEFSQDEVENLAAP
24 K.D.EQQKE....TVIVSVLQFVIVKEE..FVISGVISQMVRILDRLVDKPSELRKRKLNP----
25 -.-.-----....-----MLQFVIVRDE..FVISGVISQMAHIQGRLINEPSEFSQDEVENLAAP
110 120
| |
1 CLNMLNRLTYEVTEIALDLPGINLEF...
2 LLDTLQRLTYEVTEIALDRPGINLEF...
3 LLDTLQRLTYEVTEIALDRPGINLEF...
4 CLNMLNRLTYEVTEIALDLPGINLEF...
5 CLNMLNRLTYEVTEIALDLPGINLEF...
6 LLDMVKRLTYEVTEIALDRPGIHLEFkn.
7 LLEIVKRLTYEVTEIALDRPGVTLEFns.
8 LLEIVKRLTYEVTEIALDRPGVTLEFns.
9 LLDILKRMTYEVTEIALDKEGVSLEF...
10 LLDILKRMTYEVTEIAFDAPGVNLEF...
11 LFDLLQRLVYETTEVALDHPGINLEF...
12 LFELLKRLVYETTEVALDQPGINLGF...
13 LVEQIETLTYEITQLTLDHP-VNLSFksn
14 LVEEIETLTYEITQLTLDHP-VNLSFksn
15 LVEYIETLTYEVTQVTFDKP-VNLNFkae
16 LFSIVERLSYEVTEIALDRPGVQLNFqqs
17 LIEYIETLTYQVTAVALNE-GIQLQFtah
18 LVEYIETLTYEVSQVALDEP-INLDFepn
19 LVETIETLTYQVTAVALDQ-GVNLQFgas
20 LIDYIKRLTYDVTEIAFDEPGVSLDFnan
21 LIDYIKRLTYDVTEIAFDEPGVSLDFnan
22 LIDYIKRLTYDVTEIAFDEPGVSLDFnan
23 LLEIVKRLTYEVTEIALDRPGVTLEFns.
24 --------------------------...
25 LLEIVKRLTYEVTEIALDRPGVTLEFns.