(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html
Citations (SAM, SAM-T99, HMMs)
- R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: Extension and analysis of the basic method, CABIOS 12:95-107, 1996.
- K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting remote protein homologies, Bioinformatics 14(10):846-856, 1998.
- A. Krogh et al., Hidden Markov models in computational biology: Applications to protein modeling, JMB 235:1501-1531, Feb 1994.
Sequence numbers correspond to the following labels:
-
- T0148 HI1034, Haemophilus influenzae
-
- gi|16272968|ref|NP_439194.1|_1:163 (NC_000907) conserved hypothetical protein [Haemophilus influenzae Rd]
- gi|1175383sp|P44096|YAJQ_HAEIN Protein HI1034
- gi|1074618|pir||F64018 conserved hypothetical protein HI1034 - Haemophilus influenzae (strain Rd KW20)
- gi|1574067|gb|AAC22694.1| (U32784) conserved hypothetical protein [Haemophilus influenzae Rd]
-
- gi|15603521|ref|NP_246595.1|_1:163 (NC_002663) unknown [Pasteurella multocida]
- gi|12722060|gb|AAK03740.1| (AE006202) unknown [Pasteurella multocida]
-
- gi|15800156|ref|NP_286168.1|_7:169 (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
- gi|15829734|ref|NP_308507.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
- gi|16128411|ref|NP_414960.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
- gi|2495529sp|P77482|YAJQ_ECOLI Protein yajQ
- gi|7429439|pir||B64772 yajQ protein - Escherichia coli
- gi|1773110|gb|AAB40182.1| (U82664) similar to H. influenzae HI1034 [Escherichia coli]
- gi|1786629|gb|AAC73529.1| (AE000149) orf, hypothetical protein [Escherichia coli K12]
- gi|12513285|gb|AAG54776.1|AE005222_1 (AE005222) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
- gi|13359937|dbj|BAB33903.1| (AP002551) hypothetical protein [Escherichia coli O157:H7]
-
- gi|16759412|ref|NP_455029.1|_7:169 (NC_003198) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
- gi|16501703|emb|CAD08891.1| (AL627266) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
-
- gi|16763815|ref|NP_459430.1|_7:169 (NC_003197) putative cytoplasmic protein [Salmonella typhimurium LT2]
- gi|16418940|gb|AAL19389.1| (AE008715) putative cytoplasmic protein [Salmonella typhimurium LT2]
-
- gi|16123332|ref|NP_406645.1|_4:166 (NC_003143) conserved hypothetical protein [Yersinia pestis]
- gi|15981108|emb|CAC92405.1| (AJ414155) conserved hypothetical protein [Yersinia pestis]
-
- gi|21244396|ref|NP_643978.1|_1:161 (NC_003919) conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
- gi|21110056|gb|AAM38514.1| (AE012017) conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
-
- gi|17547268|ref|NP_520670.1|_1:161 (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
- gi|17429570|emb|CAD16256.1| (AL646070) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
-
-
- gi|15614950|ref|NP_243253.1|_4:164 (NC_002570) BH2387~unknown conserved protein [Bacillus halodurans]
- gi|10175007|dbj|BAB06106.1| (AP001515) BH2387~unknown conserved protein [Bacillus halodurans]
-
- gi|21233061|ref|NP_638978.1|_1:161 (NC_003902) conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
- gi|21114912|gb|AAM42902.1| (AE012484) conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
-
- gi|15599591|ref|NP_253085.1|_1:159 (NC_002516) conserved hypothetical protein [Pseudomonas aeruginosa]
- gi|11348171|pir||C83096 conserved hypothetical protein PA4395 [imported] - Pseudomonas aeruginosa (strain PAO1)
- gi|9950626|gb|AAG07783.1|AE004855_5 (AE004855) conserved hypothetical protein [Pseudomonas aeruginosa]
-
- gi|15641517|ref|NP_231149.1|_1:160 (NC_002505) conserved hypothetical protein [Vibrio cholerae]
- gi|11278644|pir||D82191 conserved hypothetical protein VC1508 [imported] - Vibrio cholerae (group O1 strain N16961)
- gi|9656012|gb|AAF94663.1| (AE004229) conserved hypothetical protein [Vibrio cholerae]
-
- gi|17232154|ref|NP_488702.1|_4:163 (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
- gi|17133799|dbj|BAB76361.1| (AP003597) ORF_ID:all4662~hypothetical protein [Nostoc sp. PCC 7120]
-
- gi|16078166|ref|NP_388983.1|_4:163 (NC_000964) similar to hypothetical proteins [Bacillus subtilis]
- gi|7429438|pir||D69840 conserved hypothetical protein yitK - Bacillus subtilis
- gi|2145403|emb|CAA70666.1| (Y09476) YitK [Bacillus subtilis]
- gi|2633438|emb|CAB12942.1| (Z99109) similar to hypothetical proteins [Bacillus subtilis]
-
- gi|15607706|ref|NP_215080.1|_3:163 (NC_000962) hypothetical protein Rv0566c [Mycobacterium tuberculosis H37Rv]
- gi|15839964|ref|NP_335001.1| (NC_002755) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
- gi|7445082|pir||E70932 hypothetical protein Rv0566c - Mycobacterium tuberculosis (strain H37RV)
- gi|2909625|emb|CAA17437.1| (AL021942) hypothetical protein Rv0566c [Mycobacterium tuberculosis H37Rv]
- gi|13880105|gb|AAK44815.1| (AE006957) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
-
- gi|21222997|ref|NP_628776.1|_3:162 (NC_003888) conserved hypothetical protein [Streptomyces coelicolor A3(2)]
- gi|10129710|emb|CAC08267.1| (AL392146) conserved hypothetical protein [Streptomyces coelicolor A3(2)]
-
- gi|15791741|ref|NP_281564.1|_4:162 (NC_002163) hypothetical protein Cj0374 [Campylobacter jejuni]
- gi|11278645|pir||D81380 hypothetical protein Cj0374 [imported] - Campylobacter jejuni (strain NCTC 11168)
- gi|6967848|emb|CAB74210.1| (AL139075) hypothetical protein Cj0374 [Campylobacter jejuni]
-
- gi|10802673|gb|AAG23559.1|AF244610_1_4:49 (AF244610) putative oxo-tricarboxilic-pentene acid decarboxylase/isomerase [Carboxydothermus hydrogenoformans]
10 20 30 40 50 60
| | | | | |
1 MPSFDIVSEITLHEVRNAVENANRVLSTRYDFRGVEAVIELNEKNETIKITTESDFQLEQLIEIL
2 MPSFDIVSEITLHEVRNAVENANRVLSTRYDFRGVEAVIELNEKNETIKITTESDFQLEQLIEIL
3 MPSFDIVSEITLHEVRNAVENANRDLTNRWDFRNVQAAIELNEKNESIKVSSESDFQVEQLVDIL
4 MPSFDIVSEVDLQEARNAVDNASREVESRFDFRNVEASFELNDASKTIKVLSESDFQVNQLLDIL
5 MPSFDIVSEVDLQEARNGVDNAVREVESRFDFRGVEATIELNDANKTIKVLSESDFQVNQLLDIL
6 MPSFDIVSEVDLQEARNGVDNAVREVESRFDFRGVEATIELNDANKTIKVLSESDFQVNQLLDIL
7 MPSFDIVSEIDMQEVRNAVENATRDLANRWDFRNVPASFELNEKNESIKVVSESDFQVEQLLDIL
8 MPSFDVVSEVDKHELTNAVDQANRELDTRFDFKGVEARFEL-EDGKVINQSAPSDFQIKQMTDIL
9 MPSFDVVCEANMVELKNAVEQANKEISTRFDFKGSDARVEH--KDQELTLFGDDDFKLGQVKDVL
10 ----DIVSEVDTVELRNAVDNANRELSTRFDFRNVEAGFEL--KDEVVKLSAEDDFQLGQMMDIL
11 EHSFDLVSEVNLQEVDNAINLAMKEITNRYDFKGSKSSIER-TGDEQVTLISDDEYKLESVIDIL
12 MPSFDVISEVDKHELTNAVDQANRELDTRFDFKGVEAKFEL-EDGKVINQSAPSDFQVKQMTDIL
13 MPSFDVVSELDKHELTNAVDNAIKELDRRFDLKG-KCSFEA--KDKSVTLTAEADFMLEQMLDIL
14 MPSFDIVSEIDAVELRNAVENSTRELASRFDFRNVDASFEL--KEETVKLAAEDDFQLGQMMDIL
15 TYSFDIVSDFDRQELVNAVDQVIRDLKSRYDLKDTQTTVEL--GEEKITIGTDSEFTLESVHNIL
16 ESSFDIVSKVELPEVQNAIQIALKEISTRYDFKGSKSDISL--DKEELVLVSDDEFKLSQLKDVL
17 DSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTKIAW-KGDEAVELTSSTEERVKAAVDVF
18 DSSFDIVSKVERQEVDNALNQAAKEISQRYDFKGVGASISW--SGEKILMEANSEDRVTAVLDVF
19 EHSFDISAALDKQELKNAFEQAKKELDSRYDLKGIKCEIDLSEKESIFKLSSSSEGKLDVLKDIV
20 DYSFDIVSEVNLPEVKNAVNQALKEISQRYDFKGSNVEIELNEKDK-------------------
70 80 90 100 110 120 13
| | | | | |
1 IGSCIKRGIEHSSLDIPAESEHHGKLYSKEIKLKQGIETEMAKKITKLVKDSKIK.VQTQIQGEQ
2 IGSCIKRGIEHSSLDIPAESEHHGKLYSKEIKLKQGIETEMAKKITKLVKDSKIK.VQTQIQGEQ
3 RNACIKRGIDSGSLDIPTEYEHSGKTYSKEIKLKQGIASEMAKKITKLIKDSKLK.VQTQIQGEQ
4 RAKLLKRGIEGSSLDVPENIVHSGKTWFVEAKLKQGIESATQKKIVKMIKDSKLK.VQAQIQGDE
5 RAKLLKRGIEGASLDVPDEFVHSGKTWYVEAKLKQGIESAVQKKIVKLIKDSKLK.VQAQIQGEE
6 RAKLLKRGIEGASLDVPDEFVHSGKTWYVEAKLKQGIESAVQKKIVKLIKDSKLK.VQAQIQGEE
7 RAQLSKRGIEGAALEIPEEMARSGKTYSVDAKLKQGIESVQAKKLVKLIKDSKLK.VQAQIQGEQ
8 RARLLARGIDIRCLEFGDVETNLAG-ARQKVTVKQGIEQKQAKQLVAKLKEAKLK.VEAQINGDK
9 LTKLAKRGVDVRFLDYQDKQKIGGDKMKQVVKIKKGVSGELSKKIVKLIKDSKIK.VQGSIQGDA
10 RGNLAKRGVDAKAMEAKD-SVHSGKRWFKDVQFKQGLDPLTSKKVVKAIKDAKLK.VQASIQGEK
11 KSKFIKRGLSQKTMDFGKIERAAGGTVRQVVTLLSGIEGERAKKLTKLIRDSKLK.VKAQIQNDQ
12 RARLLARGIDVRCLEFGDVETNLAG-ARQKVTVKQGIEQKQAKQLVAKLKEAKLK.VEAQINGDK
13 RSNLVKRKVDSQCMEIKD-AYPSGKVVKQDVNFREGIDKDLAKKIVGLIKERKLK.VQAAIQGEQ
14 RGNLAKRGVDARAMKAKD-SVHIGKNWYKEAEFKQGLEALLAKKIVKLIKDAKIK.VQASIQGDK
15 REKAAKRNLSQKIFDFGKVESASGNRVRQEITLKKGISQDIAKQISKLIRDEFKK.VQASIQGDA
16 VSKLIKRNVPTKNIDYGKVENASGGTVRQRAKLVQGIDKDNAKKINTIIKNSGLK.VKSQVQDDQ
17 KEKLIRRDISLKAFEAGE-PQASGKTYKVTGALKQGISSENAKKITKLIRDAGPKnVKTQIQGDE
18 QSKLIKRGISLKALDAGE-PQLSGKEYKIFASIEEGISQENAKKVAKLIRDEGPKgVKAQVQGEE
19 ISKLIKRGINPKA--IKELSRESGAMFRLNLKANDAIDSENAKKINKVIKDSKLK.VNSSIRGEE
20 -------------------------------------------------------.---------
0 140 150 160
| | | |
1 VRVTGKSRDDLQAVIQLVKSAELGQPFQFNNFRD
2 VRVTGKSRDDLQAVIQLVKSAELGQPFQFNNFRD
3 VRVTGKSRDDLQAVIQLVKGAELGQPFQFNNFRD
4 IRVTGKSRDDLQAVMAMVRGGDLGQPFQFKNFRD
5 IRVTGKSRDDLQSVMALVRGDDLGQPFQFKNFRD
6 IRVTGKSRDDLQSVMALVRGGDLGQPFQFKNFRD
7 VRVTGKARDDLQAVMALVRAADLGQPFQFNNFRD
8 LRVTGKKRDDLQDAIALLKKADFELPLQFDNFRD
9 VRVSGAKRDDLQAVIAMLRKDVTDTPLDFNNFRD
10 VRVTGKKRDDLQSVIALMRESDMGQPFQYDNFRD
11 IRVTGKNIDDLQQVIQLVKEQDLDFPVQFVNMR-
12 LRVTGKKRDDLQDAIAVLKKADFELPLQFDNFRD
13 VRVTGKKRDDLQEAIALLRGESLGMPLQFTNFRD
14 VRVTGKKRDDLQEVMAMLREANLEQPLQYNNFRE
15 VRVSAKAKDDLQIVIQRLKQEDYPVALQFTNYR-
16 VRVTGKNKDDLQQIISAVRGADLPIDVQFINFR-
17 VRVTSKKRDDLQAVIAMLKKADLDVALQFVNYR-
18 LRVSSKSRDDLQTVISLLKGQDFDFALQFVNYR-
19 IRVAAKQIDDLQAVMKLVKELDLELNISFKNL--
20 ----------------------------------