(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0148 HI1034, Haemophilus influenzae
    • gi|16272968|ref|NP_439194.1|_1:163 (NC_000907) conserved hypothetical protein [Haemophilus influenzae Rd]
    • gi|1175383sp|P44096|YAJQ_HAEIN Protein HI1034
    • gi|1074618|pir||F64018 conserved hypothetical protein HI1034 - Haemophilus influenzae (strain Rd KW20)
    • gi|1574067|gb|AAC22694.1| (U32784) conserved hypothetical protein [Haemophilus influenzae Rd]
    • gi|15603521|ref|NP_246595.1|_1:163 (NC_002663) unknown [Pasteurella multocida]
    • gi|12722060|gb|AAK03740.1| (AE006202) unknown [Pasteurella multocida]
    • gi|15800156|ref|NP_286168.1|_7:169 (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
    • gi|15829734|ref|NP_308507.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
    • gi|16128411|ref|NP_414960.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
    • gi|2495529sp|P77482|YAJQ_ECOLI Protein yajQ
    • gi|7429439|pir||B64772 yajQ protein - Escherichia coli
    • gi|1773110|gb|AAB40182.1| (U82664) similar to H. influenzae HI1034 [Escherichia coli]
    • gi|1786629|gb|AAC73529.1| (AE000149) orf, hypothetical protein [Escherichia coli K12]
    • gi|12513285|gb|AAG54776.1|AE005222_1 (AE005222) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
    • gi|13359937|dbj|BAB33903.1| (AP002551) hypothetical protein [Escherichia coli O157:H7]
    • gi|16759412|ref|NP_455029.1|_7:169 (NC_003198) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
    • gi|16501703|emb|CAD08891.1| (AL627266) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
    • gi|16763815|ref|NP_459430.1|_7:169 (NC_003197) putative cytoplasmic protein [Salmonella typhimurium LT2]
    • gi|16418940|gb|AAL19389.1| (AE008715) putative cytoplasmic protein [Salmonella typhimurium LT2]
    • gi|16123332|ref|NP_406645.1|_4:166 (NC_003143) conserved hypothetical protein [Yersinia pestis]
    • gi|15981108|emb|CAC92405.1| (AJ414155) conserved hypothetical protein [Yersinia pestis]
    • gi|21244396|ref|NP_643978.1|_1:161 (NC_003919) conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
    • gi|21110056|gb|AAM38514.1| (AE012017) conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
    • gi|17547268|ref|NP_520670.1|_1:161 (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
    • gi|17429570|emb|CAD16256.1| (AL646070) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
    • gi|15614950|ref|NP_243253.1|_4:164 (NC_002570) BH2387~unknown conserved protein [Bacillus halodurans]
    • gi|10175007|dbj|BAB06106.1| (AP001515) BH2387~unknown conserved protein [Bacillus halodurans]
    • gi|21233061|ref|NP_638978.1|_1:161 (NC_003902) conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
    • gi|21114912|gb|AAM42902.1| (AE012484) conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
    • gi|15599591|ref|NP_253085.1|_1:159 (NC_002516) conserved hypothetical protein [Pseudomonas aeruginosa]
    • gi|11348171|pir||C83096 conserved hypothetical protein PA4395 [imported] - Pseudomonas aeruginosa (strain PAO1)
    • gi|9950626|gb|AAG07783.1|AE004855_5 (AE004855) conserved hypothetical protein [Pseudomonas aeruginosa]
    • gi|15641517|ref|NP_231149.1|_1:160 (NC_002505) conserved hypothetical protein [Vibrio cholerae]
    • gi|11278644|pir||D82191 conserved hypothetical protein VC1508 [imported] - Vibrio cholerae (group O1 strain N16961)
    • gi|9656012|gb|AAF94663.1| (AE004229) conserved hypothetical protein [Vibrio cholerae]
    • gi|17232154|ref|NP_488702.1|_4:163 (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
    • gi|17133799|dbj|BAB76361.1| (AP003597) ORF_ID:all4662~hypothetical protein [Nostoc sp. PCC 7120]
    • gi|16078166|ref|NP_388983.1|_4:163 (NC_000964) similar to hypothetical proteins [Bacillus subtilis]
    • gi|7429438|pir||D69840 conserved hypothetical protein yitK - Bacillus subtilis
    • gi|2145403|emb|CAA70666.1| (Y09476) YitK [Bacillus subtilis]
    • gi|2633438|emb|CAB12942.1| (Z99109) similar to hypothetical proteins [Bacillus subtilis]
    • gi|15607706|ref|NP_215080.1|_3:163 (NC_000962) hypothetical protein Rv0566c [Mycobacterium tuberculosis H37Rv]
    • gi|15839964|ref|NP_335001.1| (NC_002755) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
    • gi|7445082|pir||E70932 hypothetical protein Rv0566c - Mycobacterium tuberculosis (strain H37RV)
    • gi|2909625|emb|CAA17437.1| (AL021942) hypothetical protein Rv0566c [Mycobacterium tuberculosis H37Rv]
    • gi|13880105|gb|AAK44815.1| (AE006957) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
    • gi|21222997|ref|NP_628776.1|_3:162 (NC_003888) conserved hypothetical protein [Streptomyces coelicolor A3(2)]
    • gi|10129710|emb|CAC08267.1| (AL392146) conserved hypothetical protein [Streptomyces coelicolor A3(2)]
    • gi|15791741|ref|NP_281564.1|_4:162 (NC_002163) hypothetical protein Cj0374 [Campylobacter jejuni]
    • gi|11278645|pir||D81380 hypothetical protein Cj0374 [imported] - Campylobacter jejuni (strain NCTC 11168)
    • gi|6967848|emb|CAB74210.1| (AL139075) hypothetical protein Cj0374 [Campylobacter jejuni]
    • gi|10802673|gb|AAG23559.1|AF244610_1_4:49 (AF244610) putative oxo-tricarboxilic-pentene acid decarboxylase/isomerase [Carboxydothermus hydrogenoformans]
              10        20        30        40        50        60     
              |         |         |         |         |         |     
   1 MPSFDIVSEITLHEVRNAVENANRVLSTRYDFRGVEAVIELNEKNETIKITTESDFQLEQLIEIL
   2 MPSFDIVSEITLHEVRNAVENANRVLSTRYDFRGVEAVIELNEKNETIKITTESDFQLEQLIEIL
   3 MPSFDIVSEITLHEVRNAVENANRDLTNRWDFRNVQAAIELNEKNESIKVSSESDFQVEQLVDIL
   4 MPSFDIVSEVDLQEARNAVDNASREVESRFDFRNVEASFELNDASKTIKVLSESDFQVNQLLDIL
   5 MPSFDIVSEVDLQEARNGVDNAVREVESRFDFRGVEATIELNDANKTIKVLSESDFQVNQLLDIL
   6 MPSFDIVSEVDLQEARNGVDNAVREVESRFDFRGVEATIELNDANKTIKVLSESDFQVNQLLDIL
   7 MPSFDIVSEIDMQEVRNAVENATRDLANRWDFRNVPASFELNEKNESIKVVSESDFQVEQLLDIL
   8 MPSFDVVSEVDKHELTNAVDQANRELDTRFDFKGVEARFEL-EDGKVINQSAPSDFQIKQMTDIL
   9 MPSFDVVCEANMVELKNAVEQANKEISTRFDFKGSDARVEH--KDQELTLFGDDDFKLGQVKDVL
  10 ----DIVSEVDTVELRNAVDNANRELSTRFDFRNVEAGFEL--KDEVVKLSAEDDFQLGQMMDIL
  11 EHSFDLVSEVNLQEVDNAINLAMKEITNRYDFKGSKSSIER-TGDEQVTLISDDEYKLESVIDIL
  12 MPSFDVISEVDKHELTNAVDQANRELDTRFDFKGVEAKFEL-EDGKVINQSAPSDFQVKQMTDIL
  13 MPSFDVVSELDKHELTNAVDNAIKELDRRFDLKG-KCSFEA--KDKSVTLTAEADFMLEQMLDIL
  14 MPSFDIVSEIDAVELRNAVENSTRELASRFDFRNVDASFEL--KEETVKLAAEDDFQLGQMMDIL
  15 TYSFDIVSDFDRQELVNAVDQVIRDLKSRYDLKDTQTTVEL--GEEKITIGTDSEFTLESVHNIL
  16 ESSFDIVSKVELPEVQNAIQIALKEISTRYDFKGSKSDISL--DKEELVLVSDDEFKLSQLKDVL
  17 DSSFDIVSKVDRQEVDNALNQAAKELATRFDFRGTDTKIAW-KGDEAVELTSSTEERVKAAVDVF
  18 DSSFDIVSKVERQEVDNALNQAAKEISQRYDFKGVGASISW--SGEKILMEANSEDRVTAVLDVF
  19 EHSFDISAALDKQELKNAFEQAKKELDSRYDLKGIKCEIDLSEKESIFKLSSSSEGKLDVLKDIV
  20 DYSFDIVSEVNLPEVKNAVNQALKEISQRYDFKGSNVEIELNEKDK-------------------


         70        80        90       100       110       120        13
         |         |         |         |         |         |          
   1 IGSCIKRGIEHSSLDIPAESEHHGKLYSKEIKLKQGIETEMAKKITKLVKDSKIK.VQTQIQGEQ
   2 IGSCIKRGIEHSSLDIPAESEHHGKLYSKEIKLKQGIETEMAKKITKLVKDSKIK.VQTQIQGEQ
   3 RNACIKRGIDSGSLDIPTEYEHSGKTYSKEIKLKQGIASEMAKKITKLIKDSKLK.VQTQIQGEQ
   4 RAKLLKRGIEGSSLDVPENIVHSGKTWFVEAKLKQGIESATQKKIVKMIKDSKLK.VQAQIQGDE
   5 RAKLLKRGIEGASLDVPDEFVHSGKTWYVEAKLKQGIESAVQKKIVKLIKDSKLK.VQAQIQGEE
   6 RAKLLKRGIEGASLDVPDEFVHSGKTWYVEAKLKQGIESAVQKKIVKLIKDSKLK.VQAQIQGEE
   7 RAQLSKRGIEGAALEIPEEMARSGKTYSVDAKLKQGIESVQAKKLVKLIKDSKLK.VQAQIQGEQ
   8 RARLLARGIDIRCLEFGDVETNLAG-ARQKVTVKQGIEQKQAKQLVAKLKEAKLK.VEAQINGDK
   9 LTKLAKRGVDVRFLDYQDKQKIGGDKMKQVVKIKKGVSGELSKKIVKLIKDSKIK.VQGSIQGDA
  10 RGNLAKRGVDAKAMEAKD-SVHSGKRWFKDVQFKQGLDPLTSKKVVKAIKDAKLK.VQASIQGEK
  11 KSKFIKRGLSQKTMDFGKIERAAGGTVRQVVTLLSGIEGERAKKLTKLIRDSKLK.VKAQIQNDQ
  12 RARLLARGIDVRCLEFGDVETNLAG-ARQKVTVKQGIEQKQAKQLVAKLKEAKLK.VEAQINGDK
  13 RSNLVKRKVDSQCMEIKD-AYPSGKVVKQDVNFREGIDKDLAKKIVGLIKERKLK.VQAAIQGEQ
  14 RGNLAKRGVDARAMKAKD-SVHIGKNWYKEAEFKQGLEALLAKKIVKLIKDAKIK.VQASIQGDK
  15 REKAAKRNLSQKIFDFGKVESASGNRVRQEITLKKGISQDIAKQISKLIRDEFKK.VQASIQGDA
  16 VSKLIKRNVPTKNIDYGKVENASGGTVRQRAKLVQGIDKDNAKKINTIIKNSGLK.VKSQVQDDQ
  17 KEKLIRRDISLKAFEAGE-PQASGKTYKVTGALKQGISSENAKKITKLIRDAGPKnVKTQIQGDE
  18 QSKLIKRGISLKALDAGE-PQLSGKEYKIFASIEEGISQENAKKVAKLIRDEGPKgVKAQVQGEE
  19 ISKLIKRGINPKA--IKELSRESGAMFRLNLKANDAIDSENAKKINKVIKDSKLK.VNSSIRGEE
  20 -------------------------------------------------------.---------


      0       140       150       160   
     |         |         |         |   
   1 VRVTGKSRDDLQAVIQLVKSAELGQPFQFNNFRD
   2 VRVTGKSRDDLQAVIQLVKSAELGQPFQFNNFRD
   3 VRVTGKSRDDLQAVIQLVKGAELGQPFQFNNFRD
   4 IRVTGKSRDDLQAVMAMVRGGDLGQPFQFKNFRD
   5 IRVTGKSRDDLQSVMALVRGDDLGQPFQFKNFRD
   6 IRVTGKSRDDLQSVMALVRGGDLGQPFQFKNFRD
   7 VRVTGKARDDLQAVMALVRAADLGQPFQFNNFRD
   8 LRVTGKKRDDLQDAIALLKKADFELPLQFDNFRD
   9 VRVSGAKRDDLQAVIAMLRKDVTDTPLDFNNFRD
  10 VRVTGKKRDDLQSVIALMRESDMGQPFQYDNFRD
  11 IRVTGKNIDDLQQVIQLVKEQDLDFPVQFVNMR-
  12 LRVTGKKRDDLQDAIAVLKKADFELPLQFDNFRD
  13 VRVTGKKRDDLQEAIALLRGESLGMPLQFTNFRD
  14 VRVTGKKRDDLQEVMAMLREANLEQPLQYNNFRE
  15 VRVSAKAKDDLQIVIQRLKQEDYPVALQFTNYR-
  16 VRVTGKNKDDLQQIISAVRGADLPIDVQFINFR-
  17 VRVTSKKRDDLQAVIAMLKKADLDVALQFVNYR-
  18 LRVSSKSRDDLQTVISLLKGQDFDFALQFVNYR-
  19 IRVAAKQIDDLQAVMKLVKELDLELNISFKNL--
  20 ----------------------------------