(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.soe.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0361 A putative transcriptional regulator, Shigella flexneri 2a str. 2457T, 169 res
              10        20        30        40        50        60     
              |         |         |         |         |         |     
   1 MATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   2 -AKLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   3 -ATLTEDDVLEQLDAQDNLFSFMKTAHTILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   4 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   5 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   6 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   7 -ATLTEDDVLEQLDAQDNLFSFMKTAHTILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   8 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGICQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
   9 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
  10 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
  11 -ATLTEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
  12 -ATLSEDDVLEQLDAQDNLFSFMKTAHSILLQGIRQFLPSLFVDNDEESVEYAVKPLLAQSGPLD
  13 -ATLTEDDVLEQLDAQDNLFSFMNTAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
  14 --NINESEIIERLNSAPSVRGFFIATVDVFNESIDGLIQRIFRKDN-FAVQSVVGPLLQDSGPLG
  15 --NINETEIIERLNSAPSVRGFFIAAVDVFNDSIDGLIQRIFRKDN-FAVQSVVGPLLQDSGPLG
  16 --NINETEIIERLNSAPSVRGFFIAAVDVFNDSIDGLVQRIFRKDN-FAVQSVVGPLLQDSGPLG
  17 ---TNESDIIERLNHAPSVRGFFVETVDILTEAVDGLVQRIFRKDN-FAVQSVVGPLLQDSGPLG
  18 ---TNESDIIERLNHAPSVRGFFVETVDILTEAVDGLVQRIFRKDN-FAVQSVVGPLLQDSGPLG
  19 --QAFENRVLERLNAGKTVRSFLITAVELLTEAVNLLVLQVFRKDD-YAVKYAVEPLLDGDGPLG
  20 --QAFENRVLERLNAGKTVRSFLITAVELLTEAVNLLVLQVFRKDD-YAVKYAVEPLLDGDGPLG
  21 --QAFENRVLEHLNAGKTVRSFLMAAVELLAEALNILVVQVFRKDD-YAVKYAVEPLLVGDGPLG
  22 --QAVENRVLERLNAGKTVRSFLITAVELLTEAVNLLVLQVFRKDD-YAVKYAVEPLLDGDGPLG
  23 --QAFENRVLEHLNAGKTVRSFLMAAVELLAEALNILVVQVFRKDD-YAVKYAVEPLLVGDGPLG
  24 --QAFENRVLERLNAGKTVRSFLITAVELLTEAVNILVLQVFRKDD-YAVKYAVEPLLDGDGPLG
  25 --QAFENRVLEHLNAGKTVRSFLMAAVELLAEALNILVVQVFRKDD-YAVKYAVEPLLVGDGPLG
  26 --QAFENRVLERLNAGKTVRSFLIATVELLTEAVNILVLQVFRKDD-YAVKYAVEPLLEGSGPLG
  27 --QAFENRVLEHLNAGKTVRSFLMAAVELLAEALNILVVQVFRKDD-YAVKYAVEPLLVGDGPLG
  28 --QAFENRVLERLNAGKTVRSFLITAVELLTEAVNILVLQVFRKDD-YAVKYAVEPLLDGDGPLG
  29 --QAFENRVLEHLNAGKTVRSFLMAAVDLLAEALNILVVQIFRKDD-YAVKYAVEPLLVGDGPLS
  30 --KINESDILERLNQTHTVRGFFITTVDVLTEAIDALMQRIFRKDN-FAVKSVVEPLLHDTGPLG
  31 --KINESDILERLNQTHTVRGFFITTVDVLTEAIDALMQRIFRKDN-FAVKSVVEPLLHDTGPLG
  32 ------------------------TAHSILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
  33 ----QESDILERLSDTSNVRGFIITSVEVFEEAVDSLIQRVFRKDD-FAVKSVVGPLLDNSGPLG
  34 ----QESDILERLSDTSNVRGFIITSVEVFEEAVDSLIQRIFRKDD-FAVKSVVGPLLDNSGPLG
  35 --QAFENRVLEALNSGKTVRDFMLCAVELLAEAASILMLQVFRKDD-YAVKYAVEPLMTGTGPLG
  36 --QLLENSVLERLNAGTTVREFLRAAITLLAEAVAILVTQVFRKDD-YAVKYAVEPLLVGAGPLV
  37 --ALTEDDVLERLASPENLNDFLLNANEILFQGIKSLLPYLFINNDEDIQEYAVKPLLAKSGPLD
  38 -----EDPIYEKLNSTVSVRGFMIATVAIFEEAVDSLINRVFRKTD-FVVQSVIDSLFTNDGPLG
  39 ----FDIDIVETLSEAETPYDFFTASFDLFEDAVDILIQNVFRKDD-YAVKYAVEPLLSRDGPLA
  40 -----EDPIYEKLNGTVSVRGFMIAIVAIFEEAVDSLINRVFRKTD-FVVQSVIDSLFTNDGPLG
  41 --TTHESELLEELAESADAAECLLCAYDVLEDMLDALLKSIFYKDD-YAVKFVVDPLLTTDGPLG
  42 ------DPIYEKLNEATSIRGFITASVAIFDEAVDKLINRVFRKTD-FAVKSVVDSLLSNSGPLC
  43 -----EDELLEKLDECENATAFLQVSNKIINLKLKALLPSVFVQDD-LVKEYAVDPLLKEDGPLV
  44 -----EDELLEKLDECENATAFLQVSNKIINLKLKALLPSVFVQDD-LVKEYAVDPLLREDGPLV
  45 ------------------------TAHTILLQGIRQFLPSLFVDNDEEIVEYAVKPLLAQSGPLD
  46 --------LLETVEECDDATSFLCASNKLINLKLKALLPNIFVQDE-LVMEYAVEPLLKEDGPLV
  47 --TSHETELLEALSEAESASACLMAAYDALDDTVDAVLKNIFKKDD-TAIKFVVEPLLNSGGPLG
  48 ----FDIDIVETLSEAETAYVFFTASFDLFEDAIDILIQNVFRKDD-YAVKYAVEPLLNNDGPLA
  49 ------DNFIEQLSEVPSLRGFFALSVNQFAQNIERLIQRVFRKTD-FALKSVVDSLFEHQGPLA


         70        80        90       100       110       120          
         |         |         |         |         |         |          
   1 DIDVALRLIYALGKMDKWLYADITHFSQYWHYLNEQDETPGFADDITWDFISNVNSI..TRNATL
   2 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TRNAML
   3 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TRNAML
   4 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TRNATL
   5 DIDVALRLIYALGKMDKWLYADITHFSQYWHYLNEQDETPGFADDMTWDFISNVNSI..TCNATL
   6 DIDVALRLIYALGKMDKWLYADITHFSQYWHYLNEQDETPGFADDITWDFISNVNSI..TRNATL
   7 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TCNATL
   8 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDITWDFISNVNSI..TRNATL
   9 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TCNATL
  10 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMAWDFISNVNSI..IRNASL
  11 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQNEMPGFADDMAWDFISNVNSI..TRNATL
  12 DIDVALRLIYALGKMDKWLYADITHFSQYWHYLKEQDETLGFADDMTWDFISNVNSI..TCNATL
  13 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TCNATL
  14 DLSVRLKLLFGLGVLPDDIYHDIEDIIKLKNHLNSDASDYEFTDPNILEPIKKLHLV..KKMGMV
  15 DLSVRLKLLFGLGVLPDDIYHDIEDIIKLKNRLNSDASDYEFTDPNILEPIKKLHLV..KKMGMV
  16 DLSVRLKLLFGLGVLPDDIYHDIEDIIKLKNQLNSDASDYEFTDPNILEPIKKLHLV..KKMGMV
  17 DVSVRLKLLFGLGVLSDHHYHDIEDIIKLKNKLNSDSTEYEFTDPQILEPIKKLHLV..QKMGMV
  18 DVSVRLKLLFGLGVLSDHHYHDIEDIIKLKNKLNSDSTEYEFTDPQILEPIKKLHLV..QKMGMV
  19 DLSVRLKLIYGLGVINRQEYEDAELLMALREELNHDGNEYAFTDDEILGPFGELHCV..AALPP-
  20 DLSVRLKLIYGLGVINRQEYEDAELLMALREELNHDGNEYAFTDDEILGPFGELHCV..AALPP-
  21 ELSVRLKLVYALGVITRHEYEDAELLMALREELNHDGTEYRFTDDEILGPFGELHCV..AELPPA
  22 DLSVRLKLIYGLGVINRQEYEDAELLMALREELNHDGNEYAFTDDEILGPFGELHCV..AALPP-
  23 ELSVRLKLVYALGVITRHEYEDAELLMALREELNHDGTEYRFTDDEILGPFGELHCV..AELPPV
  24 DLSVRLKLIYGLGVLSRTEYEDAELLMALREELNHDGNEYAFTDDEILGPFGELHCV..MALPP-
  25 ELSVRLKLVYALGVITRHEYEDAELLMALREELNHDGTEYRFTDDEILGPFGELHCV..VELPPV
  26 DLSVRLKLIYGLGVISRAEYEDAELLMALREELNHDGNEYSFTDDEIIGPFGELHCV..AALPP-
  27 ELSVRLKLVYALGVITRHEYEDAELLMALREELNHDGTEYRFTDDEILGPFGELHCV..AELPPV
  28 DLSVRLKLIYGLGVLSRTEYEDAELLMALREELNHDGNEYAFTDDEILGPFGELHCV..MALPP-
  29 ELSVRLKLVYALGVITRHEYEDAELLMALREELNHDGTEYRFTDDEILGPFGELHCV..DELPPV
  30 DLTVRLKLLFGLGVIPDEVFHDIEHLIKLRNQLNHDATEYQFTDPQILAPIKALNLV..KKMGML
  31 DLTVRLKLLFGLGVIPDEVFHDIEHLIKLRNQLNHDATEYQFTDPQILAPIKALNLV..KKMGML
  32 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TRNAML
  33 EITVRLKLLLGLGVVSHNIYQDIDAFLKLRDFLNSDGNDYCFTDPKILQPIKKMHAV..QNMGVV
  34 EITVRLKLLLGLGVVSHNIYQDIDAFLKLRDFLNSDGSDYCFTDPKILQPIKKMHAV..QNMGVV
  35 DLSVRLKLIYGLGMISRKEYEDAELLMALGEELAHDDRHYRFTDDEILGPIGELHCV..AALPAE
  36 ELSVRLKLVYALGVISRQEYEDAELLMALHEELKQDPGDYRFTDDEILGPFGELHCV..AAFPPS
  37 SLDVSLRLIYALGQITKAVYADILLFSQLSEHLQETGEIAEFHDDIVYEFMGNLNAV..TQNKTL
  38 DLSVRLKVLLGLGVIQHNLFSDVNAFIQFKETLNKDEKEYTFDDEIVLTFLQKLTLL..MDKSAL
  39 HLPVRIKLLYGLGLLSQGSYQDIEKFIGLKEFVQSEGESIEFLSLALIERLNAIRAV..AKILPP
  40 DLSVRLKVLLGLGVIQHNLFSDVNAFIQFKETLNKDEKEYTFDDEIVLTFLQKLTLL..MDKSAL
  41 EILVRTKLLLGLGVISKEVYDDIEVFVTLKEWVNIQEGQVNFWDQDVVFELNRINAI..QKMMPI
  42 DLSIRLKVLLGLGIIEHHVFSDISHFIEIKEKLNNDEKEYDFADLIIIDFIQQLSCQ..NDKSLL
  43 TTDVVSKLMFAMGKISLETYADIGLYDQVLEYVVSQPEKVEFADDVIYDFIKNQAVLssQQDSFY
  44 TTDVVSKLMFAMGKISLQTYADIGLYDQVLEYVVTQPEKIAFTDDVIYDFIKNQAVLssQQDCFY
  45 DIDVALRLIYALGKMDKWLYADITHFSQFWHYLNEQDETPGFADDMTWDFISNVNSI..TRNAML
  46 TTDVVSKLMFAMGKISLETYADIGLYAQVLEYAQSQPNKLSFGDDMIYDFISNQAVLstQQDSFY
  47 EIMIRAKLLLGLGVISKELYDDLEIFVTLKEWAKIQGEDTSFTEVDVIFELNKVQAI..QRIMPI
  48 QLAIRIKLLYGLGLLSQDAYQDIEKFIALKTFVQSEGESIDFLSPLLFERINAISAV..AEMIPI
  49 ELTVRLKLLLGLGVISAAVFEDISLFIEVKQQLSDEVEELPFSHPAIVQFAKDLHHI..DLSPVA


         130            140       150       160         
          |              |         |         |         
   1 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELTP
   2 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTILKELTP
   3 ....YDALK..AMKFA...DFSVWSEARFSGMVKTALTLAVTTTLKELT-
   4 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
   5 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
   6 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
   7 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
   8 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
   9 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
  10 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALMLAVTTTLKELT-
  11 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
  12 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTTLKELT-
  13 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLVVTTTLKELT-
  14 ....QLEVN..EPDDDidlEFYQLQLQRQQQIIKSGLSLAIVEICNELGK
  15 ....QLEVN..EPDDDidlEFYQLQLQRQQQIIKSGLSLAIVEICNELGK
  16 ....QLEVS..EPDDDidlEFYQLQLQRQQQIIKSGLSLAIVEICNELGK
  17 ....QLEVD..DPDDDielSFYQLQLQRKQQIIRSGLSLAIVEICHELGK
  18 ....QLEVD..DPDDDielSFYQLQLQRKQQIIRSGLSLAIVEICHELGK
  19 ....-PPQF..EPADS...SLYAMQIQRYQQAVRSTMVLSLTELISKI--
  20 ....-PPQF..EPADS...SLYAMQIQRYQQAVRSTMVLSLTELISRI--
  21 ....PTFLRp.DEADA...SLIAMQRQRYQQMVRSTMVLSITELLSRIS-
  22 ....-PPQF..EPADS...SLYAMQIQRYQQAVRSTMVLSLTELISKI--
  23 ....PTFLRp.DEADA...SLIAMQRQRYQQMVRSTMVLSITELLSRI--
  24 ....-PPHF..DTSDA...ALYAMQIQRYQQAVRSTMILSLTELISKI--
  25 ....PTFLRp.DEADA...SLIAMQRQRYQQMVRSTMVLSITELLSRI--
  26 ....-TPQF..DDSDA...ELLAMQKLRYQQMVRSTMVLSLTELISRI--
  27 ....PTFLRp.DEADA...SLIAMQRQRYQQMVRSTMVLSITELLSRI--
  28 ....-PPHF..DTSDA...ALYAMQIQRYQQAVRSTMVLSLTELISKI--
  29 ....PTFLRp.DEADA...SLIAMQRQRYQQMVRSTMVLSITELLSRI--
  30 ....HLNVV..EPDDDidlSFYHLQLQRQQQVIKSGLSLAIIQICNALN-
  31 ....HLNVV..EPDDDidlSFYHLQLQRQQQVIKSGLSLAIIQICNALN-
  32 ....YDALK..AMKFA...DFAVWSEARFSGMVKTALTLAVTTILKELTP
  33 ....QLEIA..SPGDDvdlIFYQMQLARQEQIIKSALALAIAGICNELG-
  34 ....QLEIA..SPSDDvdlIFYQMQLARQEQIIKSALALAIAGICNELG-
  35 p...TLPLG..ADADP...LLVSMQQQRYQQMVRSTLVLSLTTLIAQI--
  36 ....PQISR..APEDE...ALTKMQRQRYQQMVRSTMVLSATEYLVRI--
  37 ....FQLIE..KMKFS...DFTIFNQTRYGNMVKTGLTLAVTSLLQEL--
  38 d...FEPLI..EDKNS...LTYQMKSLRREKIIRSCLILTIADIYRQL--
  39 nl..GDNQL..EESSH...YIVQMKLEQNRQVIKSSLILAITEIIKELH-
  40 d...FEPLI..EDKNS...LTYQMKSLRREKIIRSCLILTIADIYRQL--
  41 dy..APELV..EGMSD...DMKRMFLERHFQKVRSTIVLAVNDIHQQLA-
  42 ....NFPMEkdENPDS...LLYQVKALRREKLIRSYLTLAITEIYEQL--
  43 ....LESIN..QLKFS...SFESFSQMRYESLIKTVLKLSCEMLIERIE-
  44 ....LESIN..QLKFS...SFESFSQMRYESLIKTVLKLSCEMLIERIE-
  45 ....YDALK..AMKFA...DFSVWSEARFSGMVKT---------------
  46 ....LESIN..QLKFS...SFEVFSQMRYESLIKTVLKLSCEMLLEKIE-
  47 ey..DSEMV..ETMSG...PMLQMFLGRHNQKVKSTIVLAITDIITTL--
  48 na..DDNQP..EESSH...FIVQMKLDHNRQVIKSSLILAITEIIKELH-
  49 dllkFASSV..ENKDS...MLYQMQQIRLERVMRSSLILAITEINEKL--