(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0474 SfT1, Shigella flexneri, 80 residues
    • gi|191166466|ref|ZP_03028297.1|_2:71 ribbon-helix-helix protein, copG family [Escherichia coli B7A]
    • gi|190903566|gb|EDV63284.1| ribbon-helix-helix protein, copG family [Escherichia coli B7A]
              10        20        30         40        50          60  
              |         |         |          |         |           |  
   1 MNSLAGIDMGRILLDLSNEVIKQLDDLEVQ.RNLPRADLLREAVDQYLINQSQ..TARTSVPGIW
   2 MNSLAGIDMGRIILDLSNEVIKQLDDLEVQ.RNLPRTELLREAVDQYLVNQSQ..TVRTSAFGIW
   3 MNSLAGIDMGRILLDLSNEVIKQLDDLEVQ.RNLPRADLLREAVDQYLINQSQ..TARTSVPGIW
   4 MSMMAGMDMGRILLDLSDDVIKRLDDLKVQ.RNLPRAELLREAVEQYLERQDRaeTTISKALGLW
   5 -SAMAGMDMGRILLDLSDEVIKRLDDLKVQ.RNLPRAELLREAVDQYLENQSK..TTISSALGIW
   6 MSMMAGMEMGRILLDLSDEVIKRLDDLKQQ.RNLPRAELLREAVEQYLERQGQaeTTISDALGLW
   7 MSMMAERDMGRILLDLSDDVIQRLDDLKVQ.RNIPRAELLREAVEQYLEKQDRakDTISSALGLW
   8 --------MGRILLDLSNEVIKQLDDLEVQ.RNLPRADLLREAVDQYLINQSQ..TARTSVPGIW
   9 ----AGMDMGRILLDLSDDVIKRLDDLKVQ.RNLPRAELLREAVEQYLERQDRaeTTISKALGLW
  10 --------MGRILLNLSNELFKQLDDLEVQ.RNLPRAELLREAVDQYLVNQSQ..TARASAFGIW
  11 ICKVVVMAMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  12 ICKVLVMAMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  13 ICKVVVMAMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  14 ICKVVVMAMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  15 --------MSRILVDLSDGQLDELAVIVET.QHRPRAAIIRDAIDAYIALNKH..RLADDVFGLW
  16 --------MSRILIDLSNGQLDELAAIVDT.EQRPRAAIIRDAIDAYIAQHKR..SHADIVFGLW
  17 --------MSRILIDLPDAQVEELAVLVET.EQRPRAAVIRDAIEAYIAQHRR..ARGADVFGLW
  18 --------MNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  19 -------AMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  20 -------AMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  21 -------AMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKRGRQ
  22 ----------RALIDMNDTQVEALDTLAKR.VRRSRAALIREAIDDYLNRHHR..EQIEDGFGLW
  23 --TSDNADCMRTIIDLPEDERAVLDAHCRQ.RGLSRAAAIREALHLWL-QHQH..PRSADVFGLW
  24 ----------RALVDMSDAQIEALDTLAKR.VGQSRAALIRAAIDDYLDRHHR..EQVADGFGLW
  25 --------MNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  26 -------AMNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  27 --------MNTVFLHLSEEAIKRLNKLRGW.RKVSRSAILREAVEQYLERQQF..PVRKAKGGRQ
  28 ----------RTLVDIGDPEVKALDRLAQR.EKMSRAALIRKAINDFLARNNA..DIEAEAFGLW
  29 -------------IDLPEEDLRQLDNLKNI.RHVPRAEIIRQAVSRYLVVNRV..DDPSNAFGLW
  30 ----------RTIIDLPEDERAVLDAHCRQ.RGLSRAAAIREALHLWL-QHQQ..PRSDNVFGLW
  31 ---VTEVAMIRIPVDLSNDQHAALTRIATR.QSSSPAEIIRDAIDAYIAQQDR..TLADNVFGLW
  32 ----------RTIVDLPEPEREQLDAQCRQ.LGISRAAALREALRLWL-KQQH..PRHEKVFGLW
  33 ----------RTIVDLPEKELQAIKALAKR.ERISQAEAVRRAVRHYLDTHPP..QPTEEAFGVW
  34 ----------RTIINIADSQIKILDKISK-.KKISRDKIIGQALTSYIASNDHnnKAFENAFGLW
  35 --------MVEININLLEETIKKREIVEKCsPDKSRATQSRRAVDTYLDANIP..EQNNSVFGLW
  36 ---------RRISINLPATVVAEPTRIAES.EHRALDAVIRDAIGRHVARIGE..ARSADVFGLW
  37 ----------RTIINIADSQIKILDKVS-K.KKISRDKIIGQALTSYIASNDHnnKAFENAFGLW
  38 --------------EILPEQAEALDQIAAR.EHTSRDALVRTLIADFVAQHAPk.PDIQSFFGIW
  39 -----------TIINIADSQIKILDKISK-.KKISRDKIIGQALTSYIASNDHnnKAFENAFGLW
  40 ---------TRILADLPDEDIEKLDARAAA.LGKSRAALVREAVKLFLVQGSSsnDWIDRFAGLW
  41 --------MSRILIDLPDDDIRSLDAMARA.NGRSRAAEMREAVALYLRRQADg.NWIAQGSGYW


              70        80
              |         |
   1 QGCE..EDGVEYQRKLREEW
   2 QGCE..EDGVEYQRKLREEW
   3 QGCE..EDGVEYQRKLREEW
   4 QGCE..EDGVEYQRKLREEW
   5 QGCE..EDGVEYQRKLREEW
   6 QGCE..EDGVEYERKLREEW
   7 QDCE..EDGMEYQRQLRKEW
   8 QGCE..EDGVEYQRKLREEW
   9 QGCE..EDGVEYQRKLREEW
  10 QGSE..EDGVEYQRKLREEW
  11 KGEV..VGVDDQCKEHK---
  12 KGEV..VGVDDQCKEHK---
  13 KGEV..VGVDDQCKEHK---
  14 RDEA..VGVEELCKQHK---
  15 KDRT..VDGLAYQEELRSEW
  16 KDRA..VDGLTYQEALRSEW
  17 KSKK..VDGLEYQQELRSEW
  18 KGEV..VGVDDQCKEHK---
  19 KGEV..VGVDDQCKEHK---
  20 RGET..VGVDDQCKEHK---
  21 RGET..VGVDDQCKEHK---
  22 GKRK..VDGLAYQEKVRGEW
  23 RDRN..ADALTLESELRQEW
  24 GKRK..VDGLAYQEKARSEW
  25 RDEA..VGVEELCKQHK---
  26 RGET..VVVDDQCKEHK---
  27 RGET..VVVDDQCKEHK---
  28 GDRK..IDGLAYQENMRREW
  29 SGVL..GDGVDYENQLREEW
  30 RDRN..TDALTLESELRQEW
  31 KGRD..V---VDQEDLRSEW
  32 KDRP..ADALVLQEAFRQEW
  33 ANR-..CDGLSYQEALRQEW
  34 KDKN..LDSVEYQTKLCNEW
  35 GDKK..LDGLKYQGKLREEW
  36 RHRA..RSGIDYQRDARAEW
  37 KDKN..LDSVEYQAKLCNEW
  38 RERG..VDGLEYQERLRAEW
  39 KDKN..LDSVEYQAKLCNEW
  40 AERDdiPDSVAYQRAIRE--
  41 AD--..--------------