(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0473 CmR9, Clostridium thermocellum, 68 residues
    • gi|191160896|ref|ZP_03022779.1|_2:64 conserved hypothetical protein [Geobacter sp. M21]
    • gi|190997627|gb|EDV73072.1| conserved hypothetical protein [Geobacter sp. M21]
    • gi|191160909|ref|ZP_03022792.1|_1:59 conserved hypothetical protein [Geobacter sp. M21]
    • gi|190997640|gb|EDV73085.1| conserved hypothetical protein [Geobacter sp. M21]
                        10             20                30            
                        |              |                 |            
   1 .....MK...IT.K.DMIIADVLQMD....RG.TAPIF....IN.N..GMH.CL....GCPSSMG
   2 .....MK...IT.K.DMIIADVLQMD....RG.TAPIF....IN.N..GMH.CL....GCPSSMG
   3 .....MQ...IT.K.DMTIGEIVRNF....PN.SIEIL....MS.F..GMG.CV....GCPSAQG
   4 m....VQ...IT.K.DTIIGDILDIA....PE.TAPLF....LS.I..GMH.CL....GCPSSRG
   5 .....MT...IT.K.EMTITEVVTKY....PK.TIPIF....YK.H..GMG.CL....GCAAAQF
   6 .....MQ...IT.K.DMTIGEIVRNF....PS.SIEIL....MS.F..GMG.CV....GCPSAQG
   7 ms...EK...VT.K.DMTISEVLKKN....PK.TAEVF....MK.H..GMQ.CL....GCPSAAG
   8 .....MK...IT.K.DMIIADIIAID....QN.LIPIL....LD.T..GMH.CI....GCPSAQG
   9 .....MK...IS.K.DMTIGEVVRNH....PE.CVEVL....FN.F..GLG.CV....GCPSAQA
  10 m....AT...IT.K.EMSITEVVSKY....PQ.TVPVF....ME.H..GMG.CL....GCAAARF
  11 .....MK...IT.E.TMGIGECVAKF....PN.TVPVF....MS.F..GMS.CL....GCSAARF
  12 m....AK...VT.K.DMIISDVLNMD....KG.TVPIF....LE.S..GMH.CL....GCPSSSG
  13 .....M-...IT.K.DMTIGEILRVK....PE.SAQVL....MD.M..GMG.CL....GCPSAQF
  14 .....M-...IT.K.DWTITDIVEKY....PK.TTEIL....MN.H..GMH.CF....GCMAARF
  15 .....MK...IN.R.DMTIMDVMQLD....RE.VATIF....MK.Y..GLH.CL....GXPGATM
  16 .....MT...IT.R.DMLIGDLLRMK....PE.AASIL....MG.F..GMG.CL....GCPSSQM
  17 m....PR...IT.T.DTIIADVLRID....RG.TIPIF....LN.N..GLH.CL....GCPSAQG
  18 tap..KQ...VT.R.DTIIGDILDMD....QT.TAPYF....ME.I..GMH.CL....GCPASRG
  19 m....SK...IT.R.DMTMGYIVKEF....PQ.TVEVF....QR.Y..GMG.CL....SCPTAQL
  20 vvp..LQ...IT.K.DLSIMDVLRAY....PQ.VRPVF....IR.H..GMG.CL....ECMGAMD
  21 mal..--...IT.K.DMTVGQVLRSY....PQ.TVQTF....LE.L..GMH.CL....GCPSSTM
  22 .....M-...IT.K.DMTVGQVLRSY....PQ.TVQTF....LE.L..GMH.CL....GCPSSTM
  23 .....M-...IT.K.DMVIQEIVTKY....PQ.TLPVF....GQ.F..NMG.CL....GCSGALF
  24 m....EK...IT.K.DTLIGNALKIN....PN.SASIL....MS.F..GMG.CL....GCPSSQM
  25 m....AE...FS.K.DTKIGELIDQF....PE.SAPIL....ME.I..GMH.CL....GCPASQM
  26 tne..MA...IS.K.DMIIADLIALD....PN.YAAIL....MA.S..GMG.CV....GCPSSQG
  27 m....AK...VT.K.DMLIGQLITLD....PN.IAPIL....MR.A..GMH.CL....GCPSSQM
  28 .....MQ...IT.K.DMGIMDIVNKY....PQ.AVSVF....QA.Y..GMG.CI....GCMAARF
  29 stsd.-E...IT.K.DTVIGDILKIN....PE.SASTL....ME.A..GMH.CL....GCPASQM
  30 .....MT...IT.K.DSIIGDILDAY....GEvTAPFF....LE.M..GMH.CL....GCPASRG
  31 mad..-K...IT.K.DMTFFAVMQAY....PQ.SLDVL....RK.H..RLG.CV....GCMGAQN
  32 vqpg.-K...IT.K.DSIIGQVIRDN....PR.TIAVF....RA.H..GMG.CL....GCPSASG
  33 .....M-...VT.K.DMTIGEVIQKN....PG.AAEIL....MS.F..GMG.CV....CCPSALG
  34 .....M-...IT.K.DMTIGEVVKND....SS.KAEVL....MS.F..GMG.CV....GCPSAQA
  35 m....AT...VT.K.DTIIGDILDMD....RT.TAPFF....LE.M..GMH.CL....GCPASRG
  36 msd..-K...IT.K.DMKFSEILNYG....QP.VVQVF....MK.Y..QMG.CL....GCAVAKF
  37 m....AK...IT.K.DMIIKDIININ....MG.CIPIL....LN.E..GMH.CV....GCPASQG
  38 .....M-...IT.K.DMTIGEVIRKM....PT.AAEVL....MS.F..GMG.CV....GCPSAQA
  39 m....AQ...IS.K.TMTISEILSVD....KV.VIPVL....MN.S..GMH.CL....GCPSAQG
  40 m....AQ...VT.K.DMTFAAVMRMH....PD.VVKVL....AK.Y..NLG.CI....GCMGAQN
  41 .....MK...IT.K.DMLIGDIIQIH....PD.AVEIL....FN.F..GLS.CV....GCPASQM
  42 m....NT...IT.K.DMVIGDLLAID....EN.FAAIL....MA.S..GMH.CV....GCPSSQG
  43 .....MK...IT.K.DMTIGEIVRNH....EG.AAEVL....MS.F..GMG.CV....GCPSAQS
  44 .....MP...IR.H.DSVVDDLMRTQ....PA.TIRTF....LD.F..RMG.CC....GCPIATF
  45 ma...VN...IT.K.EMTMGELLSID....RG.VAVVL....MN.A..GMH.CI....GCPSSIG
  46 .....M-...IT.K.DMTIGEVVSAD....QS.KAQVL....MS.F..GMG.CV....GCPSAQA
  47 .....MP...FG.S.DDLVDDIMRTA....PH.TIRVF....LA.F..RLA.CV....GCPIATF
  48 sengv--...IT.K.DMIIADIVSED....AE.NTKIL....ME.F..GMH.CI....GCPSSQM
  49 .....M-...IT.K.DLTIGEIIRIK....EN.APQIL....MS.F..GMG.CV....GCPSAQA
  50 .....MK...IK.K.EMLIGQILSEK....PE.SIGTL....MS.F..GMG.CI....MCPSSQM
  51 .....MT...IR.D.DLPVDEVMRSW....PA.TIRTF....LD.F..RMQ.CC....GCPIAAF
  52 m....SQ...VT.K.DMTFAAVMRMH....PD.VVKVL....AK.Y..NLG.CI....GCMGAQN
  53 .....M-...LT.G.AEKITDVVEKY....PQ.SVEVF....QK.Y..GMH.CF....GCMAARF
  54 rrki.M-...IT.K.DMTVGEIIRIK....EN.AAEIL....MS.F..GMG.CI....GCPSAQS
  55 m....AQ...IT.K.DMTFGELLSKYystcPK.LVDDL....ME.A..GMG.CI....GCPHSQM
  56 .....M-...IS.K.EMTIGEIIRRY....PQ.TLPVF....EK.Y..GLD.CH....DCQIADF
  57 .....MS...ID.R.TLVVEDVMSRW....PA.TIRVF....LD.F..KLA.CV....GCPIATF
  58 ms...QQ...FT.K.DMTFAQALQAN....PE.VAKVL....RK.Y..NLG.CI....GCMGAQN
  59 m....KK...VT.E.DMTIAEVLKMD....RE.VAGIF....MK.Y..GLH.CL....GCPGATM
  60 .....MEt..FT.K.NTTIGELLSVY....PE.CAPIL....ME.I..GMH.CL....GCPSAQM
  61 avr..QT...LH.D.DMTMDAIMREW....PA.TIRVV....LD.H..GLL.CV....GCPIAPF
  62 vmm..AQ...VS.R.DTTIGEALSMN....PG.IAPIL....QE.I..GMH.CL....GCPASQG
  63 .....M-...IT.K.DMIIGDIIRKH....PR.TLTVF....VK.Y..GLD.CN....ECQIADY
  64 .....M-...IT.K.DMIIGDIIRQH....PA.TVQVF....AR.H..GLE.CY....ECQIADL
  65 md...-K...IN.K.DTTVGEVIRMN....PA.NAQKL....MN.F..GMG.CV....GCPSAQS
  66 ear..MP...IS.F.DELVDDVMRRR....PE.TIRVF....LA.F..QMR.CV....GCPIACF
  67 mpp..PK...LD.DpDLPLDVLMTTW....PE.TVRVF....MD.H..DML.CV....GCMVSPF
  68 .....M-...IT.K.TMRIGDIIRTY....PQ.SLKIF....EK.Y..GLD.CY....ECQVADY
  69 .....M-...IT.R.DMIIADIIRKY....PE.TLPVF....KK.H..RLE.CY....ECQISDL
  70 ms...QQ...VT.K.DMTFAQVMRMH....QD.AVKVL....AK.Y..NLG.CV....GCMGAQN
  71 .....MH...LD.P.DMTLEEIMRAW....PP.AISVI....LR.H..HML.CV....GCPITAF
  72 mpp..PK...LD.DpDLPLDVLMTTW....PE.TVRVF....MD.H..DML.CV....GCMVSPF
  73 .....MK...YT.K.DSLVGEVLDND....ES.LARYF....LE.M..GMH.CL....GCPSSRG
  74 m....AR...VT.K.EMTMGELLQTYyeqcPE.IVDVL....TG.L..GMH.CI....GCPSSIG
  75 hpq..PD...-D.P.DIPLIELMALW....PQ.TIPVF....VR.H..RML.CV....GCLVSPF
  76 m....PE...IDlS.TVTVGEWLRRW....PE.TVRVF....LN.Y..KMN.CP....ACPIAPF
  77 mt...QK...FT.K.DMTFAQALQTH....PG.VAGVL....RS.Y..NLG.CI....GCMGAQN
  78 m....AK...IS.K.DMLINDILAVD....AG.NAAIL....MA.A..GMH.CI....GCLAAAG
  79 qat..AK...IT.K.DMTFLEMLRTY....PE.TAKVL....KK.Y..NLA.CA....GCMGAQS
  80 kked.-K...FH.R.DMLVGSIIGMD....PQ.AAQIL....SD.S..GMG.CL....GCPASQS
  81 m....EK...VT.K.DMNIMEAVEKY....PI.IAQVL....MR.Y..GLG.CV....GCIISSA
  82 m....AK...IS.K.DMLINDILAID....AG.NAAIL....MA.A..GMH.CI....GCLAAAG
  83 eke..VL...IT.K.KMSTGEVTKKY....PA.TKEVF....AKyF..GKG.CF....DCPSFGT
  84 asp..CE...ID.A.ATLVDDLMRQR....PQ.TIGVF....LR.R..RLY.CV....GCPVGHF
  85 .....M-...VT.G.DMNIMEAVEKY....PV.IVEVL....QR.N..GLG.CV....GCMIASG
  86 etk..PK...IT.K.KTSIGDVIQNY....PE.TESVV....KKyF..GAG.CY....TCPGSKT
  87 .....M-...VT.G.DMNIMEAVEKY....PI.IVEVL....QR.N..GLG.CV....GCMIASG
  88 .....M-...VT.G.DMNIMEAVEKY....PV.IVEVL....QR.N..GLG.CV....GCMIASG
  89 .....MA...LT.A.DSTIAELLREK....PE.SAQVL....FR.F..GMG.CL....GCAIANN
  90 vvk..PR...FY.K.EMTVGEAMAVH....PE.AGLVF....SS.Y..HLGgCS....HCSINEL
  91 vvk..PR...FY.K.EMTVGEAMAVH....PE.AGLVF....SS.Y..HLGgCS....HCSINEL
  92 .....--...--.-.-MTISEILRRY....PE.TLPVF....ER.H..HLD.CY....DCQLADF
  93 mr...PD...LD.DpDLPLSRLFDRW....PA.TAAVF....LT.R..RML.CP....GCPIAPF
  94 m....AD...LT.A.DSTIYDLLQAK....PE.ATEAL....FK.F..GMG.CV....GCAIARG
  95 .....--...--.-.-MTISEILRRY....PE.TLPVF....ER.H..HLD.CY....DCQLADF
  96 .....MA...LS.K.DSTILEVLQEK....PD.AGAIF....AR.F..GMG.CV....GCAISRG
  97 .....--...--.-.-------MRRK....PE.TIRVF....LA.F..QMR.CV....GCPIACF
  98 .....MK...FT.L.EMKLKDIMAAN....PK.TVEAM....QE.L..GLH.CL....GCPFSVN
  99 .....MK...LD.S.KMTVGELVTRH....PS.VMEVF....IK.R..RMP.CV....GCPTERF
 100 .....MA...IT.L.DSTIADLLREK....PE.SAATL....QS.F..GMG.CL....GCAIANN
 101 vfk..MK...FT.L.EMKLKDIMAAN....PK.TVEAM....QE.L..GLH.CL....GCPFSVN
 102 kpd..PD...-D.P.DLPLARLLQTW....PA.SAGVF....LE.R..RML.CP....GCPIAPF
 103 kpd..PD...-D.P.DLPLARLFQTW....PA.SAGVF....LE.R..RML.CP....GCPIAPF
 104 stk..PR...FF.K.EMTVGEAIAIH....PE.AGLVF....SS.Y..HLGgCS....HCSINEV
 105 .....MKkhiIN.G.EMKIWDVIQDY....PE.TYGIF....RQ.F..GYP.DIrkgdTAVTSHF
 106 prr..PR...FD.DpDLPLSTLFGEW....PD.MVEVF....LA.K..QML.CP....GCPVAPF
 107 kaa..TE...IS.R.SMTIEDILGMF....PY.KAQKLsqeiTN.A..GLH.CV....GCHAAVW
 108 ms...M-...FD.K.TTKMAAVLKGH....PK.AKEVL....ES.F..GLQ.CS....TCSGAKH
 109 genkv--...IT.K.DMIINDVIQKY....SK.TIGIF....KD.F..GVD.--....SCCGGGF
 110 mt...IT...LT.A.DTSVLELVEHH....PE.TEAVF....ER.YtkRLGiCI....-CCECLF
 111 m....PV...IT.K.EMSIIEVVQKY....PE.TVEVF....RK.Y..GMG.CF....G------


      40         50          60           
      |          |           |           
   1 ESIEDACAVHG.ID..ADKLVKELNEYFEKKEV...
   2 ESIEDACAVHG.ID..ADKLVKELNEYFEKKEV...
   3 ESLEQAAMVHG.MD..IEKLLEALNKAI-----...
   4 ETVEQACMVHG.VD..VDALLAELNKMTAGAAQ...
   5 ENIEQGARAHG.IN..IDELIADLNKVVAEQAQs..
   6 ESLEQAAMVHG.MD..IEKLLEALNKAI-----...
   7 ETVEQAAMVHG.AD..ADKLLEELNKVFENEE-...
   8 ESLEEACMVHG.ID..VDELVAKLNAFEEAK--...
   9 ETIEEACSVHG.MD..VNELVEALNKEAK----...
  10 ENIEQGALAHG.ID..VDGLIADLNKVANKAE-...
  11 ENIGQGARAHG.ID..VDKLIEELNKVVGKDDDacg
  12 ESIEDACAIHG.ID..ADQLIDNLNKYLENK--...
  13 ETLEQACEVHG.QD..VEDILAKLNK-------...
  14 ENIEQGAMAHG.IN..VDELMKELNDAIKE---...
  15 ESISDAGNVHG.ID..VDKLVDDLNKFFEEKGN...
  16 ESLEQAAAVHG.IN..IEQLLEKLNA-------...
  17 ESIEEACALHG.ID..AQKLVDELNEYLKSKGLld.
  18 ETIEEACEVHG.VN..CDELLEKLNTHLAAKKA...
  19 ESLEKGAMLHG.LD..VQELLEELNKVVQ----...
  20 ETIASGARMHG.LD..LDQLLKDLNEAIKNRDQe..
  21 ESIEGAALTHG.KK..PDELVEKLNKVIAAN--...
  22 ESIEGAALTHG.KK..PDELVEKLNKVIAAN--...
  23 ETLEQGALAHG.ID..VDAMLKALNDLIKK---...
  24 ETIEQAAAVHG.ID..AEALLEKLNA-------...
  25 ETLEEAAMVHG.ID..CGLLVEKINAAAKAMGK...
  26 ESIEQAAYVHG.MD..LDELLGRLNEYAQTKEA...
  27 ESLEEAAMVHG.MD..ADVLVQQINDFLGE---...
  28 ETLEEGANAHG.IN..VDDLVDDLNENI-----...
  29 ETLEEACSVHG.ID..VEELLNKLNA-------...
  30 ETVAQACDVHG.VD..ADELVKKLNEAVGN---...
  31 ESLEQGANAHG.ID..VNALLKDLNDAVA----...
  32 ESVEKAAGIHG.ID..LEELLSELNKV------...
  33 ETIEEAAMVHG.ID..ADEIIKSLNYSKEENN-...
  34 ETIEEAAMVHG.IN..LDELIEALNK-------...
  35 ESLEQACLVHN.VD..PDELVEKLNEHLAGK--...
  36 ETLEQGANAHG.VD..VDALLKDLNAAIDND--...
  37 ETLEEACIVHG.LD..ADVLAKKLNDFVVSVDGe..
  38 ETLEEAAIVHG.IN..LDDLIEAINNIEY----...
  39 ETLEEACMHHG.LN..ADELETQINDALAGI--...
  40 ESLEQGCAAHG.IN..VDEIVADINKLF-----...
  41 ETLEEATMVHG.LN..LDLLLDVLNENNT----...
  42 ETLEEAAFVHG.MN..VNELLGRLNEYMETKQA...
  43 ETLAEAAMVHG.ME..LDALLEALNK-------...
  44 HTVDDACREHD.VD..RDVFLVALRDAMADQDSpga
  45 ESLEEACMVHG.IE..VDELLKNINEYFANK--...
  46 ETIAEAATVHG.LN..LDDLLEALNR-------...
  47 HTVEDACREHG.ID..RDKFLAALCDCVPA---...
  48 ETLEDACAVHG.LN..VEELIKKLNK-------...
  49 ETIEDAVKVHG.IN..LEELLEQLNK-------...
  50 ETLEEAAMVHG.ID..PNTIVAALNEDHKEEAEa..
  51 HTMKDACREHG.VD..RDSFVAALEATIAG---...
  52 ESLEQGCAAHG.IS..VDEIVADINKLF-----...
  53 ENVEQGAMAHG.ID..VPSLIKDLNKAIGNQS-...
  54 ESLEDAANVHG.LN..LDDLLKALN--------...
  55 ESIEEGAMGHG.ID..PDLLVAKLNATLEASQA...
  56 EAVEHGASVHK.VD..IGRLMEDLNRIINA---...
  57 HTIEDSCHEHG.IA..EAPFLAALRKAVAKSQEvls
  58 ESLEQGCSAHG.LD..VNEVLKDLNAIGQ----...
  59 ESISDAGNVHG.ID..VSLLIADLNKHFESN--...
  60 ETLGEAAMVHG.ID..ADLLVEKINAARAA---...
  61 HTIIDAAREHD.LD..PASLARDLKRAVAEEDTgts
  62 ESLAEAAMVHG.ID..AELLVEKINAFLNA---...
  63 EELEHGAGVHK.VN..IEQLLSELNEHIGSGTE...
  64 ETLEHGAGHHK.LD..IEALLEELNRTVITP--...
  65 ETLREASLVHG.ID..LDRLIKALSEDKN----...
  66 HNVADACREHG.VD..ADTFLSALCACT-----...
  67 HSVSEACAEYH.LD..EEVFRAALAEAVEAAHRrwg
  68 EELEHGAGVHK.TD..LEKLLKELNELIQS---...
  69 ETLEHGAEVHR.VG..IDGLLEELNRSIA----...
  70 ESLEQGCGAHG.LN..VDDVVRDLNALF-----...
  71 HTVNDACREHM.ID..EGAFLEELRAAIAAVEGaii
  72 HSVSEACAEYH.LD..EVVFRAALADAVEAAHRrwg
  73 ETIEQACEVHG.AD..CARLLEQLNGTA-----...
  74 ESLADAAYVHG.ID..SDLLVEKLNATINAKLGe..
  75 HTVTDACAEYD.LD..EGEFLAELKMAIGMA--...
  76 MTIDEAASEYR.VD..ANLLKRDLMQMLKERAPqer
  77 ESLEQGANAHG.LN..VEDILRDLNALA-----...
  78 ETLEEAAAVHG.LD..AAELEVEINDYLAKKEEqqa
  79 EPIDLGAINHG.LD..PEQLLADLNAAVK----...
  80 ETLADACLVHG.LD..VEEILKQLNQ-------...
  81 ETLGEGIAVHG.LN..PDIIIEEVNMILEKQEE...
  82 ESLEEAAAVHG.LD..AVELEQEINDYLAKKEAe..
  83 EDINLACMMHN.TD..VDKFVQELNEAAYKEINkt.
  84 HTIEEAAREHG.LE..PKALLAELRFIP-----...
  85 ETLAEGIEAHG.LD..TKAILAEINSLIKE---...
  86 EDIAFGATMHN.VD..PEVIIKELNEIIEKHK-s..
  87 ETLAEGIEAHG.LD..AKAILDEINSLIKE---...
  88 ETLAEGIEAHG.LD..TKAILDEINSLIKE---...
  89 ETIREAAQAHG.IP..LEEMLSALGVAE-----...
  90 ETIEQVCMGYG.VE..VDVLVESLNNLLEDSED...
  91 ETIEQVCMGYG.VE..VDVLIESLNNLLEDSQD...
  92 EQLEHGATVHK.ID..VESLLCELNCSIKK---...
  93 HTVVEACAEYG.LD..EDEFRRALRLLAGI---...
  94 ETIREAAEAHG.IP..LAELLNALG--IKE---...
  95 EQLEHGATVHK.ID..VESLLCELNCNIKK---...
  96 ETVAEAAAAHG.IP..LEELMSALGISA-----...
  97 HNVADACREHG.VD..PDLFLSALSAAT-----...
  98 ETLLNAAQMHK.LD..PEKLLEAVNSVEQGEMSeaa
  99 HTIEDIARING.IV..LEHLLKDLLDAIGVGEEt..
 100 ETIREAAMVHG.IP..LEELAKKLGL-------...
 101 ETLLNAAQMHK.LD..PEKLLEAVNSVEQGEMSeaa
 102 HTVIEACAEYG.LD..EGEVRRALRLVVMP---...
 103 HTVIEACAEYG.LD..EGEVRRALRLVVMP---...
 104 ETIEQVCMGYG.VE..VDTLIDSLNNLFAEE--...
 105 MKLRTAAHAYH.ID..LDKLLEALNDAIAGHDRgac
 106 HAITDACEEYE.LD..EEVFRAELRRAARL---...
 107 ETLEAGMMTHGkTDaqIDELVRRLNALLQEPVDqss
 108 ESIELGATNHG.LD..VNELLTHLNALFDEPPGk..
 109 -SIEKTAAMSG.GD..MEKLLEKLNKAIDE---...
 110 CSLKDVAARYE.LD..LDELMARLNSAIESDLKesp
 111 -----------.--..-----------------...