(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0108 Family 17 carbohydrate binding module, C. cellulovorans
              10        20           30        40        50          60
              |         |            |         |         |           |
   1 MGASGQYVRARIKGAYYATPVDPVTNQ...PTAPKDFSSGFWDFNDGTTQGFGVNPDSPIT..AI
   2 LSLSGEYVRARIKGAKY-EPIDRT---...-----RYTKVLWDFNDGTKQGFGVNSDSPNKe.AI
   3 LSLSGEYVRARIKGAKY-EPIDRT---...-----RYTKVLWDFNDGTKQGFGVNSDSPNKe.AI
   4 LSVSGEYARARIKGIKY-EPIERS---...--EKEEFTTNVWDFNDGTTQGFGINGDSPIKadSI
   5 LSVSGEYVRSRILGEEY-QPIDRT---...--PREEFSEVIWDFNDGTTQGFVQNSDSPL-..DV
   6 LTTSGQYVRARIKGAYYATPVDPVTNQ...PTAPKDFSSGFWDFNDGTTQGFGVNPDSPIT..AI
   7 -------------GTEVEIPVVHD---...---PKGEAVLPSVFEDGTRQGWDWAGESGVKt.AL
   8 -------------GTEVEIPVVHD---...---PKGEAVLPSVFEDGTRQGWDWAGESGVKt.AL
   9 -------------GTEVEIPVVHD---...---PKGEAVLPSVFEDGTRQGWDWAGESGVKt.AL
  10 -------------GKIVEIPVVHS---...---PKGDAALPSNFEYGTRQGWDWAGESGVKt.AL
  11 -------------GKIVEIPVVHS---...---PKGDAALPSNFEYGTRQGWDWAGESGVKt.AL
  12 LSLSGEYVRARIKGIEY-TPIDRT---...-----KFTKLVWDFNDGTTQGFQVNGDSPNKe.SI
  13 LSISGEYVRARIKGIAY-QPIKRDNKIkegENAPLGEKVLPSTFEDDTRQGWDWDGPSGVKg.PI
  14 LSLSGEYVRARIKGVNY-EPIDRT---...-----KYTKVLWDFNDGTKQGFGVNGDSPVE..DV
  15 LSLSGEYVRARIKGVNY-EPIDRT---...-----KYTKVLWDFNDGTKQGFGVNGDSPVE..DV
  16 LSLSGEYVRARIKGVNY-EPIDRT---...-----KYTKVLWDFNDGTKQGFGVNGDSPVE..DV
  17 -------------GTEIEIEVIHD---...---EKGTATLPSTFEDGTRQGWDWHTESGVKt.AL
  18 ----------------VPEPVEHD---...---TKGDSALPSDFEDGTRQGWEWDSESAVRt.AL
  19 ---------------VVEAPVEHA---...---PIGKATLPSTFEDSTRQDWAWDATSGVQs.AL


              70        80        90          100       110         120
              |         |         |            |         |           |
   1 NVENANNALKISNLNSKGSNDLSEGNFWAN...VRISADIWGQSINIYGDTKLTMDVI..APTPV
   2 EVENENGTLRISGLNV--SNDLSDGNFWAN...FRLSANGWGKSVDILGAEKLTMDVI..VDEPT
   3 EVENENGTLRISGLNV--SNDLSDGNFWAN...VRLSANGWGKSVDILGAEKLTMDVI..VDEPT
   4 TLANEKNALKITGLNN--SNDLTEGNYWAN...VRLSADGTSNKPNIFGAEKLTMDVI..TAAPA
   5 TIENVNDALQITGLDESNAIAGEEEDYWSN...VRISADEWEETFDILGAEELSMDVV..VDDPT
   6 NVENANNALKISNLNSKGSNDLSEGNFWAN...VRISADIWGQSINIYGDTKLTMDVI..APTPV
   7 TIEEANGSNALSW--EFGYPEVKPSDNWATaprLDFWKSDLVRGENDYVTFDFYLDPV..RATEG
   8 TIEEANGSNALSW--EFGYPEVKPSDNWATaprLDFWKSDLVRGENDYVTFDFYLDPV..RATEG
   9 TIEEANGSNALSW--EFGYPEVKPSDNWATaprLDFWKSDLVRGENDYVTFDFYLDPV..RATEG
  10 TIEEANGSQALSW--EFGYPEVKPSDNWASaprLDFHKDNLVRGENDYVAFDFYIDPA..RATEG
  11 TIEEANGSQALSW--EFGYPEVKPSDNWASaprLDFHKDNLVRGENDYVAFDFYIDPA..RATEG
  12 TLSNNNDALQIEGLNV--SNDISEGNYWDN...VRLSADGWSENVDILGATELTIDVI..VEEPT
  13 TIESANGSKALSF--NVEYPEKKPQDGWATaarLILKDINVERGNNKYLAFDFYLKPD..RASKG
  14 VIENEAGALKLSGLDA--SNDVSEGNYWAN...ARLSADGWGKSVDILGAEKLTMDVI..VDEPT
  15 VIENEAGALKLSGLDA--SNDVSEGNYWAN...ARLSADGWGKSVDILGAEKLTMDVI..VDEPT
  16 VIENEAGALKLSGLDA--SNDVSEGNYWAN...ARLSADGWGKSVDILGAEKLTMDVI..VDEPT
  17 TIEEANGSNALSW--EYAYPEVKPSDGWATaprLDFWKDELVRGTSDYISFDFYIDAV..RASEG
  18 TIEEANGSNALSW--EYAYPEVKPSDDWATaprLTLYKDDLVRGDYEFVAFDFYIDPIedRATEG
  19 TIKDANESKAISW--EVKYPEVKPVDGWASaprIMLGNVNTTRGNNKYLTFDFYLKPT..QASKG


             130        140       150           160       170       180
              |          |         |             |         |         |
   1 NVSIAAIPQSSTHG.WGNPTRARIVWTNNFVA....QTDGTYKATLTISTNDSPNFNTIATDAAD
   2 TVAIAAIPQSTKHG.WANPERSVKVTEADFVK....QDDGKYKALLTITGDDAPNLKNIGFDDEN
   3 TVAIAAIPQSTKHG.WANPERSVKVTEADFVK....QDDGKYKALLTITGDDAPNLKNIGKDDEN
   4 TVSIAAIPQSSTHG.WANPTRAIAVKPADFVK....QEDGTYKAVLTITPADSPNFDSIAKDSKD
   5 TVAIAAIPQSSAHE.WANASNSVLITEDDFEE....QEDGTYKALLTITGEDAPNLTNIAEDPEG
   6 NVSIAAIPQSSTHG.WGNPTRAIRVWTNNFVA....QTDGTYKATLTISTNDSPNFNTIATDAAD
   7 AMNINLVFQPPTNGyWVQAPKTYTINFDELEEpn..QVNGLYHYEVKI------NVRDITNIQDD
   8 AMNINLVFQPPTNGyWVQAPKTYTINFDELEEan..QVNGLYHYEVKI------NVRDITNIQDD
   9 AMNINLVFQPPTNGyWVQAPKTYTINFDELEEan..QVNGLYHYEVKI------NVRDITNIQDD
  10 AMNINLVFQPPANGyWVQAPKTFTINFEELEEan..QVNGLYHYEVKI------NVRDIANIQDD
  11 AMNINLVFQPPANGyWVQAPKTFTINFEELEEan..QVNGLYHYEVKI------NVRDIANIQDD
  12 TVSIAAIPQGPAAG.WANPTRAIKVTEDDFES....FGDG-YKALVTITSEDSPSLETIATSPED
  13 MIQMFLAFSPPSLGyWAQVQDSFNIDLGKTVKckkdRRTEVYKFNVFF---DLDKIQDNKVLSPD
  14 TVSIAAIPQGPSAN.WVNPNRAIKVEPTNFVP....LGDK-FKAELTITSADSPSLEAIAMHAEN
  15 TVSIAAIPQGPSAN.WVNPNRAIKVEPTNFVP....LGDK-FKAELTITSADSPSLEAIAMHAEN
  16 TVSIAAIPQGPSAN.WVNPNRAIKVEPTNFVP....LEDK-FKAELTITSADSPSLEAIAMHAEN
  17 AISINAVFQPPANGyWQEVPTTFEIDLTELDSatv.TSDELYHYEVKI------NIRDIEAITDD
  18 AIDINLIFQPPAAGyWAQASETFEIDLEELDSatv.TDDGLYHYEVEI---------NIEDIEND
  19 SLTISLAFAPPSLGfWAQATGDVNIPLSSLSKmkk.TTDGLYHFQVKY---DLDKINDGKVLTAN


               190           200      
                |             |      
   1 SVVTNM.IL.FVGSN.SDN...ISLDNIKFTK
   2 NNMNNI.IL.FVGTEaADV...IYLDNIKVT-
   3 NNMNNI.IL.FVGTEaADV...IYLDNIKVT-
   4 STMTNI.IL.FVGAD.TDV...ISLDNITV--
   5 SELNNI.IL.FVGTEnADV...ISLDNITVT-
   6 SVVTNM.IL.FVGSN.SDN...ISLDNIKFTK
   7 TLLRNMmII.FADVE.SDFagrVFVDNVRF--
   8 TLLRNMmII.FADVE.SDFagrVFVDNVRF--
   9 TLLRNMmII.FADVE.SDFagrVFVDNVRF--
  10 TVLRNM.ILiFADVQ.SDFagrVFVDNVRF--
  11 TVLRNM.ILiFADVQ.SDFagrVFVDNVRF--
  12 NTMSNI.IL.FVGTEdADV...ISLDNITV--
  13 TLLRDI.IVvIADGN.SDFkgkMYIDNVRFT-
  14 NNINNI.IL.FVGTEgADV...IYLDNIKV--
  15 NNINNI.IL.FVGTEgADV...IYLDNIKV--
  16 NNINNI.IL.FVGTEgADV...IYLDNIKV--
  17 TELRNL.LLiFADED.SDFagrVFVDNVRF--
  18 IELRNL.MLiFADDE.SDFagrVFLDNVRM--
  19 TVLRDItIV.VADGN.SDFpgtMYLDNIRF--