(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0107 Family 9 carbohydrate binding module, T. maritima
              10        20          30        40         50        60  
              |         |           |         |          |         |  
   1 VATAKYGTPVIDGEIDEIWNTTEEI..ERKAVAMGSLDKNATAKVRVLWD.ENYLYVLAIVKDPV
   2 VATAKYGTPVIDGEIDEIWNTTEEI..ETKAVAMGSLDKNATAKVRVLWD.ENYLYVLAIVKDPV
   3 VATAKYGTPVIDGEIDEIWNTTEEI..ETKAVAMGSLDKNATAKVRVLWD.ENYLYVLAIVKDPV
   4 IATAIYGTPVIDGKVDGIWNNAEAI..STNTWVLGS--NGATATAKMMWD.DKYLYILADVTDNN
   5 IATAIYGTPVIDGKVDDIWNNVEPI..STNTWILGS--NGATATQKMMWD.DKYLYVLADVTDSN
   6 IATAIYGTPVIDGKVDDIWNNVEPI..STNTWILGS--NGATATQKMMWD.DKYLYVLADVTDSN
   7 VATAKYGTPVIDGEIDDIWNTTEEI..ETKSVAMGSLEKNATAKVRVLWD.EENLYVLAIVKDPV
   8 VATAKYGTPVIDGEIDDIWNTTEEI..ETKSVAMGSLEKNATAKVRVLWD.EENLYVLAIVKDPV
   9 VATAIYGTPVIDGKIDDIWNKVDAI..TTNTWVLGS--DGATATAKMMWD.DKYLYVLADVTDSN
  10 IATAIYGTPVIDGKVDGVWNNPEAI..STNTWVLGS--NGATATAKMMWD.DKYLYILADVTDNN
  11 --KVTYGKPVIDAKKDDIWKKAASI..KTDVWVIGN--SGATATAQLLWD.EKYLYVLADVKDPL
  12 -AKALEGSPTIGANVDSSWKLVKPL..YVNTYVEGTV--GATATVKSMWD.TKNLYLLVQVSDNT
  13 -AKALEGSPTIGANVDSSWKLVKPL..YVNTYVEGTV--GATATVKSMWD.TKNLYLLVQVSDNT
  14 -AKVMYGTPTVDGKEDKLWKKAVTI..TTDVKVTGN--SGAKAKAKLLWD.EKYLYVLAEVKDPL
  15 -AKALEGSPTIGANVDSSWKLVKPL..YANTYVEGTV--GATATVKSMWD.TKNLYLLVQVSDNT
  16 FASAQKGTPKIDAELDDAWKNAQEIvtDTKVTVTGTVYDSAYAKARMMWD.ENCIYVYAVVYDPL
  17 -AKALEGSPTIGANVDSSWKLVKPL..DANTYVKGTI--GATAAVKSMWD.TKNLYLLVQISDNT
  18 -AKALEGSPTIGANVDSSWKLVKPL..YANTYVKGTI--GATAAVKSMWD.TKNLYLLVQVSDNT
  19 -SKISEGEAVVVGMMDDSYMMSKPI..EIY-----DEEGNVKATIRAIWK.DSTIYVYGEVQDAT
  20 -STISEGEAVVVGKMDDSYLMSKPI..EIY-----DEEGNVKATIRAIWK.GSTIYVYGEVQDAT
  21 -SRISEGEAVVVGMMDDSYLMSKPI..EIL-----DEEGNVKATIRAVWK.DSTIYIYGEVQDKT
  22 -SRISEGEAVVVGMMDDSYLMSKPI..EIL-----DEEGNVKATIRAVWK.DSTIYIYGEVQDKT
  23 -AYAVYGTPEIDGKTDEVWNKAPEL..KINRYQTAWH--GADGTARVLYD.ENNLYVLIKVNDTQ
  24 -INIKKGSATVDGDIEEAWDAAEAV..SLSI----KLGSDISADGKLLWD.EENLYVLVDVKDSV
  25 -------APVIDGVVDEAWADAPVL..TTDVQVEGTP--GATAEIRVLWH.DDAVDVLATVADPV
  26 -SQAKVSLPDRKGQEDIIWGAVRAL..PFSHVIEGAV--GTTGEVKTLWD.GKQLNLRIEVKDAT
  27 -LKASPSFPNHKGQEDILWGAVKGL..EINHLADGT--SAVTGEARVMWD.AKKVNLRVEVKDTT
  28 --------PEIDGQVDDAWADAEVV..STGKTVEGGA-DGATAQVRTLWSgDDTLYVLAEVTDPV
  29 --YANNAQPRIDGIMDKEYKGTIPL..SVLNDAGQDI-----AQVRALWS.GNELCLYVTVNDSS


            70        80          90            100       110          
            |         |           |              |         |          
   1 LNKDNSNPWEQDSVEIFIDENNHKTG..YYEDDDAQF.....RVNYMNEQTFGTGG...SPARFK
   2 LNKDNSNPWEQDSVEIFIDENNHKTG..YYEDDDAQF.....RVNYMNEQTFGTGG...SPARFK
   3 LNKDNSNPWEQDSVEIFIDENNHKTG..YYEDDDAQF.....RVNYMNEQTFGTGG...SPARFK
   4 LNKSSVNPYEQDSVEVFVDQNNDKTT..YYENDDGQF.....RVNYDNEQSFGGST...NSNGFK
   5 LNKSSINPYEQDSVEVFVDQNNDKTT..YYENDDGQY.....RVNYDNEQSFGGST...NSNGFK
   6 LNKSSINPYEQDSVEVFVDQNNDKTT..YYENDDGQY.....RVNYDNEQSFGGST...NSNGFK
   7 LNKDNSNPWEQDSVEIFIDENNHKTG..YYEDDDAQF.....RVNYMNEQSFGTGA...SAARFK
   8 LNKDNSNPWEQDSVEIFIDENNHKTG..YYEDDDAQF.....RVNYMNEQSFGTGA...SAARFK
   9 LNKSSVNPYEQDSVEVFVDQNNDKTS..YYESDDGQY.....RVNYDNEQSFGGST...NSNGFK
  10 LNKSSVNPYEQDSVEVFVDQNNDKTT..YYENDDGQL.....RVNYDNEQSFGGST...NSNGFK
  11 RSKLSSNAHEQDSIEIFIDPSKDQTT..FYQEDDAQY.....RVNFDNETSFGGNA...RKESFK
  12 -------PSNNDGIEIFVDKNDDKST..SYETDDERY.....TIKRDGTGSS----...---DIT
  13 -------PSNNDGIEIFVDKNDDKST..SYETDDERY.....TIKRDGTGSS----...---DIT
  14 LSKKSANAHEQDSIELFIDLNKNQTN..SYEEDDAQY.....RVNFDNETSFGGSP...RKELFK
  15 -------PSSNDGIEIFVDKNDNKST..SYETDDEHY.....TIKSDGTGSS----...---DIT
  16 LNKANTNPWEQDSIEIFIDENNHKTP..YYEDDDVQY.....RVSYENTQTFGTNG...DAKNFI
  17 -------PSNNDGIEISVDKNDNKST..TYESDDEHY.....IAKRDGTGSS----...---NIT
  18 -------PSNNDGIEIFVDKNDNKST..TYESDDEHY.....IVKRDGTGSS----...---NIT
  19 ------KKPAEDGVAIFINPNNERTP..YLQPDDTYV.....VLWTNWKSEVNRED...V--EVK
  20 ------KKPAEDGVAIFINPNNERTP..YLQPDDTYV.....VLWTNWKSEVNRKD...V--EVK
  21 ------KKPAEDGVAIFINPNNERTP..YLQPDDTYA.....VLWTNWKTEVDRED...V--QVK
  22 ------KKPAEDGVAIFINPNNERTP..YLQPDDTYA.....VLWTNWKTEVNRED...V--QVK
  23 LDKGSPNPWEQDSVEIFIDENNAKTS..FYEGDDGQY.....RVNFENETSFNPES...IAGGFE
  24 LNEDSEDDYQQDSVEIFVDENNGKSG..GYEADDKQY.....RISFSNKQSFNGEKc..VAENIT
  25 VDETATNAWEQDSVEIFVDPVNAKAG..AYTPQDGQY.....RISASNAQSVSGDLav.IGERLT
  26 -------RLKGDQVEVFVSPEDMTAGkkNSTPKDGQY.....IFNRDG------GK...GKDQKL
  27 -------RRKSDQVQVFLAEEVKGTG..AATSEADLTakkknPPSANGQYTFKRDGgk.GKDKNI
  28 VDVSSADPWNQDSVELFLDLGNTKPA..AYGPNVSQM.....RISADNVTSFGTGDaaaQAARLT
  29 V------DANNDRVVIFIDQDNGKLP..ELKDDDFWV.....SISRNGTKNQSKTG...Y--VKD


      120       130       140            150        160       170      
       |         |         |              |          |         |      
   1 TAVKLIEGGYIVEAAIKWKTIKPTPNTVI.....GFNIQVNDAN.EKGQRVGIISWSDPTNNSWR
   2 TAVKLIEGGYIVEAAIKWKTIKPTPNTVI.....GFNIQVNDAN.EKGQRVGIISWSDPTNNSWR
   3 TAVKLIEGGYIVEAAIKWKTIKPTPNTVI.....GFNIQVNDAN.EKGQRVGIISWSDPTNNSWR
   4 SATSLTQNGYIVEEAIPWTSITPSNGTII.....GFDLQVNDAD.ENGKRTGIVTWCDPSGNSWQ
   5 SATSLTQSGYIVEEAIPWTSITPSNGTII.....GFDLQVNNAD.ENGKRTGIVTWCDPSGNSWQ
   6 SATSLTQSGYIVEEAIPWTSITPSNGTII.....GFDLQVNNAD.ENGKRTGIVTWCDPSGNSWQ
   7 TAVKLIEGGYIVEAAIKWKTIKPSSNTVI.....GFNVQVNDAN.EKGQRVGIISWSDPTNNSWR
   8 TAVKLIEGGYIVEAAIKWKTIKPSPNTVI.....GFNVQVNDAN.EKGQRVGIISWSDPTNNSWR
   9 SATSLTQSGYIVEEAIPWTSITLLNGTII.....GFDLQVNDAD.ENGKRTGIVTWCDPSGNSWQ
  10 SATSLTQNGYIVEEAIPWTSITPLNGTII.....GFDLQVNDAD.ENGKRTGIVTWCDPSGNSWQ
  11 SATRLTNGGYIVEVAIPLDSVRAEGQRWI.....GFDLQVNDDGaGDGKRSSVSIWSDSSGNSYQ
  12 KYVTSNADGYVAQLAIPIEDISPAVNDKI.....GFDIRINDDK.GNGKIDAITVWNDYTNSQNT
  13 KYVTSNADGYVAQLAIPIEDISPAVNDKI.....GFDIRINDDK.GNGKIDAITVWNDYTNSQNT
  14 SATRLTKEGYIVEAAIPLENVRTKESKWI.....GFDLQVNDDGaGDGKRSSVFMWSDPSGNSYR
  15 KYVTSNADGYIVQLAIPIEDISPTLNDKI.....GLDVRLNDDK.GSGSIDTVTVWNDYTNSQDT
  16 TATKIIPNGYVVEVQVYMRTTKLSEGMVI.....GFDIQVNDAD.HTGGRVGVLTWNDKVGNNWR
  17 KYVMSNADGYVAQIAIPIEDISPVLNDKL.....GFDIRINDDQ.GSGNVTAITAWNDYTNSQDT
  18 KYVMSNADGYVAQIAIPIEDISPVLNDKI.....GFDIRINDDQ.GSGNINAITVWNDYTNSQDT
  19 KFVGPGFRRYSFEMSITIPGVEFKKDSYI.....GFDVAVIDDG.K------WYSWSDTTNSQKT
  20 KFVGPGFRRYSFEMSITIPGVEYRKDSYI.....GFDVAVIDDG.K------WYSWSDTTNSQKT
  21 KFVGPGFRRYSFEMSITIPGVEFKKDSYI.....GFDAAVIDDG.K------WYSWSDTTNSQKT
  22 KFVGPGFRRYSFEMSITIPGVEFKKDSYI.....GFDAAVIDDG.K------WYSWSDTTNSQKT
  23 SAAEVSGQLYLGSEN-TVQDREPVSNMQI.....GFDVQINDG-.KNGVRQSIATWNDPTGNAWQ
  24 SATKKTDDGYVVEAAIKWTDITPEVGGKA.....GVELQVNDAT.AEGVRCGTISWADDTGTGYM
  25 SATALVDGGYVVEASIALGR-DVTVGDLV.....GLDFQVNDAT.A-GVRGSVRTWTDPTGRSYQ
  26 YQVKENKSGYVVYASLPLSSADLAAGKVL.....SLDFRITDKQ.PNGK-TSIVVWNDVNNQQPQ
  27 YNVQETKTGYVVYASLPMSAALLTEGKVL.....SLDFRITDEH.AAGK-TATIVWNDISNQQPD
  28 SATARTDTGYVVELAVTLRGQSGGQDDVAlg8fqGLDVQVNDGR.D-GARYAVHTWADPTGTGYQ
  29 YVVLQQLNGYTMEVKLLLNN-SLAINTNI.....GFDIAVIDNG.Q------QYSWNDRTNSQYF


       180        
        |        
   1 DPSKFGNLRLIK
   2 DPSKFGNLRLIK
   3 DPSKFGNLRLIK
   4 DTSGFGNLML--
   5 DTSGFGNLLL--
   6 DTSGFGNLLL--
   7 DPSKFGNLKLLK
   8 DPSKFGNLRLIK
   9 DTSGFGNLLL--
  10 ATSGFGNLML--
  11 DTSGFGSLLL--
  12 NTSYFGDIVLSK
  13 NTSYFGDIVLSK
  14 DTSGFGSLLLMK
  15 NTSYFGDIVLSK
  16 DTTKFGCLELV-
  17 NTAYFGDLVLSK
  18 NTAYFGDLVLSK
  19 NTMNYGTLKL--
  20 NTMNYGTLKL--
  21 NTMNYGTLKL--
  22 NTMNYGTLKL--
  23 DTSVFGILTL--
  24 SPEVFGTVV---
  25 STARWGVAELV-
  26 KTENRGKLKL--
  27 KPANRGKLKL--
  28 TGARWGVAHLV-
  29 ETDNYGILTM--