(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0109 Protein HI1715, H. influenzae
              10        20         30        40             50        6
              |         |          |         |              |         
   1 MSFDKQNLIWIDLEMTGLDPEK.ERIIEIATIVTDKNLNILA.....EGPVLAVHQSDELLNKMN
   2 MSFDKQNLIWIDLEMTGLDPEK.ERIIEIATIVTDKNLNILA.....EGPVLAVHQSDELLNKMN
   3 MSANENNLIWIDLEMTGLDPER.DRIIEIATLVTDANLNILA.....EGPTIAVHQSDEQLALMD
   4 MSANENNLIWIDLEMTGLDPER.DRIIEIATLVTDANLNILA.....EGPTIAVHQSDEQLALMD
   5 -ESMAQRMVWVDLEMTGLDIEK.DQIIEMACLITDSDLNILA.....EGPNLIIKQPDELLDSMS
   6 ----KNNLCWLDMEMTGLNPET.DRIIEVAMIITDSDLNVLA.....QSEVYAIHQSDELLDNMD
   7 ----KNNLCWLDMEMTGLNPET.DRIIEVAMIITDSDLNVLA.....QSEVYAVHQSDDVLNKMD
   8 -ESMAQRMVWVDLEMTGLDIEK.DQIIEMACLITDSDLNILA.....EGPNLIIKQPDELLDSMS
   9 CDKIEQRIIWIDCEMTGLDVEK.QTLCEIALIVTDSELNTIA.....TGPDIVIHQPKEVLDNME
  10 --KLFKPLVWIDCEMTGLDHVN.DRIIEICCIITDGHLAPVKaa7gdSHYESVIHYGPEVMNKMN
  11 ----QDELVWIDCEMTGLDLGS.DKLIEIAALVTDADLNILG.....DGVDVVMHADDAALSGMI
  12 ----HDELVWIDCEMTGLDLGS.DQLIEIAALVTDADLNILG.....DGVDVVIHIDSTALSSMI
  13 -GDYKQPLVWIDLEMTGLNVEV.DRILEIACIITNGDLTQSV.....EGPDLVVRQTKDCLDKMD
  14 -CGLDTDIVWMDLEMTGLDIEK.DKILEVACIITDQDLNVKS.....EGPCFAINHPQEVYDSMN
  15 MSNLKQPLVWIDCEMTGLEVGK.HVLMEVAAIITDGNLRPVE.....EKFDAVIKLDEKQLSEMN
  16 ----NDRMVWIDCEMTGLSLAD.DALIEVAALVTDSELNVLG.....EGVDIVIRPPDAALETMP
  17 -TKLFKPLVWIDCEMTGLDHVN.DRIIEICCIITDGHLAPVKaa7gdSHYETVIHYGPEVMNKMN
  18 ----DVEFVCLDCETTGLDVKK.DRVIEFAAIRF--TFDEII.....DSVEFLIHPERAV-----
  19 ----DIEFVCLDCETTGLDVKK.DRVIEFAAIRF--TFDEVI.....DSVEFLIHPERAV-----
  20 SSQTMDVLIFYDTETTGTQIER.DRIIEIAAYNS-----VTD.....ESFLTYVNPEIPI-----
  21 -NLLDGTFVVIDLEATGFDVEK.SEVIDLAAVRVEGG--IIT.....EKFSTLVYPGYFI-----
  22 ------ALIFYDTETTGTQIDK.DRIVELAAYNG-----TTS.....ESFQTLVNPEIPI-----
  23 -----PDLIFYDTETTGTQIDK.DRIVEIAAYNG-----TTG.....ESFQTLVNPEIPI-----
  24 GSLRKVQFLSIDLETTGLNQKK.DEIIAIGAVPIIGTRILAG.....ESYYRLLRPEKFK-----
  25 -TFGDATFVVLDFETTGLDPQV.DEIIEIGAVKIQGG--QIV.....DEYHTLIKPSREI-----
  26 ---KDTVFTCLDCEMTGLDVKK.DRIIEIAAVRF--TFDSVI.....SSIEFLINPERVV-----
  27 -------FTAFDTETTGLKAEE.DRIIEIGAVTFDRK--GII.....ARFSTLIFPDRAI-----
  28 TSWFEGPLAAFDTETTGVDTET.DRIVSAALVVQDAP-GLRP.....RVTRWLVNPGVPV-----
  29 ----KQRFVVIDVETTGNSPKKgDKIIQIAAVVIENG--QIT.....ERFSKYINPNKSI-----
  30 -DARELDTLVLDFETTGFNPEV.DRVISIGWVEIRNSNIRLN.....SARHVFINHAIDI-----
  31 -SHQDRGWAVIDVETSGFRPGQ.ARIISLAVLGLDAA-GRLE.....QSVVSLLNPKVDP-----
  32 --------IILDTETTGLDPQQgHRIVEIGAIEMVNKV-LTG.....KHFHFYINPERDM-----
  33 ----TRQLVVVDCETTGL-HDG.AAILEVAAVNIDTG---AE.....LHFVPFVTREQLAQAQPM


      0        70         80         90        100       110       120 
     |         |          |          |          |         |         | 
   1 DWCQKTHSENGLIERIK.ASKLTERA.AELQTLDFLKKWVP.KGASPICGNSIAQDKRFLVKYMP
   2 DWCQKTHSENGLIERIK.ASKLTERA.AELQTLDFLKKWVP.KGASPICGNSIAQDKRFLVKYMP
   3 DWNVRTHTASGLVERVK.ASTMGDRE.AELATLEFLKQWVP.AGKSPICGNSIGQDRRFLFKYMP
   4 DWNVRTHTASGLVERVK.ASTMGDRE.AELATLEFLKQWVP.AGKSPICGNSIGQDRRFLFKYMP
   5 DWCKEHHGKSGLTKAVK.ESTITLQQ.AEYEFLSFVRQQTP.PGLCPLAGNSVHEDKKFLDKYMP
   6 EWNTATHGRTGLTQRVR.ESSHTEAE.VEQKLLDFMSEWVP.RRATPMCGNSIHQDRRFMVKYMP
   7 EWNTATHGRTGLTQRVR.ESSHTEAE.VEQKLLDFMSEWVP.GRATPMCGNSIHQDRRFMVKYMP
   8 DWCKEHHGRSGLTKAVK.ESTITLQQ.AEYEFLSFVRQQTP.PGLCPLAGNSVHEDKKFLDKYMP
   9 EWPRNTFHENGLMEKII.ASKYSMAD.AENEVIDFLKLHAL.PGKSPIAGNSIYMDRLFIKKYMP
  10 EWCIEHHGNSGLTAKVL.ASEKTLAQ.VEDELLEYIQRYIPdKNVGVLAGNSVHMDRLFMVREFP
  11 DVVAEMHSRSGLIDEVK.ASTVDLAT.AEAMVLDYINEHVKqPKTAPLAGNSIATDRAFIARDMP
  12 DVVAEMHSRSGLINEVE.SSIVDLVT.AESIVLDYINNHVKqPKTAPLAGNSIATDRSFIARDMP
  13 DWCQTHHGASGLTKKVL.LSAITERE.AEQKVIEFVKKHVG.SGNPLLAGNSVYVDFLFLKKYMP
  14 EWCMKHHYNSGLIDRCK.SSDVNLEE.ASNLVLSYLEKNIP.KRACPLGGNSVYTDRLFIMKFMP
  15 DWCIEQHGKSGLTERCR.QSNLTVKD.VENQLLAYIKKYIPkKREALIAGNSVHADVRFLSVEMP
  16 EVVRQMHTASGLLDE-L.AGGTTLAD.AEEQVLAYVREHVKePGKAPLCGNSVGTDRGFLARDMR
  17 EWCIEHHGNSESHPKVL.ASEKTLAQ.VEDELLEYIQRYIPdKNVGVLAGNSVHMDRLFMVREFP
  18 --SAESQKIHKISDAML.RDKPKFGE.VFSRIKGFFKE---.--RDHIVGHHVGFDLQVLSQESE
  19 --SAESQKIHKISDAML.KDKPKFSE.VFSTIKGFFKE---.--RDYIVGHHVGFDLQVLSQESE
  20 --PDEASKIHGITTDAV.LSAPKFPE.AYEGFRKFCGE---.-DSILVAHNNDGFDFPLLGKECR
  21 --PERIKKLTGITNAML.VGQPTIEE.VLPEFLEFVGD---.---NIVVGHFVEQDIKFINKYTK
  22 --PAEATKIHGITTAEV.ADAPRFPE.AYQKFIEFCGT---.-DNILVAHNNNAFDYPLLVRECR
  23 --PAEATKIHGITTSEV.ANAPKFPE.AYQQFSDFCGT---.-DNILVAHNNNAFDYPLLLRECR
  24 ---HESMKFHGLDPARL.KTAHDFSE.IAEEVADLLRG---.---KVLVGYAIELDYGFLKRALK
  25 --SRKSSEITGITQEML.ENKRSIEE.VLPEFLGFLED---.---SIIVAHNANFDYRFLRLWIK
  26 --SAESQRVHHISNAML.RDQPKIAE.VFPQIKAFFKE---.--GDYIVGHSVGFDLQVLAQEME
  27 --PPDVSKINHITDDML.VNKPRFCE.IVSDFSRFIKG---.---TVLVAHNANFDVEFLNAELS
  28 --PESATAVHGLTEEYV.QRHGRWPApVMYEMAEALTEQAR.AG-RPLVVMNAPFDLTLLDRELR
  29 --PAFIEQLTGISNQMV.ENEQPFEA.VAEEVFQLLDG---.---AYFVAHNIHFDLGFVKYELH
  30 --CHESVKVHHIRPETLhVSGISEQA.AFTQLLDVIAG---.---KILVAHGCIMEQRFLEQYIK
  31 ----GPTHVHGLTAAML.DGQPQFAD.IAGEVVDVLRG---.---RTLVAHNVAFDYAFLAAEAE
  32 --PFEAYKIHGISGEFL.KDKPLFKT.IANDFLKFIAD---.---STLIIHNAPFDIKFLNHELS
  33 AMQMNRYYERGIWQRRL.SPDST-DA.AYWKLANMLAG---.---NTFGGSNPAFDSRLLAAAMP


               130       140            150        160       170       
                |         |              |          |         |       
   1 DLADY...FHYRHLDVSTLKELAARWKPEI.....LEG.FKKENTHLALDDIRESIKELAYYREH
   2 DLADY...FHYRHLDVSTLKELAARWKPEI.....LEG.FKKENTHLALDDIRESIKELAYYREH
   3 ELEAY...FHYRYLDVSTLKELARRWKPEI.....LDG.FTKQGTHQAMDDIRESVAELAYYREH
   4 ELEAY...FHYRYLDVSTLKELARRWKPEI.....LDG.FTKQGTHQAMDDIRESVAELAYYREH
   5 QFMKH...LHYRIIDVSTVKELCRRWYPEEy....EFA.PKKAASHRALDDISESIKELQFYRNN
   6 KLENY...FHYRNLDVSTLKELAKRWNPPV.....AKS.VVKRGSHKALDDILESIEEMRHYREH
   7 KLENY...FHYRNLDVSTLKELAKRWNPPV.....AKS.VVKRGSHKALDDILESIEEMRHYREH
   8 QFMKH...LHYRIIDVSTVKELCRRWYPEEy....EFA.PKKAASHRALDDISESIKELQFYRNN
   9 KLDKF...AHYRCIDVSTIKGLVQRWYPDY.....-KH.PKKQCTHRAFDDIMESIAELKNYRES
  10 KVIDH...LFYRIVDVSSIMEVARRHNPALq....ARN.PKKEAAHTAYSDIKESIAQLQWYMDN
  11 TLDSF...LHYRMIDVSSIKELCRRWYPRI.....YFGqPPKGLTHRALADIHESIRELRFYRRT
  12 TLDSF...LHYRMIDVSSIKELCRRWYPRI.....YFGqPAKGLTHRALADIHESIRELRFYRRT
  13 ELAAL...FPHILVDVSSVKALCARWFPIEr....RKA.PAKKNNHRAMDDIRESIKELKYYKKT
  14 LVDAY...LHYRIVDVSTIKELAKRWHPAIl....DSA.PKKSFTHRSLDDIRESIKELAYYKAN
  15 KIIEH...LHYRIIDVSTIKELAKRWCPDI.....-PA.YDKKGDHRALSDILESIGELQHYRSY
  16 ELEGY...LHYRIVDVSSVKELARRWYPRA.....YFNsPAKNGNHRALADIRDSITELRYYREA
  17 KVIDH...LFYRIVDVSSIMEVARRHNPALq....ARN.PKKEAAHTAYSDIK------------
  18 RLGET...LLPKHHYVIDTLRLAKGYGDSPnn9arHFN.VPHQGNHRAMKDVEMNVKVFKHLTKR
  19 RLGET...LLPKQHYVIDTLRLAKEYGDSPnn9arHFN.VPHQGNHRAMKDVEMNVKVFKHLTKR
  20 RH-SL...EPLTNRTIDSLK-WAQKYRPDLp10rqVYG.FAENQAHRALDDVVILHKVFT-----
  21 QY-RG...KKFRNPSLCTLK-LARKVFPGLk10aeNFG.FETNGVHRALKDATLTAEIFIKI---
  22 RH-GL...SEPQLRTIDSLK-WAKKYRTDLp10rqVYG.FEENQAHRALDDVITLYRVF------
  23 RH-GL...PEPQLRTIDSLK-WAKKYRTDLp10rqVYG.FEENNAHRALDDVITLHRVF------
  24 REGYK...VENKRIDVIDFEKAVCYILGERp12akKYR.VEVSYRHNALADAFITAQIFQVQ---
  25 KVMG-...LDWERPYIDTLA-LAKSLLKLRsy9veKLG.LGPFRHHRALDDARVTAQVFLRFVE-
  26 RIGET...FLSKYTIIDTLR-LAKEYGDSPnn9avHFN.VPYDGNHRAMKDVEININIFKHLCKR
  27 LC-KK...QPLSHKVVDTYA-MAQAVFPGLg12alQFG.LTVHAAHRAEDDARVCMELFTT----
  28 RHRAS...SLGRWLERTPLHVLDPHVLDKHl16caHYG.VELAGAHDAAADAQAALEVVRAVGRR
  29 KA-GF...QLPDCEVLDTVE-LSRIVFPGFe10seELQ.LRHDQPHRADSDAEVTGLI-------
  30 MKYQN...LKLPLIWLDTLK--IEQYRTQLr14rkELN.LPTYQAHNALNDAIATAEL-------
  31 IAEAE...LPVDF--VMCTVELARRLQLGVd10aaHWG.VPQQRPHDAFDDVRVLTGIL------
  32 LLKRTeikFLELTNTIDTLV-MARNMFPGAr11rfKVD.NSGRQLHGALKDAAL-----------
  33 DGAPE...WHHRLADLAAFT--AGKLNLDPv11ceRLG.VTVSDRHSALADAHATATCFTILR--


      180  
       |  
   1 FMKLD
   2 FMKL-
   3 FI---
   4 FI---
   5 IFK--
   6 FL---
   7 FL---
   8 IFK--
   9 IFV--
  10 YLKPP
  11 AFVPQ
  12 AFVPP
  13 IFK--
  14 L----
  15 -----
  16 VFVPQ
  17 -----
  18 F----
  19 F----
  20 -----
  21 -----
  22 -----
  23 -----
  24 -----
  25 -----
  26 F----
  27 -----
  28 -----
  29 -----
  30 -----
  31 -----
  32 -----
  33 -----