(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0110 Protein HI1288, H. influenzae
              10        20              30          40             50  
              |         |               |           |              |  
   1 MAREFKRSDRVAQEIQKEIAVIL.QREV.....KDPRIG..MVTVSDVEVSSDLS.....YAKIF
   2 --KEFGRPQRVAQEMQKEIALIL.QREI.....KDPRLGm.MTTVSGVEMSRDLA.....YAKVY
   3 -AREFKRSDRVAQEIQKEIAVIL.QREV.....KDPRIG..MVTVSDVEVSSDLS.....YAKIF
   4 -----SRSQRVSQEMQKEIALIL.QREI.....KDPRVG..MATVSGIELSRDLA.....YAKVF
   5 -MTTHSRPERVGQEIQAAIGQLLtRGEL.....RDPRIG..FITITGVKVSPDLR.....VAQVF
   6 -----MRANRVGEQMKKELGDII.SRKL.....KDPRIG..FLTVTDVRVSGDLQ.....IAKVY
   7 -MATSRRVSRVSSLIKREVSQML.LHEI.....KDDRVGtgMVSVTEVEVSGDLQ.....HAKIF
   8 -MNPAYRKAMLESEIQKLLMEAL.Q-QL.....RDPRLKkdFVTFSRVELSKDKR.....YADVY
   9 CMTENRRIKRVNALLQEAIAKVI.LKDV.....KHPKISnlWITVTRVSLSKDLH.....SARVY
  10 -MADQARARRLAKRICTIVASAI.EFEI.....KDPGLD..GVTIVDVKVTADLH.....DATVF
  11 -MTENRRIKRVNALLQEAIAKVI.LKDV.....KHPKISnlWITVTRVSLSKDLH.....SARVY
  12 -MADAARARRLAKRIAAIVASAI.EYEI.....KDPGLA..GVTITDAKVTADLH.....DATVY
  13 -MADNARAKRLADLIREVVAQKL.QRGI.....KDPRLGs.HVTITDTRVTGDLR.....EATVF
  14 -QVSQLRVRKLGEHIRAEIAQLImLGKI.....KDPRVSp.FLSVNWVDVSGGMV.....CARVY
  15 FMYKNIKKFKLESFIAQEIGNLIvSGGI.....KDPRIHs.FLTVVKVEFSKDLI.....NAKVF
  16 -MTENRRMKKVNAMLRESIAKVI.LKDV.....KHPKISnrWITITRVSLSRDLQ.....SARVY
  17 -QRGYARQDRVKEQIMRELAELV.RTGL.....KDPRAG..FITVNEVEVTRDYS.....HATVF
  18 ---ASYRKQRIENDIIRLINRTI.INEI.....YDPVVK..LGHVSHVKLSADFF.....HAVVY
  19 -QRGYARQDRVKEQIMRELAELV.RTGL.....KDPRAG..FITVNEVEVTRDYS.....HATVF
  20 ---ASYKKERLENDIIRLINRTV.IHEI.....YNETVK..TGHVTHVKLSDDLL.....HVTVY
  21 -KTNSHRQQKLASIINEVLIEILkRGKM.....LDKRLFdcPLTITKIIVTADLK.....IANCY
  22 -MAENRRMKKVNAMLREAIAKVI.LKDV.....KHPKISnrWITITRVSLSRDLQ.....SACVY
  23 LIMANHRIDRVGMEIKREVNEIL.RLRV.....NDPRVQ..DVTITDVQMLGDLS.....MAKVF
  24 -----AHKERLESNLLELLQEAL.A-SL.....NDSELN..SLSVTKVECSKGKH.....HALVF
  25 -----AHKERLESNLLELLQEAL.A-SL.....NDSELN..SLSVTKVECSKGKH.....HAYVF
  26 -MGNSFRSDRVAVEIQREINDIL.RNKV.....RDPRVQ..DVNITDVQLTGDLS.....QATVY
  27 --MDNKRLSKIERLLQKELSEIF.LRDA.....KSLPGV..IVSVTNVRVSPDLS.....IARIH
  28 -MANEVRVARLESLIKDVINNAL.ANEI.....NDKIAK..LARVTAVRLSNDLS.....VAKIF
  29 -RLDDKKVVQLSRILEERIAEVV.A---.....TDEMLGrlQLQITRVRVDRAFT.....QVSVY
  30 CMANPRRVKMVAKQIMRELSDML.LTDTv13lgADRYLSs.LTTISDVEVSNDLQv12cnVVKVY
  31 --GPLMKPEQVQAQMSRVLSSAI.A-EL.....RDPRVPl.IVTVERVHVTPDYG.....QARVY


                  60        70        80          90       100       11
                  |         |         |           |         |         
   1 VTFLF....D..HDEMAIEQGMKGLEKASPYIRSLLGKAMR..LRIVPEIRFIYDQSLVEGMRMS
   2 VTFLN....D..KDEDAVKAGIKALQEASGFIRSLLGKAMR..LRIVPELTFFYDNSLVEGMRMS
   3 VTFLF....D..HDEMAIEQGMKGLEKASPYIRSLLGKAMR..LRIVPEIRFIYDQSLVEGMRMS
   4 VTFLNvltdN..ADPDTAKNGIKALQDASGYIRTLLGKAMR..LRIVPELTFAYDNSLIEGMRMS
   5 YSMMG....T..--AQERAETQKGLDAAKGFVRREVTAAVN..LRVSPEVFFTFDESVGEGDKID
   6 ISVLG....D..--EKKREEALKGLAKAKGFIRSEIGSRIR..LRKTPEIEFEFDESIDYGNRIE
   7 VSIYG....S..--PEAKASTMAGLHSAAPFVRRELGQRMR..LRRTPEVSFLEDRSLERGDKIL
   8 VSFLG....T..--PEERKETVEILNRAKGFFRTFIAKNLR..LYVAPEIRFYEDKGIEASVKVH
   9 VSVMP....H..--ENTKEEALEALKVSAGFIAHRASKNVV..LKYFPELHFYLDDIFSPQDYIE
  10 YTVMG....RtlEDAPDYTAATAALNRAKGTLRSKVGAGTG..VRFTPTLTFIRDTTSDSVARME
  11 VSVMP....H..--ENTKEEALEALKVSAGFIAHRASKNVV..LKYFPELHFYLDDIFSPQDYIE
  12 YTVMG....RtlHDEPNCAGAAAALERAKGVLRTKVGAGTG..VRFTPTLTFTLDTISDSVHRMD
  13 YTVYG....D..--DEERKAATAGLESAKGILRSEVGKAAG..VKFTPTLTFVMDALPDTARNIE
  14 VSSFM....G..--KYKTKQGVQGLESAAGFIRSVLAKKLR..LRQCPRLSFVYDESVRDGFSLS
  15 MGSIK....E..--GASLDNAVKALNNAKGFIQSQIIKRIK..VRSTPKLLFVKDDSLSKSFYVN
  16 VSIMP....H..--ENSQEETLAALKASAGFIACQASKDVV..LKYFPDLNFYMEDIFSPQDHIE
  17 YTILN....Q..---DAREITEEVLEHARGHLRSELAKRIK..LFKTPELHFKYDESLERGLNLS
  18 LDCYD....R..---SQIQTVVNAFKKAQGVFSQMLAQNLY..LAKSVKLHFVKDDAIDNALKIE
  19 YTVLN....Q..---DAREITEEVLEHARGHLRSELAKRIK..LFKIPELHFKYDESLERGLNLS
  20 LDCYN....R..---EQIDRVVGAFNQAKGVFSRVLAHNLY..LAKAVQIHFVKDKAIDNAMRIE
  21 FFPFN....T..--KLTFDEIMDALNNSKHAIRNFITNRIN..MKFSPEIRFYYDYGFDNAIKIE
  22 VSIMP....H..--ENSQEETLAALKASAGFIAFQASKDLV..LKYFPDLNFYVEDIFSPQDHIE
  23 YTIHS....T..-LASDNQKAQIGLEKATGTIKRELGKNLT..MYKIPDLQFVKDESIEYGNKID
  24 VLSSD....H..-------KILSKLKKAEGLIRQFVLQASG..WFKCPKLSFVLDDSLEKQLRLD
  25 VLSSD....H..-------KILSKLKKAEGLIRQFVLQASG..WFKCPKLSFVSDNSLEKQLRLD
  26 YSLLS....N..-LASDNEKAATALKKATGLFKSELAKRMT..IFKIPDLTFAKDESVEYGSKID
  27 LSIFP....S..---EKSSEILESIKHNTKTIRYDLGQQVRtqLRKIPDLTFYIDDSLDYLENID
  28 LDAHK....R..---ESMLKVLENVNKVSGLLRSKLAAEWT..SYKVPELRFVIDETIDYANHID
  29 WMCRG....D..GDS----EIVDFLEESKHQIRRRVEESIG..I-TCPEVKFIGDKALLMEQEMD
  30 VSVFG....D..--DRGKDVAIAGLKSKAKYVRSELGKRMK..LRLTPEVRFIEDEAMERG----
  31 VSAIG....A..----DMPELLDALTHARGRLQRELSAHVR..MRRTPTLEF-------------


      0       120        
     |         |        
   1 NLVTNVVREDEKKHVEESN
   2 NLVTSVVKHDEERRVNPDD
   3 NLVTNVVREDEKKHVEES-
   4 NLVTNVIKNDVERQVNP--
   5 RLLREVKQ-----------
   6 TLIHELHSEK---------
   7 NLLNNLPQAIATEDLEDDD
   8 QLLVQLGYDPLKDKEK---
   9 NLLWQIQEK----------
  10 ELLARARAA----------
  11 NLLWQIQEK----------
  12 ELLARARAA----------
  13 DLLDKARQSDEK-------
  14 RKIDRLES-----------
  15 KLIEGLN------------
  16 SLLLKIAEQDKKLTHNNN-
  17 ALIDQVAAEK---------
  18 QIINSL-------------
  19 ALIDQVAAEK---------
  20 SIINSLKKSK---------
  21 ELLKN--------------
  22 SLLLKIAEQDK--------
  23 EMLRNLDK-----------
  24 AIFNEIAKGKD--------
  25 AIFNEIAKGKD--------
  26 EIL----------------
  27 RLLN---------------
  28 ELFKKIKQ-----------
  29 KLFREADYG----------
  30 -------------------
  31 -------------------