(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0091 HI0442, H. influenzae
    • gi|43322|emb|CAA28176.1|_1:73 (X04487) open reading frame 2 (75AA) (2775 is 1st base in codon) [Escherichia coli] (Escherichia coli)
              10         20        30        40        50        60    
              |          |         |         |         |         |    
   1 MFGKGGLGGLMKQAQQ.MQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLME.
   2 MFGKGGLGGLMKQAQQ.MQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLME.
   3 MFGKGGLGGLMKQAQQ.MQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLME.
   4 MFGKGGLGNLMKQAQQ.MQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAHNCRRVEIDPSLLE.
   5 RGGMGNMQKMMKQMQK.MQKDMAKAQEELAEKVVEGTAGGGMVTVKANGQKEILDVIIKEEVVDp
   6 MFGKAGLGGLMKQAQQ.MQENMKKAQAKLAETEIEGEAGNGLVKITMTCAHEVRKIDISPDLIQe
   7 MFGKAGLGGLMKQAQQ.MQENMKKAQAKLAETEIEGEAGNGLVKITMTCAHEVRKIDISPDLIQe
   8 ---MFDPSKLSEMLTQ.FQDKAKEMEEKSHNTSFTAKSGGGLVSVSMSGAGELLDVSIDDSLLE.
   9 ---MKNMGQMMKQMQK.MQKQMMKAQEELKEKTVEATAGGGMVTVVASGDKRILDVRISEDVVDp
  10 ----VNFNQFLKQAQS.MQKKMQEAQEQMANTRYTGKAGGMLVEIIITGKGEVEKISIDESLLKt
  11 MFENMDFSKMGELLNQ.VQEKAKNIELELANREFSAKSGAGLVKVSANGKGEIIDVSIDDSLLE.
  12 MQPGGDMSALLAQAQQ.MQQKLLEAQQQLANSEVHGQAGGGLVKVVVKGSGEVIGVTIDPKVVDp
  13 LGNMQNLYETVKKAQMvVQVEAVRVQKELAVAEFDGYCQGELVKVTLSGNQQPIRTDITDAAME.
  14 ----MSFKKITEMMRQ.AERQSKQKALDFEQKLFEYSYKNAAIKIIIFGNLTIKSITIDPALIDp
  15 ----MSFKKIAEMMRQ.AERETKKKTLAFEQQAFEYNYKNGAIKITILGDLTLKSINIDPVLIDa
  16 ----MDFSQLGGLLDG.MKKEFSQLEEKNKDTIHTSKSGGGMVSVSFNGLGELVDLQIDDSLLE.
  17 MQPGGDMSALLAQAQQ.MQQKLLETQQQLANAQVHGQGGGGLVEVVVKGSGEVVSVAIDPKVVDp
  18 FGNMQNMYETVKKAQMvVQVEAVRVQKELAAAEFDGYCAGELVKVTLSGNQQPIRTDITEAAME.
  19 ---FSQLGGLSGLLDG.VKKEFSQLEEKNKDTIHTSKSGGGMVSVSFNGLGELVDLQIDDSLLE.
  20 LGKIKELQEAFQKAQQ.VQEGAKVLQEELERMEIPGKSADGLVTVLMSGNQEPLSIEIDPSALE.
  21 -GGQPNMQQLLQQAQK.MQQDLAKAQEELARTEVDGQAGGGLVKATVTGSGELRGLVIDPKAVDp
  22 ----MDMKKLMKQMQQ.AQVAAGKIQDELAAQSVEGTA-SGLVTVQMNGHGKVTSLKIKPEAVDg
  23 ----VNPLDFLKNMSS.VKNNIDNIKKEISKITVCGKAGSNIVTIEMDGEFNVKKVSINKEFFDd
  24 ------LKDFARMQEE.LQKKIQELEESFSQIEVEASVGGGAVRIVATCDRRVKDIEIDEDLKE.
  25 ----MDMKKLMKQMQQ.AQVAAGKIQDELAAQSVEGTA-SGLVTVQMNGHGKVTSLKIKPEAVDg
  26 ----MDFQKLAQELKK.MQNTLSKKQKEFEEKVFDFDYK-GYVLVKIKGDLNIEAIEIKTEIVDp
  27 MFGKGGLGNLMKQAQQ.MQEKMQKMQEEIAQLEVTGESGAGLVKVTINGAHNCRRVEIDPSLLE.
  28 --------KKKKEAKL.MERQFMEMEASLEQKRFSGEAGNGLVSVTINGKCNLVDVKIKPDCLDp
  29 --------KKKKEAKI.MEQQFLEMEASLLEKRYEGQAGNGLVSVVINGKCDLISVKVQPTCLDp
  30 --------KKKKEAKL.MERQFMEMEASLEQKRFSGEAGNGLVSVTINGKCDLVDVRIKPDCLDp


             70        80        90       100         
             |         |         |         |         
   1 ..DDKEMLEDLIAAAFNDAVRRAEELQKEKMASVTAGMPLPPGMKFPF
   2 ..DDKEMLEDLIAAAFNDAVRRAEELQKEKMASVTAGMPLPPGMKFPF
   3 ..DDKEMLEDLIAAAFNDAVRRAEELQKEKMASVTAGMPLPPGMKFPF
   4 ..DDKEMLEDLVAAAFNDAARRIEETQKEKMASVSSGMQLPPGFKMPF
   5 ..EDIDMLQDLVLAATNEALKKVDEITNETMGQFTKGMNM-PGL----
   6 aaDDKEMLEDLILAALKSARDKAEETANKTMGAFTQGL--PPGVGDF-
   7 aaDDKEMLEDLILAALKSARGKAEETANKTMGAFTQGL--PPGVGDF-
   8 ..-DKESLQILLISAINDVYKSVEENKKSMTLGMLGGMAPF-------
   9 ..DDVEMLQDLILAATNEALKKVDELVEQDMGKFTKGLNM-PGM----
  10 ..EEKEMLEDLIKVAFNDAKQKCDEDSQNSLSGALNGMSLPPGFKIPF
  11 ..-DKESLQILLISAINDVLAMVAQNRSSMANDVLGGFG---------
  12 ..DDIETLQDLIVGAMRDASQQVTKMAQERLGALAGAMRP--------
  13 ..LGSEKLSLLVTEAYKDAHSKSVLAMKERMSDLAQSLGMPPGL----
  14 ..EDKVTLEEMITEAVNEAVGDVKAKYDQLMEE---AMPQMPGL----
  15 ..SDKVILEEMIIEATNEAVSDVKTKYDNLVEK---TMPKVPGL----
  16 ..-DKEAMQIYLMSALNDGYKAVEENRKNLAFNMLGNF----------
  17 ..GDIETLQDLIVGAMADASKQVTKLAQERLGALTSAMR---------
  18 ..LGSEKLSQLVTEAYKDAHAKSVVAMKERMSDLAQSLGMPPGL----
  19 ..-DKEAMQIYLMSALNDGYKAVEENRKNLAFNMLGNF----------
  20 ..KGAEGLSASVTEAMKAAYAESTETMRSKMEELTSGLNL-PG-----
  21 ..EDTETLADLVVAAVQAANENAQNLQQQKLGPLAQGMGG--------
  22 ..DDVEALEDLILAAINDAAEKAEGLQREATAGL--------------
  23 ..LDNDAFEQMIKSALNDAVSKVKEEIKLKTMGV--------------
  24 ..-DFDTLKDLLIAGMNEVMEKIEKRREEEMSKITQQFGI-PG-----
  25 ..DDVEALEDLILAAINDAAEKAEGLQRE-------------------
  26 ..EDKETLQDILRAAINEAISITCKERDAIMN----------------
  27 ..DDKEMLEDLV------------------------------------
  28 ..EDPEVVADLFRAAFKAA----KAALDSEMSAMHMGMP---------
  29 ..EDPEVIEDLFRAAFKLA----KEQMDQEMSLMRSTM----------
  30 ..EDPEVVADLFRAAFKAA----KAALDSEMSAMQMGMP---------