; SAM: prettyalign v3.2 (July 31, 2000) compiled 08/11/00_16:27:51 ; (c) 1992-2000 Regents of the University of California, Santa Cruz ; ; Sequence Alignment and Modeling Software System ; http://www.cse.ucsc.edu/research/compbio/sam.html ; ; ----------------- Citations (SAM, SAM-T99, HMMs) -------------------- ; R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: ; Extension and analysis of the basic method, CABIOS 12:95-107, 1996. ; K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting ; remote protein homologies, Bioinformatics 14(10):846-856, 1999. ; A. Krogh et al., Hidden Markov models in computational biology: ; Applications to protein modeling, JMB 235:1501-1531, Feb 1994. ; --------------------------------------------------------------------- ; Sequences correspond to the following labels: ; 1 gi|125391|sp|P08210|KHSE_CORGL_116:302 ; 2 gi|125390|sp|P07128|KHSE_BRELA_116:302 ; 3 gi|1730046|sp|Q10603|KHSE_MYCTU_131:285 ; 4 gi|1170651|sp|P45836|KHSE_MYCLE_120:287 ; 5 gi|7450509|pir||B64651_109:275 ; 6 gi|7450510|pir||C71940_110:275 ; 7 gi|6967628|emb|CAB72618.1|_111:287 ; 8 gi|4614|emb|CAA37083.1|_129:341 ; 9 gi|6321814|ref|NP_011890.1|_130:342 ; 10 gi|7490646|pir||T40495_119:323 ; 11 gi|7472024|pir||G75280_117:267 ; 12 gi|2497514|sp|P73646|KHSE_SYNY3_122:267 ; 13 gi|125393|sp|P04947|KHSE_FREDI_113:268 ; 14 gi|6225592|sp|O67332|KHSE_AQUAE_107:273 ; 15 gi|1730045|sp|P52991|KHSE_LACLA_105:266 ; 16 gi|2497513|sp|P72535|KHSE_STRPN_100:266 ; 17 gi|80285|pir||B25364_111:283 ; 18 gi|6648061|sp|P04948|KHSE_BACSU_112:284 ; 19 T0115-116-end ; 20 gi|2497515|sp|Q58504|KHSE_METJA_116:300 ; 21 gi|7434441|pir||H75080_110:282 ; 22 gi|7450512|pir||D71103_88:260 ; 23 gi|4927412|gb|AAD33097.1|AF082525_1_170:353 ; 24 gi|7532410|gb|AAF63224.1|_170:353 ; 25 gi|7521143|pir||E72511_150:326 ; 26 gi|9107373|gb|AAF85023.1|AE004035_2_133:284 ; 27 gi|7447990|pir||E69207_115:279 ; 28 gi|2128859|pir||G64479_112:270 ; 29 gi|7450508|pir||F72364_109:272 ; 30 gi|7387824|sp|O66132|KHSE_BUCAI_1:95 ; 31 gi|147980|gb|AAA83915.1|_117:291 ; 32 gi|266419|sp|P00547|KHSE_ECOLI_117:292 ; 33 gi|125395|sp|P27722|KHSE_SERMA_117:292 ; 34 gi|9656932|gb|AAF95506.1|_123:298 ; 35 gi|1170650|sp|P44504|KHSE_HAEIN_122:297 ; 36 gi|586879|sp|P37550|IPK_BACSU_120:276 10 20 30 40 50 | | | | | 1 GLADFPLTQ-EQI..VQLSSAFEGHPDNAAASVLGGAVVSWtnlsidgkSQPQYAA..VPLEVQD 2 GLADFPLTQ-EQI..VQLSSAFEGHPDNAAASVLGGAVVSWtnlsidgkSQPQYAA..VPLEVQD 3 ----------AEL..IQLASEFEGHPDNAAAAVLGGAVVSWtdhsg...DRPNYSA..VSLRLHP 4 QIDSTPLSN-AQL..IQLASEFEGHPDNAAAAVLGGAVVSWvdrsy...DQPDYCA..VPLRLHP 5 DFDRENIVNT---..---ALIYENHPDNITPAVFGGYNAAFv.......EKKKVIS..LKTKIPS 6 -FDRENILNT---..---ALIYENHPDNITPAVFGGYNAAFv.......EKKKVIS..LKTKIPS 7 -VEKECILDE---..---ALIYENHPDNIAPATLGGFVCSLv.......EKNKVYS..IKKEIDK 8 GFSKQRMLDY---..---CLMIERHPDNITAAMMGGFCGSFlrd36plpPTDIGRH..VKYQWNP 9 GFSKQRMLDY---..---CLMIERHPDNITAAMMGGFCGSFlrd36plpPTDIGRH..VKYQWNP 10 GLSKLQMMDY---..---VLMIERHPDNVMASMMGGFVGSFlre28tlpPKSLGTF..ARLPWAS 11 PLDDETVLDV---..---TAREEGHPDNVAPALFGGIVVATl.......DKLGTHY..VRLDPPA 12 -----------EI..LQMAIAMEGHPDNVAPALLGGCQLAVk.......NGDHWQL..VALDWPS 13 QLAGEPLSQ-LQV..MELAIAMEGHPDNVVPALLGGCRLAAt.......SAEGWEI..CDVPWDE 14 --HKVELPL-KEK..LKIAFEFEKHPDNIIPAFVGGFTVCAt.......SESGVIF..KKLPFPE 15 QLAKLNLTS-DEK..LKLACEIEGHPDNVAPALLGNLVIAS........TVAGKTS..HIVADFP 16 QLGQLNLSD-HEK..LQLATKIEGHPDNVAPAIYGNLVIAS........SVEGQVS..AIVADFP 17 ELCGLKLSE-ADK..LHLASLEEGHPDNAGASLVGGLVIGL........HEDDETQ..MIRVPNA 18 ELCGLKLSE-ADK..LHLASLEEGHPDNAGASLVGGLVIGL........HEDDETQ..MIRVPNA 19 NLDKLKLVDYASY..GELASSGAKHADNVAPAIFGGFTMVT........NYEPLEV..LHIPIDF 20 NLDKLKLVDYASY..GELASSGAKHADNVAPAIFGGFTMVT........NYEPLEV..LHIPIDF 21 --NDELIIMAALE..GEKAASGSPHGDNVIPSYYGGFNILE........SLNPLRV..HRVDV-- 22 --NDELILKAAMK..GEEKASGEPHPDNVVPSYYGGFTVIE........SKSPLRV..HFVDA-- 23 KLGSDQLVLAGLE..SEAKVSG-YHADNIAPAIMGGFVLIR........NYEPLDLkpLKFPSDK 24 KLGSDQLVLAGLE..SEAKVSG-YHADNIAPAIMGGFVLIR........NYEPLDLkpLRFPSDK 25 --PVDRLVFYAGL..GERAAAGQPHFDNAAASILGGLAVVA........SDAAGKL..RVFRVPF 26 PLRREHLYRYALD..GEAVASGSRHGDNLGPLFLGGLVLCT........---LERL..VPVTVPA 27 PMEDFEMLNMAVDasLQAGVSVTGAYDDASASFYGGLTVTD........NMERRII..LREPME- 28 KIDDELILNLGIKssFDEKLTVTGAYDDATASYYGGITITD........NIERKIL..KRDKMRD 29 NLSREDLMKL---..---AVELEGHPDNVVPAFTGGLVVCY........Q-NGSHL..DFEKFEI 30 -------------..-----------------YLGGLQLIL........EDSKIIS..QTIPNFK 31 PLNDTRLLALMGE..LEGRISGSIHYDNVAPCFLGGMQLMI........EENDIIS..QQVQGLM 32 PLNDTRLLALMGE..LEGRISGSIHYDNVAPCFLGGMQLMI........EENDIIS..QQVPGFD 33 PLDKTTLLGLMGE..LEGRISGSVHYDNVAPCYLGGLQLML........EEEGIIS..QEVPCFD 34 PLDETELLALMGE..MEGKISGSIHYDNVAPCYLGGVQLML........EELGIIS..QSVPSFD 35 PFSKMELLEMMGE..LEGRISGSIHYDNVAPCYLGGVQFMV........QSLGNIC..QKLPFFD 36 NLSAETLAEL---..------GAEIGSDVSFCVYGGTALAT........G-RGEKI..KHISTPP 60 70 80 90 100 110 | | | | | | 1 NIRATALVPNFHAS...TEAVRRVLP.TEVTHIDARFNVSRVAVMIVALQQRPDLLW--E.GTRD 2 NIRATALVPNFHAS...TEAVRRVLP.TEVTHIDARFNVSRVAVMIVALQQRPDLLW--E.GTRD 3 DIRLFTAIPEQRSS...TAETRVLLP.AQVSHDDARFNVSRAALLVVALTERPDLLM--A.ATED 4 DIHLFAAIPEERSS...TAESRVLLP.ARVSHDDARFNVSRAALLVVALTERPDLLM--A.ATED 5 FLKAVMVIPNRAIS...TKQSRHLLP.KRYSVQESVFNLSHASLMTMAIVQGKWDLL-RC.CSKD 6 FLKAVMVIPNRVIS...TKQSRHLLP.KRYSVQESVFNLSHASLMTMAIVQGKWDLL-RC.CSKD 7 DLAAVVVIPNLAMS...TEQSRQALA.KNLSFNDAVFNLSHASFLTACFLEKKYEFL-KF.ASQD 8 AIKCIAIIPQFELS...TADSRGVLP.KAYPTQDLVFNLQRLAVLTTALTMDPPNADLIYpAMQD 9 AIKCIAIIPQFELS...TADSRGVLP.KAYPTQDLVFNLQRLAVLTTALTMDPPNADLIYpAMQD 10 ELKAIVVIPEFHLA...TSKARSVLP.TSYGRTDVVYNLQRLALLTTALGQTPINPHLVYeVMKD 11 HLGVTVLVPDFELS...TSKARAVLP.REYSRADTVHALSHAALLAAALAQGRLDLL-RH.AMQD 12 KFVPVLAIPNFELS...TEAARAVLP.HQYDRSAAIFNASHLALLVQAFSQGRGDWL-AL.ALQD 13 NVVPVVAIPDFELS...TQEARRVLP.TEFSRADAIFNTAHLGLLLRGLATGKGEWL-KT.ALQD 14 DIKIVFVIPDFEVS...TSEARRVLP.KKVELKEAVFNVQRSALFVSALLTKDYKLL-RE.AVRD 15 SCALLAFVPDYELK...TVESRKVLP.NELTYKEAVAASSIANVLTASLLTNNLEVAGQM.MEAD 16 ECDFLAYIPNYELR...TRDSRSVLP.KKLSYKEAVAASSIANVAVAALLAGDMVTAGQA.IEGD 17 DIDVVVVIPFYEVL...TRDARDVLP.KEFPYADAVKASAVSNILIAAIMSKDWPLVGKI.MKKD 18 DIDVVVVIPFYEVL...TRDARDVLP.KEFPYADAVKASAVSNILIAAIMSKDWPLVGKI.MKKD 19 KLDILIAIPNISIN...TKEAREILP.KAVGLKDLVNNVGKACGMVYALYNKDKSLFGRY.MMSD 20 KLDILIAIPNISIN...TKEAREILP.KAVGLKDLVNNVGKACGMVYALYNKDKSLFGRY.MMSD 21 ELNVVVVLPEVEVP...TKEARRIVP.EKVPLKDAIKNLAMASSLVLALKEGDIETV-GR.LLDD 22 KLRGVVVLPEVEIP...TAKARKILP.SMVPLKDAVKNIAMASSLILALKEGDLETI-GR.LLDD 23 DLFFVLVSPEFEAP...TKKMRAALP.TEIPMVHHVWNSSQAAALVAAVLEGDAVMLGKA.LSSD 24 DLFFVLVSPDFEAP...TKKMRAALP.TEIPMVHHVWNSSQAAALVAAVLEGDAVMLGKA.LSSD 25 KAWFAVVTPMNPVPqgkTGVMRKVLP.ENVSFRDAVRNFSRAAGIVAAAVNGDLKSMGAL.MMSD 26 AWHSLLVHPDTLLE...TRRAREVLK.DPYLLPDIVTQSANLALVLAGCYHGDAELV-RA.GLRD 27 NQKVLIYMPDRKSL...TAQS-----.---------------DVPRMKLLAPWVDMAFRE.VLDG 28 DLNVLILIPNLEKN...VDVNRMKLI.KDY--VEIAFNEAINGNYFKALFLNGILYA---.---- 29 DLSLTFLVPNFPVC...TNEMRKILP.EKVPFEDAVFNIKNSCQFLAKIAAGKIKEA-LK.YVGD 30 NWFWIVAWPGTKVP...TAEARDILP.KKYKKETCIKNSRYLAGFIHASYSQQPHLA-AR.LMQD 31 SGCGCSRIRGLK-S...RRQKQGYLP.AQYRRQDCIAHGRHLAGFIHACYSRQPELA-AK.LMKD 32 EWLWVLAYPGIKVS...TAEARAILP.AQYRRQDCIAHGRHLAGFIHACYSRQPELA-AK.LMKD 33 DWLWVMAYPGIKVS...TAEARAILP.AQYRRQDCISHGRYLAGFIHACHTRQPQLA-AK.LMQD 34 DWYWVMAYPGIKVS...TAEARAILP.AQYRRQDIVAHGRYLAGFIHACHTQQPELA-AK.MIKD 35 NWYWVLAYPGIEVS...TAEARAILP.KSYTRQNVIAHGRHLGGFVHACHTHQENLA-AI.MMKD 36 HCWVILAKPTIGVS...TAEVYRALKlDGIEHPDV-------QGMIEAIEEKSFQKM--C.SRLG 120 130 140 150 160 | | | | | 1 RLHQPYRAEVLPITS.......EWVN....RLRNRGYAAYLSGAGPTAMVL.STE.---PIPDKV 2 RLHQPYRAEVLPVTS.......EWVN....RLRNRGYAAYLSGAGPTAMVL.STE.---PIPDKV 3 LLHQPQRAAAMTASA.......EYLR....LLRRHNVAAALSGAGPSLIAL.STD.S--ELPTD- 4 VLHQPHRASAMSASA.......EYLR....LLRRHNVAATLSGAGPSLIAL.STQ.S--ELPREA 5 RMHQYKRMQTYPVLF.......AIQKi...ALENNALMSTLSGSGSSFFNM.CYE.EDAPKLKQV 6 RMHQYKRMQTYPVLF.......AIQKl...ALENNALMSTLSGSGSSFFNM.CYE.EDAPKLKQV 7 KLHEINRMKNLPELF.......EVQKf...ALENKALMSTLSGSGSSFFSLaFKD.DALALAKKI 8 RVHQPYRKTLIPGLT.......EILScvtpSTYPGLLGICLSGAGPTILAL.ATE.NFEEISQEI 9 RVHQPYRKTLIPGLT.......EILScvtpSTYPGLLGICLSGAGPTILAL.ATE.NFEEISQEI 10 KVHQPYRASLIPGLQ.......NILAtlnpDTQPGLCGICLSGAGPTVLAL.ATG.NFDEIAHAM 11 YVHQVWRAPLVPGLS.......DILEh...AHEYGALGAALSGAGPTVLCF.HDQ.--------- 12 QIHQPYRQSLIPAYD.......QLHQa...ALAAGAYNLVISGAGPTLLAI.ADE.--------- 13 KLHQPYRKALIPGYD.......AVNQa...AVAAGAYGMVISGAGPTLLAL.ADA.--------- 14 KLHQPYREKLVPGLS.......EAILv...SYKEGALATFLSGAGPTICSL.TTE.NEEKIGEAI 15 RFHESYRASLIPELQ.......LLREi...GHEFGAYGTYLSGAGPTVMLL.VPD.DKLTLL--- 16 LFHERYRQDLVREFA.......MIKQv...TKENGAYATYLSGAGPTVMVL.ASH.DKMPTIKAE 17 MFHQPYRAMLVPELS.......KVEHv...AEMKGAYGTALSGAGPTILVM.TEK.GKGEELKEQ 18 MFHQPYRAMLVPELS.......KVEHv...AEMKGAYGTALSGAGPTILVM.TEK.GKGEELKEQ 19 KVIEPVRGKLIPNYF.......KIKE....EVKDKVYGITISGSGPSIIAF.PKE.EFIDEVENI 20 KVIEPVRGKLIPNYF.......KIKE....EVKDKVYGITISGSGPSIIAF.PKE.EFIDEVENI 21 NLALPYRKKLMPWFD.......EVRKa...GLEAGAYGVTVSGSGPSLFAI.-GE.NLKDIGKAM 22 NLALPYRKKLMPWFD.......EIRRv...ALETGAYGITVSGSGPALFAI.-GE.NLKDIGKTI 23 KIVEPTRAPLIPGME.......AVKKa...ALEAGAFGCTISGAGPTAVAV.IDS.E--EKGQVI 24 KIVEPTRAPLIPGME.......AVKKa...ALEAGAFGCTISGAGPTAVAV.IDS.E--EKGQVI 25 EIVEPRRRSYVPCYT.......QVRKa...ALQAGALGFSLSGAGPSMIAL.APS.S--EAAREI 26 VLIEPRRAPLIAGFT.......AAQQa...ALQADAMGASISGAGPSVFAW.FQ-.--------- 27 RVHSALTLNGILYCAslgfdpgIALD....ALEAGALAAGLSGTGPSFVAL.THE.DSEADIIDA 28 -------SALNFPTN.......IAID....ALDAGAITAGLSGTGPSYIAM.VED.ENVEKVKEK 29 RLHQNYRINGNKKMK.......EFVEa...ILSKNPEYWFVSGSGPSVCSN.INDfEGIPYLKDV 30 FIAEPYRIKLLPN--.......----....---------------------.---.--------- 31 VIAEPYRERLLPGFR.......QARQa...VAEIGAVASGISGSGPTLFAL.CDK.P--ETAQRV 32 VIAEPYRERLLPGFR.......QARQa...VAEIGAVASGISGSGPTLFAL.CDK.P--ETAQRV 33 VIAEPYRTRLLPGFA.......EARKa...AQEIGALACGISGSGPTLFAV.CND.G--ATAQRM 34 VIAEPYREKLLPGFA.......KARNy...AASAGALATGISGSGPTLFSV.CKE.Q--AVAERV 35 VIAEPYRESLLPNFA.......EVKQa...TRDLGALATGISGSGPTIFSI.APD.L--QTAIKL 36 NVLESVTLDMHPEVA.......MIKNq...MKRFGADAVLMSGSGPTVFGL.VQY.E--SKVQRI 170 180 | | 1 LEDARESGIKVLELEVAGPV 2 LEDARESGIKVLELEVAGPV 3 -------------------- 4 AEY----------------- 5 LSKKFPK------------- 6 LSKKFPK------------- 7 QTKFKDFRVQYLEFDDN--- 8 INRFAKNGIKCSW------- 9 INRFAKNGIKCSW------- 10 LSIFEKHGVKCRY------- 11 -------------------- 12 -------------------- 13 -------------------- 14 REVI---------------- 15 -------------------- 16 LE------------------ 17 LALHFPHC------------ 18 LALHFPHC------------ 19 LRDYYENTIRTEVGKGVEVV 20 LRDYYENTIRTEVGKGVEVV 21 KEKFEELGIRAEF------- 22 VEKFEELGIKAEY------- 23 GEKMVEAFWKVGHLKSVAS- 24 GEKMVEAFWKVGHLKSVAS- 25 AAAMEESCICCD-------- 26 -------------------- 27 WENLEGDVLVTS-------- 28 LNRYGKVI------------ 29 LKLRV--------------- 30 -------------------- 31 ADWLGKNYLQNQE------- 32 ADWLGKNYLQNQE------- 33 AAWLQQHYLQNDE------- 34 ARWLEQNYVQNEE------- 35 SSYLESHYLQNNE------- 36 YNGLRGFCDQV---------