; SAM: prettyalign v3.1b (February 24, 1999) compiled 04/18/00_11:44:15 ; (c) 1992-1999 Regents of the University of California, Santa Cruz ; ; Sequence Alignment and Modeling Software System ; http://www.cse.ucsc.edu/research/compbio/sam.html ; ; ------ Citations (HMMs, SAM) ------ ; A. Krogh et al., Hidden Markov models in computational biology: ; ------------- Citations (SAM, SAM-T99, HMMs) ----------------- ; R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: ; Extension and analysis of the basic method, CABIOS 12:95-107, 1996. ; K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting ; remote protein homologies, Bioinformatics 14(10):846-856, 1999. ; A. Krogh et al., Hidden Markov models in computational biology: ; Applications to protein modeling, JMB 235:1501-1531, Feb 1994. ; -------------------------------------------------------------- ; Sequence numbers correspond to the following labels: ; 1 T0087 ; 2 gi|7674163|sp|O68579|PPX1_STRMU_1:309 ; 3 gi|1743856|gb|AAB39104.1|_1:310 ; 4 gi|2496410|sp|P75229|Y371_MYCPN_19:315 ; 5 gi|586817|sp|P37487|YYBQ_BACSU_1:307 ; 6 gi|2496409|sp|Q49428|Y371_MYCGE_19:315 ; 7 gi|2498554|sp|P75144|MGPA_MYCPN_21:311 ; 8 gi|7447473|pir||D69344_5:319 ; 9 gi|1346531|sp|P22746|MGPA_MYCGE_59:346 ; 10 gi|7462295|pir||G72233_11:309 ; 11 gi|7471265|pir||A75258_3:310 ; 12 gi|2496072|sp|Q58025|Y608_METJA_3:306 ; 13 gi|7462213|pir||D72359_23:563 ; 14 gi|6899406|gb|AAF30828.1|AE002139_1_16:311 ; 15 gi|7448980|pir||H71452_161:450 ; 16 gi|7448979|pir||B75216_161:450 ; 17 gi|7445472|pir||F70440_17:311 ; 18 gi|3025197|sp|Q59027|YG33_METJA_165:439 ; 19 gi|7430136|pir||F69999_12:291 ; 20 gi|7430137|pir||B70177_21:309 ; 21 gi|7448978|pir||D69503_157:436 ; 22 gi|453695|gb|AAA65933.1|_31:394 ; 23 gi|6321995|ref|NP_012071.1|PPX1|_31:394 ; 24 gi|7514584|pir||D71323_41:332 ; 25 gi|7462978|pir||B72344_3:292 ; 26 gi|7492568|pir||T38544_32:376 ; 27 gi|7471305|pir||A75470_26:307 ; 28 gi|7470790|pir||S75598_3:318 ; 29 gi|586814|sp|P37484|YYBT_BACSU_338:644 ; 30 gi|7520820|pir||D70337_6:278 ; 31 gi|7477397|pir||H70693_34:320 ; 32 gi|7469901|pir||S74940_64:340 ; 33 gi|3024962|sp|Q58395|Y988_METJA_9:294 ; 34 gi|6175243|gb|AAF04914.1|U67085_1_5:279 ; 35 gi|7516499|pir||D72622_14:273 ; 36 gi|7518694|pir||B71167_18:302 ; 37 gi|7518105|pir||C75065_18:303 ; 38 gi|7494008|pir||T02817_25:296 ; 39 gi|7519590|pir||E71279_37:297 ; 40 gi|406933|gb|AAD12529.1|_4:110 10 20 30 40 | | | | 1 MSKILVFGHQNPDSDAIGSSMAYA.YLK........R........QLGVDAQ..AVALGNPNE.. 2 MSKILVFGHQNPDSDAIGSSMAYA.YLK........R........QLGVDAQ..AVALGNPNE.. 3 MSKILVFGHQNPDSDAIGSSYAFA.YLA........Re.......AYGLDTE..AVALGEPNE.. 4 FDNFSLYVHVNPDFDAFGAAFAFK.AFL........A........VYFPHKK..AYVMGSHN-.. 5 MEKILIFGHQNPDTDTICSAIAYA.DLK........N........KLGFNAE..PVRLGQVNG.. 6 FDKFSLFVHVNPDFDAFGSAFAFK.TFL........N........TFFSEKK..AYVMGSYN-.. 7 HDKIVIFHHIRPDGDCLGAQHGLA.RLI........Q........TNFPHKQ..VFCVGDPKH.. 8 ---VYVVGHKNPDTDSVCSAIAFA.YLW........Nkwk12kmmKIEAEAK..PVIQGDVNP.. 9 FDKIVIFHHVRPDGDCLGAQQGLF.HLI........K........ANFKNKE..VKCVGNNNN.. 10 HDRILVVGHIMPDGDCVSSVLSLT.LGL........E........KLGKEVK..AAVDYKIPY.. 11 ----AVFGHLNPDTDAISAAMVYA.RLL........T........RQGTEAQ..AYRLGEPNF.. 12 ----YVVGHKNPDTDSIASAIVLA.YFL........-........----DCY..PARLGDINP.. 13 LERVYVIGHKNPDTDSVCSAIGYA.HFK........Nnv......EKGKTFI..PARSGDLTN.. 14 FNKITIFVHTNPDCDALGSAFALA.RIL........Kln......TFGTRVK..IVGVNALNP.. 15 TDTLLIVMHDNPDPDCMASASALA.VIA........Q........SIGLKTQ..IVYGGDITHh. 16 TDSLLIVMHDNPDPDSMASASALA.VIA........Q........SVGLRPQ..IVYGGDITHh. 17 -GSILILTHENPDGDSLGSGLALY.KFL........K........KKGKEVY..IGSKDGVPH.. 18 -APLLILTHINPDPDAIASAMALK.TLA........E........RWGVDSD..IAYGGNIGYd. 19 YDTIILHRHVRPDPDAYGSQCGLT.EIL........R........ETYPEKN..IFAVGTPEP.. 20 YNNFVIIGHKDPDFDCIGSSLALS.SFL........S........RIGKNSI..LLNEGPFIRk. 21 -KRLGIFTHDNPDPDSMSSAYALR.EIA........K........QFDVIAD..ILYYGEILHq. 22 ----ICVGNESADMDSIASAITYS.YCQyiy9gtysE........EKKKGSF..IVPIIDIPRed 23 ----ICVGNESADMDSIASAITYS.YCQyiy9gtysE........EKKKGSF..IVPIIDIPRed 24 HRAFAVVGHEKPDGDCVGSSLALA.SFL........R........RIGKEVE..LLSAGPFKRr. 25 ----VITTHRSPDFDAFASCVAAK.KLL........D........----DHI..IVLPSNPAR.. 26 ----FVSGNESADLDSCASSIVYA.YCL........Q........RKQLGRI..VVPFFNIPRke 27 PGPIVVLSHENPDGDALGSVLGLS.RAL........R........TLGKTV-..-LAPMTVPH.. 28 ----LILCHQTADFDVLGAAVGLA.KLH........P........----GSR..IVLTGGSHP.. 29 -SNVIIMGHKFPDMDSIGAAIGIL.KVA........Q........ANNKDGF..IVIDPNQIGss 30 ----VVILSEGADLDSLSAAYGVL.KLY........P........D------..-AYLLKPKH.. 31 -ARVGVVCHVHPDADTIGAGLALA.LVL........D........GCGKRVEvsFAAPATLPE.. 32 GQRLILVIQDFPDPDALSSAWAFQ.LIA........A........QYEIQCD..IVYAGTLSHq. 33 RDEVLFLCHHNADPDAVGSCVALK.YLA........S........QLNPNGK..FRISADSVSk. 34 -------GNEACDLDSTVSALALAfYLA........K........TTEAEEV..FVPVLNIKRse 35 -EKSAVITHRNADPDAVGAALVVR.EVL........R........ALRMNPC..LYSPEGISR.. 36 -EGIILLCHHNADPDSLGSAIAFS.NFL........L........SKGFSR-..-IRIGVAQS.. 37 -EGIILLCHHNADPDSLGSAIAFS.NFL........L........DRGLRN-..-VRIGVAQS.. 38 ----VVQGNEGGDMDSIVGCIYLA.MLF........D........KQPKFGFenPVPALNFPQ.. 39 HGSFLLLGHEHPDEDCIASLVAFA.LLL........T........RCNKRVE..ICCQGPIRV.. 40 ------------------------.---........-........-------..---------.. 50 60 70 | | | 1 .....ETAFVLDYFGIQA....P........PVVK.SAQAE.......G..AKQ........VIL 2 .....ETAFVLDYFGIQA....P........PVVK.SAQAE.......G..AKQ........VIL 3 .....ETAFVLDYFGVAA....P........RVIT.SAKAE.......G..AEQ........VIL 4 .....IKADGKDLFPFEA....A........PI--.DDAFV.......K..NSL........AII 5 .....ETQYALDYFKQES....P........RLV-.ETAAN.......E..VNG........VIL 6 .....INADGRELFPFEQ....T........DI--.NDDFV.......K..ESL........AII 7 .....NFPWLEMVFT---....P........KEQI.TPELM.......Q..QAL........AVI 8 .....ETKYVLEKFGFEV....P........EIMT.NGE--.......-..GKK........VAL 9 .....LFSFINMTFTNQI....-........----.DESFL.......K..EAL........AIV 10 .....VFEKFPYIDKIEE....N........PNF-.-----.......D..PEL........LVV 11 .....ETAYVLRELGLEA....P........PL--.LTELP.......A..GSK........VAL 12 .....ETEFVLRKFGVME....P........ELIE.SA---.......K..GKE........IIL 13 .....ESLFVLKYFGMNP....Plh229rlcGVIT.RTDLL.......KdvRKK........VIL 14 .....N--DFKNFFTFDP....N........EV--.DDEFI.......K..DSI........AFI 15 .....QNRAMVNVLGMEF....R........KVSRgSYEIK.......R..HKA........IAI 16 .....QNRAMVNVLGMEF....K........KVLRgSYEIK.......R..YDA........IAI 17 .....FLDFLPGVED---....-........--VI.NPDGK.......F..YDV........GIV 18 .....ENKAMINLLGIKL....L........NVE-.DIDLD.......N..YCV........IAV 19 .....SLSFLYSLDE---....-........---V.DNETY.......E..GAL........VIV 20 .....EIVPFKDKFLSEW....P........NIEI.SE---.......-..-YS........VII 21 .....KNRAMVNLLEIPM....I........RAP-.EVDLS.......H..YDA........FAI 22 lslrrDVMYVLEKLKIKE....E........ELFF.IEDLK.......S..LKQnvs9telnSYL 23 lslrrDVMYVLEKLKIKE....E........ELFF.IEDLK.......S..LKQnvs9telnSYL 24 .....EIAAYATLFR---....P........SLSA.QIRPS.......D..QTA........VIV 25 .....NLSDFLKVYS-DR....F........EFVW.DHEFE.......Ge.ITE........LVI 26 lrlrpELSYLLNLASISS....D........DIVF.LDDIV.......K..LPKrifsnp..IYL 27 .....YLSFLPQPGELTA....P........----.LESWP.......Q..GAL........AAV 28 .....TVRQFLALHRNEF....P........LIELrSVNPD.......K..IRS........LYI 29 vq...RLIGEIKKYEELW....S........RFIT.PEEAMeisn...D..DTL........LVI 30 .....LSKKAGEVFKKYR....D........KFRV.IEDLP.......D..CFE........LVL 31 .....SLRSLPGCHLLVR....P........EVMR.RD---.......-..VDL........VVT 32 .....ENIALVKLTGLPAkrwgP........QMLK.DIDLS.......D..YQG........CVL 33 .....LSRNILNEIG-ER....V........DIEI.YPKL-.......-..PET........VFI 34 lplrgDIVFFLQKVHIPE....S........ILIF.RDEIDlhalyqaG..QLT........LIL 35 .....QSKRLLEAVGEEF....G........PLCS.PDE--.......N..PLI........AVV 36 .....IASYSRRLVALSR....V........PIEK.DP-VI.......K..ENV........IFI 37 .....IASYSRRLLKFSR....V........PIER.NPKI-.......S..ERV........VFI 38 .....EDFGLRNDVTNLF....K........ELGI.DASLL.......M..SVQrgq17nasVVL 39 .....QISFLIDICLYNG....I........AVHL.DTQTVpr.....M..PDA........LVI 40 .....-------------....-........----.-----.......-..---........--- 80 90 100 110 120 | | | | | 1 TDHNEFQQS.......IADIREV.EVVEVVDHHRVANFET....ANPL.YMRLEPV.GSASSIVY 2 TDHNEFQQS.......IADIREV.EVVEVVDHHRVANFET....ANPL.YMRLEPV.GSASSIVY 3 TDHNEFQQS.......VADIAEV.EVYGVVDHHRVANFET....ANPL.YMRLEPV.GSASSIVY 4 FDTSNQERV.......LTQKHKLaKETVRIDHHPKTESF-....-ADL.EWIDPAF.SAAAEMVG 5 VDHNERQQS.......IKDIEEV.QVLEVIDHHRIANFET....AEPL.YYRAEPV.GCTATILN 6 FDTSNQERV.......LTQKHKLaKETVRIDHHPRTEKF-....-ADM.EWIDSSF.SATAEMIG 7 VDANYKERIecrd...LLDQNQF.KAVLRIDHHPNEDDL-....NTTH.NFVDASY.IAAAEQVV 8 VDHSEKAQT.......VDGIDKA.EVVAIVDHHKIGDVTT....PQPI.LFVNLPV.GCTATVIK 9 VDANYKNRIelre...LLDKNLF.KAVLRIDHHPNEDDL-....NTSF.NFVEESY.VACCEQIV 10 VDASSPDRIgk.....FQDLLDK.VPSVVIDHHSTNTNF-....-GNW.NWVDPSF.AATAQMIF 11 VDHNESAQS.......LPALGEL.DVTRVVDHHKLGDLTT....INPP.YLRFEPV.GCTGTILL 12 VDHSEKSQS.......FDDLEEG.KLIAIIDHHKVGLTT-....TEPI.LYYAKPV.GSTATVIA 13 VDHNEITQA.......PEGVEKA.EILEIIDHHRLGGLST....LNPV.FFYNEPV.GSTSTIVA 14 VDTANQERV.......LSQKHRLaKKTILVDHHVKTITY-....-TDL.EYINDQS.IAACEMIA 15 VDAQPNGNIt......ILDEEDLnKIEIIIDHHQILQNLReklpPNCF.VDIRTDV.NATSSIMA 16 VDAQPNGNIt......ILDDDDLkRVEIIVDHHQILQNLReklsPNCF.VDIRTDV.NATSSIMV 17 VDASGFYRVg......--KEVKV.GKRIRIDHHVGGEFY-....-GMH.DYIDPTA.PATAALVY 18 IDTSTSKQL.......PIELPN-.-IDIIIDHHNNTDLT-....-AKY.MDVRPEV.GATASILT 19 CDTANQERId......DQRYPSG.AKLMKIDHHPNEDPY-....-GDL.LWVDTSA.SSVSEMIY 20 LDCSILDRIgde....FIFYVKN.MPTLVIDHHMSGEKL-....-ECE.GYIDPFA.PSTTFLIE 21 VDSSGPGVN.......NSIPPDI.DISIVIDHHPAEKVV-....-AEF.VDLREDV.GATATILT 22 VDNNDTPKN.......LKNYID-.NVVGIIDHHFDLQKHL....DAEP.-RIVKVS.GSCSSLVF 23 VDNNDTPKN.......LKNYID-.NVVGIIDHHFDLQKHL....DAEP.-RIVKVS.GSCSSLVF 24 VDCSELSRVgae....LASQLAP.FARAFIDHHETCGDH-....-CAH.SFVVKTA.PSTTTLVQ 25 VDAPSLDRIpes....IRKKAQG.AKITVYDHHVDE----....-SPY.DGIVSKV.GATITILV 26 VDHNSLDRK.......DLENFNG.SIAGIIDHHKDEGGSL....HADP.-RIIEEC.GSCCTLVC 27 LDVDNNDPVrva....GADLTQFdGPVVNVDHHGTNLRR-....-ADA.GVVDPSK.PAAAMMVA 28 VDNQQGDRLgkaad..WLTLPHL.RQVAIYDHHLNSPRDI....EADI.-WELEAV.GASTTLIV 29 VDTHKPSLVme.....ERLVNKI.EHIVVIDHHRRGEEFI....RDPL.LVYMEPYaSSTAELVT 30 VDTHFLPEG.......LPRERI-.KRIIVYDHHPIGDV--....-KEF.EGKIEKV.GAATTLVV 31 VDIPSVDRLga.....LGDLTDSgRELLVIDHHASNDLF-....-GTA.NFIDPSA.DSTTTMVA 32 VDSQGTNSQlmp....LVQEANI.PVVVIIDHHSPQDEVEa...EGAF.VDIRPSS.RSTATILT 33 VDTASINQLkv.....NFDELKE.REVILIDHHKKTDLAD....ICKY.YIIKEDY.PSTSEIIA 34 VDHHILSKS.......DTALEE-.AVAEVLDHRPIEPKHC....PPCH.-VSVELV.GSCATLVT 35 VDAANLSQIaga....ESLFRAA.RLKVVIDHHERGSIHE....EADV.ALVDPGA.GSSSELAV 36 FDTSSIEQL.......EPIEVPP.AKIVVIDHHVEKENPI....PADV.AIIDPTR.TSTAEIVW 37 FDTSSLEQLe......PIKIPPN.AKLIVIDHHVEKENPI....PADI.SVIDPKR.TSTAEIVW 38 YDHNKLREN.......QSDLAS-.RVVGVVDHHFDEQQYLk...TASK.LRVLRTV.GSACTLVT 39 LDTPNPGMIyappscrLLLSDST.IRKIELDHHLFANAAC....CGDPgYSLIARA.SSTCEIIA 40 ---------.......-------.-------------KF-....-ADM.EWIDSSF.SATAEMIG 130 140 150 160 170 | | | | | 1 RLY...KENG.VAI........PKEIAGVMLSGLISDTLLL..KS..PTTHASDPAVAEDLAKIA 2 RLY...KENG.VAI........PKEIAGVMLSGLISDTLLL..KS..PTTHASDPAVAEDLAKIA 3 RMF...KEHS.VAV........SKEIAGLMLSGLISDTLLL..KS..PTTHPTDKAIAPELAELA 4 YLI...LQMG.YEL........NAEMAAYIYAGIITDTQRF..SS..SATTPQTFALTAKLLE-T 5 KMY...KENN.VKI........EKEIAGLMLSAIISDSLLF..KS..PTCTDQDVAAAKELAEIA 6 YLI...LQMG.YKL........NDEIASYLYAGIITDTQRF..WG..PTTTPQTFALTAKLME-T 7 DLA...VQAK.WKL........SPPAATALYLGIYTDSNRF..LY..SNTSWRTLYLGSMLYR-A 8 LLF...DKTG.VEI........PKDIAGILLSSILSDTVIF..KS..ATTTELDKEVAEELAKIA 9 EMA...TVAK.WTI........PPVAATLLYIGIYTDSNRF..LY..SNTSYRTLYLAAILYK-A 10 RIN...KALG.VEY........DSNLATLNYLGIATDTGFF..RH..SNADVRVFEDAYKLVK-M 11 KLH...REAG.LSV........EPQDAKLMLSAILSDTLHF..RS..PTTTQDDRDAVAFLAPVA 12 ELY...FKDA.IDLiggkkkelKPDLAGLLLSAIISDTVLF..KS..PTTTDLDKEMAKKLAEIA 13 EFF...LKNG.VKM........EREIAGILLSGIVSDTLFF..KL..STTTEKDRKMANFLADVA 14 YSL...MHTN.LNF........DLKTLNYLLLGITTDSNRL..MY..DKVSDFTYEIMSWFFK-N 15 EYL...KALE.IPI........TETLATALFYGMYIDTKKF..SK..--LSRVDINAIEFLTGKV 16 EYL...KALE.IPI........TDTLATALFYGMYIDTKKF..SK..--LSRVDISAIEFLTGKV 17 EII...KNWDeTAI........DKDIATCIYTGLATDTGFF..RY..SNTNEKTFELAKELVS-Y 18 QYL...MELD.IEP........SRNLATALFYGIQSDTDYF..KR..-ETSKLDFEAAAYLQSYI 19 ELYlegKEHG.WKL........NTKAAELIYAGIVGDTGRF..LF..PNTTEKTLKYAGELIQ-Y 20 KLI...REFG.HDL........TKEEAWYILVGFCTDTGFF..KFi.SRSDPEPFEMVARLVS-K 21 EYI...KELK.ITP........SKILATALFFGIKSETDEF..KR..-NTRTADFLACAFLYPFV 22 NYWyekLQGD.REV........VMNIALLLMGAILIDTSNM..RR..-KVEESDKLAIERCQAVL 23 NYWyekLQGD.REV........VMNIAPLLMGAILIDTSNM..RR..-KVEESDKLAIERCQAVL 24 TLI...ETMA.GSL........EAAEARALFLGLATDTGFF..RHl.DEHSADTFASAARLVR-A 25 ELI...REKN.IPL........DPTEATLFMIALYDDTGNL..LF..SSTTPRDLEIAKFLLE-N 26 RYF...MPVI.RSLyds11hqtATNLAVLALGPILIDTGNL..KN..EKTTDTDVKIVNDLCSFV 27 DVI...DALG.APW........SEAVATPLMLGLNTDTGNF..AF..DSVSAETFECAARLRA-H 28 EKL...QRAD.ISL........SMVEASVMALGIHVDTGSL..TF..TQTTVRDVKALAWLME-Q 29 ELLey.QPKR.LKI........NMIEATALLAGIIVDTKSF..SL..-RTGSRTFDAASYLRA-K 30 EEI...KEKG.IDI........NPRDATLLAFGIYEDTGNF..TY..EGTTPRDALALAFLLE-K 31 EIL...DAWG.KPI........DPRVAHCIYAGLATDTGSF..RW..--ASVRGYRLAARLVE-I 32 EYL...QGGM.LDFnssnpt..HVKCATALMHGLRSDTINL..LQ..--AQEAEFMAAAYLSRIY 33 EIF...KELN.IFP........PKNVRIALLCGIVYDTKHL..KL..--ANSKTFELISYLIK-- 34 ERI...LQGA.PEIl.......DRQTAALLHGTIILDCVNMdlKI..GKATPKDSKYVEKLEALF 35 KTA...VEAG.TPL........RPSVATAALGGIVYDTGRF..LR..--ASKLSFEAAAHLLS-M 36 ELF...KKFN.YS-........DENSAKALLAGIISDTSNF..RY..--ANAKTFKAVYEILELY 37 ELF...KKLG.YK-........DEDSAKVLLAAIISDTSSF..RY..--ANAKTFKTVSEILELY 38 ELY...RECG.E--........DVVCPTLLTAPIVLDTVNF..EPaqKKVTPEDIAAYEWLRA-K 39 YLC...YKLA.RNHaqr12nlySRNVVLSILTGMIGDAKTG..AY..-LISRKDRALYTYFTQRL 40 YLI...LQMG.YKL........NDEIASYLYAGIITDTQRF..WG..PTTTPQTFALTAKLME-T 180 190 200 210 | | | | 1 GV........DL...QEYGLAML........KAGTNLA.SKTAAQLVDIDAKTFELNGSQ....V 2 GV........DL...QEYGLAML........KAGTNLA.SKTAAQLVDIDAKTFELNGSQ....V 3 GV........NL...EEYGLAML........KAGTNLA.SKSAEELIDIDAKTFELNGNN....V 4 GF........NR...NKVHDAVY........-------.---LKPLLEHKYFSYVLNKAK....I 5 GV........DA...EEYGLNML........KAGADLS.KKTVEELISLDAKEFTLGSKK....V 6 GF........NR...NKVHDAVY........-------.---LKPLLEHKYFSYVLSKAK....I 7 QA........NI...AKIHDELN........-------.---HTSLKDIQFKQYVFKNFQ....T 8 GId.......DL...TKFGVEIK........AKLSAVD.DLTAMDIIKRDYKDFDMSGKK....V 9 KA........DI...RIVHDHLN........-------.---HTSLADLKFKKYVYNHFK....T 10 GA........DA...HFVAKEIL........ENKRFEQ.FKLFAEVLERLQL---LENGK....I 11 GVn.......DV...EAYALAMF........AAKSDLG.NTPAETLLRMDYKVFPFGDPVqpqnW 12 GIs.......NI...EEFGMEIL........KAKSVVG.KLKPEEIINMDFKNFDFNGKK....V 13 KL........DL...EKFAKKLL........KEGMKIPeDVDPAELLKRDVKVYEMGEES....F 14 NV........KH...YQIYQQLY........ERNLNDI.-----------LFDNELVKTI....K 15 NY........EL...------LD........KIEFPDI.STETAEILARAILNRKIYKNV....I 16 NY........EL...------LD........KIEFPDI.STETAEILARAILNRKMYKNV....I 17 GA........DP...YYVYTMVM........EREKVNK.MKLIAKVLETLQL-------H....E 18 DA........SI...------LN........MIENPEI.STEVMEVLAKAVMNRRVVKGN....I 19 PF........SS...SELFNQLY........ETKLNVV.----------KLNGFIFQNVS....L 20 GI........SL...KEVYSYIE........-TTKSLK.SIETLKLMLNSLESYWNGKVL....F 21 DQ........DL...------IE........KIESPSI.STETLDILGTAIKNRQVYSSF....L 22 SGavn11gleDS...SEFYKEIK........SRKNDIK.GFSVSDILKKDYKQFNFQGKG....H 23 SGavn11gleDS...SEFYKEIK........SRKNDIK.GFSVSDILKKDYKQFNFQGKG....H 24 GA........NP...KDTFLAMN........-------.--GGRSLASRMLIARVLSRLT....P 25 GA........NL...DEVALYTR........EELTPRQ.MELLDDLIE-NARDYEVNGVP....I 26 PK........DWvr.DEFFDTLK........EKKKSCK.GFSFDDLLRRDLKQYFPDGIV....V 27 GA........RI...GWLNDQMR........QNPQSYY.LLLREVLGKLEFL--------....H 28 GA........NLrliAEYADPGF........PPPLQFL.FAEAMQNLHKEMVRGYWLGSV....L 29 GA........DT...VLVQKFLK........ETVDSYI.-----------KRAKLIQHTV....L 30 GA........NL...REIREVVM........ETYTPEQ.IEAVGKIVQ-SIEKVFINGRQ....I 31 GV........DN...ATVSRTLM........----DSH.PFTWLPLLSRVLGSAQLVSEA....V 32 DA........QL...------LN........AVLQSAR.SKRVMEVIERSLQNRVVQNNF....S 33 DI........SF...QKILYLLSqesdvs..KRTAHLK.ACSRMEIREFDKLRIALSHVS....S 34 PDlp......KR...NDIFDSLQ........KVKFDVS.GLTTEQMLRKDQKTIYRQGVK....V 35 GA........DY...GKVLEAGR........QRGRRDR.GDLSLRLAKLKAFSRLKIGRA....C 36 DF........SI...PEVSQLVApvs10dqsRRIAILK.ACQRMEIHKVKKFVIVTSKVS....A 37 DF........SI...SEVSQLVA........PV-SDEN.VEQSKRIAVLKACQRMEIHKV....R 38 EVadsa....DA...AALFEKLS........KWKDDVL.ALSVPQILRRDYKQFSFKART....Q 39 NTmlre....KT...KPITGNIA........STKQILH.TLESLSAEEHAVYQTMVKNVQ....H 40 GF........NR...NKVHDAVY........-------.---LKPLLEHKYFSYVLSKAK....I 220 230 240 250 260 | | | | | 1 RVAQVNTVDI....NEVLERQNE...IEEAIKASQAANGYSDFVLMI....TDIL..NS...... 2 RVAQVNTVDI....NEVLERQNE...IEEAIKASQAANGYSDFVLMI....TDIL..NS...... 3 RVAQVNTVDI....AEVLERQAE...IEAAIEKAIADNGYSDFVLMI....TDII..NS...... 4 TPNGLAYALL....KKGTYKQFG...VVSPLPMVHALNNIKGVKIWT....TCYF..NE...... 5 EIAQVNTVDI....EDVKKRQAE...LEAVISKVVAEKNLDLFLLVI....TDIL..EN...... 6 TKNGLAYALI....KKGAYKHFG...VVSPLPMVHALNNIKGVKIWT....TVYF..NEsi.... 7 FQNVIYFVAD....KKFQKKLKV...TPLECARVNILANIEQFHIWL....FFIE..EG...... 8 GVGQIELVDL....SLIESRIDE...IYEAMKKMKEEGGYAGIFLML....TDIM..KE...... 9 QGQVIYFICT....KKIQKRLRM...TADQCARVNLLSNIADYKIWL....FFIE..QA...... 10 AYSYI---DY....DTYLRHNCT...DEDSAGFVGELRSIRGVEVAV....LFME..FP...... 11 GIGVIETTNP....AYVFGRQQE...LLAAMDQVKAEDTLSGMLLSV....VDIL..NE...... 12 GIGQVEVIDV....SEVESKKED...IYKLLEEKLKNEGYDLIVFLI....TDIM..KE...... 13 AVSQIMTSDF....STLLKEKER...FMNTLKTLKGEFGVKHFFVLF....TNPV..EE...... 14 TQGQVAYLNI....DPQWNQKYH...FTRWGDKVYLLSNIKNYPIWF....IVYF..DEnt.... 15 ISNVG-----....--FIANRDA...IAEAADFLLRLEGITTVLVFG....IVDD..RIeisa.. 16 ISNVG-----....--FIANRDA...IAEAADFLLRLEGITTVLVFG....IVDD..RIeisa.. 17 DGKVAGITVF....KKFLDETGTt..YEDTEGLVNYPRSIEGVKVAY....ALIE..KP...... 18 ALAYV-----....-GEISNRDA...LPKAADFLLKMEGISTTFVFG....IVGD..EI...... 19 SENGAASVFI....KKDTLEKFGtt.ASEASQLVGTLGNISGIRAWV....FFVE..ED...... 20 TFLSSSSSGK....DGGVSGVNE...L-----FYMILSNVENNEILG....ILKE..ME...... 21 ISFAG-----....--FINDRDA...LPQAADFLLKLEGISTVVVFGvikdTVYV..SA...... 22 KGLEIGFSSIvkrmSWLFNEHGGeadFVNQCRRFQAERGLDVLVLLT....SWRKagDS...... 23 KGLEIGLSSIvkrmSWLFNEHGGeadFVNQCRRFQAERGLDVLVLLT....SWRKagDS...... 24 YYGGALMTSY....ETCEDAVQLgldVRDSDALYQLIQSIQGVEAIV....VVRQ..ES...... 25 TISMIECEDFvgglGLIVSKAWE...MMG-----------KETFIAI....VKMG..KK...... 26 NYASV-GKGL....DWIKKKRLG...WEDELKSFAEVQNSDLVIVGL....SLSK..NDefg18a 27 GGRVVQTRVD....EEMLARAGAt..WEQVENYVSMLRNAEGAQLAV....MAKD..YG...... 28 LTTENFVPGL....SHLTERLLS...LTEC-----------DALLLG....HVYD..KGkdk20n 29 YKDNIAIASL....PENEEEYFD...QVLIAQAADSLLSMSEVEASF....AVAR..RD...... 30 SFATA-----....--VLERYQP...DINTLLYEIKDLKESDAFFVI....IEAE..GK...... 31 GGRGLVYVVVdnr.EWVAARSEE...VESIVDIVRTTQQAE--VAAV....FKEV..EP...... 32 LAGVGYL---....--RYEERDA...IPQAADFLVSEENVHTALVYG....IVHD..RA...... 33 HEASCAKTIV....SIG------...--------------ADVAFVV....AVRK..KE...... 34 AISAI-YMDL....EAFLQRSNL...LADLHAFCQAHS--YDVLVAM....TIFF..NT...... 35 SELLIAVTHI....GSFESDVAKs..LVDE---------AADVAVAV....AERT..SE...... 36 YEALA-----....---------...-----CKVFLQLGAD-VAIVG....S---..EK...... 37 KFII------....--VTSKVSA...YEALACKVFLQLGAD-VAIVG....S---..EK...... 38 KGV-------....--MSAGTSS...VPCACKQLEAHFSVDLIVAEA....AKYV..EQ...... 39 RGSISLLLLN....QTQTQALHE...CC-------------------....----..--...... 40 TKNGLAYA--....---------...---------------------....----..--...... 270 280 290 300 | | | | 1 ..NSEILA...LGNNTDKV.E........AAFNFTL......KNNHAFLA..GAVSRKKQVVPQL 2 ..NSEILA...LGNNTDKV.E........AAFNFTL......KNNHAFLA..GAVSRKKQVVPQL 3 ..NSEILA...IGSNMDKV.E........AAFNFVL......ENNHAFLA..GAVSRKKQVVPQL 4 ..DIKKWIgs.IRSRSIPI.N........NFAQMFG......GGGHKYAA..AFVLDDKRQFMKL 5 ..DSLALA...IGNEAAKV.E........KAFNVTL......ENNTALLK..GVVSRKKQVVPVL 6 ..KKWIGS...IRSRNIPI.N........NFAQMFN......GGGHKYAA..AFVLDEKNQFMKL 7 ..KNHYRVe..FRSNGINV.R........EVALKYG......GGGHIQAS..GAVLKSKRDI--- 8 ..GTELLV...VTDYPEVV.E........KAFGKKL......EGKSVWLD..GVMSRKKQVVPPL 9 ..NNEIRId..LRSNGINV.R........DIAIKYG......GGGHNNAS..GAIITNKKQ---- 10 ..RGKIHVsm.RSKDWFNV.N........EVAFELG......GGGHPRAA..GVTFEGKKIEEVI 11 ..TNRTLV...LGATEAKVlR........EAFGAEA......EGQVADLG..NRISRKKQIVPTL 12 ..GSEALV...VGN-KEMF.E........KAFNVKV......EGNSVFLE..GVMSRKKQVVPPL 13 ..AS-LLM...MDGDQKLV.E........KAFNAEK......KDGLFLLK..GVMSRKKDFVPKI 14 ..NTYKVS...LRSNKYKV.R........LVASQFN......GGGHDLAA..GCSLTNIDQLENL 15 ..RTRDVR...VNIGNVMK.E........AFGEIGS......GGGHPQAG..GA----------- 16 ..RTRDVR...VNIGKVMK.E........AFGEIGS......GGGHAQMG..GA----------- 17 ..EEGVWKvslRAKGNVNV.G........KIAERLG......GGGHKYAS..GAKIKTNSYEEAL 18 ..HISART...KDLRLNLG.E........ILNKAFG......GGGHQTAA..AA----------- 19 ..DQIRVR...FRSKGPVI.N........GLARKYN......GGGHPLAS..GAS---------- 20 ..DGSIIVgl.RSKDSFDV.G........KLAEDFG......GGGHKNAS..GFRIK-------- 21 ..RNKDVR...IHMGEVLR.R........AFGDVGS......AGGHAHAA..GA----------- 22 ..HRELVI...LGDSNVVR.Elie11lqlQLFGGNL......DGGVAMFKqlNVEATRKQVVPYL 23 ..HRELVI...LGDSNVVR.Elie11lqlQLFGGNL......DGGVAMFKqlNVEATRKQVVPYL 24 ..PTHCSVgf.RSRGSIDV.S........VIAARFG......GGGHRCAA..GLRIEG------- 25 ..IYVIGR...TSSPDVDL.G........SLMKDLG......GGGHTRAA..SATITGKEIDEVL 26 glADSFLK...LSKQNLGL.E........IIEEKDN......GDLSMWNQr.NSAASRKKVVPLL 27 ..DRVKFS...LRSRGPVSaQ........NIAVALG......GGGHVPAA..GA----------- 28 qrFSLIGR...TRIPDTDL.T........QLLEPYG......GGGHAQAA..AVNLRDVEPTTVM 29 ..EQTVCIsa.RSLGEVNV.Q........IIMEALE......GGGHLTNA..ATQLSGISVSEAL 30 ..TYVFGR...SQSEDVDV.G........EILSHFG......GGGHREAG..AVKLEN------- 31 ..HRWSVS...MRAKTVNL.A........AVASGFG......GGGHRLAA..GYTT--------- 32 ..NDIELVigsLRTNK---.-........-------......--------..------------- 33 ..KEIRVS...ARCRKHVS.Kyvhlg...NLMEKIGkelggsGGGHSEAG..GL----------- 34 ..HNEPVR...--------.-........-------......--------..------------- 35 ..FRL--S...VRVSPL--.-........-------......--------..------------- 36 ..DGVRIS...ARAKDYLV.Kqglhlg..KIMEKVGpiikgsGGGHAGAA..GA----------- 37 ..DGVRIS...ARAKDYLV.Kkglhlg..KLMEKVGpiiggsGGGHPGAA..GA----------- 38 ..HQ----...--------.-........-------......--------..------------- 39 ..------...--------.-........-------......--------..------------- 40 ..------...--------.-........-------......--------..------------- 310 | 1 TESFNG 2 TESFN- 3 TESFN- 4 VEIMDD 5 TDAM-- 6 VQIMDD 7 ------ 8 EKAF-- 9 ------ 10 PRVINH 11 EKYF-- 12 ERAYN- 13 GEVL-- 14 LSALD- 15 ------ 16 ------ 17 KKLL-- 18 ------ 19 ------ 20 ------ 21 ------ 22 EEAYSN 23 EEAYSN 24 ------ 25 KEVLN- 26 MD---- 27 ------ 28 AEIY-- 29 ER---- 30 ------ 31 ------ 32 ------ 33 ------ 34 ------ 35 ------ 36 ------ 37 ------ 38 ------ 39 ------ 40 ------