; SAM: prettyalign v3.1b (February 24, 1999) compiled 04/18/00_11:44:15 ; (c) 1992-1999 Regents of the University of California, Santa Cruz ; ; Sequence Alignment and Modeling Software System ; http://www.cse.ucsc.edu/research/compbio/sam.html ; ; ------ Citations (HMMs, SAM) ------ ; A. Krogh et al., Hidden Markov models in computational biology: ; ------------- Citations (SAM, SAM-T99, HMMs) ----------------- ; R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: ; Extension and analysis of the basic method, CABIOS 12:95-107, 1996. ; K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting ; remote protein homologies, Bioinformatics 14(10):846-856, 1999. ; A. Krogh et al., Hidden Markov models in computational biology: ; Applications to protein modeling, JMB 235:1501-1531, Feb 1994. ; -------------------------------------------------------------- ; Sequence numbers correspond to the following labels: ; 1 gi|7470790|pir||S75598_3:318 ; 2 gi|7471265|pir||A75258_3:310 ; 3 gi|1743856|gb|AAB39104.1|_1:310 ; 4 gi|7674163|sp|O68579|PPX1_STRMU_1:309 ; 5 T0087 ; 6 gi|586817|sp|P37487|YYBQ_BACSU_1:307 ; 7 gi|2496072|sp|Q58025|Y608_METJA_3:306 ; 8 gi|7447473|pir||D69344_5:319 ; 9 gi|7462213|pir||D72359_23:563 ; 10 gi|6175243|gb|AAF04914.1|U67085_1_5:279 ; 11 gi|6321995|ref|NP_012071.1|PPX1|_31:394 ; 12 gi|453695|gb|AAA65933.1|_31:394 ; 13 gi|7492568|pir||T38544_32:376 ; 14 gi|7494008|pir||T02817_25:296 ; 15 gi|7516499|pir||D72622_14:273 ; 16 gi|7462295|pir||G72233_11:309 ; 17 gi|7469901|pir||S74940_64:340 ; 18 gi|7448980|pir||H71452_161:450 ; 19 gi|7448979|pir||B75216_161:450 ; 20 gi|7448978|pir||D69503_157:436 ; 21 gi|3025197|sp|Q59027|YG33_METJA_165:439 ; 22 gi|7520820|pir||D70337_6:278 ; 23 gi|7462978|pir||B72344_3:292 ; 24 gi|2498554|sp|P75144|MGPA_MYCPN_21:311 ; 25 gi|1346531|sp|P22746|MGPA_MYCGE_59:346 ; 26 gi|6899406|gb|AAF30828.1|AE002139_1_16:311 ; 27 gi|2496409|sp|Q49428|Y371_MYCGE_19:315 ; 28 gi|406933|gb|AAD12529.1|_4:110 ; 29 gi|2496410|sp|P75229|Y371_MYCPN_19:315 ; 30 gi|7430136|pir||F69999_12:291 ; 31 gi|7518105|pir||C75065_18:303 ; 32 gi|7518694|pir||B71167_18:302 ; 33 gi|3024962|sp|Q58395|Y988_METJA_9:294 ; 34 gi|7445472|pir||F70440_17:311 ; 35 gi|7430137|pir||B70177_21:309 ; 36 gi|7514584|pir||D71323_41:332 ; 37 gi|7477397|pir||H70693_34:320 ; 38 gi|586814|sp|P37484|YYBT_BACSU_338:644 ; 39 gi|7471305|pir||A75470_26:307 ; 40 gi|7519590|pir||E71279_37:297 10 20 30 40 | | | | 1 ----LILCHQTADFDVLGAAVGLA.KLH........P........----GSR..IVLTGGSHP.. 2 ----AVFGHLNPDTDAISAAMVYA.RLL........T........RQGTEAQ..AYRLGEPNF.. 3 MSKILVFGHQNPDSDAIGSSYAFA.YLA........Re.......AYGLDTE..AVALGEPNE.. 4 MSKILVFGHQNPDSDAIGSSMAYA.YLK........R........QLGVDAQ..AVALGNPNE.. 5 MSKILVFGHQNPDSDAIGSSMAYA.YLK........R........QLGVDAQ..AVALGNPNE.. 6 MEKILIFGHQNPDTDTICSAIAYA.DLK........N........KLGFNAE..PVRLGQVNG.. 7 ----YVVGHKNPDTDSIASAIVLA.YFL........-........----DCY..PARLGDINP.. 8 ---VYVVGHKNPDTDSVCSAIAFA.YLW........Nkwk12kmmKIEAEAK..PVIQGDVNP.. 9 LERVYVIGHKNPDTDSVCSAIGYA.HFK........Nnv......EKGKTFI..PARSGDLTN.. 10 -------GNEACDLDSTVSALALAfYLA........K........TTEAEEV..FVPVLNIKRse 11 ----ICVGNESADMDSIASAITYS.YCQyiy9gtysE........EKKKGSF..IVPIIDIPRed 12 ----ICVGNESADMDSIASAITYS.YCQyiy9gtysE........EKKKGSF..IVPIIDIPRed 13 ----FVSGNESADLDSCASSIVYA.YCL........Q........RKQLGRI..VVPFFNIPRke 14 ----VVQGNEGGDMDSIVGCIYLA.MLF........D........KQPKFGFenPVPALNFPQ.. 15 -EKSAVITHRNADPDAVGAALVVR.EVL........R........ALRMNPC..LYSPEGISR.. 16 HDRILVVGHIMPDGDCVSSVLSLT.LGL........E........KLGKEVK..AAVDYKIPY.. 17 GQRLILVIQDFPDPDALSSAWAFQ.LIA........A........QYEIQCD..IVYAGTLSHq. 18 TDTLLIVMHDNPDPDCMASASALA.VIA........Q........SIGLKTQ..IVYGGDITHh. 19 TDSLLIVMHDNPDPDSMASASALA.VIA........Q........SVGLRPQ..IVYGGDITHh. 20 -KRLGIFTHDNPDPDSMSSAYALR.EIA........K........QFDVIAD..ILYYGEILHq. 21 -APLLILTHINPDPDAIASAMALK.TLA........E........RWGVDSD..IAYGGNIGYd. 22 ----VVILSEGADLDSLSAAYGVL.KLY........P........D------..-AYLLKPKH.. 23 ----VITTHRSPDFDAFASCVAAK.KLL........D........----DHI..IVLPSNPAR.. 24 HDKIVIFHHIRPDGDCLGAQHGLA.RLI........Q........TNFPHKQ..VFCVGDPKH.. 25 FDKIVIFHHVRPDGDCLGAQQGLF.HLI........K........ANFKNKE..VKCVGNNNN.. 26 FNKITIFVHTNPDCDALGSAFALA.RIL........Kln......TFGTRVK..IVGVNALNP.. 27 FDKFSLFVHVNPDFDAFGSAFAFK.TFL........N........TFFSEKK..AYVMGSYN-.. 28 ------------------------.---........-........-------..---------.. 29 FDNFSLYVHVNPDFDAFGAAFAFK.AFL........A........VYFPHKK..AYVMGSHN-.. 30 YDTIILHRHVRPDPDAYGSQCGLT.EIL........R........ETYPEKN..IFAVGTPEP.. 31 -EGIILLCHHNADPDSLGSAIAFS.NFL........L........DRGLRN-..-VRIGVAQS.. 32 -EGIILLCHHNADPDSLGSAIAFS.NFL........L........SKGFSR-..-IRIGVAQS.. 33 RDEVLFLCHHNADPDAVGSCVALK.YLA........S........QLNPNGK..FRISADSVSk. 34 -GSILILTHENPDGDSLGSGLALY.KFL........K........KKGKEVY..IGSKDGVPH.. 35 YNNFVIIGHKDPDFDCIGSSLALS.SFL........S........RIGKNSI..LLNEGPFIRk. 36 HRAFAVVGHEKPDGDCVGSSLALA.SFL........R........RIGKEVE..LLSAGPFKRr. 37 -ARVGVVCHVHPDADTIGAGLALA.LVL........D........GCGKRVEvsFAAPATLPE.. 38 -SNVIIMGHKFPDMDSIGAAIGIL.KVA........Q........ANNKDGF..IVIDPNQIGss 39 PGPIVVLSHENPDGDALGSVLGLS.RAL........R........TLGKTV-..-LAPMTVPH.. 40 HGSFLLLGHEHPDEDCIASLVAFA.LLL........T........RCNKRVE..ICCQGPIRV.. 50 60 70 | | | 1 .....TVRQFLALHRNEF....P........LIELrSVNPD.......K..IRS........LYI 2 .....ETAYVLRELGLEA....P........PL--.LTELP.......A..GSK........VAL 3 .....ETAFVLDYFGVAA....P........RVIT.SAKAE.......G..AEQ........VIL 4 .....ETAFVLDYFGIQA....P........PVVK.SAQAE.......G..AKQ........VIL 5 .....ETAFVLDYFGIQA....P........PVVK.SAQAE.......G..AKQ........VIL 6 .....ETQYALDYFKQES....P........RLV-.ETAAN.......E..VNG........VIL 7 .....ETEFVLRKFGVME....P........ELIE.SA---.......K..GKE........IIL 8 .....ETKYVLEKFGFEV....P........EIMT.NGE--.......-..GKK........VAL 9 .....ESLFVLKYFGMNP....Plh229rlcGVIT.RTDLL.......KdvRKK........VIL 10 lplrgDIVFFLQKVHIPE....S........ILIF.RDEIDlhalyqaG..QLT........LIL 11 lslrrDVMYVLEKLKIKE....E........ELFF.IEDLK.......S..LKQnvs9telnSYL 12 lslrrDVMYVLEKLKIKE....E........ELFF.IEDLK.......S..LKQnvs9telnSYL 13 lrlrpELSYLLNLASISS....D........DIVF.LDDIV.......K..LPKrifsnp..IYL 14 .....EDFGLRNDVTNLF....K........ELGI.DASLL.......M..SVQrgq17nasVVL 15 .....QSKRLLEAVGEEF....G........PLCS.PDE--.......N..PLI........AVV 16 .....VFEKFPYIDKIEE....N........PNF-.-----.......D..PEL........LVV 17 .....ENIALVKLTGLPAkrwgP........QMLK.DIDLS.......D..YQG........CVL 18 .....QNRAMVNVLGMEF....R........KVSRgSYEIK.......R..HKA........IAI 19 .....QNRAMVNVLGMEF....K........KVLRgSYEIK.......R..YDA........IAI 20 .....KNRAMVNLLEIPM....I........RAP-.EVDLS.......H..YDA........FAI 21 .....ENKAMINLLGIKL....L........NVE-.DIDLD.......N..YCV........IAV 22 .....LSKKAGEVFKKYR....D........KFRV.IEDLP.......D..CFE........LVL 23 .....NLSDFLKVYS-DR....F........EFVW.DHEFE.......Ge.ITE........LVI 24 .....NFPWLEMVFT---....P........KEQI.TPELM.......Q..QAL........AVI 25 .....LFSFINMTFTNQI....-........----.DESFL.......K..EAL........AIV 26 .....N--DFKNFFTFDP....N........EV--.DDEFI.......K..DSI........AFI 27 .....INADGRELFPFEQ....T........DI--.NDDFV.......K..ESL........AII 28 .....-------------....-........----.-----.......-..---........--- 29 .....IKADGKDLFPFEA....A........PI--.DDAFV.......K..NSL........AII 30 .....SLSFLYSLDE---....-........---V.DNETY.......E..GAL........VIV 31 .....IASYSRRLLKFSR....V........PIER.NPKI-.......S..ERV........VFI 32 .....IASYSRRLVALSR....V........PIEK.DP-VI.......K..ENV........IFI 33 .....LSRNILNEIG-ER....V........DIEI.YPKL-.......-..PET........VFI 34 .....FLDFLPGVED---....-........--VI.NPDGK.......F..YDV........GIV 35 .....EIVPFKDKFLSEW....P........NIEI.SE---.......-..-YS........VII 36 .....EIAAYATLFR---....P........SLSA.QIRPS.......D..QTA........VIV 37 .....SLRSLPGCHLLVR....P........EVMR.RD---.......-..VDL........VVT 38 vq...RLIGEIKKYEELW....S........RFIT.PEEAMeisn...D..DTL........LVI 39 .....YLSFLPQPGELTA....P........----.LESWP.......Q..GAL........AAV 40 .....QISFLIDICLYNG....I........AVHL.DTQTVpr.....M..PDA........LVI 80 90 100 110 120 | | | | | 1 VDNQQGDRLgkaad..WLTLPHL.RQVAIYDHHLNSPRDI....EADI.-WELEAV.GASTTLIV 2 VDHNESAQS.......LPALGEL.DVTRVVDHHKLGDLTT....INPP.YLRFEPV.GCTGTILL 3 TDHNEFQQS.......VADIAEV.EVYGVVDHHRVANFET....ANPL.YMRLEPV.GSASSIVY 4 TDHNEFQQS.......IADIREV.EVVEVVDHHRVANFET....ANPL.YMRLEPV.GSASSIVY 5 TDHNEFQQS.......IADIREV.EVVEVVDHHRVANFET....ANPL.YMRLEPV.GSASSIVY 6 VDHNERQQS.......IKDIEEV.QVLEVIDHHRIANFET....AEPL.YYRAEPV.GCTATILN 7 VDHSEKSQS.......FDDLEEG.KLIAIIDHHKVGLTT-....TEPI.LYYAKPV.GSTATVIA 8 VDHSEKAQT.......VDGIDKA.EVVAIVDHHKIGDVTT....PQPI.LFVNLPV.GCTATVIK 9 VDHNEITQA.......PEGVEKA.EILEIIDHHRLGGLST....LNPV.FFYNEPV.GSTSTIVA 10 VDHHILSKS.......DTALEE-.AVAEVLDHRPIEPKHC....PPCH.-VSVELV.GSCATLVT 11 VDNNDTPKN.......LKNYID-.NVVGIIDHHFDLQKHL....DAEP.-RIVKVS.GSCSSLVF 12 VDNNDTPKN.......LKNYID-.NVVGIIDHHFDLQKHL....DAEP.-RIVKVS.GSCSSLVF 13 VDHNSLDRK.......DLENFNG.SIAGIIDHHKDEGGSL....HADP.-RIIEEC.GSCCTLVC 14 YDHNKLREN.......QSDLAS-.RVVGVVDHHFDEQQYLk...TASK.LRVLRTV.GSACTLVT 15 VDAANLSQIaga....ESLFRAA.RLKVVIDHHERGSIHE....EADV.ALVDPGA.GSSSELAV 16 VDASSPDRIgk.....FQDLLDK.VPSVVIDHHSTNTNF-....-GNW.NWVDPSF.AATAQMIF 17 VDSQGTNSQlmp....LVQEANI.PVVVIIDHHSPQDEVEa...EGAF.VDIRPSS.RSTATILT 18 VDAQPNGNIt......ILDEEDLnKIEIIIDHHQILQNLReklpPNCF.VDIRTDV.NATSSIMA 19 VDAQPNGNIt......ILDDDDLkRVEIIVDHHQILQNLReklsPNCF.VDIRTDV.NATSSIMV 20 VDSSGPGVN.......NSIPPDI.DISIVIDHHPAEKVV-....-AEF.VDLREDV.GATATILT 21 IDTSTSKQL.......PIELPN-.-IDIIIDHHNNTDLT-....-AKY.MDVRPEV.GATASILT 22 VDTHFLPEG.......LPRERI-.KRIIVYDHHPIGDV--....-KEF.EGKIEKV.GAATTLVV 23 VDAPSLDRIpes....IRKKAQG.AKITVYDHHVDE----....-SPY.DGIVSKV.GATITILV 24 VDANYKERIecrd...LLDQNQF.KAVLRIDHHPNEDDL-....NTTH.NFVDASY.IAAAEQVV 25 VDANYKNRIelre...LLDKNLF.KAVLRIDHHPNEDDL-....NTSF.NFVEESY.VACCEQIV 26 VDTANQERV.......LSQKHRLaKKTILVDHHVKTITY-....-TDL.EYINDQS.IAACEMIA 27 FDTSNQERV.......LTQKHKLaKETVRIDHHPRTEKF-....-ADM.EWIDSSF.SATAEMIG 28 ---------.......-------.-------------KF-....-ADM.EWIDSSF.SATAEMIG 29 FDTSNQERV.......LTQKHKLaKETVRIDHHPKTESF-....-ADL.EWIDPAF.SAAAEMVG 30 CDTANQERId......DQRYPSG.AKLMKIDHHPNEDPY-....-GDL.LWVDTSA.SSVSEMIY 31 FDTSSLEQLe......PIKIPPN.AKLIVIDHHVEKENPI....PADI.SVIDPKR.TSTAEIVW 32 FDTSSIEQL.......EPIEVPP.AKIVVIDHHVEKENPI....PADV.AIIDPTR.TSTAEIVW 33 VDTASINQLkv.....NFDELKE.REVILIDHHKKTDLAD....ICKY.YIIKEDY.PSTSEIIA 34 VDASGFYRVg......--KEVKV.GKRIRIDHHVGGEFY-....-GMH.DYIDPTA.PATAALVY 35 LDCSILDRIgde....FIFYVKN.MPTLVIDHHMSGEKL-....-ECE.GYIDPFA.PSTTFLIE 36 VDCSELSRVgae....LASQLAP.FARAFIDHHETCGDH-....-CAH.SFVVKTA.PSTTTLVQ 37 VDIPSVDRLga.....LGDLTDSgRELLVIDHHASNDLF-....-GTA.NFIDPSA.DSTTTMVA 38 VDTHKPSLVme.....ERLVNKI.EHIVVIDHHRRGEEFI....RDPL.LVYMEPYaSSTAELVT 39 LDVDNNDPVrva....GADLTQFdGPVVNVDHHGTNLRR-....-ADA.GVVDPSK.PAAAMMVA 40 LDTPNPGMIyappscrLLLSDST.IRKIELDHHLFANAAC....CGDPgYSLIARA.SSTCEIIA 130 140 150 160 170 | | | | | 1 EKL...QRAD.ISL........SMVEASVMALGIHVDTGSL..TF..TQTTVRDVKALAWLME-Q 2 KLH...REAG.LSV........EPQDAKLMLSAILSDTLHF..RS..PTTTQDDRDAVAFLAPVA 3 RMF...KEHS.VAV........SKEIAGLMLSGLISDTLLL..KS..PTTHPTDKAIAPELAELA 4 RLY...KENG.VAI........PKEIAGVMLSGLISDTLLL..KS..PTTHASDPAVAEDLAKIA 5 RLY...KENG.VAI........PKEIAGVMLSGLISDTLLL..KS..PTTHASDPAVAEDLAKIA 6 KMY...KENN.VKI........EKEIAGLMLSAIISDSLLF..KS..PTCTDQDVAAAKELAEIA 7 ELY...FKDA.IDLiggkkkelKPDLAGLLLSAIISDTVLF..KS..PTTTDLDKEMAKKLAEIA 8 LLF...DKTG.VEI........PKDIAGILLSSILSDTVIF..KS..ATTTELDKEVAEELAKIA 9 EFF...LKNG.VKM........EREIAGILLSGIVSDTLFF..KL..STTTEKDRKMANFLADVA 10 ERI...LQGA.PEIl.......DRQTAALLHGTIILDCVNMdlKI..GKATPKDSKYVEKLEALF 11 NYWyekLQGD.REV........VMNIAPLLMGAILIDTSNM..RR..-KVEESDKLAIERCQAVL 12 NYWyekLQGD.REV........VMNIALLLMGAILIDTSNM..RR..-KVEESDKLAIERCQAVL 13 RYF...MPVI.RSLyds11hqtATNLAVLALGPILIDTGNL..KN..EKTTDTDVKIVNDLCSFV 14 ELY...RECG.E--........DVVCPTLLTAPIVLDTVNF..EPaqKKVTPEDIAAYEWLRA-K 15 KTA...VEAG.TPL........RPSVATAALGGIVYDTGRF..LR..--ASKLSFEAAAHLLS-M 16 RIN...KALG.VEY........DSNLATLNYLGIATDTGFF..RH..SNADVRVFEDAYKLVK-M 17 EYL...QGGM.LDFnssnpt..HVKCATALMHGLRSDTINL..LQ..--AQEAEFMAAAYLSRIY 18 EYL...KALE.IPI........TETLATALFYGMYIDTKKF..SK..--LSRVDINAIEFLTGKV 19 EYL...KALE.IPI........TDTLATALFYGMYIDTKKF..SK..--LSRVDISAIEFLTGKV 20 EYI...KELK.ITP........SKILATALFFGIKSETDEF..KR..-NTRTADFLACAFLYPFV 21 QYL...MELD.IEP........SRNLATALFYGIQSDTDYF..KR..-ETSKLDFEAAAYLQSYI 22 EEI...KEKG.IDI........NPRDATLLAFGIYEDTGNF..TY..EGTTPRDALALAFLLE-K 23 ELI...REKN.IPL........DPTEATLFMIALYDDTGNL..LF..SSTTPRDLEIAKFLLE-N 24 DLA...VQAK.WKL........SPPAATALYLGIYTDSNRF..LY..SNTSWRTLYLGSMLYR-A 25 EMA...TVAK.WTI........PPVAATLLYIGIYTDSNRF..LY..SNTSYRTLYLAAILYK-A 26 YSL...MHTN.LNF........DLKTLNYLLLGITTDSNRL..MY..DKVSDFTYEIMSWFFK-N 27 YLI...LQMG.YKL........NDEIASYLYAGIITDTQRF..WG..PTTTPQTFALTAKLME-T 28 YLI...LQMG.YKL........NDEIASYLYAGIITDTQRF..WG..PTTTPQTFALTAKLME-T 29 YLI...LQMG.YEL........NAEMAAYIYAGIITDTQRF..SS..SATTPQTFALTAKLLE-T 30 ELYlegKEHG.WKL........NTKAAELIYAGIVGDTGRF..LF..PNTTEKTLKYAGELIQ-Y 31 ELF...KKLG.YK-........DEDSAKVLLAAIISDTSSF..RY..--ANAKTFKTVSEILELY 32 ELF...KKFN.YS-........DENSAKALLAGIISDTSNF..RY..--ANAKTFKAVYEILELY 33 EIF...KELN.IFP........PKNVRIALLCGIVYDTKHL..KL..--ANSKTFELISYLIK-- 34 EII...KNWDeTAI........DKDIATCIYTGLATDTGFF..RY..SNTNEKTFELAKELVS-Y 35 KLI...REFG.HDL........TKEEAWYILVGFCTDTGFF..KFi.SRSDPEPFEMVARLVS-K 36 TLI...ETMA.GSL........EAAEARALFLGLATDTGFF..RHl.DEHSADTFASAARLVR-A 37 EIL...DAWG.KPI........DPRVAHCIYAGLATDTGSF..RW..--ASVRGYRLAARLVE-I 38 ELLey.QPKR.LKI........NMIEATALLAGIIVDTKSF..SL..-RTGSRTFDAASYLRA-K 39 DVI...DALG.APW........SEAVATPLMLGLNTDTGNF..AF..DSVSAETFECAARLRA-H 40 YLC...YKLA.RNHaqr12nlySRNVVLSILTGMIGDAKTG..AY..-LISRKDRALYTYFTQRL 180 190 200 210 | | | | 1 GA........NLrliAEYADPGF........PPPLQFL.FAEAMQNLHKEMVRGYWLGSV....L 2 GVn.......DV...EAYALAMF........AAKSDLG.NTPAETLLRMDYKVFPFGDPVqpqnW 3 GV........NL...EEYGLAML........KAGTNLA.SKSAEELIDIDAKTFELNGNN....V 4 GV........DL...QEYGLAML........KAGTNLA.SKTAAQLVDIDAKTFELNGSQ....V 5 GV........DL...QEYGLAML........KAGTNLA.SKTAAQLVDIDAKTFELNGSQ....V 6 GV........DA...EEYGLNML........KAGADLS.KKTVEELISLDAKEFTLGSKK....V 7 GIs.......NI...EEFGMEIL........KAKSVVG.KLKPEEIINMDFKNFDFNGKK....V 8 GId.......DL...TKFGVEIK........AKLSAVD.DLTAMDIIKRDYKDFDMSGKK....V 9 KL........DL...EKFAKKLL........KEGMKIPeDVDPAELLKRDVKVYEMGEES....F 10 PDlp......KR...NDIFDSLQ........KVKFDVS.GLTTEQMLRKDQKTIYRQGVK....V 11 SGavn11gleDS...SEFYKEIK........SRKNDIK.GFSVSDILKKDYKQFNFQGKG....H 12 SGavn11gleDS...SEFYKEIK........SRKNDIK.GFSVSDILKKDYKQFNFQGKG....H 13 PK........DWvr.DEFFDTLK........EKKKSCK.GFSFDDLLRRDLKQYFPDGIV....V 14 EVadsa....DA...AALFEKLS........KWKDDVL.ALSVPQILRRDYKQFSFKART....Q 15 GA........DY...GKVLEAGR........QRGRRDR.GDLSLRLAKLKAFSRLKIGRA....C 16 GA........DA...HFVAKEIL........ENKRFEQ.FKLFAEVLERLQL---LENGK....I 17 DA........QL...------LN........AVLQSAR.SKRVMEVIERSLQNRVVQNNF....S 18 NY........EL...------LD........KIEFPDI.STETAEILARAILNRKIYKNV....I 19 NY........EL...------LD........KIEFPDI.STETAEILARAILNRKMYKNV....I 20 DQ........DL...------IE........KIESPSI.STETLDILGTAIKNRQVYSSF....L 21 DA........SI...------LN........MIENPEI.STEVMEVLAKAVMNRRVVKGN....I 22 GA........NL...REIREVVM........ETYTPEQ.IEAVGKIVQ-SIEKVFINGRQ....I 23 GA........NL...DEVALYTR........EELTPRQ.MELLDDLIE-NARDYEVNGVP....I 24 QA........NI...AKIHDELN........-------.---HTSLKDIQFKQYVFKNFQ....T 25 KA........DI...RIVHDHLN........-------.---HTSLADLKFKKYVYNHFK....T 26 NV........KH...YQIYQQLY........ERNLNDI.-----------LFDNELVKTI....K 27 GF........NR...NKVHDAVY........-------.---LKPLLEHKYFSYVLSKAK....I 28 GF........NR...NKVHDAVY........-------.---LKPLLEHKYFSYVLSKAK....I 29 GF........NR...NKVHDAVY........-------.---LKPLLEHKYFSYVLNKAK....I 30 PF........SS...SELFNQLY........ETKLNVV.----------KLNGFIFQNVS....L 31 DF........SI...SEVSQLVA........PV-SDEN.VEQSKRIAVLKACQRMEIHKV....R 32 DF........SI...PEVSQLVApvs10dqsRRIAILK.ACQRMEIHKVKKFVIVTSKVS....A 33 DI........SF...QKILYLLSqesdvs..KRTAHLK.ACSRMEIREFDKLRIALSHVS....S 34 GA........DP...YYVYTMVM........EREKVNK.MKLIAKVLETLQL-------H....E 35 GI........SL...KEVYSYIE........-TTKSLK.SIETLKLMLNSLESYWNGKVL....F 36 GA........NP...KDTFLAMN........-------.--GGRSLASRMLIARVLSRLT....P 37 GV........DN...ATVSRTLM........----DSH.PFTWLPLLSRVLGSAQLVSEA....V 38 GA........DT...VLVQKFLK........ETVDSYI.-----------KRAKLIQHTV....L 39 GA........RI...GWLNDQMR........QNPQSYY.LLLREVLGKLEFL--------....H 40 NTmlre....KT...KPITGNIA........STKQILH.TLESLSAEEHAVYQTMVKNVQ....H 220 230 240 250 260 | | | | | 1 LTTENFVPGL....SHLTERLLS...LTEC-----------DALLLG....HVYD..KGkdk20n 2 GIGVIETTNP....AYVFGRQQE...LLAAMDQVKAEDTLSGMLLSV....VDIL..NE...... 3 RVAQVNTVDI....AEVLERQAE...IEAAIEKAIADNGYSDFVLMI....TDII..NS...... 4 RVAQVNTVDI....NEVLERQNE...IEEAIKASQAANGYSDFVLMI....TDIL..NS...... 5 RVAQVNTVDI....NEVLERQNE...IEEAIKASQAANGYSDFVLMI....TDIL..NS...... 6 EIAQVNTVDI....EDVKKRQAE...LEAVISKVVAEKNLDLFLLVI....TDIL..EN...... 7 GIGQVEVIDV....SEVESKKED...IYKLLEEKLKNEGYDLIVFLI....TDIM..KE...... 8 GVGQIELVDL....SLIESRIDE...IYEAMKKMKEEGGYAGIFLML....TDIM..KE...... 9 AVSQIMTSDF....STLLKEKER...FMNTLKTLKGEFGVKHFFVLF....TNPV..EE...... 10 AISAI-YMDL....EAFLQRSNL...LADLHAFCQAHS--YDVLVAM....TIFF..NT...... 11 KGLEIGLSSIvkrmSWLFNEHGGeadFVNQCRRFQAERGLDVLVLLT....SWRKagDS...... 12 KGLEIGFSSIvkrmSWLFNEHGGeadFVNQCRRFQAERGLDVLVLLT....SWRKagDS...... 13 NYASV-GKGL....DWIKKKRLG...WEDELKSFAEVQNSDLVIVGL....SLSK..NDefg18a 14 KGV-------....--MSAGTSS...VPCACKQLEAHFSVDLIVAEA....AKYV..EQ...... 15 SELLIAVTHI....GSFESDVAKs..LVDE---------AADVAVAV....AERT..SE...... 16 AYSYI---DY....DTYLRHNCT...DEDSAGFVGELRSIRGVEVAV....LFME..FP...... 17 LAGVGYL---....--RYEERDA...IPQAADFLVSEENVHTALVYG....IVHD..RA...... 18 ISNVG-----....--FIANRDA...IAEAADFLLRLEGITTVLVFG....IVDD..RIeisa.. 19 ISNVG-----....--FIANRDA...IAEAADFLLRLEGITTVLVFG....IVDD..RIeisa.. 20 ISFAG-----....--FINDRDA...LPQAADFLLKLEGISTVVVFGvikdTVYV..SA...... 21 ALAYV-----....-GEISNRDA...LPKAADFLLKMEGISTTFVFG....IVGD..EI...... 22 SFATA-----....--VLERYQP...DINTLLYEIKDLKESDAFFVI....IEAE..GK...... 23 TISMIECEDFvgglGLIVSKAWE...MMG-----------KETFIAI....VKMG..KK...... 24 FQNVIYFVAD....KKFQKKLKV...TPLECARVNILANIEQFHIWL....FFIE..EG...... 25 QGQVIYFICT....KKIQKRLRM...TADQCARVNLLSNIADYKIWL....FFIE..QA...... 26 TQGQVAYLNI....DPQWNQKYH...FTRWGDKVYLLSNIKNYPIWF....IVYF..DEnt.... 27 TKNGLAYALI....KKGAYKHFG...VVSPLPMVHALNNIKGVKIWT....TVYF..NEsi.... 28 TKNGLAYA--....---------...---------------------....----..--...... 29 TPNGLAYALL....KKGTYKQFG...VVSPLPMVHALNNIKGVKIWT....TCYF..NE...... 30 SENGAASVFI....KKDTLEKFGtt.ASEASQLVGTLGNISGIRAWV....FFVE..ED...... 31 KFII------....--VTSKVSA...YEALACKVFLQLGAD-VAIVG....S---..EK...... 32 YEALA-----....---------...-----CKVFLQLGAD-VAIVG....S---..EK...... 33 HEASCAKTIV....SIG------...--------------ADVAFVV....AVRK..KE...... 34 DGKVAGITVF....KKFLDETGTt..YEDTEGLVNYPRSIEGVKVAY....ALIE..KP...... 35 TFLSSSSSGK....DGGVSGVNE...L-----FYMILSNVENNEILG....ILKE..ME...... 36 YYGGALMTSY....ETCEDAVQLgldVRDSDALYQLIQSIQGVEAIV....VVRQ..ES...... 37 GGRGLVYVVVdnr.EWVAARSEE...VESIVDIVRTTQQAE--VAAV....FKEV..EP...... 38 YKDNIAIASL....PENEEEYFD...QVLIAQAADSLLSMSEVEASF....AVAR..RD...... 39 GGRVVQTRVD....EEMLARAGAt..WEQVENYVSMLRNAEGAQLAV....MAKD..YG...... 40 RGSISLLLLN....QTQTQALHE...CC-------------------....----..--...... 270 280 290 300 | | | | 1 qrFSLIGR...TRIPDTDL.T........QLLEPYG......GGGHAQAA..AVNLRDVEPTTVM 2 ..TNRTLV...LGATEAKVlR........EAFGAEA......EGQVADLG..NRISRKKQIVPTL 3 ..NSEILA...IGSNMDKV.E........AAFNFVL......ENNHAFLA..GAVSRKKQVVPQL 4 ..NSEILA...LGNNTDKV.E........AAFNFTL......KNNHAFLA..GAVSRKKQVVPQL 5 ..NSEILA...LGNNTDKV.E........AAFNFTL......KNNHAFLA..GAVSRKKQVVPQL 6 ..DSLALA...IGNEAAKV.E........KAFNVTL......ENNTALLK..GVVSRKKQVVPVL 7 ..GSEALV...VGN-KEMF.E........KAFNVKV......EGNSVFLE..GVMSRKKQVVPPL 8 ..GTELLV...VTDYPEVV.E........KAFGKKL......EGKSVWLD..GVMSRKKQVVPPL 9 ..AS-LLM...MDGDQKLV.E........KAFNAEK......KDGLFLLK..GVMSRKKDFVPKI 10 ..HNEPVR...--------.-........-------......--------..------------- 11 ..HRELVI...LGDSNVVR.Elie11lqlQLFGGNL......DGGVAMFKqlNVEATRKQVVPYL 12 ..HRELVI...LGDSNVVR.Elie11lqlQLFGGNL......DGGVAMFKqlNVEATRKQVVPYL 13 glADSFLK...LSKQNLGL.E........IIEEKDN......GDLSMWNQr.NSAASRKKVVPLL 14 ..HQ----...--------.-........-------......--------..------------- 15 ..FRL--S...VRVSPL--.-........-------......--------..------------- 16 ..RGKIHVsm.RSKDWFNV.N........EVAFELG......GGGHPRAA..GVTFEGKKIEEVI 17 ..NDIELVigsLRTNK---.-........-------......--------..------------- 18 ..RTRDVR...VNIGNVMK.E........AFGEIGS......GGGHPQAG..GA----------- 19 ..RTRDVR...VNIGKVMK.E........AFGEIGS......GGGHAQMG..GA----------- 20 ..RNKDVR...IHMGEVLR.R........AFGDVGS......AGGHAHAA..GA----------- 21 ..HISART...KDLRLNLG.E........ILNKAFG......GGGHQTAA..AA----------- 22 ..TYVFGR...SQSEDVDV.G........EILSHFG......GGGHREAG..AVKLEN------- 23 ..IYVIGR...TSSPDVDL.G........SLMKDLG......GGGHTRAA..SATITGKEIDEVL 24 ..KNHYRVe..FRSNGINV.R........EVALKYG......GGGHIQAS..GAVLKSKRDI--- 25 ..NNEIRId..LRSNGINV.R........DIAIKYG......GGGHNNAS..GAIITNKKQ---- 26 ..NTYKVS...LRSNKYKV.R........LVASQFN......GGGHDLAA..GCSLTNIDQLENL 27 ..KKWIGS...IRSRNIPI.N........NFAQMFN......GGGHKYAA..AFVLDEKNQFMKL 28 ..------...--------.-........-------......--------..------------- 29 ..DIKKWIgs.IRSRSIPI.N........NFAQMFG......GGGHKYAA..AFVLDDKRQFMKL 30 ..DQIRVR...FRSKGPVI.N........GLARKYN......GGGHPLAS..GAS---------- 31 ..DGVRIS...ARAKDYLV.Kkglhlg..KLMEKVGpiiggsGGGHPGAA..GA----------- 32 ..DGVRIS...ARAKDYLV.Kqglhlg..KIMEKVGpiikgsGGGHAGAA..GA----------- 33 ..KEIRVS...ARCRKHVS.Kyvhlg...NLMEKIGkelggsGGGHSEAG..GL----------- 34 ..EEGVWKvslRAKGNVNV.G........KIAERLG......GGGHKYAS..GAKIKTNSYEEAL 35 ..DGSIIVgl.RSKDSFDV.G........KLAEDFG......GGGHKNAS..GFRIK-------- 36 ..PTHCSVgf.RSRGSIDV.S........VIAARFG......GGGHRCAA..GLRIEG------- 37 ..HRWSVS...MRAKTVNL.A........AVASGFG......GGGHRLAA..GYTT--------- 38 ..EQTVCIsa.RSLGEVNV.Q........IIMEALE......GGGHLTNA..ATQLSGISVSEAL 39 ..DRVKFS...LRSRGPVSaQ........NIAVALG......GGGHVPAA..GA----------- 40 ..------...--------.-........-------......--------..------------- 310 | 1 AEIY-- 2 EKYF-- 3 TESFN- 4 TESFN- 5 TESFNG 6 TDAM-- 7 ERAYN- 8 EKAF-- 9 GEVL-- 10 ------ 11 EEAYSN 12 EEAYSN 13 MD---- 14 ------ 15 ------ 16 PRVINH 17 ------ 18 ------ 19 ------ 20 ------ 21 ------ 22 ------ 23 KEVLN- 24 ------ 25 ------ 26 LSALD- 27 VQIMDD 28 ------ 29 VEIMDD 30 ------ 31 ------ 32 ------ 33 ------ 34 KKLL-- 35 ------ 36 ------ 37 ------ 38 ER---- 39 ------ 40 ------