; SAM: prettyalign v3.1b (February 24, 1999) compiled 04/18/00_11:44:15 ; (c) 1992-1999 Regents of the University of California, Santa Cruz ; ; Sequence Alignment and Modeling Software System ; http://www.cse.ucsc.edu/research/compbio/sam.html ; ; ------ Citations (HMMs, SAM) ------ ; A. Krogh et al., Hidden Markov models in computational biology: ; ------------- Citations (SAM, SAM-T99, HMMs) ----------------- ; R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: ; Extension and analysis of the basic method, CABIOS 12:95-107, 1996. ; K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting ; remote protein homologies, Bioinformatics 14(10):846-856, 1999. ; A. Krogh et al., Hidden Markov models in computational biology: ; Applications to protein modeling, JMB 235:1501-1531, Feb 1994. ; -------------------------------------------------------------- ; Sequence numbers correspond to the following labels: ; 1 gi|7484206|pir||S73083_240:593 ; 2 gi|480411|pir||S36823_223:474 ; 3 gi|542504|pir||S40449_223:474 ; 4 gi|453663|gb|AAA27769.1|_223:474 ; 5 gi|283589|pir||S27270_223:473 ; 6 gi|125348|sp|P09231|KEX1_KLULA_198:444 ; 7 gi|3334242|sp|O13359|KEX2_CANAL_247:491 ; 8 gi|3646417|emb|CAA07702.1|_113:359 ; 9 gi|4115628|dbj|BAA36466.1|_113:359 ; 10 gi|7595970|gb|AAF64521.1|AF253471_1_129:445 ; 11 gi|2127379|pir||JC6032_274:682 ; 12 gi|135018|sp|P28842|SUBT_BACS9_168:416 ; 13 gi|282432|pir||S25835_170:415 ; 14 gi|5302815|emb|CAB46075.1|_177:427 ; 15 gi|2731626|gb|AAB93489.1|_180:430 ; 16 gi|7473900|pir||F75625_232:478 ; 17 gi|5918763|gb|AAD56147.1|AF154675_7_173:429 ; 18 gi|2293484|gb|AAB65414.1|_1:200 ; 19 gi|7473898|pir||D75393_205:429 ; 20 gi|1168687|sp|P42779|BPRV_BACNO_221:466 ; 21 gi|576815|gb|AAA80562.1|_221:466 ; 22 gi|130965|sp|P23314|PROA_XANCP_230:465 ; 23 gi|7542318|gb|AAF63395.1|AF083618_2_199:466 ; 24 gi|135023|sp|P29141|SUBV_BACSU_211:572 ; 25 gi|7414560|emb|CAB86111.1|_251:481 ; 26 gi|7473895|pir||A75589_239:445 ; 27 gi|7481372|pir||T36842_94:323 ; 28 gi|7481387|pir||T10926_89:322 ; 29 gi|7435653|pir||H72784_186:432 ; 30 gi|2239272|gb|AAB62277.1|_225:438 ; 31 gi|113482|sp|P09230|AEP_YARLI_225:431 ; 32 gi|3549630|emb|CAA75805.1|_123:310 ; 33 gi|129234|sp|P28296|ORYZ_ASPFU_184:372 ; 34 gi|384177|prf||1905286A_184:372 ; 35 gi|470731|gb|AAA67705.1|_185:379 ; 36 gi|545060|gb|AAC60459.1|_181:392 ; 37 gi|4835715|gb|AAD30204.1|AF064522_1_163:264 ; 38 gi|7473901|pir||D75286_10:170 ; 39 gi|131088|sp|P20015|PRTT_TRIAL_75:261 ; 40 gi|6634475|emb|CAB64346.1|_171:368 ; 41 gi|460032|gb|AAA91584.1|_148:358 ; 42 gi|117631|sp|P29138|CUDP_METAN_172:383 ; 43 gi|6624958|emb|CAB63911.1|_172:383 ; 44 gi|4761119|gb|AAD29255.1|AF104385_1_163:372 ; 45 gi|628051|pir||JC2142_163:372 ; 46 gi|742825|prf||2011184A_163:372 ; 47 gi|6624950|emb|CAB63907.1|_212:450 ; 48 gi|6624962|emb|CAB63913.1|_212:450 ; 49 gi|7473893|pir||A75474_217:583 ; 50 gi|732988|emb|CAA82213.1|_181:534 ; 51 gi|2921857|gb|AAC04871.1|_204:571 ; 52 gi|2118106|pir||I39974_181:391 ; 53 gi|6573500|pdb|1DBI|A_60:270 ; 54 gi|1890101|gb|AAB49694.1|_176:387 ; 55 gi|135738|sp|P04072|THET_THEVU_59:269 ; 56 gi|494645|pdb|1THM|_59:269 ; 57 gi|5726590|gb|AAD48483.1|AF170567_1_165:381 ; 58 gi|7435650|pir||JW0075_165:381 ; 59 gi|5002190|gb|AAD37351.1|_5:357 ; 60 gi|5002192|gb|AAD37352.1|AF142415_1_256:482 ; 61 gi|1332716|gb|AAB36054.1|_1:369 ; 62 gi|1172505|sp|P42790|PICP_PSESR_216:587 ; 63 T0103 ; 64 gi|2145514|pir||JC4900_235:625 ; 65 gi|1086249|pir||S52769_202:603 ; 66 gi|323087|pir||A44869_33:350 ; 67 gi|881428|gb|AAA70103.1|_165:560 ; 68 gi|6094507|sp|O89023|TPP1_MOUSE_220:527 ; 69 gi|6753448|ref|NP_034036.1|_249:555 ; 70 gi|4583351|gb|AAD25043.1|AF114167_1_249:556 ; 71 gi|5729770|ref|NP_000382.3|_249:556 ; 72 gi|6175068|sp|O14773|TPP1_HUMAN_249:556 10 20 30 40 | | | | 1 ------------FNF...TLAYERGYTGG......GSNIAIEGVPESFVNVSDIYSF........ 2 -----------TWFN...SHGTRCAGEVSa.....AKDNGVC---------------........ 3 -----------TWFN...SHGTRCAGEVSa.....AKDNGVC---------------........ 4 -----------TWFN...SHGTRCAGEVSa.....AKDNGVC---------------........ 5 -----------TWFN...SHGTRCAGEVSa.....AKDNGVC---------------........ 6 ------------KDD...YHGTRCAGEIAafr...NDICGV----------------........ 7 ------------FDD...YHGTRCAGEIAavk...NDVCGI----------------........ 8 ------------NND...SHGTHVTGTMGaa....RDGVGMH---------------........ 9 ------------NND...SHGTHVTGTMGaa....RDGVGMH---------------........ 10 ------DPTPTDPDT...GHGTSVSGIIAav....DNAIGTK---------------........ 11 ------------NDQ...IVDNGCGEMHG......QHVAGIAGANGQVK--------........ 12 --GTTYTNNSCTDRQ...GHGTHVAGSALadg...GTGNGVY---------------........ 13 -----FTDNSCTDRQ...GHGTHVAGSALang...GTGSGVY---------------........ 14 -GATTPINNSCTDRN...GHGTHVAGTALadgg..SDQAGIY---------------........ 15 -GATTPINNSCTDRN...GHGTHVAGTALadgg..SDQAGIY---------------........ 16 -------DNQVDENI...EHGTAVTSTIAaa....RDGRGVV---------------........ 17 ----TGSINDIDDKK...GHGTAVAGQIAan....----------GQIF--------........ 18 ---------------...-HGTHVAGIAE......ANMPGWK---------------........ 19 -----------HDTT...DHGTHTAGLLV......GSKVGVA---------------........ 20 CGGYPDPRREKKFST...WHGSHVAGTIAavt...NNGVGVA---------------........ 21 CGGYPDPRREKRFST...WHGSHVAGTIAavt...NNGVGVA---------------........ 22 ---------PAASSS...WHGTHVAGTVAavt...NNTTGVA---------------........ 23 --GECGIFSAARDSS...WHGTHVAGTIAeat...GNAIGGA---------------........ 24 --FVDNDYDPKETPT...GDPRGEATDHG......THVAGTVAANGTIK--------........ 25 --------EEVADRH...GHGTHVTSTVG......GSGAASD--------------G........ 26 ------------GGV...GHGTAVAGIAT......----------------------........ 27 -----------RSWA...RHGTAMAGIIAghghgsGDAEGVM---------------........ 28 -----QPGDERTDHE...GHGTGMAALIAgtgkh.GSKSGAY---------------........ 29 --------SNYQDRN...GHGTHVTGTVAai....DNDIGVI---------------........ 30 ---------SNVDDN...GHGTHVAGTIG......SRTYGVA---------------........ 31 ----------NADLL...GHGTHVAGTVG......GKTYGVD---------------........ 32 -------GGSHVDSI...GHGTHVAGTIG......GKTYGVA---------------........ 33 -------GGSHVDSI...GHGTHVAGTIG......GKTYGVA---------------........ 34 -------GGSHVDSI...GHGTHVAGTIG......GKTTGVA---------------........ 35 ---------QHVDSV...GHGTHVAGTIG......GETYGVS---------------........ 36 ---FVDNDNDATDCN...GHGTHVAGTIG......GGEYGVA---------------........ 37 ---------------...-----------......----------------------........ 38 ---------------...-----------......----------------------........ 39 --------GQDTDGN...GHGTHVAGTVG......GTTYGVA---------------........ 40 -----------SDRN...GHGTHVAGTIG......SKKYGVA---------------........ 41 ---------QNTDGN...GHGTHCAGTIG......SKTYGVA---------------........ 42 ---------QNTDGH...GHGTHCAGTIG......SKTYGVA---------------........ 43 ---------QNRDGH...GHGTHCAGTIG......SRSYGVA---------------........ 44 ---------GTTDGH...GHGTHCAGTIG......SKTYGVA---------------........ 45 ---------TARDGN...GHGTHCSGTIG......SKTYGVA---------------........ 46 ---------TARDGN...GHGTHCSGTIG......SKTYGVA---------------........ 47 WGKTIPAGDADEDGN...GHGTHCSGTIA......GKKYGVA---------------........ 48 WGKTIPAGDADEDGN...GHGTHCSGTIA......GKKYGVA---------------........ 49 --------YQLNDVS...HHGTHVAGTVFaqy...GAGTGASGL----QSGMDA--N........ 50 -------------NN...AHGTHVAGTIAai....ANNEGVK---------------........ 51 ------------NNN...AHGTHVAGTIAai....ANNEGVV---------------........ 52 ----VDNDYDPMDLN...NHGTHVAGIAAaet...NNATGIA---------------........ 53 ----VDNDYDPMDLN...NHGTHVAGIAAaet...NNATGIA---------------........ 54 ---FIDRDNNPMDLN...GHGTHVAGTVAadt...NNGIGVA---------------........ 55 ----VDNDSTPQNGN...GHGTHCAGIAAavt...NNSTGIA---------------........ 56 ----VDNDSTPQNGN...GHGTHCAGIAAavt...NNSTGIA---------------........ 57 --DYVDNDNTSDDGN...GHGTHCAGITGalt...NNSVGIA---------------........ 58 --DYVDNDNTSDDGN...GHGTHCAGITGalt...NNSVGIA---------------........ 59 ---------------...-------PVTG......NSSVGVIEFEDQNFAPSDLSDF........ 60 ---------------...-----------......----------------------........ 61 AAGTAKGHNPTEFPT...IYDASSAPTAA......NTTVGIITIGGVSQTLQDLQQF........ 62 AAGTAKGHNPTEFPT...IYDASSAPTAA......NTTVGIITIGGVSQTLQDLQQF........ 63 AAGTAKGHNPTEFPT...IYDASSAPTAA......NTTVGIITIGGVSQTLQDLQQF........ 64 AAAAVAAHHPQDFAA...IYGGSSLPAAT......NTAVGIITWGSITQTVTDLNSF........ 65 --NATFSMNSARDTL...GHGTHTASTAA......GNYVNGA-------SYFGYGKG........ 66 PAVTTQPFNPTVDVLplyGIDTSDSNRGA......GQVIGIIDAYGASTAESDLAAF........ 67 WATAIQQYNPIPKIA...SISYGWAEVEQceitnsCSTLGIDSVVYVARSNVELQKVglr10fvs 68 -------------SF...THQASVAKVVGkqg...RGRAGIE-------ASLDVEYL........ 69 -------------SF...THQASVAKVVGkqg...RGRAGIE-------ASLDVEYL........ 70 ------------GNF...AHQASVARVVGqqg...RGRAGIE-------ASLDVEYL........ 71 ------------GNF...AHQASVARVVGqqg...RGRAGIE-------ASLDVQYL........ 72 ------------GNF...AHQASVARVVGqqg...RGRAGIE-------ASLDVQYL........ 50 60 70 80 90 100 | | | | | | 1 WQLYGIPR..TGHLNviyfgnVTTGGQSGE.NELDAEW..SGAFAPAANVTIVFSNGYV..---- 2 -GVGVAFG..SKVAG......LRMLDQPFM.TD-----..-------------------..---- 3 -GVGVAFG..SKVAG......LRMLDQPFM.TD-----..-------------------..---- 4 -GVGVAFG..SKVAG......LRMLDQPFM.TD-----..-------------------..---- 5 -GVGVAYG..SKVAG......LRMLDQPFM.TD-----..-------------------..---- 6 ---GVAYN..SKVSG......IRILSGQIT.AE-----..-------------------..---- 7 ---GVAWK..SQVSG......IRILSGPIT.SS-DEAE..AMVYGLD------------..---- 8 ---GVAYN..AQLYV......GNTNANDSF.LFG----..-------------------..---- 9 ---GVAYN..AQIYV......GNTNANDSF.LFGP---..-------------------..---- 10 ---GIAPR..AQLQG......FNLLDDNSQ.QL-----..------------------Q..KDWL 11 ---GVAPD..AQLLA......MKVFSNNAK.NSGAYDD..DIISAIEDSVKLG------..---- 12 ---GVAPD..ADLWA......YKVLGDDGSgYADDIAA..AIRHAGDQATAL-------..---- 13 ---GVAPE..ADLWA......YKVLGDDGSgYADDIAE..AIRHAGDQATALN------..---- 14 ---GVAPD..ADLWA......YKVLLDSGSgYSDDIAA..AIRHAADQATATGT-----..---- 15 ---GVAPD..ADLWA......YKVLLDSGSgYSDDIAA..AIRHAADQATATGT-----..---- 16 ---GVAPD..AKYLT......AAMFQPGSV.GS-----..-------------------..---- 17 ---GVSPG..TNLLV......YRVFGKSKS.KE-----..-------------------..---- 18 -MQGAAPG..AKIVS......AKACVYAGG.CT-----..-------------------..---- 19 ------PG..AKVIS......ALVLPNNEG.TF-----..-------------------..---- 20 ---GVAYG..AKVIP......VRVLGKCGG.YDSDITD..GMYWSAGGHIDGVPDN---..---- 21 ---GVAYG..AKVIP......VRVLGKCGG.YDSDITD..GMYWSAGGHIDGVPDN---..---- 22 ---GTAYG..AKVVP......VRVLGKCGG.SLSDIAD..AIVWASGGTVSGIPA----..---- 23 ---GVAYK..AKVLP......VRVLGHCGG.SFSDITD..AIVWASGGHVEGVPDN---..---- 24 ---GVAPD..ATLLA......YRVLGPGGS.GT-----..-------------------..---- 25 KEKGVAPG..ATLAV......GKVLDDEGF.GS-----..-------------------..---- 26 ---QVAPM..VQIMP......VRALGTDGS.GD-----..-------------------..---- 27 ---GIAPE..AKILP......VRVILEDGD.PS-----..-------------------..---- 28 ---GLAPG..VEILP......IRMPEKIEG.-------..-------------------..---- 29 ---GVAHS..VEIYA......VKALGNGGY.GS-----..----WSDLIIAIDLAVKGP..DGVI 30 ------KR..VTIFG......VKVLPARGT.SP-----..-------------------..---- 31 ------AN..TKLVA......VKVFAGRSA.ALSVINQ..GFTWALNDYISKR------..---- 32 ------KK..TNLLS......VKVFQGESS.STSIILD..GFNWAVNDIVSKGRTKKA-..---- 33 ------KK..TNLLS......VKVFQGESS.STSIILD..GFNWAVNDIVSKGRTKKA-..---- 34 ------KK..TNLLS......VKVFQGESS.STSIILD..GFNWAVNDIVSKGRTKKA-..---- 35 ------KK..ANLLS......VKVFQGESS.STSIILD..GFNWAANDIVSKGR-----..---- 36 ------KN..VNIVG......VRVLGCNGS.GS-----..-------------------..---- 37 --------..-----......---------.-------..-------------------..---- 38 --------..-----......---------.-------..-------------------..---- 39 ------KK..TSLFA......VKVLDANGQ.GS-----..-------------------..---- 40 ------KK..TKILG......IKVLSDQGS.GD-----..-------------------..---- 41 ------KK..TKIYG......VKVLDNSGS.GS-----..-------------------..---- 42 ------KK..AKLYG......VKVLDNQGS.GS-----..-------------------..---- 43 ------KN..AKLFA......VKVLDDQGS.GS-----..-------------------..---- 44 ------KK..ASILG......VKVLEDSGS.GS-----..-------------------..---- 45 ------KK..VSIFG......VKVLDDNGS.GS-----..-------------------..---- 46 ------KK..VSIFG......VKVLDDNGS.GS-----..-------------------..---- 47 ------KK..ANVYA......VKVLRSNGS.GT-----..----MADVVKGVEFAATSH..VEQV 48 ------KK..ANVYA......VKVLRSNGS.GT-----..----MADVVKGVEFAATSH..VEQV 49 GVGGVASG..VNLYM......ARVLGDDGS.GS-----..-------------------..---- 50 ---GLLPNqnVNLHI......VKVFNESGW.GY-----..-------------------..---- 51 ---GVMPNqnANIHV......INVFNEAGW.GY-----..-------------------..---- 52 ---GMAPN..TRILA......VRALDRNGS.GT-----..-------------------..---- 53 ---GMAPN..TRILA......VRALDRNGS.GT-----..-------------------..---- 54 ---GMAPD..TKILA......VRVLDANGS.GS-----..-------------------..---- 55 ---GTAPK..ASILA......VRVLDNSGS.GT-----..-------------------..---- 56 ---GTAPK..ASILA......VRVLDNSGS.GT-----..-------------------..---- 57 ---GVAPQ..TSIYA......VRVLDNQGS.GT-----..-------------------..---- 58 ---GVAPQ..TSIYA......VRVLDNQGS.GT-----..-------------------..---- 59 ATSFSVPI..TPLTD......NHIIGSNDP.TS----P..QIEATLDIQYILG------..VLLV 60 --------..-----......---------.-------..-------------------..---- 61 TSANGLAS..VNTQT......IQTGSSNGD.YSDDQQG..QGEWDLDSQSIVGSAGGAV..QQLL 62 TSANGLAS..VNTQT......IQTGSSNGD.YSDDQQG..QGEWDLDSQSIVGSAGGAV..QQLL 63 TSANGLAS..VNTQT......IQTGSSNGD.YSDDQQG..QGEWDLDSQSIVGSAGGAV..QQLL 64 TSGAGLAT..VNSTI......TKVGS--GT.FANDPDS..NGEWSLDSQDIVGIAGGVK..QLIF 65 TARGIAPR..ARVAV......YKVTWPEGR.YT-----..-------------------..---- 66 SRANGLPA..ANFQK......VDQNGGTNY.PKDDPDDasGDGWGVETALDLQIAHAVApaAKLI 67 SGDDGAPS..FG-AA......SGNCPIDGT.KQYCPLG..GCNHKSSQCPMITIMESNG..TQCF 68 MSAGANIS..TW---......--VYSSPGR.HE-----..--------------AQEPF..LQWL 69 MSAGANIS..TW---......--VYSSPGR.HE-----..--------------AQEPF..LQWL 70 MSAGANIS..TW---......--VYSSPGR.HE-----..--------------SQEPF..LQWL 71 MSAGANIS..TW---......--VYSSPGR.HE-----..--------------GQEPF..LQWL 72 MSAGANIS..TW---......--VYSSPGR.HE-----..--------------GQEPF..LQWL 110 120 130 140 | | | | 1 --GG.....PQLVGNLLNYYYEYYYMVNYL........NPNVISISVTV........PESFLAAY 2 ----.....---------LIEANAMGHMPN........VIDIYSASWGP........TDDGKTVD 3 ----.....---------LIEANAMGHMPN........VIDIYSASWGP........TDDGKTVD 4 ----.....---------LIERNAMGHMPN........VIDIYSASWGP........TDDGKTVD 5 ----.....---------LIEANAMGHMPN........VIDIYSASWGP........TDDGKTVD 6 ----.....DEAASLIYGL----------D........VNDIYSCSWGP........SDDGKTMQ 7 ----.....---------------------........TNDIYSCSWGPt.......DNGKVLSE 8 ----.....---PTPDPKYFKAVYTALVDS........GVRAINNSWGSqpp12lagLHAAYAQH 9 ----.....----TPDPQYFKAVYSALVDS........GVRAINNSWGSqpk12lgdLHAAYAQH 10 YALG.....DSNASRDNRVFNHSYRMSVVD........PRSANSL----........DQSQL--- 11 ----.....---------------------........-ADVINMSLGS........VSSDV--- 12 ----.....--------------------N........TKVVINMSLGS........SGESS--- 13 ----.....---------------------........TKVVINMSLGS........SGESS--- 14 ----.....---------------------........-KTIISMSLGS........SANNS--- 15 ----.....---------------------........-KTIISMSLGS........SANNS--- 16 ----.....--------AGVAKAILWMVDN........GAKVLNNSWGG........AGFDP--- 17 ----.....--------CWILKAIIDATNN........GANVINLSLGQyiki....PNGDIWES 18 ----.....------SVALTEGMIELVGNQ........RVDIVNMSIGG........--LPALND 19 ----.....AQVIAGMQYVLDPDNNADTDD........GADVVNMSLGI........PGTWN--- 20 ----.....-------------------QN........PAQVVNMSLGG........GG------ 21 ----.....-------------------QN........PAQVINMSLGG........DGDCS--- 22 ----.....------------------NAN........PAEVINMSLGG........GGSCS--- 23 ----.....---------------------re......PAEIINISLGG........FGPCD--- 24 ----.....-------TENVIAGVERAVQD........GADVMNLSLGNsl......NNPDW--- 25 ----.....-------ESEIIAGMEWAARDv.......DADIVSMSLGS........TEPSDGTD 26 ----.....-------ISAVVQAIVWAVDH........GANIINLSLGS........NEASE--- 27 ---R.....AKARKTRGNALAEGIRWAADH........GADIINLSLGD........DSASA--- 28 --LD.....FTSGHNAARDFSKAIRFAADS........DAKVINISMGQ........AESGTKGG 29 DADG.....DGVVAGDPD----------DD........APEVISMSLGG........SSPPP--- 30 ----.....NSVIIKGMDFVHAMPSGVNAP........TDVVVNMSLGG........-------- 31 ----.....---------------------dtl.....PRGVLNFSGGG........-------- 32 ----.....---------------------........---AINMSLGG........-------- 33 ----.....---------------------........---AINMSLGG........-------- 34 ----.....---------------------........---AINMSLGG........-------- 35 ----.....--------------------T........GKSAINMSLGG........-------- 36 ----.....-------YSGVISGIDWVKNNas......GPSVANMSLGG........-------- 37 ----.....---------------------........-----------........-------- 38 ----.....------------SSINWAVGNkgs.....AAAVANMSLGG........-------- 39 ----.....-------NSGVIAGMDFVTKDassqncp.KGVVVNMSLGG........PSSSA--- 40 ----.....-------YSGILAGMDFAIQDsrtrgcp.KGVVANMSLGG........-------- 41 ----.....-------YSGIISGMDFAVQDsksrscp.KGVVANMSLGG........-------- 42 ----.....-------YSGIISGMDYVAQDsktrgcp.NGAIASMSLGG........-------- 43 ----.....-------YSGIISGMDFVAQDsksrncp.NGHIASMSLGG........-------- 44 ----.....-------LSGVIAGMDFVATDrksrpcs.KGTVASMSLGG........-------- 45 ----.....-------LSNVIAGMDFVASDyrsrncp.RGVVASMSLGG........-------- 46 ----.....-------LSNVIAGMDFVASDyrsrncp.RGVVASMSLGG........-------- 47 LRAK.....DGKRKGFKGS-----------........---VANMSLGG........-------- 48 LRAK.....DGKRKGFKGS-----------........---VANMSLGG........-------- 49 ----.....-------SSGIINGVNWCAAQlksqggteSKVVISLSLGG........GRASQ--- 50 ----.....-------SSTLVRAIQTCADN........GAKIVNMSLGG........SQSSR--- 51 ----.....------SSSLVAAIDTCVTSG........GANVVTMSLGG........SGSTT--- 52 ----.....-------LSDIADAIIYAADS........GAEVINLSLGC........DCHTT--- 53 ----.....-------LSDIADAIIYAADS........GAEVINLSLGC........DCHTT--- 54 ----.....-------LDSIASGIRYAADQ........GAKVLNLSLGC........ECNST--- 55 ----.....-------WTAVANGITYAADQ........GAKVISLSLGG........TVGNS--- 56 ----.....-------WTAVANGITYAADQ........GAKVISLSLGG........TVGNS--- 57 ----.....-------LDAVAQGIREAADS........GAKVISLSLGA........PNGGT--- 58 ----.....-------LDAVAQGIREAADS........GAKVISLSLGA........PNGGT--- 59 QLDG.....SGLKVIVYRLYGFHHLFATKD........VPLVNSISYGW........NEEDQCEN 60 ----.....---------------------........-----------........----YLGS 61 FYMA.....DQSASGNTGLTQAFNQAVSDN........VAKVINVSLGW........CEADANAD 62 FYMA.....DQSASGNTGLTQAFNQAVSDN........VAKVINVSLGW........CEADANAD 63 FYMA.....DQSASGNTGLTQAFNQAVSDN........VAKVINVSLGW........CEADANAD 64 YTSAngds.SSSGITDAGITASYNRAVTDN........IAKLINVSLGE........DETAAQQS 65 ----.....--------SDVLAGIDQAIAD........GVDVISISLGY........DGVPLYED 66 LCTA.....KSASDTNLN---ACIKTLTNL........HVNHISMSYGG........SEGDT--- 67 FPMGsesntCQSMLQNQNIVNGINEFVSSN........SKCQVAL----........-EQDTQQN 68 LLLS.....NESSLPH--------------........---VHTVSYGD........DEDSL-SS 69 LLLS.....NESSLPH--------------........---VHTVSYGD........DEDSL-SS 70 LLLS.....NESALPH--------------........---VHTVSYGD........DEDSL-SS 71 MLLS.....NESALPH--------------........---VHTVSYGD........DEDSL-SS 72 MLLS.....NESALPH--------------........---VHTVSYGD........DEDSL-SS 150 160 170 180 | | | | 1 Ypa......MLDMIHNIMLQAAAQGISVLAASGDWGYESDHPPPN........FHiGTY.....N 2 Gprnlt...MRAIVNGVNNGRNGLGNVYVWASGDGGPN-------........--.-DD.....C 3 Gprnlt...MRAIVNGVNNGRNGLGNVYVWASGDGGPN-------........--.-DD.....C 4 Gprnlt...MRAIVNGVNNGRNGLGNVYVWASGDGGPN-------........--.-DD.....C 5 Gprnlt...MRAIVNGVNNGRNGLGNVYVWASGDGGPN-------........--.-DD.....C 6 Apdtlv...KKAIIKGVTEGRDAKGALYVFASGNGGMF-------........--.GDS.....C 7 Pdviv....KKAMIKGIQEGRDKKGAIYVFASGNGGRF-------........--.GDS.....C 8 Y........NQGTWLDAAADVAKAGVINVFSAGNSGYANASVR--........--.-SA.....L 9 Y........NQNTWLDAAADVAKAGVINVFSAGNSGYANASVR--........--.-SA.....L 10 -........---DRLFEQQTLKAQGAAYIKAAGN-GFNKIAAGGYvln10gpkLP.FEN.....S 11 G........PSDPQQQAVAKASEAGVINVISAGNSGVAGSTADGN........PV.NNTgtselS 12 -........---LITNAVNYSYNKGVLIIAAAGNSGPY-------........--.--Q.....G 13 -........---LITNAVDYAYDKGVLIIAAAGNSGPK-------........--.--P.....G 14 -........---LISSAVNYAYSKGVLIVAAAGNSGYS-------........--.--Q.....G 15 -........---LISSAVNYAYSKGVLIVAAAGNSGYA-------........--.--Q.....G 16 -........---LIKAAMDYALERNVTVVVSAGNESRE-------........--.---.....- 17 A........EALRYKFAIDYATRHNVIVVAATGNDGLSDDNGEVKtyy11gqdMS.QND.....T 18 G........NNTRSALYNRIIDEYGVQIFISAANSGAG-------........--.--T.....N 19 -........---EFIVPVNNMLKAGVVPVFAIGNFGPA-------........--.--A.....G 20 G........CSQNSQRMIDKTTNLGALIVIAAGNENQD-------........--.---.....A 21 -........--QSSQRIIDKTTNLGALIVIAAGNENQD-------........--.---.....A 22 -........--TTMQNAINGAVSRGTTVVVAAGNDASN-------........--.---.....V 23 -........--SAMQAAINGAVSRGTTVVVAAGNGGSD-------........--.---.....V 24 -........---ATSTALDWAMSEGVVAVTSNGNSGPN-------........--.--G.....W 25 P........MAEAVNTLSR---ETGALFVIAAGNTGAP-------........--.---.....S 26 -........---ALQSAVRYADEKGVAVVAAAGNAGNN-------........--.ELT.....Y 27 H........PEPGEDEAIQYALKKGVVVVASAGNGGEL-------........--.--G.....D 28 V........DTSELDAAVKYAVDKGKLIFAAAGNEGDG-------........--.--A.....N 29 -........---ELHDVIKAAYNLGITIVAAAGNDGAD-------........--.---.....- 30 G........YSKATNQAAARLVRAKYFVAVASGNNNRDARNYS--........--.---.....- 31 P........KSASQDALWSRATQEGLLVAIAAGNDAVDACNDS--........--.---.....- 32 G........YSYAFNNAVENAFDEGVLSVVAAGNENSDASNTS--........--.---.....- 33 G........YSYAFNNAVENAFDEGVLSVVAAGNENSDASNTS--........--.---.....- 34 G........YSYAFNNAVENAFDEGVLSVVAAGNENSDASNTS--........--.---.....- 35 G........YSYAFNQAVEDAYDEGVLSVVAAGNDNIDASDSS--........--.---.....- 36 G........VSQAVDDAVNNAVASGVSFVVAAGNDNSNACNYS--........--.---.....- 37 -........------------------------------------........--.---.....- 38 G........ASQAVDDAVNNAASKNLVMAVAAGNENQNACNVS--........--.---.....- 39 -........----VNRAAAEITSAGLFLAVAAGNEATDASSSS--........--.---.....- 40 G........YSAAINQAAAKMIQSNVFLAVAAGNDAKDASQTS--........--.---.....- 41 G........KAQSVNDGAAAMIRAGVFLAVAAGNDNANAANYS--........--.---.....- 42 G........YSASVNQGAAALVNSGVFLAVAAGNDNRDAQNTS--........--.---.....- 43 G........YSASVNQGAAALVRSGVFLAVAAGNDNRDAQNTS--........--.---.....- 44 G........YSATVNQAAARLQASGVFVAVAAGNDNRDAAQTS--........--.---.....- 45 G........YSATVNQAAARLQSSGVFVAVAAGNDNRDAANTS--........--.---.....- 46 G........YSATVNQAAARLQSSGVFVAVAAGNDNRDAANTS--........--.---.....- 47 G........KTQALDAAVNAAVKAGIHFAVAAGNDNADACNYS--........--.---.....- 48 G........KTQALDAAVNAAVKAGIHFAVAAGNDNADACNYS--........--.---.....- 49 -........---TEQRAYTSVYNKGVLTVAATGNDGAA-------........--.---.....- 50 -........---TEQNAMDALYERGVLMIAAAGNSGNT-------........--.---.....- 51 -........---TERNALNTHYNGGVLLIAAAGNAGDS-------........--.---.....- 52 -........---TLENAVNYAWNKGSVVVAAAGNNGSS-------........--.---.....- 53 -........---TLENAVNYAWNKGSVVVAAAGNNGSS-------........--.---.....- 54 -........---TLKSAVDYAWNKGAVVVAAAGNDNVS-------........--.---.....- 55 -........---GLQQAVNYAWNKGSVVVAAAGNAGNT-------........--.---.....- 56 -........---GLQQAVNYAWNKGSVVVAAAGNAGNT-------........--.---.....- 57 -........---ALQQAVQYAWNKGSVIVAAAGNAGNT-------........--.---.....- 58 -........---ALQQAVQYAWNKGSVIVAAAGNAGNT-------........--.---.....- 59 Gigg15saqYVARVNTEFQKIGLRGITLFAASGDSGAN-GRTDPD........CS.ESN.....L 60 G........YLRRSDVEFQKLALMGITIIIADGDNGAGDLGAPPMlt......PD.CSTr....L 61 G........TLQAEDRIFATAAAQGQTFSVSSGDEGVYECNNRGY........PD.GST.....Y 62 G........TLQAEDRIFATAAAQGQTFSVSSGDEGVYECNNRGY........PD.GST.....Y 63 G........TLQAEDRIFATAAAQGQTFSVSSGDEGVYECNNRGY........PD.GST.....Y 64 G........TQAADDAIFQQAVAQGQTFSIASGDAGVYQWSTDPTsgs15tvkID.LTH.....Y 65 P........IAIASFAAM----EKGVVVSTSAGNAGPFFGNMH--........--.---.....- 66 -........---SSDSYFQQAQEAGISLFASAGDSGAE-------........--.---.....- 67 Y........HIYSSCTCDK-------LKPYSDSDAGFKIVGYSYD........QDaGTL.....F 68 I........YIQRVNTEFMKAAARGLTLLFASGDTGAGCWSVSGR........H-.--K.....F 69 I........YIQRVNTEFMKAAARGLTLLFASGDTGAGCWSVSGR........H-.--K.....F 70 A........YIQRVNTEFMKAAARGLTLLFASGDSGAGCWSVSRR........H-.--Q.....F 71 A........YIQRVNTELMKAAARGLTLLFASGDSGAGCWSVSGR........H-.--Q.....F 72 A........YIQRVNTELMKAAARGLTLLFASGDSGAGCWSVSGR........H-.--Q.....F 190 200 210 220 230 | | | | | 1 TIWYPESDPYVTSVGGIFLNASSN.GSI.....VEI........-------........--SGWD 2 NCDGYAASMWTVSINSATNDGQTA.GYD.....ESC........-------........------ 3 NCDGYAASMWTVSINSATNDGQTA.GYD.....ESC........-------........------ 4 NCDGYAASMWTVSINSATNDGQTA.GYD.....ESC........-------........------ 5 NCDGYAASMWTISINSARNDGQTA.GYD.....ESC........-------........-SSTLA 6 NFDGYTNSIFSITVGAIDWKGLHP.PYS.....ESC........-------........------ 7 NFDGYTNSIYSITVGAIDYKGLHP.QYS.....EAC........-------........------ 8 PYFQPELEGHWLAVSGLDKTNNQK.---.....---........-------........------ 9 PYFQPGLEGHWLAVSGLDKANNQK.---.....---........-------........------ 10 NLDPSNSNFWNLVVSALNADGVRS.SYS.....SVG........-------........-SNIFL 11 TVGTPGVTPDALTVASAENSKVTT.DTV.....KDElg116kfwLKQQKKV........RASRLK 12 SIGYPGALVNAVAVAALENKVENG.TYR.....VAD........-------........FSSRGY 13 SIGYPGALVNAVAVAALENTIQNG.TYR.....VAD........-------........FSSRGH 14 TIGYPGALPNAIAVAALENVQQNG.TYR.....VAD........-------........YSSRGY 15 TIGYPGALPNAIAVAALENVQQNG.TYR.....VAD........-------........YSSRGY 16 YYQRPALFAGVIPSAALAVNNTKA.SFS.....SFG........-------........---RHI 17 VEDYPSVLPNAIAVGSSDNNNQRS.SFS.....NYY........-----NQ........YQDNFI 18 TIADPSVATDAVSVAAGASKETW-.-LA.....NYG........AKAKEEYwaqn....YSSRGP 19 STGSPGNLPQAIGVGAVDSNGQVA.SFS.....SRG........-------........------ 20 SRTWPSSCNNVLSVGATTPKGKRA.PFS.....NYG........-------........------ 21 SRTWPSSCNNVLSVGATTPKGKRA.PFS.....NYG........-------........------ 22 SGSLPANCANVIAVAATTSAGAKA.SYS.....NFG........-------........------ 23 SSAVPANCANVVSVAATRLTGGLA.YYS.....NFG........-------........-SLIDL 24 TVGSPGTSREAISVGATQLPLNE-.YAV.....TFGsy117tfkLTVSKAL........GEQVAD 25 SIGSPGAADAALTVGAVDSADQAA.WFT.....SAG........-------........------ 26 PAAYARTSAGLLSVGSVSDSDVKS.GFS.....NYA........-------........------ 27 HISYPAAYPGVIAATAVDRYGTRA.AFS.....TRR........-------........------ 28 RPRFPASTPGVVAVGSINEKVKRS.SFS.....EWG........-------........------ 29 SPSYPAAYPEVIAVGAIDENGNVP.SWS.....NRN........-------........------ 30 ----PASEPSVCTVGGTDKFDSV-.YMS.....NWG........-------........---PAV 31 PGNIGGSTSGIITVGSIDSSDKIS.VWSggqgsNYG........-------........------ 32 ----PASAPNALTVAAINKSNARA.SFS.....NYG........-------........------ 33 ----PASAPNALTVAAINKSNARA.SFS.....NYG........-------........------ 34 ----PASAPNALTVAAINKSNARA.SFS.....NYG........-------........------ 35 ----PASAPNALTVAASTKSNTRA.SFS.....NYG........-------........------ 36 ----PARAANAITVGSTTSSDARS.SFS.....NYG........-------........------ 37 -----GNGYTAITRGLTTNTDARW.SFS.....NYG........-------........------ 38 ----PARAVNAITVGATTKTDSRDtGYS.....NYG........-------........------ 39 ----PASEESACTVGATDKTDTLA.EYS.....NFG........-------........------ 40 ----PASEPSVCTVGATDSSDRLS.SFS.....NYG........-------........------ 41 ----PASEPTVCTVGATTSSDARS.SFS.....NYG........-------........------ 42 ----PASEPSACTVGASAENDSRS.SFS.....NYG........-------........------ 43 ----PASEPTACTVGATASDDSRS.TFS.....NYG........-------........------ 44 ----PASEPSVCTVGATDSSDRRS.TFS.....NFG........-------........------ 45 ----PASEPSVCTVGATDSSDRRS.SFS.....NYG........-------........------ 46 ----PASEPSVCTVGATDSSDRRS.SFS.....NYG........-------........------ 47 ----PAAAELPVTVGASAFDDSRA.YFS.....NYG........-------........------ 48 ----PAAAELPVTVGASAFDDSRA.YFS.....NYG........-------........------ 49 -VSYPAAYTNVVGVGAIDSAEARA.SFS.....NFG........-------........---SQV 50 AHSYPASYDSVMSVAAVDSNYDHA.SFS.....QATnq109nnaGALAAMV........YSNQQ- 51 TYTYPPSYDIVMSVAAVDSNLDHA.AFS.....QYTdq108ktaGAKGIIV........YSNTAL 52 TTFEPASYENVIAVGAVDQYDRLA.SFS.....NYG........-------........------ 53 TTFEPASYENVIAVGAVDQYDRLA.SFS.....NYG........-------........------ 54 RTFQPASYPNAIAVGAIDSNDRKA.SFS.....NYG........-------........------ 55 APNYPAYYSNAIAVASTDQNDNKS.SFS.....TYG........-------........------ 56 APNYPAYYSNAIAVASTDQNDNKS.SFS.....TYG........-------........------ 57 KANYPAYYSEVIAVASTDQSDRKS.SFS.....TYG........-------........------ 58 KANYPAYYSEVIAVASTDQSDRKS.SFS.....TYG........-------........------ 59 NPAYPAASPYITSVGATQISQSSG.VAK.....LPNppp27pilHQVEDFL........WLPQHC 60 NPDWP--SQRLTSRLGLYIHHTLA.EPI.....CYT........DIDCRLDnpe9vgvsLDNGLF 61 SVSWPASSPNVIAVGGTTLYTTSA.GAY.....SNE........TVWNEGL........DSNGKL 62 SVSWPASSPNVIAVGGTTLYTTSA.GAY.....SNE........TVWNEGL........DSNGKL 63 SVSWPASSPNVIAVGGTTLYTTSA.GAY.....SNE........TVWNEGL........DSNGKL 64 SVSEPASSPYVIQVGGTTLSTS-G.TTW.....SGE........TVWNEGLsaiapsqgDNNQRL 65 -----NGIPWVLTVAAGNIDRSFA.GTL.....TLGndq64rsnVAGAILI........SNHTKL 66 -AEYPAASQYVVAVGGTTLHTNSD.GSF.....NSE........-------........---TAW 67 QPDYPASSPFITSVGATQITDVTK.PEI.....VCS........-------........VATGAI 68 RPSFPASSPYVTTVGGTSFKNPF-.---.....---........-----LI........TDEVVD 69 RPSFPASSPYVTTVGGTSFKNPF-.---.....---........-----LI........TDEVVD 70 RPSFPASSPYVTTVGGTSFQNPFR.---.....---........------V........TTEIVD 71 RPTFPASSPYVTTVGGTSFQEPF-.---.....---........-----LI........TNEIVD 72 RPTFPASSPYVTTVGGTSFQEPF-.---.....---........-----LI........TNEIVD 240 250 260 270 | | | | 1 YSTGGNSVVYPAQIYEITSLIPFtp...VI........VRTYPDIAFVSAGGynipefgFGLPLV 2 ---------------------SS.....TL........ASTFSNGKSSSRDA.......GVATTD 3 ---------------------SS.....TL........ASTFSNGKSSSRDA.......GVATTD 4 ---------------------SS.....TL........ASTFSNGKSSSRDA.......GVATTD 5 STFSNGKSN--------------.....--........----------SRDA.......GVATTD 6 ---------------------SA.....VM........VVTYSSGS----GN.......YIKTTD 7 ---------------------SA.....VM........VVTYSSGS----GE.......HIHTTD 8 --------------------YNK.....CG........IAKYWCIS--TPGA.......LINSTV 9 --------------------YNQ.....CG........IAKYWCIS--TPGA.......LINSTI 10 SATGGEYGTDTP-----------.....--........----AMVTTDLPGC.......DMGYNR 11 FGTALIDNSRAGKMSDFTSWGPT.....PE........LDFKPEIT--APGG.......KIYSLA 12 SWTDGDYAIQK------------.....--........----GDVEISAPGA.......AIYSTW 13 KRTAGDYVIQK------------.....--........----GDVEISAPGA.......AVYSTW 14 ISTAGDYVIQE------------.....--........----GDIEISAPGS.......SVYSTW 15 ISTAGDYVIQE------------.....--........----GDIEISAPGS.......SVYSTW 16 SVAAPGTDILMASPLFINDDGTR.....KL........GATPPDGS------.......------ 17 LAPGGGTPLLDQ--YGQEEWYNQ.....KL........----------FMKE.......QVLSTS 18 REDGGFKPNIMAPGSAISAVPMF.....MD........PEDIPQVSYKLPV-.......------ 19 ------------PVAWQGEISGV.....F-........--TKPDIA--APGV.......NITSTV 20 -----------------------.....--........----ARVHLAAPGT.......NILSTI 21 -----------------------.....--........----ARVHLAAPGT.......NILSTI 22 -----------------------.....--........----TGIDVSAPGS.......SILSTL 23 AAPGGGARDLETDTLYDGPIG--.....--........----SWIW--QTGY.......TGATTP 24 FSSRGPVMDTWM-----------.....--........--IKPDIS--APGV.......NIVSTI 25 --------------------PRY.....GD........NALKPDLS--APGV.......GILAAR 26 -----------------------.....--........----ASLEVLAPGE.......RIATYA 27 -----------------------.....--........--WYATVS--APGV.......DVIIAD 28 -----------------------.....--........----PEIDVTAPGE.......DLVHAC 29 -----------------------.....--........----PEVA--APGV.......NILSTY 30 DINGPGVDVLSTLPNRRTVCFF-.....--........----FLIKIPAWRA.......EELTCM 31 -----------------------.....--........----TCVDVFAPGS.......DIISAS 32 -----------------------.....--........----SVVDIFAPGQ.......DILSAW 33 -----------------------.....--........----SVVDIFAPGQ.......DILSAW 34 -----------------------.....--........----SVVDIFAPGQ.......DILSAW 35 -----------------------.....--........----SVVDIFAPGQ.......DILSAW 36 -----------------------.....--........----NCLDIYAPGS.......SITSAW 37 -----------------------.....--........----SCLDIFAPGS.......SITSAW 38 -----------------------.....--........----SCLDIFAPGT.......NITSTW 39 -----------------------.....--........----SVVDLLAPGT.......DIKSTW 40 -----------------------.....--........----AAVDILAPGS.......DILSTW 41 -----------------------.....--........----NLVDIFAPGS.......NILSTW 42 -----------------------.....--........----RVVDIFAPGS.......NVLSTW 43 -----------------------.....--........----RIVDIFAPGT.......GILSTW 44 -----------------------.....--........----KAVDIFAPGT.......GILSTW 45 -----------------------.....--........----RALDIFAPGT.......DITSTW 46 -----------------------.....--........----RALDIFAPGT.......DITSTW 47 -----------------------.....--........----KCTDIFAPGL.......NILSTW 48 -----------------------.....--........----KCTDIFAPGL.......NILSTW 49 DLVGPGVSVLSSIPLGQGTRASAs....GG........GVTFTDVS--AADK.......SGKATF 50 -RPGLQNPFVLDQFNTYPLLSVS.....VN........RNVGQELAALVGQD.......ITVSTR 51 PRLQNPFVVDAD--SEILIPSMS.....VD........RTTGLALKAKLGQS.......TTVSNQ 52 -----------------------.....--........----TWVDVVAPGV.......DIVSTI 53 -----------------------.....--........----TWVDVVAPGV.......DIVSTI 54 -----------------------.....--........----TWVDVTAPGV.......NIASTV 55 -----------------------.....--........----SVVDVAAPGS.......WIYSTY 56 -----------------------.....--........----SWVDVAAPGS.......SIYSTY 57 -----------------------.....--........----SWVDVAAPGS.......NIYSTY 58 -----------------------.....--........----SWVDVAAPGS.......NIYSTY 59 LSEGSHCSILQVRCNLASKLLLQ.....CC........CKRISDVS---ALG.......SAILIE 60 WTTGGGFADYPPRPQYQEAIISQ.....YLqsn15nsgGRAYPDIS---TVG.......HNLMTV 61 WATGGGYSVYESKPSWQSVVSGT.....PG........RRLLPDISFDAAQG.......TGALIY 62 WATGGGYSVYESKPSWQSVVSGT.....PG........RRLLPDISFDAAQG.......TGALIY 63 WATGGGYSVYESKPSWQSVVSGT.....PG........RRLLPDISFDAAQG.......TGALIY 64 WATGGGVSLYEAAPSWQSSVSSS.....T-........KRVGPDLAFDAASS.......SGALIV 65 FELGGGVSCPCLVISPKDAAALI.....KYakt39sypGILKPDVM--APGS.......LVLASW 66 SGSGGGCSKYTKVIPEQNADPGYaslgcKG........KKALPDLAALADPN.......SGITII 67 ITGGGGVAITQAQPSYQADAVAT.....YIks......GTLPPSYSYMPPID.......SIQILL 68 YISGGGFSNVFPRPPYQEEAVAQ.....FLkss15nasGRAYPDVAALSDGY.......WVVSNM 69 YISGGGFSNVFPRPPYQEEAVAQ.....FLkss15nasGRAYPDVAALSDGY.......WVVSNM 70 YISGGGFSNVFPQPSYQEEAVVQ.....FLsss15nasGRAYPDVAALSDGY.......WVVSNS 71 YISGGGFSNVFPRPSYQEEAVTK.....FLsss15nasGRAYPDVAALSDGY.......WVVSNR 72 YISGGGFSNVFPRPSYQEEAVTK.....FLsss15nasGRAYPDVAALSDGY.......WVVSNR 280 290 300 310 | | | | 1 ........FQGQ........LFVWY....GTSGAAPMTAAMVALAGTR--......--LGALNFA 2 ........LYNNc.......TASHS....GTSAAAPEAAGVFALALEANK......-NLTWRDMQ 3 ........LYNNc.......TASHS....GTSAAAPEAAGVFALALEANK......-NLTWRDMQ 4 ........LYNNc.......TASHS....GTSAAAPEAAGVFALALEANK......-NLTWRDMQ 5 ........LYNNc.......TASHS....GTSAAAPEAAGVFALALEANR......-NLTWRDMQ 6 ........LDEKc.......SNTHG....GTSAAAPLAAGIYTLVLEANP......-NLTWRDVQ 7 ........IKKKc.......SATHG....GTSAAAPLASGIYSLILSANP......-NLTWRDVQ 8 ........PGGG........YGVKS....GTSMSAPHATGALALVMERYP......---YMTNEQ 9 ........PEGG........YGVKS....GTSMSAPHATGALALVMERYP......---YLNNEQ 10 ........TDDPstn17cdyNGVMN....GTSSATPSTSGAMALLMSAYP......---DLSVRD 11 ........NDNK........YQQMS....GTSMASPFVAGSEALILQGIKk.....QGLNLSGEE 12 ........FDGG........YATIS....GTSMASPHAAGLAAKIWAQYP......---SASNVD 13 ........FDGG........YATIS....GTSMASPHAAGLAAKIWAQSP......---AASNVD 14 ........YNGG........YNTIS....GTSMATPHVSGLAAKIWAENP......---SLSNTQ 15 ........YNGG........YNTIS....GTSMATPHVSGLAAKIWAENP......---SLSNTQ 16 ........---G........YVLMS....GTSFSGPYTAATAALILGAHP......---ELDPYQ 17 ........NNGN........YDYAD....GTSISTGKVSGELAEIISNYH......--LQGDSSK 18 ........---G........LSMFN....GTSMARP--------------......--------- 19 ........RNGG........YQAMS....GSSQASPITAGAVAVLLSAKP......---GASVDA 20 ........DVGQagpvrss.YGMKA....GTSMAAPHVSGVAALVISAANs.....IGKTLTPSE 21 ........DVGQagpvrss.YGMKA....GTSMAAPHVSGVAALVISAANs.....IGKTLTPSE 22 ........NSGTttpgsas.YASYN....GTSMASPHVAGVVALVQSVAP......--TALTPAA 23 ........TSGQ........FTYIGpgfaGTSMASPHVAGTAALVQSALIad....GKAPLTPAA 24 ........PTHDpdhpyg..YGSKQ....GTSMASPHIAGAVAVIKQAKP......---KWSVEQ 25 srlae...GSGD........YTSMD....GTSMATPHIAGVAALLAEEHP......---DWSGAR 26 ........PNNK........LALWT....GTSMSAPVVAGLLALMQGERQ......-AAGGENAV 27 ........PDHR........YYEGW....GTSAASAFVSGAAALVKAAHP......---DLTPAQ 28 ........IGGTg.......VCRTS....GTSDATAIASASAALVWSKHP......---TWTNNQ 29 ........PDDT........YEELS....GTSMATPHVSGTVALIQAARL......-AAGLPLLP 30 ........MQGR........LT---....GTSMATPHIAGLGAYLAAKNG......---RRAGPG 31 ........YQSDsg......TLVYS....GTSMACPHVAGLASYYLSIND......--EVLTPAQ 32 ........IGSTta......TNTIS....GTSMATPHIVGLSVYLMGLEN......---LSGP-- 33 ........IGSTta......TNTIS....GTSMATPHIVGLSVYLMGLENl.....----SGPA- 34 ........IGSTta......TNTIS....GTSMATPHIVGLSVYLMGLENl.....----SGPA- 35 ........IGSTta......TNTIS....GTSMATPHVVGLSLYLIALEG......---LSSASA 36 ........YNSDts......TNTIS....GTSMAAPHVAGAVALYLDENP......---SLSPSQ 37 ........YTSSta......TNTIS....GTSMASPHVAGAAALYLALNP......---SATPAQ 38 ........IGSTsa......TNTIS....GTSMATPHVTGAVALLIAEG-......---NTTTSA 39 ........NDGR........TKIIS....GTSMASPHVAGLGAYFLGLGQ......KVQGL---- 40 ........IGGI........TKSIS....GTSMATPHIVGLGAYLSSLEGf.....PGAQALCER 41 ........IGGT........TNTIS....GTSMATPHIVGLGAYLAGLE-......---GFPGAQ 42 ........IVGR........TNSIS....GTSMATPHIAGLAAYLSALQG......---KTTPAA 43 ........INGR........TNTIS....GTSMATPHIAGLAAYFSALSG......---KTSPAA 44 ........NNGG........TNTIS....GTSMATPHIAGLGAYLLALG-......---KGTAGN 45 ........IGGR........TNTIS....GTSMATPHIAGLGAYLLALE-......---GGSAST 46 ........IGGR........TNTIS....GTSMATPHIAGLGAYLLALE-......---GGSAST 47 ........IGSPta......VNTIS....GTSMASPHICGLLAYYLSLQPagdsefSVASITPKQ 48 ........IGSPta......VNTIS....GTSMASPHICGLLAYYLSLQPagdsefSVASITPKQ 49 sg103vavTGAD........YDFFD....GTSMATPHVSAAAAVVWAAKP......---TLTNTQ 50 ........TGED........YQYYN....GTSMATPHVSGVAGLVWSYHP......---QCSAKQ 51 ........GNRD........YEYYN....GTSMATPHVSGVATLVWSYHP......---ECSASQ 52 ........TGNR........YAYMS....GTSMASPHVAGLAALLASQ--......---GRNNIE 53 ........TGNR........YAYMS....GTSMASPHVAGLAALLASQ--......---GRNNIE 54 ........PNNG........YSYMS....GTSMASPHVAGLAALLASQ--......---GKNNVQ 55 ........PTST........YASLS....GTSMATPHVAGVAGLLASQ--......---GRSASN 56 ........PTST........YASLS....GTSMATPHVAGVAGLLASQ--......---GRSASN 57 ........KGST........YQSLS....GTSMATPHVAGVAALLANQ--......---GYSNTQ 58 ........KGST........YQSLS....GTSMATPHVAGVAVLLANQ--......---GYSNTQ 59 ........TGGN........IQTVG....GTSASSPIFAGVVGLLNDYVNsktg..KPLGFVSPL 60 ........ISGS........MTPVD....GTSPSATIFAGIVSLLTDARLragk..PALGFLNPL 61 ........NYGQ........LQQIG....GTSLASPIFVGLWARLQSANS......NSLGFPAAS 62 ........NYGQ........LQQIG....GTSLASPIFVGLWARLQSANS......NSLGFPAAS 63 ........NYGQ........LQQIG....GTSLASPIFVGLWARLQSANS......NSLGFPAAS 64 ........VNGS........TEQVG....GTSLASPLFVGAFARIESAAN......NAIGFPASK 65 ........IPNEata14sshYNMVS....GTSMACPHASGVAALLKAAHP......---EWSPAA 66 ........FHGQe.......LEGIG....GTSLAAPLTAGRAAIRGDQVT......--------- 67 llv17sntCPCA........LESVD....GTSCSSPTLAGMISLINDKLIgagk..PTLGFLNPL 68 ........V--P........IPWVS....GTSASTPVFGGILSLINEHRIlngr..PPLGFLNPR 69 ........V--P........IPWVS....GTSASTPVFGGILSLINEHRIlngr..PPLGFLNPR 70 ........V--P........IPWVS....GTSASTPVFGGILSLINEHRLlsgl..PPLGFLNPR 71 ........V--P........IPWVS....GTSASTPVFGGILSLINEHRIlsgr..PPLGFLNPR 72 ........V--P........IPWVS....GTSASTPVFGGILSLINEHRIlsgr..PPLGFLNPR 320 330 340 350 360 | | | | | 1 LYHISY...QGIIesp11gkvAWIPITSGNNP.......L-----PAHYGWNYVTGPGTYNAYAM 2 HLTVLT...SKR-........--NSLYDSNGI.......HHWKLNGAHLLFNHLFGYGVLDAASM 3 HLTVLT...SKR-........--NSLYDSNGI.......HHWKLNGAHLLFNHLFGYGVLDAASM 4 HLTVLT...SKR-........--NSLYDSNGI.......HHWKLNGAHLLFNHLFGYGVLDAASM 5 HLTVLT...SKR-........--NSLYDSNGI.......HHWKLNGAHLLFNHLFGYGVLDAAS- 6 YLSILS...SEEI........NPHDGKWQDTA.......M-------GKRYSHTYGFGKLDAYNI 7 YISVLS...ATPI........NEEDGNYQTT-.......------ALNRKYSHKYGYGKTDAYKM 8 ALQVLL...TTAT........QLDG-------.......------SITQAPNTVVGWGVPDLG-- 9 ALQVLL...TTAT........QLNGAVTDAPT.......T-------------QVGWGVPDLG-- 10 LRDLLA...RSATrvdakhqpVMVSYTSSTGKvrdvkglEGWERNAAGMWFSPTYGFGLIDVNK- 11 LVQFAK...NSA-........-----MNTSHP.......VYDTEHTKEIISPRRQGSGEIN---- 12 VRGELQ...YRA-........YENDILSG---.......-----YYAGYGDDFASGFGF------ 13 VRGELQ...TRA-........SVNDILSGN--.......------SAGSGDDIASGFGF------ 14 LRSNLQ...ERA-........KSVDIKGG---.......-----YGAAIGDDYASGFGF------ 15 LRSNLQ...ERA-........KSVDIKGG---.......-----YGAAIGDDYASGFGF------ 16 VRRLME...ETA-........---DGSVGENP.......N---------GFDRGTGYGRIQLGEL 17 ARSILL...NQVN........-----------.......-------------------------- 18 ------...----........-----------.......-------------------------- 19 IKNALF...TSA-........-----------.......------SNASAKNNNVGFGQISI--- 20 LSDILV...RTTS........RFNG-------.......----------RLDRGLGSGIVDANA- 21 LSDILV...RTTS........RFNG-------.......----------RLDRGLGSGIVDANA- 22 VETLLK...NTAR........ALPGACSG---.......--------------GCGAGIVNADAA 23 LERLLK...RSAR........AF---------.......------PVQIPLATPAGSGIVDAGA- 24 IKAAIM...NTAV........TLKD-------.......-------------------------- 25 LKDALM...STSK........-ELDVS-----.......------------AYQLGAGRVS---- 26 TAHAHD...ISGI........ALNQPFTG---.......--------------GLGYGRIDAA-- 27 VKSVLE...DTA-........--RNAPAG---.......----------GRDDSRGFGFVDPA-- 28 VLRVLI...NTM-........------KGNEE.......EWTHNESFGYG--------------- 29 PGSESD...TTPD........TVRGVLHTTAT.......D-----AGDPGYDSLYGYGIIDAYD- 30 LCRTIK...----........-----------.......-------------------------- 31 VEALIT...ESN-........-----------.......-------------------------- 32 ------...----........-----------.......-------------------------- 33 ------...----........-----------.......-------------------------- 34 ------...----........-----------.......-------------------------- 35 VVSRIK...E---........-----------.......-------------------------- 36 IDSLLS...QRSSkg......KVSNPQSGSP-.......-------------------------- 37 VTTAIIna.STPN........KVTGAQTGSPN.......-------------------------- 38 VTSALL...NN--........-----------.......-------ATTGKLSSIGTGSPN---- 39 ------...----........-----------.......-------------------------- 40 IRSLAI...RNT-........-----------.......-------------------------- 41 ALCKRIqtlSTKN........VLTGIPSGTVN.......Y------------------------- 42 LCKKIQ...DTATkn......VLTGVPSGTVN.......YL------------------------ 43 LCQKIQdt.STKN........VIRNVPAGTVN.......FL------------------------ 44 LCQTIQtl.STKN........VLTGVPSGTVN.......Y------------------------- 45 ICARIQtl.STKN........AISGVPSGTVN.......Y------------------------- 46 ICARIQtl.STKN........AISGVPSGTVN.......Y------------------------- 47 LKDTLIei.STQG........VLTDIPNDTPN.......-------------------------- 48 LKDTLIei.STQG........VLTDIPNDTPN.......-------------------------- 49 LLNLLT...STAK........DL---------.......-------GAAGKDNDFGSGLVN---- 50 IRQALT...QTA-........-----------.......--------------------LDLDVL 51 VRAALN...ATAD........DL---------.......-------SVAGRDNQTGYGMVNATTA 52 IRQAIE...QTA-........---DKISGTGT.......YFKYG--------------------- 53 IRQAIE...QTA-........---DKISGTGT.......YFKYG--------------------- 54 IRQAIE...QTA-........---DKISGTGT.......NFKYG--------------------- 55 IRAAIE...NTA-........---DKISGTGT.......YWAKG--------------------- 56 IRAAIE...NTA-........---DKISGTGT.......YWAKG--------------------- 57 IRQIIE...STT-........---DKISGTGT.......YWKN--------------GRVNA--- 58 IRQIIE...STT-........---DKISGTGT.......YWKN--------------GRVNA--- 59 LYKMAA...ERPA........AFFDVIKGDNI.......CTEDGCSS------------------ 60 LYQIAA...EA--........LMHSVCSG---.......------------------GRKQMQKL 61 FYSAIS...STPS........LVHDVKSGNNG.......YGGYGYNAGTGWDYPTGWGSLDIAKL 62 FYSAIS...STPS........LVHDVKSGNNG.......YGGYGYNAGTGWDYPTGWGSLDIAKL 63 FYSAIS...STPS........LVHDVKSGNNG.......YGGYGYNAGTGWDYPTGWGSLDIAKL 64 FYQAFP...TQTS........LLHDVTSGNNG.......YQSHGYTAATGFDEATGFGSFDIGKL 65 IRSAMM...TTA-........--NPLDNTLNP.......IHENGKKFHLASPLAMGAGHIDPN-- 66 -PAYVY...TGGI........QFRDITQGSNG.......H-----SAKAGLDLVTGEG------- 67 LYQAAK...EQPN........VFNDITTGANNcnray..CCQYGYTATTGYDAASGLGSINFKNF 68 LYQQHG...TGL-........--FDVTHGCHEsclnee.VEGQGFCSGPGWDPVTGWGTPNFPAL 69 LYQQHG...TGL-........--FDVTHGCHEsclnee.VEGQGFCSGPGWDPVTGWGTPNFPAL 70 LYQQRG...AGL-........--FDVTRGCHEsclnee.VQGQGFCSGPGWDPVTGWGTPNFPAL 71 LYQQHG...AGL-........--FDVTRGCHEscldee.VEGQGFCSGPGWDPVTGWGTPNFPAL 72 LYQQHG...AGL-........--FDVTRGCHEscldee.VEGQGFCSGPGWDPVTGWGTPNFPAL 370 | 1 ----------- 2 ----------- 3 ----------- 4 ----------- 5 ----------- 6 VHMAK------ 7 VHF-------- 8 ----------- 9 ----------- 10 ----------- 11 ----------- 12 ----------- 13 ----------- 14 ----------- 15 ----------- 16 AQRLQSGPMP- 17 ----------- 18 ----------- 19 ----------- 20 ----------- 21 ----------- 22 V---------- 23 ----------- 24 ----------- 25 ----------- 26 ----------- 27 ----------- 28 ----------- 29 ----------- 30 ----------- 31 ----------- 32 ----------- 33 ----------- 34 ----------- 35 ----------- 36 ----------- 37 ----------- 38 ----------- 39 ----------- 40 ----------- 41 ----------- 42 ----------- 43 ----------- 44 ----------- 45 ----------- 46 ----------- 47 ----------- 48 ----------- 49 ----------- 50 ARYIVLD---- 51 KAYL------- 52 ----------- 53 ----------- 54 ----------- 55 ----------- 56 ----------- 57 ----------- 58 ----------- 59 ----------- 60 SIH-------- 61 SAYIRSNG--- 62 SAYIRSNGFGH 63 SAYIRSNGFGH 64 NTYAQAN---- 65 ----------- 66 ----------- 67 EQYVL------ 68 L---------- 69 ----------- 70 ----------- 71 ----------- 72 -----------