(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0116-400-600 MutS, Thermus Aquaticus
    • gi|2506881|sp|P54276|MSH6_MOUSE_927:1150 DNA MISMATCH REPAIR PROTEIN MSH6 (MUTS-ALPHA 160 KDA SUBUNIT) (G/T MISMATCH BINDING PROTEIN) (GTBP) (GTMBP) (P160) (Mus musculus)
    • gi|1817725|gb|AAC53034.1| (U42190) G/T-mismatch binding protein [Mus musculus] (Mus musculus)
    • gi|387849|gb|AAB60711.1|_666:868 (L10319) MutS homologue; major mRNA product contains exon 1a and exon 9b; alternative protein produced from exon 1b and exon 9b [Mus musculus] (Mus musculus)
    • gi|400971|sp|P13705|MSH3_MOUSE_666:868 DNA MISMATCH REPAIR PROTEIN MSH3 (REPAIR-3 PROTEIN) (REP-1) (Mus musculus)
    • gi|200706|gb|AAA40052.1| (M80360) Citations 2 and 3 contain revisions to the original sequence in Citation 1. The name of the gene was changed after Citation 1 from Rep-1 to Rep-3 to avoid naming conflict with an unrelated gene.; complete cds of major mRNA [Mus musculus] (Mus musculus)
    • gi|7012942|gb|AAF35250.1|AF227632_1_185:402 (AF227632) mismatch binding protein Mus3 [Zea mays] (Zea mays)
    • gi|7287834|gb|AAF44872.1|AE003406_77_469:685 (AE003416) symbol=spel1; synonym=BG:DS01068.9; cDNA=method:''sim4'', score:''1000.0'', desc:''LD33650 LD Drosophila melanogaster embryo pOT2 Drosophila melanogaster cDNA clone LD33650 5prime, mRNA sequence:AA979275''; match=method:''sim4'', score:''100> (Drosophila melanogaster)
    • gi|2135744|pir||I37550_471:687 mismatch repair protein MSH2 - human (Homo sapiens)
    • gi|1079805|gb|AAA82080.1| (U41221) similar to S. cerevisiae Msh2p (Swiss-Prot accession number P25847) and bacterial MutS proteins (Swiss-Prot accession numbers P23909, P10339, and P27345) [Homo sapiens] (Homo sapiens)
    • gi|9294568|dbj|BAB02831.1|_280:482 (AB024036) contains similarity to mismatch repair protein MutS~gene_id:MQC12.27 [Arabidopsis thaliana] (Arabidopsis thaliana)
    • gi|3886083|gb|AAC78226.1|_448:657 (AF106587) contains similarity to DNA mismatch repair proteins, mutS (Pfam: PF00488, score=376.5, E=2.8e-109, N=1) [Caenorhabditis elegans] (Caenorhabditis elegans)
    • gi|5815430|gb|AAD52669.1|AF178755_1_404:606 (AF178755) HIM-14 protein [Caenorhabditis elegans] (Caenorhabditis elegans)
    • gi|5921981|gb|AAB93433.2| (U58758) C. elegans HIM-5 (GB:AF178755); contains similarity to Pfam domain PF00488 (mutS), Score=204.6, E-value=4.9e-58, N=1 [Caenorhabditis elegans] (Caenorhabditis elegans)
    • gi|7331984|gb|AAF60672.1|_1368:1582 (AC024791) contains similarity to Pfam family PF00488 (DNA mismatch repair proteins, mutS family), score=296.7, E=2.9e-85, N=1 [Caenorhabditis elegans] (Caenorhabditis elegans)
    • gi|3914061|sp|O63852|MSHM_SARGL_498:716 MITOCHONDRIAL MUTS PROTEIN HOMOLOG ()
    • gi|3132797|gb|AAC16386.1| (AF063191) MSH [Sarcophyton glaucum] ()
    • gi|6227005|gb|AAF06041.1|AC009360_6_198:420 (AC009360) Contains similarity to gb|D90908 DNA mismatch repair protein MutS2 from Synechocystis sp. and is a member of PF|00488 Muts family of mismatch repair proteins. [Arabidopsis thaliana] (Arabidopsis thaliana)
              10        20        30                  40        50     
              |         |         |                   |         |     
   1 REGYDPDLDALRAAHREGVAYFLELEERERERT.....GIP.....TLKVGYNAVFGYYLEVTRP
   2 REGYDPDLDALRAAHREGVAYFLELEERERERT.....GIP.....TLKVGYNAVFGYYLEVTRP
   3 KQGYDAELDELLSLSENAGQFLMDLEAREKART.....GLP.....NLKVGYNRIHGYYIELPRV
   4 -PGYHEELDEWRALADGATDYLDRLEIRERERT.....GLD.....TLKVGYNAVHGYYIQISRG
   5 KRGFSSELDEYRDLLEHAEERLKEFEEKERERT.....GIQ.....KLRVGYNQVFGYYIEVTKA
   6 KRGFSSELDEYRDLLEHAEERLKEFEEKERERT.....GIQ.....KLRVGYNQVFGYYIEVTKA
   7 -SGYNEELDEWRALADGATDYLERLEVRERERT.....GLD.....TLKVGFNAVHGYYIQISRG
   8 -SGYNEELDEWRALADGATDYLERLEVRERERT.....GLD.....TLKVGFNAVHGYYIQISRG
   9 KAGYDNELDEMLAISENAGQFLIDLEAREKART.....GLA.....NLKVGYNRVHGYFIELPTK
  10 -SGYNEELDEWRALADGATDYLERLEVRERERT.....GLD.....TLKVGFNAVHGYYIQISRG
  11 KTGYDAELDELQALSENAGQFLMDLEAREKART.....GLP.....NLKVGYNRIHGYFIELPRV
  12 KTGYDAELDELQALSENAGQFLMDLEAREKART.....GLP.....NLKVGYNRIHGYFIELPRV
  13 -PGYHEELDEWRALADGATDYLDRLEIRERERT.....GLD.....TLKVGYNAVHGYYIQISRG
  14 REGYDPDLDALRRAHAEGVAYFLDLEAREKERT.....GIP.....TLKVGYNAVFGYYLEVTRP
  15 REGYDPDLDALRRAHAEGVAYFLDLEAREKERT.....GIP.....TLKVGYNAVFGYYLEVTRP
  16 REGYDPDLDALRRAHAEGVAYFLDLEAREKERT.....GIP.....TLKVGYNAVFGYYLEVTRP
  17 -DGYSAELDEWRDLANGATEFLERLEAEERDRH.....GID.....TLKVGYNNVHGFYIQVSRG
  18 NHGFHPELDELRRIQNHGDEFLLDLEAKERERT.....GLS.....TLKVEFNRVHGFYIELSKT
  19 KAGFDSDYDQALADIRENEQSLLEYLDKQRSRL.....GCK.....SIVYWGIGRNRYQLEIPEN
  20 KAGFDSDYDQALADIRENEQSLLEYLDKQRSRL.....GCK.....SIVYWGIGRNRYQLEIPEN
  21 REGVNAYLDELRFIRDNAETYLREYEKKLRQET.....GIQ.....SLKIGYNKVMGYYIEVTKP
  22 NHGFHPELDELRRIQNHGDEFLLDLEAKERERT.....GLS.....TLKVEFNRVHGFYIELSKT
  23 REGYDPDLDALRRAHADPVAYFLDLEVREKEST.....GIP.....TLKVGYNAVFGYYLEVTRP
  24 ------LIKKRKNEIQEVIHSIQMRLQEFRKIL.....KLP.....SLQYVTVSGQEFMIEIKNS
  25 ------LIKKRKNEIQEVIHSIQMRLQEFRKIL.....KLP.....SLQYVTVSGQEFMIEIKNS
  26 KPGVNAYLDELRFIRENAEKLLKEYEKKLKKET.....GIQ.....SLKIGYNKVMGYYIEVTKA
  27 -SEYDVELDELRKLSNNADQFLIDLETRERESS.....GIS.....TLKVGYNRVHGYYIEISKG
  28 ------LIKKRKNEIQEVIHSIQMRLQEFRKIL.....KLP.....SLQYVTVSGQEFMIEIKNS
  29 ------LIKKRKDEIQGVIDEIRMHLQEIRKIL.....KNP.....SAQYVTVSGQEFMIEIKNS
  30 ------LIKKRKDEIQGVIDEIRMHLQEIRKIL.....KNP.....SAQYVTVSGQEFMIEIKNS
  31 RSGVNPTLDEMRQSLEDDNQWLANLEVTEREKT.....GIA.....NLKVGYNKAFGYYLSLPRS
  32 RSGVNPTLDEMRQSLEDDNQWLANLEVTEREKT.....GIA.....NLKVGYNKAFGYYLSLPRS
  33 KRGFSASLDELHRVRDNANEILKQYLAEERERT.....GIG.....TLKMKYNRMLGHFLEVSKG
  34 KASFDPNLTELREKMDELEKNMQGALGGAAREL.....GLDagk..SIKLESNSQIGHYFRVTCK
  35 -DHYHQDLLRLRNIKTNSKSWILEYQERIRQET.....GIK.....KLKVCYAQALGYYIEVASS
  36 -NHYHPDLLRLRNIKENSKSWILEYQERIRNET.....GIK.....KLKVCYAQALGYYIEVASN
  37 KDGYNQKLDEYRDASRNGKDWIARLEQQEREYT.....GIR.....SLKVGFNKVFGYYIEVTKA
  38 KDGYNQKLDEYRDASRNGKDWIARLEQQEREYT.....GIR.....SLKVGFNKVFGYYIEVTKA
  39 -DEFHNDLKRLRHNQEHSQEWIWEYQERIRKET.....GIK.....KLKICFAQALGYYIEVSSE
  40 -DEFHNDLKRLRHNQEHSQEWIWEYQERIRKET.....GIK.....KLKICFAQALGYYIEVSSE
  41 -EGYNAELDEWRMLSDGATQYLENLEKREREST.....GID.....TLKIGFNAVHGYYIQISQG
  42 -EGCDPEFDATCNAIEEIKSSLKEYLKEQRKLL.....RPA.....SVNYVNVGKDMYLIEVPES
  43 -EGADEEYDCACKTVEEFESSLKKHLKEQRKLL.....GDA.....SINYVTVGKDEYLLEVPES
  44 -EGADEEYDCACKTVEEFESSLKKHLKEQRKLL.....GDA.....SINYVTVGKDEYLLEVPES
  45 ----AQEIRELKMSILMVRTEMDFHLQELRDYL.....EYP.....NLEFSIWGNVKFCIEVSKG
  46 KYEFHPKIAQLNDLINNKKLHVEKLKDQYRKET.....RIE.....SLKISHNNVLGFFIDITPK
  47 ----AQEIRELKMSILMVRTEMDFHLQELRDYL.....EYP.....NLEFSIWGNVKFCIEVSKG
  48 -PNIDPEIDEKKRRLMGLPSFLTEVARKELENLds...RIP.....SCSVIYIPLIGFLLSIPRL
  49 --DQFPELAEARQAVLVIREKLDSSIASFRKKL.....AIR.....NLEFLQVSGITHLIELPVD
  50 --DQFPELAEARQAVLVIREKLDSSIASFRKKL.....AIR.....NLEFLQVSGITHLIELPVD
  51 --DQFPELAEARQAVLVIREKLDSSIASFRKKL.....AIR.....NLEFLQVSGITHLIELPVD
  52 KASFDSRLMELQQMMTELYSKMEELQFKCSQELnld..GKN.....QVKLESVAKLGHHFRITVK
  53 KAGFDSDYDQALADIRENEQSLLEYLEKQRNRI.....GCR.....TIVYWGIGRNRYQLEIPEN
  54 KAGFDSDYDQALADIRENEQSLLEYLEKQRNRI.....GCR.....TIVYWGIGRNRYQLEIPEN
  55 KAGFDSDYDQALADIRENEQSLLEYLEKQRNRI.....GCR.....TIVYWGIGRNRYQLEIPEN
  56 KASFDSRLMELQQMMTELYSKMEELQFKCSQELnld..GKN.....QVKLESVAKLGHHFRITVK
  57 KAGFDSDYDQALADIRENEQSLLEYLEKQRNRI.....GCR.....TIVYWGIGRNRYQLEIPEN
  58 KASFDSRLMELQQMMTELYSKMEELQFKCSQELnld..GKN.....QVKLESVAKLGHHFRITVK
  59 -PNIDPEIDEKKRRLMGLPSFLTEVARKELENLds...RIP.....SCSVIYIPLIGFLLSIPRL
  60 RSNINEFLDIARRTYTEIVDDIAGMISQLGEKY.....SLP.....-LRTSFSSARGFFIQMTTD
  61 RSNINEFLDIARRTYTEIVDDIAGMISQLGEKY.....SLP.....-LRTSLSSVRGFFIQMTTD
  62 -PNIDPEIDEKKRRLMGLPSFLTEVARKELENLds...RIP.....SCSVIYIPLIGFLLSIPRL
  63 -PNIDPDIDAKKRRLIGLPSFLTEVAQKELENLds...RIP.....SCSVIYIPLIGFLLSIPRL
  64 -PNIDPDIDAKKRRLIGLPSFLTEVAQKELENLds...RIP.....SCSVIYIPLIGFLLSIPRL
  65 KPSFDPNLSELREVMDGLEKKMQSTLISAARGL.....GLDpgk..QIKLDSSAQFGYYFRVTCK
  66 KRGYDLKLDNLKDLKINANKYIDQYLESERLLS.....KIN.....NLKIRKTNNRGLFFEVTKS
  67 KPSFDPNLSELREVMDGLEKKMQSTLINAARGL.....GLDpgk..QIKLDSSAQFGYYFRVTCK
  68 KPSFDPNLSELREVMDGLEKKMQSTLINAARGL.....GLDpgk..QIKLDSSAQFGYYFRVTCK
  69 KPSFDPNLSELREIMNDLEKKMQSTLISAARDL.....GLDpgk..QIKLDSSAQFGYYFRVTCK
  70 KPSFDPNLSELREIMNDLEKKMQSTLISAARDL.....GLDpgk..QIKLDSSAQFGYYFRVTCK
  71 KPSFDPNLSELREIMNDLEKKMQSTLISAARDL.....GLDpgk..QIKLDSSAQFGYYFRVTCK
  72 -SSYDTKLASLKDQKELLEQQIHELHKKTAIEL.....DLQvdk..ALKLDKAAQFGHVFRITKK
  73 -SNYDTKLASLKDQKELLEQQIHELHKKTAIEL.....DLQvdk..ALKLDKAAQFGHVFRITKK
  74 KSDSNGLLDVSRQIYKEVKEEFFREVEDLTAKN.....KIN.....-LDHNYDSARGFYLRIKRQ
  75 -NGIDEELDEIRDTYENMPMVLTAIAKQEEARL.....GLPpys..NVACVYIPLVGFVLSVPRD
  76 KSDSNGLLDVSRQIYKEVKEEFFREVEDLTAKN.....KIN.....-LDHNYDSARGFYLRIKRQ
  77 -NGIDEELDEIRDTYENMPMVLTAIAKQEEARL.....GLPpys..NVACVYIPLVGFVLSVPRD
  78 -NGIDEELDEIRDTYENMPMVLTAIAKQEEARL.....GLPpys..NVACVYIPLVGFVLSVPRD
  79 ---YSSDLGVLKDELSVVENHINNLHVDTASDL.....DLSvd...KQLKLEKGSLGHVFRMSKK
  80 RTGFDETLDKYRCVLREGTSWIAEIEAKERENS.....GIS.....TLKIDYNKKDGYYFHVTNS
  81 -YDCSEGIIKIQRESESVRSQLKEELAEIRKYL.....KRP.....--YLNFRDEVDYLIEVKNS
  82 -YDCSEGIIKIQRESESVRSQLKEELAEIRKYL.....KRP.....--YLNFRDEVDYLIEVKNS
  83 ------QLDELRQIYEELPEFLQEVSAMELEHF.....PHL.....-HKEKLPPCIVYIQQIGGS
  84 -AGMDAEYDAAMDSIGEVEKRLKTYLVEQERHF.....GCR.....-ITYFGSDKKRYQLDVPES
  85 REGASAELDEMRALRDQSRRVIAGLQLQYCEET.....GIK.....SLKIKHNNVLGYFIEVTAG
  86 KPEFDDSLRIIRKKLDRLRTDMDNEFAEAAEDL.....GQErek..KIFLENHKVHGWCMRLTRT
  87 -AEFDEELLDLRQRLDELQHSIFEEHKRVGSDL.....HQDtek..KLHLEQHHLYGWCLRLTRT
  88 NPSFNTKLRKLHDTYQGVWQKKTEYNALLKGFFvgdl.GAK.....TFTLKERQNGEYALHVTGT
  89 NPSFNTKLRKLHDTYQGVWQKKTEYNALLKGFFvgdl.GAK.....TFTLKERQNGEYALHVTGT
  90 ---------------------------------.....---.....-------KVFGYFIEITRA
  91 --DFVPEIQEISEKLEQMERVAEKLRKKYSAKF.....ECD.....NLKLDKNSQYGFYFRVTLK
  92 KSNFNNNLRKLHEKLQSLFASYDKLQEDLSKRL.....GKK.....-ATLRKSPAKLYYVHLKLS
  93 KSNFNNNLRKLHEKLQSLFASYDKLQEDLSKRL.....GKK.....-ATLRKSPAKLYYVHLKLS
  94 -DHASETLRGIRTQLRTLESRVRDRLESMLRSSsas..KML.....SDTIVTIRNDRFVIPVKQE
  95 KSGVNGLLDVSRRIRETLLEEVSELVAKLSEEL.....EIF.....-MEYRFEISRGFFIKIKGN
  96 ------QLDLARQTYEEIIRNVEETGAREIAEYfh...GNS.....SVRLSFSQSRGFHYTFVTR
  97 --RASPKLGEIRQKLKAVREQIQQKLQKIIQRQs....NAL.....QEAVITQRGDRFVLPIKAG
  98 -----PLIKKRKNEIQEVIHSIQMRLQEFRKIL.....KLP.....SLQYVTVSGQEFMIEIKNS
  99 KDEASEELLRVRKSIRAVEEEIKKRLDNLINRPdsa..KFL.....SDRIVTIRNGRYVIPVKTS
 100 -AGCDEEYDEALNRVKEALNELNDYKDSVAKKY.....SCS.....-IKFVDSGKVKYLLEMPEN
 101 ---------EIRVELRKIRSQITQKLQNIIQAKs....GAI.....QEQLITQRSDRFVIPVKAP
 102 -RGFDIEFDKSMDRIQELEDELMEILMTYRKQF.....KCS.....NIQYKDSGKEIYTIEIPIS
 103 -RGFDIEFDKSMDRIQELEDELMEILMTYRKQF.....KCS.....NIQYKDSGKEIYTIEIPIS
 104 --GFEAEYDTSQKYQSELKNELYALLEQYKKQL.....RCS.....SLNFKNIGKEVYQVEVPSD
 105 NTGVDNRLDECRNIYNHLEGILLDVARETQIFLln...7qe.....DCKtt7ekLVNAVYIPQLG
 106 -VEFNEELGKIRSKLDTLRDEIHSIHLDSAEDL.....GFDpdk..KLKLENHHLHGWCMRLTRN
 107 NTGVDNRLDECRNIYNHLEGILLDVARETQIFLln...7qe.....DCKtt7ekLVNAVYIPQLG
 108 -VEFNEELGKIRSKLDTLRDEIHSIHLDSAEDL.....GFDpdk..KLKLENHHLHGWCMRLTRN
 109 ---------------------------------.....---.....-------------------
 110 ---------------------------------.....---.....-------------------
 111 -DRASPRLREIRTEKKRLSSEIKRKADDFVRTHs....QIL.....QEQMYVYRDGRYLFPVKAS
 112 ------NLPNLFEQRQTLRAQLTEWAEQISNIVfq...DTT.....SIKAEYFNKEGYAFSISSK
 113 ------NLPNLFEQRQTLRAQLTEWAEQISNIVfq...DTT.....SIKAEYFNKEGYAFSISSK
 114 ---------------------------------.....---.....-------------------
 115 NPEFDDELMELEEQRKSVVKAIESEHQRVMKVY.....GWTek...QLKCEYHTTYGYVFRVTRK
 116 -DRASEDLEIIRSERRRNMENLDSLLKKISTKIfla..GGI.....NKPLITQRRSRMCVAIRAT
 117 RDDASPRLRDLRKRIEPLRGRIREKLTQTLEKWs....DVL.....QEHLVTIRRDRYVLPVLAS
 118 ---------------------------------.....---.....-------------------
 119 -DNASYELQGIRSKISSTNQRIRQNLDRIVKSQanq..KKL.....SDAIVTVRNERNVIPVKAE
 120 KSGVVKEYDSIEFEIKNLNRRVENQIKKIISLNa....EYL.....TSNFVCYKSNKYTLALKSN
 121 ------DIDEV---VNSVKAWADMKLRDLIKDI.....DLS.....GDEVLTLLNQGMTAKLERI
 122 ---------------------------------.....---.....-------------------
 123 --------------INGLRNGIDLLNDLQRADH.....GILa....LYKIVDIPSLSYLPELIHK
 124 --------------INGLRNGIDLLNDLQRADH.....GILa....LYKIVDIPSLSYLPELIHK
 125 ---------------------------------.....---.....-------------------
 126 ---------------------------------.....---.....-------------------
 127 KDSASSALRQSRERVQTLERKLQQLLDAIIRSQ.....KDDesvmiKFQLAAEIDGRWCIQMSSN
 128 KAGFDSDYDQALADIRENEQSLLEYLEKQRNRI.....GCR.....TIVYWGIGRNRYQLEIPEN


         60                  70           80        90        100      
         |                   |            |         |          |      
   1 YYERV.....PKE.....YRPVQTLKD...RQRYTLPEMKEKEREVYRLEAL.IRRREEEVF.LE
   2 YYERV.....PKE.....YRPVQTLKD...RQRYTLPEMKEKEREVYRLEAL.IRRREEEVF.LE
   3 QAEQA.....PAD.....YIRRQTLKG...AERFITPELKAFEDKALSAKSR.ALAREKALY.EE
   4 QSHLA.....PIN.....YVRRQTLKN...AERYIIPELKEYEDKVLTSKGK.ALALEKQLY.DE
   5 NLDKI.....PDD.....YERKQTLVN...SERFITPELKEFETKIMAAKER.IEELEKELF.TS
   6 NLDKI.....PDD.....YERKQTLVN...SERFITPELKEFETKIMAAKER.IEELEKELF.KS
   7 QSHLA.....PIN.....YMRRQTLKN...AERYIIPELKEYEDKVLTSKGK.ALALEKQLY.EE
   8 QSHLA.....PIN.....YMRRQTLKN...AERYIIPELKEYEDKVLTSKGK.ALALEKQLY.EE
   9 QAEQA.....PGD.....YIRRQTLKG...AERFITPELKAFEDKALSAKSR.ALAREKMLY.DA
  10 QSHLA.....PIN.....YMRRQTLKN...AERYIIPELKEYEDKVLTSKGK.ALALEKQLY.EE
  11 QAEQA.....PAD.....YIRRQTLKG...AERFITPELKAFEDKALSAQSR.ALAREKALY.EE
  12 QAEQA.....PAD.....YIRRQTLKG...AERFITPELKAFEDKALSAQSR.ALAREKALY.EE
  13 QSHLA.....PIN.....YVRRQTLKN...AERYIIPELKEYEDKVLTSKGK.ALALEKQLY.DE
  14 YYEKV.....PQE.....YRPVQTLKD...RQRYTLPEMKERERELYRLEAL.IKRREEEVF.LA
  15 YYEKV.....PQE.....YRPVQTLKD...RQRYTLPEMKERERELYRLEAL.IKRREEEVF.LA
  16 YYEKV.....PQE.....YRPVQTLKD...RQRYTLPEMKERERELYRLEAL.IKRREEEVF.LA
  17 QSHLV.....PPH.....YVRRQTLKN...AERYIIEELKQHEDKVLNSKSR.ALALEKQLW.EE
  18 QAEQA.....PAD.....YQRRQTLKN...AERFITPELKAFEDKVLTAQEQ.ALALEKQLF.DG
  19 FATRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDTSLKDCM.RR
  20 FATRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDTSLKDCM.RR
  21 NLKYV.....PSY.....FRRRQTLSN...SERFTTEELQRLEEKILSAQTR.INDLEYELY.KE
  22 QAEQA.....PAD.....YQRRQTLKN...AERFITPELKAFEDKVLTAQEQ.ALALEKQLF.DG
  23 YYEKV.....PQE.....YRPVQTLKD...RQRYTLPEMKERERELYRLEAL.IKRREEEVF.IA
  24 AVSCI.....PAD.....WVKVGSTKA...VSRFHPPFIVESYRRLNQLREQ.LVLDCNAEW.LG
  25 AVSCI.....PAD.....WVKVGSTKA...VSRFHPPFIVESYRRLNQLREQ.LVLDCNAEW.LG
  26 NVKYV.....PEH.....FRRRQTLSN...AERYTTEELQRLEEKILSAQTR.INELEYELY.RE
  27 QADKA.....PVH.....YTRRQTLTN...AERYITEELKAFEDKVLSARDR.ALVREKLLY.EQ
  28 AVSCI.....PAD.....WVKVGSTKA...VSRFHPPFIVESYRRLNQLREQ.LVLDCNAEW.LG
  29 AVSCI.....PTD.....WVKVGSTKA...VSRFHSPFIVENYRHLNQLREQ.LVLDCSAEW.LD
  30 AVSCI.....PTD.....WVKVGSTKA...VSRFHSPFIVENYRHLNQLREQ.LVLDCSAEW.LD
  31 KSEQA.....PDN.....YIRKQTLVN...EERYITPELKERETRILTAQAD.LNQLEYEIF.TE
  32 KSEQA.....PDN.....YIRKQTLVN...EERYITPELKERETRILTAQAD.LNQLEYEIF.TE
  33 HLSAV.....PAH.....FIRRRSLSN...ADRFTTEQLSELEAKLARAREG.LVSFEQELF.AD
  34 EEKALrn...NKK.....FTTIDIQKN...GVRFTNSKLSSLSEEYMRNREE.YEEAQNAIV.KE
  35 FAPQL.....PKE.....FIRRQSRLH...AERFTTQELQQFQDEVFSVEDK.LQTLETKLF.KE
  36 LAPQL.....PKE.....FIRRQSRLH...AERFTTQELQQFQDEVFSVEDK.LQTLETKLF.KE
  37 NLHLLe....EGR.....YERNETLTN...AERYITPELKEKEALILEAENN.ICELEYELF.TE
  38 NLHLLe....EGR.....YERNETLTN...AERYITPELKEKEALILEAENN.ICELEYELF.TE
  39 FAPQL.....PKD.....FIRRQSRLH...AERFTTIELQQFQDDMSNISEK.LQTLETQFF.KD
  40 FAPQL.....PKD.....FIRRQSRLH...AERFTTIELQQFQDDMSNISEK.LQTLETQFF.KD
  41 QAHKA.....PIH.....YVRRQTLKN...AERYIIPELKEYEDKVLKSKGA.ALALEKQLY.DE
  42 LGGSV.....PRN.....YELQSTKKG...FYRYWTPELKELILELSKAESE.KESKLKAIL.QN
  43 LSGSV.....PHD.....YELCSSKKG...VSRYWTPTIKKLLKELSQAKSE.KESALKSIS.QR
  44 LSGSV.....PHD.....YELCSSKKG...VSRYWTPTIKKLLKELSQAKSE.KESALKSIS.QR
  45 C-KKI.....PPD.....WIKLSSTRS...LFRFHTPKIQSLLIELSSHEEN.LTISSEKIY.RS
  46 NVNKIl....DPK.....FIHRQTTIN...SVRYTTYELQNLENELVNAQTL.VIRLEKELY.TD
  47 C-KKI.....PPD.....WIKLSSTRS...LFRFHTPKIQSLLIELSSHEEN.LTISSEKIY.RS
  48 PSMVE.....ASDfeingLDFMFLSEE...KLHYRSARTKELDALLGDLHCE.IRDQETLLM.YQ
  49 --SKV.....PMN.....WVKVNSTKK...TIRYHPPEIVAGLDELALATEH.LAIVNRASW.DS
  50 --SKV.....PMN.....WVKVNSTKK...TIRYHPPEIVAGLDELALATEH.LAIVNRASW.DS
  51 --SKV.....PMN.....WVKVNSTKK...TIRYHPPEIVAGLDELALATEH.LAIVNRASW.DS
  52 DDSVLrk...NKN.....YRIVDVIKG...GVRFTSDKLEGYADEFASCRTR.YEEQQLSIV.EE
  53 FTTRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDVSLKDCM.RR
  54 FTTRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDVSLKDCM.RR
  55 FTTRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDVSLKDCM.RR
  56 DDSVLrk...NKN.....YRIVDVIKG...GVRFTSDKLEGYADEFASCRTR.YEEQQLSIV.EE
  57 FTTRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDVSLKDCM.RR
  58 DDSVLrk...NKN.....YRIVDVIKG...GVRFTSDKLEGYADEFASCRTR.YEEQQLSIV.EE
  59 PSMVE.....ASDfeingLDFMFLSEE...KLHYRSARTKELDALLGDLHCE.IRDQETLLM.YQ
  60 CIALPsdql.PSE.....FIKISKVKN...SYSFTSADLIKMNERCQESLRE.IYHMTYMIV.CK
  61 CIALPsdql.PSE.....FIKISKVKN...SYSFTSADLIKMNERCQESLRE.IYHMTYMIV.CK
  62 PSMVE.....ASDfeingLDFMFLSEE...KLHYRSARTKELDALLGDLHCE.IRDQETLLM.YQ
  63 PFMVE.....ASDfeiegLDFMFLSED...KLHYRSARTKELDTLLGDLHCE.IRDQETLLM.YQ
  64 PFMVE.....ASDfeiegLDFMFLSED...KLHYRSARTKELDTLLGDLHCE.IRDQETLLM.YQ
  65 EEKVLrn...NKN.....FSTVDIQKN...GVKFTNSELSSLNEEYTKNKGE.YEEAQDAIV.KE
  66 NYAQV.....PPH.....FMESQALNS...SKRYKTEKLISLEVDINNAEDN.VVAFEQEIF.DE
  67 EEKVLrn...NKN.....FSTVDIQKN...GVKFTNSELSSLNEEYTKNKGE.YEEAQDAIV.KE
  68 EEKVLrn...NKN.....FSTVDIQKN...GVKFTNSELSSLNEEYTKNKGE.YEEAQDAIV.KE
  69 EEKVLrn...NKN.....FSTVDIQKN...GVKFTNSKLTSLNEEYTKNKTE.YEEAQDAIV.KE
  70 EEKVLrn...NKN.....FSTVDIQKN...GVKFTNSKLTSLNEEYTKNKTE.YEEAQDAIV.KE
  71 EEKVLrn...NKN.....FSTVDIQKN...GVKFTNSKLTSLNEEYTKNKTE.YEEAQDAIV.KE
  72 EEPKIr....-KKlttq.FIVLETRKD...GVKFTNTKLKKLGDQYQSVVDD.YRSCQKELV.DR
  73 EEPKIr....-KKlttq.FIVLETRKD...GVKFTNTKLKKLGDQYQSVVDD.YRSCQKELV.DR
  74 EFTDDvatl.PDV.....FISRTIKKN...YIECTTLNIIKKNARLKEVMEE.ILLLSEETV.DE
  75 YGVES.....QPD.....MTLLYSTHE...DLRVRNATTSRLDDEFGDILMR.LIDSQTAII.LT
  76 EFTDDvatl.PDV.....FISRTIKKN...YIECTTLNIIKKNARLKEVMEE.ILLLSEETV.DE
  77 YGVES.....QPD.....MTLLYSTHE...DLRVRNATTSRLDDEFGDILMR.LIDSQTAII.LT
  78 YGVES.....QPD.....MTLLYSTHE...DLRVRNATTSRLDDEFGDILMR.LIDSQTAII.LT
  79 EEQKVrkkl.TGS.....YLIIETRKD...GVKFTNSKLKNLSDQYQALFGE.YTSCQKKVV.GD
  80 QLGNV.....PAH.....FFRKATLKN...SERFGTEELARIEGDMLEAREK.SANLEYEIF.MR
  81 QIKDL.....PDD.....WIKVNNTKM...VSRFTTPRTQKLTQKLEYYKDL.LIRESELQY.KE
  82 QIKDL.....PDD.....WIKVNNTKM...VSRFTTPRTQKLTQKLEYYKDL.LIRESELQY.KE
  83 EVSRFkfdi.SSI.....QCYILKETQ...RFFYHTSKTRELDNLLGDIYHK.ILDMERAII.RD
  84 HASKA.....NKS.....YTLEGQTKGkkpSRRYTTAETRALLKDMQHAEDT.RNMVLKDLA.RR
  85 NAGSMtd...6ag.....RARFIHRQT...MANAMRFTTTELAELETKIANA.ADRALAIEL.ET
  86 EAGCIrn...NSR.....YLECSTQKN...GVYFTTKTLQALRREFDQLSQN.YNRTQSSLV.NE
  87 EAGCLrgr..SSH.....YTELSTQKN...GVYFTTKRLHSLNNSYMDHQKS.YRYHQNGLA.RE
  88 ASSLKk....13yh....GSCFHILQK...SSQTRWLSHKIWTDLGHELELL.NLKIRNEEA.NI
  89 ASSLKk....13yh....GSCFHILQK...SSQTRWLSHKIWTDLGHELELL.NLKIRNEEA.NI
  90 NLQNFe....-PSefg..YMRKQTLSN...AERFITDELKEKEDIILGAEDK.AIELEYQLF.VQ
  91 EEKSIrk...KDV.....HILETTKGS...GVKFSVGELSDINDEFLEFHLK.YTRAEEEVI.SM
  92 GNETI.....ERFik...6tqAVLFQS...TKSTASFQLPGWTSLGMDLENT.KLHIHQEEQ.RV
  93 GNETI.....ERFik...6tqAVLFQS...TKSTASFQLPGWTSLGMDLENT.KLHIHQEEQ.RV
  94 YRSSY.....GGI.....VHDTSSSGA...TLFIEPQAIVDMNNSLQQAKVK.EKQEIERIL.RV
  95 NTDINsl...PEV.....LINRVKKRK...TIECTTIELMKQSSRYNDIVSE.ITTLNSTII.HD
  96 QAESVti...PRY.....FLDVFRNRT...TVTFNSRKVIAYNDRLEQVVAE.MFLASDVIV.CD
  97 YKEQM.....PGI.....VHDSSASGN...TLYVEPQAIVELGNKLRQARRQ.EQTEEERIL.RQ
  98 AVSCI.....PAD.....WVKVGSTKA...VSRFHPPFIVESYRRLNQLREQ.LVLDCNAEW.LG
  99 HVKKI.....FGI.....VHGTSSSGY...TTYVEPQFVIHLNNKLTELKQK.EEEEVRKVL.QR
 100 --TKV.....SSS.....FELKSRRKG...FIRYSTPDSEQLVAALDAVEKE.KSKLGDDAT.RR
 101 QKDAI.....PGI.....VHDTSTSGA...TLYIEPSSVVPLGNQLRQIFRK.EQTEAEAIR.RT
 102 ATKNV.....PSN.....WVQMAANKT...YKRYYSDEVRALARSMAEAKEI.HKTLEEDLK.NR
 103 ATKNV.....PSN.....WVQMAANKT...YKRYYSDEVRALARSMAEAKEI.HKTLEEDLK.NR
 104 V--KV.....PVN.....WCKMSGTKK...TNRYYNDELRKKIKKLLEAEEL.HLAIMSRMQ.EK
 105 YLVTI.....SVL.....MEPLLDGIpnlqWEEIFRSSENIYFKNGRVLELD.ETYGDIYGA.IS
 106 DAKELrk...HKK.....YIELSTVKA...GIFFSTKQLKSIANETNILQKE.YDKQQSALV.RE
 107 YLVTI.....SVL.....MEPLLDGIpnlqWEEIFRSSENIYFKNGRVLELD.ETYGDIYGA.IS
 108 DAKELrk...HKK.....YIELSTVKA...GIFFSTKQLKSIANETNILQKE.YDKQQSALV.RE
 109 -----.....--H.....YTELSTQKN...GVYFTTKRLHSLNNSYMDHQKS.YRYHQNGLA.RE
 110 -----.....---.....--------D...KLHYRSARTKELDTLLGDLHCE.IRDQETLLM.YQ
 111 MKNAV.....RGI.....VHHLSSSGA...TVFLEPDEFVELNNRVRLLEEE.ERLEISRIL.RQ
 112 KLTKLe....10is....NNSIIVLGK...RGSHHIITSPTIHKVSIELNEL.EEQINIYVK.QT
 113 KLTKLe....10is....NNSIIVLGK...RGSHHIITSPTIHKVSIELNEL.EEQINIYVK.QT
 114 -----.....---.....---------...----RSARTKELDALLGDLHCE.IRDQETLLM.YQ
 115 EDQQVrt...SKE.....LITVSTSKD...GVRFVSERLSSLSEQYKGIRKV.YDVRQQDLK.QK
 116 HKSLLp....GGV.....VLSVSSSRA...TCFIEPKEAVELNNMEVRHANS.EKAEEMAIL.SI
 117 RVGSV.....QGI.....IVDASATGQ...TYFVEPAAVTQLNNELTRLILD.EEAEVRRIL.TE
 118 -----.....---.....-------KS...TASFQLPGWTSLGMDLENTKLH.IHQEEQRVL.KS
 119 YRQDF.....NGI.....VHDQSASGQ...TLYIEPSSVVEMNNQISRLRHD.EAIEKERIL.TQ
 120 FKGKI.....KGN.....IISISSSGE...TFYIEPNDIVNANNRLNYLSLE.KERIILKIL.RN
 121 FDDVL.....SEA.....RSMIKKRSG...VEF--DPFIKSYPLKIDQREVE.RVKRLEAAS.RN
 122 -----.....---.....---------...----------------------.--------F.EA
 123 FEERM.....QNE.....FPCGQVSD-...----------------------.VNANGANDL.AA
 124 FEERM.....QNE.....FPCGQVSD-...----------------------.VNANGANDL.AA
 125 -----.....---.....-------SG...LELFLSQFEAAIDSDFPNYQNQdVTDENAETL.TI
 126 -----.....---.....-------SG...LELFLSQFEAAIDSDFPNYQNQdVTDENAETL.TI
 127 QLTSV.....NGLl....14edNMCFS...GSGGGTAAEPIAAVSMNDDLQS.ARASVAKAE.AE
 128 FTTRNl....PEE.....YELKSTKKG...CKRYWTKTIEKKLANLINAEER.RDVSLKDCM.RR


             110        120       130         140       150            
              |          |         |           |         |            
   1 VRE.....RAKRQAE.ALREAARILAELDVYAALAEVA..VRYGYVRPRFGDR.....LQIRAGR
   2 VRE.....RAKRQAE.ALREAARILAELDVYAALAEVA..VRYGYVRPRFGDR.....LQIRAGR
   3 LLE.....ILIAQLA.PLQETATALAELDVLANLAERA..LNLDFNRPRFVEEpc...LRIRQGR
   4 LFD.....LLLPHLA.DLQQSANALAELDVLVNLAERA..WTLNYTCPTFTDKpg...IRITEGR
   5 VCE.....EVKKHKE.VLLEISEDLAKIDALSTLAYDA..IMYNYTKPVFSEDr....LEIKGGR
   6 VCE.....EVKKHKE.VLLEISEDLAKIDALSTLAYDA..IMYNYTKPVFSEDr....LEIKGGR
   7 LFD.....LLLPHLE.ALQQSASALAELDVLVNLAERA..YTLNYTCPTFIDKpg...IRITEGR
   8 LFD.....LLLPHLE.ALQQSASALAELDVLVNLAERA..YTLNYTCPTFIDKpg...IRITEGR
   9 LLE.....TLISHLA.PLQDSAAALAELDVLINLAERA..LNLDLNCPRFVDEpc...LRIEQGR
  10 LFD.....LLLPHLE.ALQQSASALAELDVLVNLAERA..YTLNYTCPTFIDKpg...IRITEGR
  11 LLE.....RLIGHLA.PLQDSASALAELDVLANLAERA..LNLDLNRPRFVEHtc...LHIEQGR
  12 LLE.....RLIGHLA.PLQDSASALAELDVLANLAERA..LNLDLNRPRFVEHtc...LHIEQGR
  13 LFD.....LLLPHLA.DLQQSANALAELDVLVNLAERA..WTLNYTCPTFTDKpg...IRITEGR
  14 LRE.....RARKEAE.ALREAARILAELDVYAALAEVA..VRHGYTRPRFGER.....LRIRAGR
  15 LRE.....RARKEAE.ALREAARILAELDVYAALAEVA..VRHGYTRPRFGER.....LRIRAGR
  16 LRE.....RARKEAE.ALREAARILAELDVYAALAEVA..VRHGYTRPRFGER.....LRIRAGR
  17 LFD.....LLMPHLE.QLQQLAASVAQLDVLQNLAERA..ENLEYCRPTLVQEag...IHIQGGR
  18 VLK.....NLQTALP.QLQKAAKAAAALDVLSTFSALA..KERNFVRPEFADYpv...IHIENGR
  19 LFC.....NFDKNHK.DWQSAVECIAVLDVLLCLANYSqgGDGPMCRPEIVLPge...7pfLEFK
  20 LFC.....NFDKNHK.DWQSAVECIAVLDVLLCLANYSqgGDGPMCRPEIVLPge...7pfLEFK
  21 LRE.....RVVKELD.KVGNNASAVAEVDFIQSLAQIA..YEKDWAKPQIHEGye...LIIEEGR
  22 VLK.....NLQTALP.QLQKAAKAAAALDVLSTFSALA..KERNFVRPEFADYpa...IHIENGR
  23 LRE.....RARKEGE.ALREAARILAELDVYAALAEVA..VRHGYTRPRFGER.....LRIRAGR
  24 FLE.....NFGEHYH.TLCKAVDHLATVDCIFSLAKVA..KQGNYCRPTLQEEkk...IIIKNGR
  25 FLE.....NFGEHYH.TLCKAVDHLATVDCIFSLAKVA..KQGNYCRPTLQEEkk...IIIKNGR
  26 LRE.....EVVKELD.KVGNNATLIGEVDYIQSLAWLA..LEKGWVKPEVHEGye...LIIEEGK
  27 LLD.....TVGAQLE.PLKRCAAALSELDVLVCFAERA..QTLDWVRPELEHTsc...LHIEGGR
  28 FLE.....NFGEHYH.TLCKAVDHLATVDCIFSLAKVA..KQGNYCRPTLQEEkk...IIIKNGR
  29 FLE.....KFSEHYH.SLCKAVHHLATVDCIFSLAKVA..KQGDYCRPTVQEErk...IVIKNGR
  30 FLE.....KFSEHYH.SLCKAVHHLATVDCIFSLAKVA..KQGDYCRPTVQEErk...IVIKNGR
  31 VRA.....TVAEKAQ.PIRDVAKAVAAIDVLAGLAEVA..VYQGYCRPIMQMEpgl..IDIEAGR
  32 VRA.....TVAEKAQ.PIRDVAKAVAAIDVLAGLAEVA..VYQGYCRPIMQMEpgl..IDIEAGR
  33 IRR.....TVCSHTQ.LLRTNAARVAQLDVLQSFAHAA..LQHGWSQPVFIKDga...LRITGGR
  34 IIT.....ISAGYVD.PIQTLNDVIAQLDAVVSFAHVSnsAPVPYVRPVILEKgqgr.IVLHSAR
  35 LCC.....YILQHRD.LILELSTTIADLDYVISLAELA..AEYDYRRPLVDHSda...LSITKGM
  36 LCF.....YIVEHRD.LILKLSTAVADLDYVVSLAELA..AEYDYRRPLVDHSda...LSITKGM
  37 LRE.....KVKQYIP.RLQQLAKQMSELDALQCFATIS..ENRHYTKPEFSKDe....VEVIEGR
  38 LRE.....KVKQYIP.RLQQLAKQMSELDALQCFATIS..ENRHYTKPEFSKDe....VEVIEGR
  39 LCS.....HILQLRT.EILALSQSLADLDYIISLADLA..HAQGYCRPHVDMSdt...LCIYRGC
  40 LCS.....HILQLRT.EILALSQSLADLDYIISLADLA..HAQGYCRPRVDMSdt...LCIYRGC
  41 LFD.....LLLPHLG.SLQLASLALSELDVLVNLAERA..DTLNYVMPTFCDEvs...VKIKNGR
  42 LIQ.....LFVEHHT.EWRQLVSVVAELDVLTSLAIASgyFEGPSCCPTIKESng...8ptFHAR
  43 LIG.....RFCEHQE.KWRQLVSATAELDVLISLAFASdsYEGVRCRPVISGSts...7phLSAT
  44 LIG.....RFCEHQE.KWRQLVSATAELDVLISLAFASdsYEGVRCRPVISGSts...7phLSAT
  45 FLS.....RISEHYN.ELRNVTTVLGTLDCLISFARIS..SQSGYTRPEFSDKe....LLIHESR
  46 ICR.....KVIEKSS.YLKILANSLSGLDVFCNFAYIA..DEYDYTKPEFTNDls...FDIVKGR
  47 FLS.....RISEHYN.ELRNVTTVLGTLDCLISFARIS..SQSGYTRPEFSDKe....LLIHESR
  48 LQC.....QVLARAA.VLTRVLDLASRLDVLLALASAA..RDYGYSRPRYSPQvlg..VRIQNGR
  49 FLK.....SFSRYYT.DFKAAVQALAALDCLHSLSTLS..RNKNYVRPEFVDDcepveINIQSGR
  50 FLK.....SFSRYYT.DFKAAVQALAALDCLHSLSTLS..RNKNYVRPEFVDDcepveINIQSGR
  51 FLK.....SFSRYYT.DFKAAVQALAALDCLHSLSTLS..RNKNYVRPEFVDDcepveINIQSGR
  52 IIH.....VAVGYAA.PLTLLNNELAQLDCLVSFAIAArsAPTPYVRPKMLEEgare.LVLEDVR
  53 LFY.....NFDKNYK.DWQSAVECIAVLDVLLCLANYSrgGDGPMCRPVILLPed...6pfLELK
  54 LFY.....NFDKNYK.DWQSAVECIAVLDVLLCLANYSrgGDGPMCRPVILLPed...6pfLELK
  55 LFY.....NFDKNYK.DWQSAVECIAVLDVLLCLANYSrgGDGPMCRPVILLPed...6pfLELK
  56 IIH.....VAVGYAA.PLTLLNNELAQLDCLVSFAIAArsAPTPYVRPKMLEEgare.LVLEDVR
  57 LFY.....NFDKNYK.DWQSAVECIAVLDVLLCLANYSrgGDGPMCRPVILLPed...6pfLELK
  58 IIH.....VAVGYAA.PLTLLNNELAQLDCLVSFAIAArsAPTPYVRPKMLEEgare.LVLEDVR
  59 LQC.....QVLARAA.VLTRVLDLASRLDVLLALASAA..RDYGYSRPRYSPQvlg..VRIQNGR
  60 LLS.....EIYEHIH.CLYKLSDTVSMLDMLLSFAHAC..TLSDYVRPEFTDT.....LAIKQGW
  61 LLS.....EIYEHIH.CLYKLSDTVSMLDMLLSFAHAC..TLSDYVRPEFTDT.....LAIKQGW
  62 LQC.....QVLARAA.VLTRVLDLASRLDVLLALASAA..RDYGYSRPRYSPQvlg..VRIQNGR
  63 LQC.....QVLARAS.VLTRVLDLASRLDVLLALASAA..RDYGYSRPHYSPCihg..VRIRNGR
  64 LQC.....QVLARAS.VLTRVLDLASRLDVLLALASAA..RDYGYSRPHYSPCihg..VRIRNGR
  65 IVN.....ISSGYVE.PMQTVNDVLAHLDAVVSFAHVSnaAPVPYVRPVILEKgkgr.IIVKASR
  66 IAS.....NVVMHNK.VLKKVAEFFAYIDLVVNFGYLA..KKNEYKRPVLTSGke...ILLEKSR
  67 IVN.....ISSGYVE.PMQTLNDVLAHLDAIVSFAHVSnaAPVPYVRPVILEKgkgr.IILKASR
  68 IVN.....ISSGYVE.PMQTLNDVLAHLDAIVSFAHVSnaAPVPYVRPVILEKgkgr.IILKASR
  69 IVN.....ISSGYVE.PMQTLNDVLAQLDAVVSFAHVSngAPVPYVRPAILEKgqgr.IILKASR
  70 IVN.....ISSGYVE.PMQTLNDVLAQLDAVVSFAHVSngAPVPYVRPAILEKgqgr.IILKASR
  71 IVN.....ISSGYVE.PMQTLNDVLAQLDAVVSFAHVSngAPVPYVRPAILEKgqgr.IILKASR
  72 VVE.....TVTSFSE.VFEDLAGLLSEMDVLLSFADLAasCPTPYCRPEITSSdagd.IVLEGSR
  73 VVE.....TVTSFSE.VFEDLAGLLSEMDVLLSFADLAasCPTPYCRPEITSLdagd.IVLEGSR
  74 LLD.....KIATHIS.ELFMIAEAVAILDLVCSFTYNL..KENNYTIPIFTNN.....LLIRDSR
  75 LKT.....RVMKKKR.SIIKLLSIASRIDVLISFGLIA..AQNGWNCPALVDEpv...IEAVELY
  76 LLD.....KIATHIS.ELFMIAEAVAILDLVCSFTYNL..KENNYTIPIFTNN.....LLIRDSR
  77 LKT.....RVMKKKR.SIIKLLSIASRIDVLISFGLIA..AQNGWNCPALVDEpv...IEAVELY
  78 LKT.....RVMKKKR.SIIKLLSIASRIDVLISFGLIA..AQNGWNCPALVDEpv...IEAVELY
  79 VVR.....VSGTFSE.VFENFAAVLSELDVLQSFADLAtsCPVPYVRPDITASdegd.IVLLGSR
  80 IRE.....EVGKYIQ.RLQALAQGIATVDVLQSLAVVA..ETQHLIRPEFGDDsq...IDIRKGR
  81 FLN.....KITAEYT.ELRKITLNLAQYDCILSLAATS..CNVNYVRPTFVNGqqa..IIAKNAR
  82 FLN.....KITAEYT.ELRKITLNLAQYDCILSLAATS..CNVNYVRPTFVNGqqa..IIAKNAR
  83 LLS.....HTLLFSA.HLLKAVNFVAELDCILSLACVA..HQNNYVRPVLTVEsl...LDIRNGR
  84 LFE.....KFSNHYD.QWKQCIDCVANLDVLGSLAEYAg.QQMVICVPELVSDadqpfIQLEEGY
  85 FEA.....MVREVVA.EAEAIKAAALALATIDVSAGLA..VLAEEQNYTRPTV.....DRSrmFA
  86 VVG.....VAASYCP.VLERLAAVLAHLDVIVSFAHCSvhAPISYVRPKIHPRgtgr.TVLTEAR
  87 VIK.....IAATYGP.PLEAIGQVIAHLDVILSFAHAStvAVIPYVRPNIVDSs....39arLYL
  88 IDL.....FKRKFID.RSNEVRQVATTLGYLDTLSSFA..VLANERNLVCPKV.....DESnkLE
  89 IDL.....FKRKFID.RSNVVRQVATTLGYLDTLSSFA..VLANERNLVCPKV.....DESnkLE
  90 LRE.....EVKKYTE.RLQQQAKIISELDCLQSFAEIA..QKYNYTRPSFSENkt...LELVESR
  91 LCK.....KAEEFIP.LIPAMAQLIATLDVFVSLSTFAatSSGIYTRPNLLPLgskr.LELKQCR
  92 LKS.....ITDEIVS.HHKTLRSLANALDELDISTSLA..TLAQEQDFVRPVV.....DDShaHT
  93 LKS.....ITDEIVS.HHKTLRSLANALDELDISTSLA..TLAQEQDFVRPVV.....DDShaHT
  94 LTE.....KTAEYTE.ELFLDLQVLQTLDFIFAKARYA..KAVKATKPIMNDTgf...IRLKKAR
  95 MYT.....SINSYTP.ILLMVSEAIGTLDLLCSFAYFTslQKDSYTCPEFAKE.....VTIMRSL
  96 MIE.....EMQPMIP.VLYYAMDALSSIDFLCGLATYS..DLRDTCKPTFGPS.....FSISQGR
  97 LSD.....QVLEVLL.DLEHLLAIATRLDLATARVRYS..FWLGAHPPQWLTPgdekpITLRQLR
  98 FLE.....NFGEHYH.TLCKAVDHLATVDCIFSLAKVA..KQGNYCRPTLQEEkk...IIIKNGR
  99 ITE.....YIGDYAK.ELLESFEACVEVDFQQCKYRFS..KLVEGSFPDFGEW.....VELYEAR
 100 VFE.....QFGHKNP.IWLETVKLVSSFDVLTSLALFAksSPFEMCMPEFDFNatdpyLIVDKGV
 101 LTE.....KVAAVTP.DLERLLAIVTTLDLAVAKARYS..LWIGSNPPRFINRqdneiITLRNLR
 102 LCQ.....KFDAHYNtIWMPTIQAISNIDCLLAITRTSeyLGAPSCRPTIVDEv....12gfLKF
 103 LCQ.....KFDAHYNtIWMPTIQAISNIDCLLAITRTSeyLGAPSCRPTIVDEv....12gfLKF
 104 FYI.....RFDSNYE.QWLALIKYTASIDCFFSLSQAAaaLGEPYCRPEIIEQkdgh.LYFEELR
 105 DFE.....IEILFSL.QEQILRRKTQLTAYNILLSELE..ILLSFAQVSAERN.....YAEPQLV
 106 IIN.....ITLTYTP.VFEKLSLVLAHLDVIASFAHTSsyAPIPYIRPKLHPMdserrTHLISSR
 107 DFE.....IEILFSL.QEQILRRKTQLTAYNILLSELE..ILLSFAQVSAERN.....YAEPQLV
 108 IIN.....ITLTYTP.VFEKLSLVLAHLDVIASFAHTSsyAPIPYIRPKLHPMdserrTHLISSR
 109 VIK.....IAATYGP.PLEAIGQVIAHLDVILSFAHAStvAVIPYVRPNIVDSs....39arLYL
 110 LQC.....QVLARAS.VLTRVLDLASRLDVLLALASAA..RDYGYSRPHYSPCihg..VRIRNGR
 111 LTN.....ILLSRLN.DLERNVELIARFDSLYARVKFA..REFNGTVVKPSSR.....IRLVNAR
 112 YhQ.....ELKRLYF.SYSELFLPLVNMISRLDVALSG..AITAIKFNYVEPC.....LTLAkt8
 113 YhQ.....ELKRLYF.SYSELFLPLVNMISRLDVALSG..AITAIKFNYVEPC.....LTLAkt8
 114 LQC.....QVLARAA.VLTRVLDLASRLDVLLALASAA..RDYGYSRPRYSPQvlg..VRIQNGR
 115 LVS.....TVVTYLP.VLDDAKELIAALDVFVAWATVVrdSPHPMVRPTIRTPe....12slITL
 116 LTS.....EVVMAQR.EILHLLDRILELDIAFARASHA..NWINGVYPNVTSEht...8laVDID
 117 LSA.....LLAQD-S.AVPMTLATIGELDLIASKAKLA..RDWRLNRPEAAPDgl...YDLHEAR
 118 ITD.....EIVSHHK.TLRSLANALDELDISTSLATLA..QEQDFVRPVVDDSha...HTVIQGR
 119 LTG.....YVAADKD.ALLVAEQVMGQLDFLIAKARYS..RSIKGTKPIFKEErt...VYLPKAY
 120 LSA.....KVHSNIV.LLDNLYNNFLYYDSLKARAIYG..IKTKGVFPEISNV.....LNIFDAH
 121 LREfe...9atRLGE.LRGAVEGEIRDIMEFDYRLALG..LFAHEYELTEPEF.....GDEISLR
 122 MVR.....EVVAEAE.AIKAAALALATIDVSAGLAVLA..EEQNYTRPTVDRSrm...FAIDGGR
 123 LMD.....VFIGKAS.EWSLVINAVSTIDVLRSFAAMTlsSFGAMCRPQVLLKddvpvLRMKGLW
 124 LMD.....VFIGKAS.EWSLVINAVSTIDVLRSFAAMTlsSFGAMCRPQVLLKddvpvLRMKGLW
 125 LIE.....LFIERAT.QWSEVIHTISCLDVLRSFAIAAslSAGSMARPVIFPEs....14piLKI
 126 LIE.....LFIERAT.QWSEVIHTISCLDVLRSFAIAAslSAGSMARPVIFPEs....14piLKI
 127 ILS.....MLTEKIN.ARATYSRAYGGAHPDIYLPPED..EVESLSAGENSPD.....INLPs11
 128 LFY.....NFDKNYK.DWQSAVECIAVL----------..-------------.....-------


      160            170            180       190       200 
       |              |              |         |         | 
   1 HPVVERR.....TEFVPNDLEMAHE.....LVLITGPNMAGKSTFLRQTALIAL
   2 HPVVERR.....TEFVPNDLEMAHE.....LVLITGPNMAGKSTFLRQTALIAL
   3 HPVVEQVld...TPFVANDLELDDNtr...MLIITGPNMGGKSTYMRQTALIVL
   4 HPVVEQVln...EPFIANPLNLSPQrr...MLIITGPNMGGKSTYMRQTALIAL
   5 HPVVERFt....QNFVENDIYMDNEkr...FVVITGPNMSGKSTFIRQVGLISL
   6 HPVVERFt....QNFVENDIYMDNEkr...FVVITGPNMSGKSTFIRQVGLISL
   7 HPVVEQVln...EPFIANPLNLSPQrr...MLIITGPNMGGKSTYMRQTALIAL
   8 HPVVEQVln...EPFIANPLNLSPQrr...MLIITGPNMGGKSTYMRQTALIAL
   9 HPVVEQVlt...TPFVANDLGLDNStr...MLIITGPNMGGKSTYMRQTALIVL
  10 HPVVEQVln...EPFIANPLNLSPQrr...MLIITGPNMGGKSTYMRQTALIAL
  11 HPVVEQVle...TPFVANDLALDADtr...MLVITGPNMGGKSTYMRQTALIVL
  12 HPVVEQVle...TPFVANDLALDADtr...MLVITGPNMGGKSTYMRQTALIVL
  13 HPVVEQVln...EPFIANPLNLSPQrr...MLIITGPNMGGKSTYMRQTALIAL
  14 HPVVERR.....TAFVPNDLEMAHE.....LVLVTGPNMAGKSTFLRQTALIAL
  15 HPVVERR.....TAFVPNDLEMAHE.....LVLVTGPNMAGKSTFLRQTALIAL
  16 HPVVERR.....TAFVPNDLEMAHE.....LVLVTGPNMAGKSTFLRQTALIAL
  17 HPVVERVmn...EPFIANPIELNPQrr...MLIITGPNMGGKSTYMRQTALIAL
  18 HPVVEQQv....RHFTANHTDLDHKhr...LMLLTGPNMGGKSTYMRQVALIVL
  19 GSRHPCI.....TKTffgDDFIPND.....ILIGCEe10ayCVLVTGPNMGGKS
  20 GSRHPCI.....TKTffgDDFIPND.....ILIGCEe10ayCVLVTGPNMGGKS
  21 HPVIEEFv....ENYVPNDTKLDRDsf...IHVITGPNMAGKSSYIRQVGVLTL
  22 HPVVEQQv....RHFTANHTNLDHKhr...LMLLTGPNMGGKSTYMRQVALIVL
  23 HPVVRRR.....TAFVPNDLEMAHE.....LVRVTGPNMAGKSTFLRQTALIAL
  24 HPMIDVLlgeq.DQFVPNSTSLSDSer...VMIITGPNMGGKSSYIKQVALVTI
  25 HPMIDVLlgeq.DQFVPNSTSLSDSer...VMIITGPNMGGKSSYIKQVALVTI
  26 HPVIEEFt....KNYVPNDTKLTEEef...IHVITGPNMAGKSSYIRQVGVLTL
  27 HPVVEAVre...QRFEPNDLDLHPErr...MLVITGPNMGGKSTYMRQNALIVL
  28 HPMIDVLlgeq.DQFVPNSTSLSDSer...VMIITGPNMGGKSSYIKQVTLVTI
  29 HPVIDVLlgeq.DQYVPNNTDLSEDser..VMIITGPNMGGKSSYIKQVALITI
  30 HPVIDVLlgeq.DQYVPNNTDLSEDser..VMIITGPNMGGKSSYIKQVALITI
  31 HPVVEQSlga..GFFVANDTQLGHDhwhpdLVILTGPNASGKSCYLRQVGLIQL
  32 HPVVEQSlga..GFFVANDTQLGHDhwhpdLVILTGPNASGKSCYLRQVGLIQL
  33 HPVVELHlps..GEFVPNDLTLSSSeh...7prFALITGPNMAGKSTFLRQTAL
  34 HPCIEMQdd...VAFIPNDITFEKEkqm..FHIITGPNMGGKSTYIRQTGVIVL
  35 HPVALTLldr..GTFIPNDTIMHSAqtr..MILLTGPNMAGKSTYIRQIALLVI
  36 HPVALTLldk..GTFIPNDTVMHSAqtr..MILLTGPNMAGKSTYIRQIALLVI
  37 HPVVEKVmds..QEYVPNNCMMGDNrq...MLLITGPNMSGKSTYMRQIALISI
  38 HPVVEKVmds..QEYVPNNCMMGDNrq...MLLITGPNMSGKSTYMRQIALISI
  39 HPVAKTLvdt..GKFIPNDTEMRGSqtr..MILLTGPNMAGKSTYIRQIALLVI
  40 HPVAKTLvdt..GKFIPNDTEMRGSqtr..MILLTGPNMAGKSTYIRQIALLVI
  41 HPVVEQVlk...DPFIANPVELNHNrh...LLVITGPNMGGKSTYMRQTALITL
  42 NLGHPIL.....RSDslgkGSFVPN.....DVKIGGPgnasFIVLTGPNMGGKS
  43 GLGHPVL.....RGDslgrGSFVPN.....NVKIGGAekasFILLTGPNMGGKS
  44 GLGHPVL.....RGDslgrGSFVPN.....NVKIGGAekasFILLTGPNMGGKS
  45 HPMIELLsd...KSFVPNHIHLSSDgvr..CLLITGPNMGGKSSFVKQLALSAI
  46 HPVVEAAlrktsKSFVYNDCHLSEAer...IWLITGPNMAGKSTYLRQNAIITI
  47 HPMIELLsd...KSFVPNHIHLSSDgvr..CLLITGPNMGGKSSFVKQLALSAI
  48 HPLMELCa....RTFVPNSTECGGDkgr..VKVITGPNSSGKSIYLKQVGLITF
  49 HPVLETIlq...DNFVPNDTILHAEgey..CQIITGPNMGGKSCYIRQVALISI
  50 HPVLETIlq...DNFVPNDTILHAEgey..CQIITGPNMGGKSCYIRQVALISI
  51 HPVLETIlq...DNFVPNDTILHAEgey..CQIITGPNMGGKSCYIRQVALISI
  52 HPCLELQeh...VNFIANSVDFKKEecn..MFIITGPNMGGKSTYIRSVGTAVL
  53 GSRHPCI.....TKTffgDDFIPND.....ILIGCEe10ayCVLVTGPNMGGKS
  54 GSRHPCI.....TKTffgDDFIPND.....ILIGCEe10ayCVLVTGPNMGGKS
  55 GSRHPCI.....TKTffgDDFIPND.....ILIGCEe10ayCVLVTGPNMGGKS
  56 HPCLELQeh...VNFIANSVDFKKEecn..MFIITGPNMGGKSTYIRSVGTAVL
  57 GSRHPCI.....TKTffgDDFIPND.....ILIGCEe10ayCVLVTGPNMGGKS
  58 HPCLELQeh...VNFIANSVDFKKEecn..MFIITGPNMGGKSTYIRSVGTAVL
  59 HPLMELCa....RTFVPNSTECGGDkgr..VKVITGPNSSGKSIYLKQVGLITF
  60 HPILEKIsa...EKPIANNTYVTEGsn...FLIITGPNMSGKSTYLKQIALCQI
  61 HPILEKIsa...EKPIANNTYVTEGsn...FLIITGPNMSGKSTYLKQIALCQI
  62 HPLMELCa....RTFVPNSTECGGDkgr..VKVITGPNSSGKSIYLKQVGLITF
  63 HPLMELCa....RTFVPNSTDCGGDqgr..VKVITGPNSSGKSIYLKQVGLITF
  64 HPLMELCa....RTFVPNSTDCGGDqgr..VKVITGPNSSGKSIYLKQVGLITF
  65 HACVEVQhd...VAFIPNDVHFEKDkqm..FHIITGPNMGGKSTYIRQTGVIVL
  66 HPVVEHYtknt.EIFTENFVRINKEky...FCLITGPNMAGKSTYLRQVALITL
  67 HACVEVQde...VAFIPNDVHFEKDkqm..FHIITGPNMGGKSTYIRQTGVIVL
  68 HACVEVQde...VAFIPNDVHFEKDkqm..FHIITGPNMGGKSTYIRQTGVIVL
  69 HACVEVQde...IAFIPNDVYFEKDkqm..FHIITGPNMGGKSTYIRQTGVIVL
  70 HACVEVQde...IAFIPNDVYFEKDkqm..FHIITGPNMGGKSTYIRQTGVIVL
  71 HACVEVQde...IAFIPNDVYFEKDkqm..FHIITGPNMGGKSTYIRQTGVIVL
  72 HPCVEAQdw...VNFIPNDCRLMRGksw..FQIVTGPNMGGKSTFIRQVGVIVL
  73 HPCVEAQdw...VNFIPNDCRLMRGksw..FQIVTGPNMGGKSTFIRQVGVIVL
  74 HPLLEKVl....KNFVPNTISSTKHsss..LQIITGCNMSGKSVYLKQVALICI
  75 HPISVLVvk...KSFVPNQVSSGRDgik..ASIITGPNACGKSVYMKSIGIMVF
  76 HPLLEKVl....KNFVPNTISSTKHsss..LQIITGCNMSGKSVYLKQVALICI
  77 HPISVLVvk...KSFVPNQVSSGRDgik..ASIITGPNACGKSVYMKSIGIMVF
  78 HPISVLVvk...KSFVPNQVSSGRDgik..ASIITGPNACGKSVYMKSIGIMVF
  79 HPCLEAQdg...VNFIPNDCTLVRGksw..FQIITGPNMGGKSTFIRQVGVNVL
  80 HAVVEKVmga..QTYIPNTIQMAEDts...IQLVTGPNMSGKSTYMRQLAMT--
  81 NPIIESLd....VHYVPNDIMMSPEngk..INIITGPNMGGKSSYIRQVALLTI
  82 NPIIESLd....VHYVPNDIMMSPEngk..INIITGPNMGGKSSYIRQVALLTI
  83 HVLQEMAv....DTFIPNDTEINDNgr...IHIITGPNYSGKSIYVKQVALIVF
  84 HPCANA-.....STYIPNGLELGTAseap.LSLLTGPNMGGKSTLMREVGLLVI
  85 IDGGRHP.....VVEQAlr6aaNPF.....VANGCDLSPPng7gaIWLLTGPNM
  86 HPCMEVQdd...VTFITNDVTLTREdss..FLIITGPNMGGKSTYIRQIGVIAL
  87 KQARHPC.....LEAQddVKFIPND.....VNLEHGsseLLIITGPNMGGKSTY
  88 VVNGRHL.....MVEEGls6slETF.....TANNCELAKDnLWVITGPNMGGKS
  89 VVNGRHL.....MVEEGls6slETF.....TANNCELAKDnLWVITGPNMGGKS
  90 HPVVERVmdy..NDYVPNNCRLDNEtf...IYLITGPNMSGKSTYMRQVAIISI
  91 HPVIEGNse...KPFIPNDVVLDKCr....LIILTGANMGGKSTYLRSAALSIL
  92 VIQGRHP.....IVEKGlshklIPF.....TPNDCFVGNGnvnIWLITGPNMAG
  93 VIQGRHP.....IVEKGlshklIPF.....TPNDCFVGNGnvnIWLITGPNMAG
  94 HPLLPP-.....DQVVANDIELGRDfs...TIVITGPNTGGKTVTLKTLGLLTL
  95 HPILGGNn....SNFVANNYSCNHElsr..IHVITGANMSGKSVYLRQIAYLVI
  96 HPILDWDds...EKTITNDTCLTRDrr...FGIITGPNMAGKSTYLKQTAQLAI
  97 HPLLHWQae...6ggPAVVPITLTI.....DSQirVIAITGPNTGGKTVTLKTL
  98 HPMIDVLlgeq.DQFVPNSTSLSD-.....------------------------
  99 HPVLVLVk....EDVVPVGILLKEKk....GLILTGPNTGGKTVALKTLGLSVL
 100 HPCLALQsr...8qtTSFIANSTTM.....GASeaaVMLLTGPNMGGKSTLMRQ
 101 HPLLVWQqq...6qgNPVIPVDLLI.....SPQirVVTITGPNTGGKTVTLKTL
 102 KSLRHPC.....FNLGattaKDFIP.....NDIELGKEqprLGLLTGANAAGKS
 103 KSLRHPC.....FNLGattaKDFIP.....NDIELGKEqprLGLLTGANAAGKS
 104 HPCINASaa...STFVPNDVVLGGEspn..MIVLTGPNMAGKSTLLRQVCIAVI
 105 EDeciLE.....IINGRHALYETFl.....DNYIPNSTMIDGGl14grIIVVTG
 106 HPVLEMQdd...ISFISNDVTLESGkgd..FLIITGPNMGGKSTYIRQVGVISL
 107 EDeciLE.....IINGRHALYETFl.....DNYIPNSTMIDGGl14grIIVVTG
 108 HPVLEMQdd...ISFISNDVTLESGkgd..FLIITGPNMGGKSTYIRQVGVISL
 109 KQARHPC.....LEAQddVKFIPND.....VNLEHGsseLLIITGPNMGGKSTY
 110 HPLMELCa....RTFVPNSTDCGGDqgr..VKVITGPNSSGKSIYLKQVGLITF
 111 HPLIPK-.....ERVVPINLELPPNkr...GFIITGPNMGGKTVTVKTVGL---
 112 gfIEAAN.....LRHPLVEQLntqE.....ECIAHNISLEDKGMLVFSVNGAGK
 113 gfIERAN.....LRHPLVEQLntqE.....ECIAHNISLEDKGMLVFSVNGAGK
 114 HPLMELCa....RTFVPNSTECGGDkgr..VKVITGPNSSGKSIYLKQVGLITF
 115 LNVRHPL.....VELRqPVYTPNTL.....RLTDDanALIITGPNMGGKSTFMR
 116 SAQHPLL.....LGSvl9gdIFPVP.....VDIKVESSakVVVISGPNTGGKTA
 117 HPLIE--.....-NPVANDIQLGETk....LLLITGPNMGGKTATLKTLGLAVL
 118 HPIVEKGlshklIPFTPNDCFVGNGnvn..IWLITGPNMAGKSTFLRQNAIISI
 119 HPLLNR-.....ETVVANTIEFMEDie...TVIITGPNTGGKTVTLKTLGLIIV
 120 HPLLKDSka...ITFTPA----EN-r....VVIITGPNAGGKTVTLKTIGL---
 121 GALHLNL.....VGSRKPQRVDYRI.....GDPdnVVLLTGANSGGKTTLLETL
 122 HPVVEQAlr...6aaNPFVANGCDL.....SPPng7gaIWLLTGPNMGGKSTFL
 123 HPYAFAGna...NSLVPNDLTLGQDls...7rfALLLTGPNMGGKSTIMRATCL
 124 HPYAFAGna...NSLVPNDLTLGQDls...7rfALLLTGPNMGGKSTIMRATCL
 125 QGLWHPF.....AVAAdgQLPVPND.....ILLGEAr10prSLLLTGPNMGGKS
 126 QGLWHPF.....AVAAdgQLPVPND.....ILLGEAr10prSLLLTGPNMGGKS
 127 wlLYLPR.....CYHPLLLYQh24sg....APPIPADFQISKGtrVLVITGPNT
 128 -------.....-------------.....------------------------