(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html
Citations (SAM, SAM-T99, HMMs)
- R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: Extension and analysis of the basic method, CABIOS 12:95-107, 1996.
- K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting remote protein homologies, Bioinformatics 14(10):846-856, 1998.
- A. Krogh et al., Hidden Markov models in computational biology: Applications to protein modeling, JMB 235:1501-1531, Feb 1994.
Sequence numbers correspond to the following labels:
-
-
- gi|15803488|ref|NP_289521.1|_2:138 (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
- gi|15833079|ref|NP_311852.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
- gi|12517495|gb|AAG58080.1|AE005525_6 (AE005525) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
- gi|13363297|dbj|BAB37248.1| (AP002563) hypothetical protein [Escherichia coli O157:H7]
-
- gi|16130850|ref|NP_417424.1|_2:138 (NC_000913) orf, hypothetical protein [Escherichia coli K12]
- gi|1731018sp|P52050|YQGF_ECOLI Hypothetical protein yqgF
- gi|7430082|pir||D65080 hypothetical protein b2949 - Escherichia coli (strain K-12)
- gi|1789318|gb|AAC75986.1| (AE000377) orf, hypothetical protein [Escherichia coli K12]
-
- gi|16761872|ref|NP_457489.1|_3:138 (NC_003198) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
- gi|16504174|emb|CAD02921.1| (AL627277) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
-
- gi|882478|gb|AAA69116.1|_44:180 (U28377) ORF_o180; was also ORF_o62p before splice [Escherichia coli]
-
- gi|16766398|ref|NP_462013.1|_2:138 (NC_003197) putative endonuclease involved in recombination [Salmonella typhimurium LT2]
- gi|16421650|gb|AAL21972.1| (AE008842) putative endonuclease involved in recombination [Salmonella typhimurium LT2]
-
- gi|21402430|ref|NP_658415.1|_1:135 (NC_003995) hypothetical protein predicted by GeneMark [Bacillus anthracis A2012] [Bacillus anthracis str. A2012]
-
- gi|20807699|ref|NP_622870.1|_1:135 (NC_003869) predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [Thermoanaerobacter tengcongensis]
- gi|20516249|gb|AAM24474.1| (AE013087) predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [Thermoanaerobacter tengcongensis]
-
- gi|15613832|ref|NP_242135.1|_1:136 (NC_002570) BH1269~unknown conserved protein [Bacillus halodurans]
- gi|10173885|dbj|BAB04988.1| (AP001511) BH1269~unknown conserved protein [Bacillus halodurans]
-
- gi|15894957|ref|NP_348306.1|_1:133 (NC_003030) Predicted endonuclease involved in recombination [Clostridium acetobutylicum]
- gi|15024643|gb|AAK79646.1|AE007678_2 (AE007678) Predicted endonuclease involved in recombination [Clostridium acetobutylicum]
-
- gi|16800605|ref|NP_470873.1|_1:135 (NC_003212) similar to unknown proteins [Listeria innocua]
- gi|16414010|emb|CAC96768.1| (AL596168) similar to unknown proteins [Listeria innocua]
-
- gi|16079793|ref|NP_390617.1|_1:135 (NC_000964) similar to hypothetical proteins [Bacillus subtilis]
- gi|6226496sp|O34634|YRRK_BACSU Hypothetical protein yrrK
- gi|7430078|pir||D69979 conserved hypothetical protein yrrK - Bacillus subtilis
- gi|2635185|emb|CAB14681.1| (Z99117) similar to hypothetical proteins [Bacillus subtilis]
- gi|2635203|emb|CAB14698.1| (Z99118) similar to hypothetical proteins [Bacillus subtilis]
-
- gi|16803542|ref|NP_465027.1|_1:135 (NC_003210) similar to unknown proteins [Listeria monocytogenes EGD-e]
- gi|16410931|emb|CAC99580.1| (AL591979) similar to unknown proteins [Listeria monocytogenes]
-
- gi|16121241|ref|NP_404554.1|_4:137 (NC_003143) conserved hypothetical protein [Yersinia pestis]
- gi|15979007|emb|CAC89780.1| (AJ414145) conserved hypothetical protein [Yersinia pestis]
-
- gi|18310760|ref|NP_562694.1|_1:133 (NC_003366) conserved hypothetical protein [Clostridium perfringens]
- gi|18145441|dbj|BAB81484.1| (AP003191) conserved hypothetical protein [Clostridium perfringens str. 13]
-
- gi|19746980|ref|NP_608116.1|_1:138 (NC_003485) conserved hypothetical protein [Streptococcus pyogenes MGAS8232]
- gi|19749234|gb|AAL98615.1| (AE010118) conserved hypothetical protein [Streptococcus pyogenes MGAS8232]
-
- gi|15675863|ref|NP_270037.1|_1:138 (NC_002737) conserved hypothetical protein [Streptococcus pyogenes] [Streptococcus pyogenes M1 GAS]
- gi|13623097|gb|AAK34758.1| (AE006631) conserved hypothetical protein [Streptococcus pyogenes M1 GAS]
-
- gi|15603735|ref|NP_246809.1|_1:136 (NC_002663) unknown [Pasteurella multocida]
- gi|12722298|gb|AAK03954.1| (AE006224) unknown [Pasteurella multocida]
-
- gi|15902220|ref|NP_357770.1|_1:138 (NC_003098) Conserved hypothetical protein [Streptococcus pneumoniae R6]
- gi|15457719|gb|AAK98980.1| (AE008400) Conserved hypothetical protein [Streptococcus pneumoniae R6]
-
- gi|15617141|ref|NP_240354.1|_1:133 (NC_002528) hypothetical protein [Buchnera sp. APS]
- gi|11387309sp|P57613|Y548_BUCAI Hypothetical protein BU548
- gi|10039206|dbj|BAB13240.1| (AP001119) hypothetical protein [Buchnera sp. APS]
-
-
- gi|17046115|dbj|BAB72159.1|_4:139 (AB061794) hypothetical protein [Escherichia coli]
-
- gi|10442817|gb|AAG15556.1|_4:137 (AY007523) hypothetical protein [Pseudomonas fluorescens]
-
- gi|21283295|ref|NP_646383.1|_4:140 (NC_003923) conserved hypothetical protein [Staphylococcus aureus subsp. aureus MW2]
- gi|21204735|dbj|BAB95431.1| (AP004827) conserved hypothetical protein [Staphylococcus aureus subsp. aureus MW2]
-
- gi|7208640|emb|CAB76923.1|_4:139 (AJ276030) hypothetical protein [Aeromonas hydrophila]
-
- gi|15595601|ref|NP_249095.1|_7:139 (NC_002516) conserved hypothetical protein [Pseudomonas aeruginosa]
- gi|11347651|pir||B83596 conserved hypothetical protein PA0404 [imported] - Pseudomonas aeruginosa (strain PAO1)
- gi|9946258|gb|AAG03793.1|AE004477_10 (AE004477) conserved hypothetical protein [Pseudomonas aeruginosa]
-
- gi|13470868|ref|NP_102437.1|_9:141 (NC_002678) hypothetical protein [Mesorhizobium loti]
- gi|14021611|dbj|BAB48223.1| (AP002995) hypothetical protein [Mesorhizobium loti]
-
- gi|16272260|ref|NP_438472.1|_2:135 (NC_000907) conserved hypothetical protein [Haemophilus influenzae Rd]
- gi|1175215sp|P43981|YQGF_HAEIN Hypothetical protein HI0305
- gi|1074342|pir||H64005 conserved hypothetical protein HI0305 - Haemophilus influenzae (strain Rd KW20)
- gi|1573274|gb|AAC21970.1| (U32716) conserved hypothetical protein [Haemophilus influenzae Rd]
-
- gi|17230592|ref|NP_487140.1|_5:137 (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
- gi|17132194|dbj|BAB74799.1| (AP003591) ORF_ID:alr3100~hypothetical protein [Nostoc sp. PCC 7120]
-
- gi|21672793|ref|NP_660860.1|_1:134 (NC_004061) hypothetical 15.2 kD protein in gshB-ansB [Buchnera aphidicola str. Sg (Schizaphis graminum)]
- gi|21623442|gb|AAM68071.1| (AE014127) hypothetical 15.2 kD protein in gshB-ansB [Buchnera aphidicola str. Sg (Schizaphis graminum)]
-
- gi|17989017|ref|NP_541650.1|_16:150 (NC_003318) DNA integration/recombination/invertion protein [Brucella melitensis]
- gi|17984856|gb|AAL53914.1| (AE009702) DNA integration/recombination/invertion protein [Brucella melitensis]
-
- gi|15888641|ref|NP_354322.1|_17:149 (NC_003062) AGR_C_2423p [Agrobacterium tumefaciens] [Agrobacterium tumefaciens str. C58 (Cereon)]
- gi|17935216|ref|NP_532006.1| (NC_003304) conserved hypothetical protein [Agrobacterium tumefaciens str. C58 (U. Washington)]
- gi|15156369|gb|AAK87107.1| (AE008058) AGR_C_2423p [Agrobacterium tumefaciens str. C58 (Cereon)]
- gi|17739725|gb|AAL42322.1| (AE009093) conserved hypothetical protein [Agrobacterium tumefaciens str. C58 (U. Washington)]
-
- gi|15965062|ref|NP_385415.1|_16:149 (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
- gi|15074241|emb|CAC45888.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
-
-
- gi|15892378|ref|NP_360092.1|_16:149 (NC_003103) unknown [Rickettsia conorii]
- gi|15619527|gb|AAL02993.1| (AE008609) unknown [Rickettsia conorii]
-
- gi|15677203|ref|NP_274356.1|_6:140 (NC_003112) conserved hypothetical protein [Neisseria meningitidis MC58]
- gi|11278849|pir||A81094 conserved hypothetical protein NMB1337 [imported] - Neisseria meningitidis (group B strain MD58)
- gi|7226581|gb|AAF41712.1| (AE002482) conserved hypothetical protein [Neisseria meningitidis MC58]
-
- gi|15794444|ref|NP_284266.1|_6:140 (NC_003116) conserved hypothetical protein [Neisseria meningitidis Z2491]
- gi|11278848|pir||B81847 conserved hypothetical protein NMA1551 [imported] - Neisseria meningitidis (group A strain Z2491)
- gi|7380192|emb|CAB84778.1| (AL162756) conserved hypothetical protein [Neisseria meningitidis Z2491]
-
- gi|15640493|ref|NP_230120.1|_2:138 (NC_002505) conserved hypothetical protein [Vibrio cholerae]
- gi|11278847|pir||G82318 conserved hypothetical protein VC0466 [imported] - Vibrio cholerae (group O1 strain N16961)
- gi|9654892|gb|AAF93639.1| (AE004133) conserved hypothetical protein [Vibrio cholerae]
-
- gi|15827175|ref|NP_301438.1|_26:160 (NC_002677) conserved hypothetical protein [Mycobacterium leprae]
- gi|13092723|emb|CAC30021.1| (AL583918) conserved hypothetical protein [Mycobacterium leprae]
-
- gi|21220001|ref|NP_625780.1|_17:151 (NC_003888) conserved hypothetical protein [Streptomyces coelicolor A3(2)]
- gi|8249959|emb|CAB93380.1| (AL357523) conserved hypothetical protein [Streptomyces coelicolor A3(2)]
-
- gi|16332240|ref|NP_442968.1|_5:137 (NC_000911) hypothetical protein [Synechocystis sp. PCC 6803]
- gi|6226451sp|P74662|YF47_SYNY3 Hypothetical protein sll1547
- gi|7430077|pir||S76868 hypothetical protein - Synechocystis sp. (strain PCC 6803)
- gi|1653870|dbj|BAA18780.1| (D90917) ORF_ID:sll1547~hypothetical protein [Synechocystis sp. PCC 6803]
-
- gi|16126678|ref|NP_421242.1|_17:151 (NC_002696) conserved hypothetical protein [Caulobacter crescentus CB15]
- gi|13423984|gb|AAK24410.1| (AE005913) conserved hypothetical protein [Caulobacter crescentus CB15]
-
-
-
- gi|15618161|ref|NP_224446.1|_7:144 (NC_000922) YggF Family hypothetical protein [Chlamydophila pneumoniae CWL029]
- gi|15835772|ref|NP_300296.1| (NC_002491) YggF family hypothetical protein [Chlamydophila pneumoniae J138]
- gi|6226490sp|Q9Z8U7|Y237_CHLPN Hypothetical protein CPn0237/CP0525/CPj0237
- gi|7445450|pir||B72103 yggf family hypothetical protein - Chlamydophila pneumoniae (strain CWL029)
- gi|4376511|gb|AAD18390.1| (AE001609) YggF Family hypothetical protein [Chlamydophila pneumoniae CWL029]
- gi|8978610|dbj|BAA98447.1| (AP002545) YggF family hypothetical protein [Chlamydophila pneumoniae J138]
-
- gi|16752801|ref|NP_445069.1|_10:147 (NC_002179) conserved hypothetical protein [Chlamydophila pneumoniae AR39]
- gi|11278850|pir||D81568 conserved hypothetical protein CP0525 [imported] - Chlamydophila pneumoniae (strain AR39)
- gi|7189439|gb|AAF38349.1| (AE002211) conserved hypothetical protein [Chlamydophila pneumoniae AR39]
-
- gi|15835074|ref|NP_296833.1|_8:144 (NC_002620) conserved hypothetical protein [Chlamydia muridarum]
- gi|14195475sp|Q9PKK9|Y456_CHLMU Hypothetical protein TC0456
- gi|11278851|pir||E81700 conserved hypothetical protein TC0456 [imported] - Chlamydia muridarum (strain Nigg)
- gi|7190501|gb|AAF39309.1| (AE002314) conserved hypothetical protein [Chlamydia muridarum]
-
- gi|17545395|ref|NP_518797.1|_15:142 (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
- gi|17427687|emb|CAD14206.1| (AL646060) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
-
- gi|15609691|ref|NP_217070.1|_22:156 (NC_000962) hypothetical protein Rv2554c [Mycobacterium tuberculosis H37Rv]
- gi|15842092|ref|NP_337129.1| (NC_002755) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
- gi|6226486sp|P94999|YP54_MYCTU Hypothetical protein Rv2554c
- gi|7430079|pir||F70660 hypothetical protein Rv2554c - Mycobacterium tuberculosis (strain H37RV)
- gi|1781048|emb|CAB06184.1| (Z83863) hypothetical protein Rv2554c [Mycobacterium tuberculosis H37Rv]
- gi|13882373|gb|AAK46943.1| (AE007097) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
-
- gi|21243644|ref|NP_643226.1|_9:145 (NC_003919) conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
- gi|21109221|gb|AAM37762.1| (AE011934) conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
-
- gi|15838818|ref|NP_299506.1|_9:146 (NC_002488) conserved hypothetical protein [Xylella fastidiosa 9a5c]
- gi|11135885sp|Q9PBB7|YM27_XYLFA Hypothetical protein XF2227
- gi|11278846|pir||A82584 conserved hypothetical protein XF2227 [imported] - Xylella fastidiosa (strain 9a5c)
- gi|9107376|gb|AAF85026.1|AE004035_5 (AE004035) conserved hypothetical protein [Xylella fastidiosa 9a5c]
-
- gi|15924606|ref|NP_372140.1|_2:121 (NC_002758) conserved hypothetical protein [Staphylococcus aureus subsp. aureus Mu50]
- gi|15927196|ref|NP_374729.1| (NC_002745) conserved hypothetical protein [Staphylococcus aureus subsp. aureus N315]
- gi|13701414|dbj|BAB42708.1| (AP003134) conserved hypothetical protein [Staphylococcus aureus subsp. aureus N315]
- gi|14247387|dbj|BAB57778.1| (AP003362) conserved hypothetical protein [Staphylococcus aureus subsp. aureus Mu50]
-
- gi|19552843|ref|NP_600845.1|_15:158 (NC_003450) COG0816:Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [Corynebacterium glutamicum]
- gi|21324400|dbj|BAB99024.1| (AP005279) Predicted endonuclease involved in recombination (possible Holliday junction resolvase in Mycoplasmas and B. subtilis) [Corynebacterium glutamicum ATCC 13032]
-
- gi|21672923|ref|NP_660988.1|_6:140 (NC_002932) conserved hypothetical protein [Chlorobium tepidum TLS]
- gi|21645979|gb|AAM71330.1| (AE012788) conserved hypothetical protein [Chlorobium tepidum TLS]
-
- gi|21232178|ref|NP_638095.1|_10:145 (NC_003902) conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
- gi|21113933|gb|AAM42019.1| (AE012388) conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
-
-
- gi|13508157|ref|NP_110106.1|_2:136 (NC_000912) conserved hypothetical protein [Mycoplasma pneumoniae]
- gi|11135617sp|P57114|Y29A_MYCPN Hypothetical protein MG291.1 homolog
- gi|11379549|gb|AAG34748.1|AE000041_6 (AE000041) conserved hypothetical protein [Mycoplasma pneumoniae]
-
- gi|15829027|ref|NP_326387.1|_4:141 (NC_002771) conserved hypothetical protein [Mycoplasma pulmonis]
- gi|14089971|emb|CAC13729.1| (AL445565) conserved hypothetical protein [Mycoplasma pulmonis]
-
- gi|12045147|ref|NP_072958.1|_3:136 (NC_000908) conserved hypothetical protein [Mycoplasma genitalium]
- gi|6226357sp|Q9ZB76|Y29A_MYCGE Hypothetical protein MG291.1
- gi|3844876|gb|AAC71518.1| (U39709) conserved hypothetical protein [Mycoplasma genitalium]
-
- gi|19704033|ref|NP_603595.1|_2:135 (NC_003454) DNA integration/recombination/invertion protein [Fusobacterium nucleatum subsp. nucleatum ATCC 25586]
- gi|19714225|gb|AAL94894.1| (AE010580) DNA integration/recombination/invertion protein [Fusobacterium nucleatum subsp. nucleatum ATCC 25586]
-
- gi|15231708|ref|NP_190859.1|_399:536 (NM_115151) putative protein [Arabidopsis thaliana]
- gi|11357882|pir||T47548 hypothetical protein F8J2.80 - Arabidopsis thaliana
- gi|7529715|emb|CAB86895.1| (AL132969) putative protein [Arabidopsis thaliana]
-
- gi|13357930|ref|NP_078204.1|_1:144 (NC_002162) conserved hypothetical [Ureaplasma urealyticum]
- gi|11356807|pir||E82899 conserved hypothetical UU370 [imported] - Ureaplasma urealyticum
- gi|6899352|gb|AAF30779.1|AE002134_3 (AE002134) conserved hypothetical [Ureaplasma urealyticum]
-
- gi|15672125|ref|NP_266299.1|_2:113 (NC_002662) HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis]
- gi|12722992|gb|AAK04241.1|AE006252_3 (AE006252) HYPOTHETICAL PROTEIN [Lactococcus lactis subsp. lactis]
-
- gi|15791995|ref|NP_281818.1|_1:125 (NC_002163) hypothetical protein Cj0635 [Campylobacter jejuni]
- gi|11135891sp|Q9PHN2|Y635_CAMJE Hypothetical protein Cj0635
- gi|11346670|pir||B81412 hypothetical protein Cj0635 [imported] - Campylobacter jejuni (strain NCTC 11168)
- gi|6968102|emb|CAB75271.1| (AL139075) hypothetical protein Cj0635 [Campylobacter jejuni subsp. jejuni NCTC 11168]
-
- gi|15611385|ref|NP_223036.1|_2:128 (NC_000921) putative [Helicobacter pylori J99]
- gi|7464680|pir||E71945 hypothetical protein jhp0317 - Helicobacter pylori (strain J99)
- gi|4154855|gb|AAD05908.1| (AE001468) putative [Helicobacter pylori J99]
-
- gi|15644962|ref|NP_207132.1|_2:128 (NC_000915) conserved hypothetical protein [Helicobacter pylori 26695]
- gi|7445451|pir||F64561 conserved hypothetical protein HP0334 - Helicobacter pylori (strain 26695)
- gi|2313435|gb|AAD07403.1| (AE000551) conserved hypothetical protein [Helicobacter pylori 26695]
-
- gi|15644293|ref|NP_229345.1|_12:122 (NC_000853) conserved hypothetical protein [Thermotoga maritima]
- gi|7462233|pir||E72243 conserved hypothetical protein - Thermotoga maritima (strain MSB8)
- gi|4982112|gb|AAD36612.1|AE001800_22 (AE001800) conserved hypothetical protein [Thermotoga maritima]
-
- gi|13194228|gb|AAK15446.1|AC037426_8_73:207 (AC037426) hypothetical protein [Oryza sativa (japonica cultivar-group)]
-
- gi|15807494|ref|NP_296229.1|_16:134 (NC_001263) conserved hypothetical protein [Deinococcus radiodurans]
- gi|11136112sp|Q9RRI2|YP09_DEIRA Hypothetical protein DR2509
- gi|7471460|pir||D75265 conserved hypothetical protein - Deinococcus radiodurans (strain R1)
- gi|6460331|gb|AAF12050.1|AE002080_4 (AE002080) conserved hypothetical protein [Deinococcus radiodurans]
10 20 30 40 50
| | | | |
1 MSGTLLAFDFGTKSIGVAVGQRI.TGTARPLPAIKAQ.....DGTPDWNIIERLLKE..WQPDEI
2 -SGTLLAFDFGTKSIGVAVGQRI.TGTARPLPAIKAQ.....DGTPDWNLIERLLKE..WQPDEI
3 -SGTLLAFDFGTKSIGVAVGQRI.TGTARPLPAIKAQ.....DGTPDWNIIERLLKE..WQPDEI
4 --DTLLAFDFGTKSIGVAIGQRI.TGTARPLPAIKAQ.....DGTPDWTLIERLLKE..WQPDEI
5 -SGTLLAFDFGTKSIGVAVGQRI.TGTARPLPAIKAQ.....DGTPDWNIIERLLKE..WQPDEI
6 -SGTLLAFDFGTKSIGVAIGQRI.TGTARPLPAIKAQ.....DGTPDWTLIERLLKE..WQPDEI
7 --MRILGLDVGTKTVGVAISDEM.GWTAQGLETIKINee...RGQFGFDRISELVKQ..YDVDKI
8 --MRVLGLDVGDKTIGVAISDVS.STIAQGITTIRRK.....SFVEDVKAIEEIVKK..YSVEKV
9 --MRTLGLDVGTKTIGIAVSDAL.GWTAQGLETWRRSda...NEQADFEHIASLVKE..HEVTTI
10 --MRILGIDVGNKTIGVALSDPL.GFTAQGITTIRRK.....NEEEDIKELKELCEK..YEVDTI
11 --MRIMGLDVGSKTVGVAISDPL.GWTAQGVETIQIDes...RKQFGYDRVKELVLE..YEVEKV
12 --MRILGLDLGTKTLGVALSDEM.GWTAQGIETIKINea...EGDYGLSRLSELIKD..YTIDKI
13 --MRIMGLDVGSKTVGVAISDPL.GWTAQGVETIQIDen...RKQFGYDRVKELVLE..YEVEKV
14 --RTIIAFDFGTKSIGVAIGQEV.TGTARALTAFKAQ.....DGTPDWQQVEKLLKE..WQPNLV
15 --MRILGLDIGSKTIGVAVSDPL.GWTAQGVTTIKRD.....CYTKDVEAVMKICKE..YGVETI
16 --MRIMGLDVGSKTVGVAISDPL.GFTAQGLEIIKIDee...KAEFGFTRLEELVKQ..YQVEQF
17 --MRIMGLDVGSKTVGVAISDPL.GFTAQGLEIIKIDee...KAEFGFTRLEELVKQ..YQVEQF
18 MGMTVLAFDFGTKSIGCAVGQSI.TGTAQALPAFKAQ.....DGIPNWDAIGKCLAE..WQPDRV
19 --MRIMGLDVGSKTVGVAISDPL.GFTAQGLEIIQINee...QGQFGFDRVKELVDT..YKVERF
20 --MIVIAFDFGIKKIGVAVGENI.TKKGRPLSVLNAQ.....NGCPNWQLVKNLIQY..WQPQFI
21 --GALAGLDLGTKTIGVAVSDTL.RGIATPLRTIRRE.....KFTLDVADLMKTVAE..RQIAGF
22 --RSIMGFDYGTKSIGVAIGQEI.TGSARPLRSLKAN.....DGIPNWDEIEKLLKE..WQPDLL
23 --RLLLGFDYGTKQIGVAVGQVI.TGQARELCTLKAQ.....NGVPDWNQVEALIKE..WKPDAV
24 --HKILGLDVGSRTVGIAISDIM.GWTAQGLDTLRINee...NNELGIDQLVDIIKK..HNVGTV
25 --RSIMGFDYGTKSIGVAIGQEL.TGTGQPLRAIKAN.....DGIPNWDDIDKLLKE..WQPDLL
26 -LRLLLGFDYGTRQIGVAVGQAV.TGQARELCVLKAQ.....NGVPDWNRVEALIKE..WQPDAI
27 --RTLAGLDLGDKTIGVAVSDRG.FAFAHPRPVIMRR.....KFSLDAAVLLALLKK..ENVGAV
28 -GITALAFDFGTKSIGCAIGQSI.TGTAQALPAFKAQ.....DGIPNWEAIEKCLKE..WKPDVV
29 --ISALGLDVGRKRVGVAGCDRT.GLIATGITTVERT.....SFDRDVQQIQNIVNE..RQVQVL
30 --MIVIAFDFGLKNIGVAVGENI.LKKGRALNKLSAK.....NGSPDWNNIKNLLKI..WQPKFL
31 -GQTVAGLDLGTKTIGLAVSDLG.LSFAHPRPVIKRV.....KFTIDAQVLLKALET..DKVGVI
32 --QAIAGLDLGTKTIGLAMSDLS.RRFATPRPVIKRV.....KFTQDAQVLLAFAEK..EKVAAF
33 -YQSVAGLDLGTKTIGISVSDLG.RRFATPREVIRRV.....KFGADAQALLSFAEK..EKIAAF
34 PNKPLIAIDYGSKKIGVALSDQE.LSIAMPFNTITAV.....NKKVVITSLLNIIKK..YKVCGI
35 -NVPLIAIDYGNKKLGIALSNQE.RSIAMPLNTITEI.....NKKIVITSLLNIIEK..YKVCGV
36 -KGTALAFDFGEARIGVAQGDAE.LGLSHPLSTVTGG.....SNDEKFAAIAKLVQE..WQPRYF
37 -KGTALAFDFGEARIGVAQGDAE.LGLSHPLATVTGG.....SNDEKFAAIAKLVQE..WQPRYF
38 -SRTVMAFDYGTKSIGSAIGQEI.TGTASPLKAFKAN.....DGIPNWDEIEKQIKE..WQPNLL
39 --GRRLGIDVGSVRIGVAFSDPD.GILATPVETVRRY.....RSAKHLRRLAELVVE..LQVVEV
40 --GRRLAVDVGDARIGVASCDPD.GILATPVETVPGR.....DVPAAHRRLRQLVAE..YEPIEV
41 --VAALGLDVGRKRIGVAGCDGT.GLIATGITTIVRS.....SYDQDIAQIKQLVEE..RNVNLL
42 --AAVVGLDPGEKTIGVAVSDVT.RTVASPLALIEKT.....KFSKDAEQLFKLMDS..RGAVAI
43 --QAFLGIDYGKKRIGLAFASSP.LLIPLPIGNVEARs....SLTLTAQALVSIIKE..RAVTTV
44 --QAFLGIDYGKKRIGLAFASSP.LLIPLPIGNVEARs....SLTLTAQALVSIIKE..RAVTTV
45 -CKAYLGIDYGKKRIGLAYAAEP.LLLTLPIGNIEAGk....NLKLSAEALHKIILS..RNITCV
46 -CKAYLGIDYGKKRIGLAYAAEP.LLLTLPIGNIEAGk....NLKLSAEALHKIILS..RNITCV
47 --EAFLGVDYGKKRIGLAFASAP.LLITLPIGSINTCs....SLALTAQALITIIKE..RAVTTV
48 -HGTLLAFDYGEKRIGVALGNSI.TRSARALEVIPNR.....SVEYRFTQITRLVNA..WQPVGF
49 --GRRLGIDVGAARIGVACSDPD.AILATPVETVRRD.....RSGKHLRRLAALAAE..LEAVEV
50 PDGTVLGFDVGSRRIGVAVGSAL.GAGARAVAVINVH.....ANGPDWVALDRVHKQ..WRPDGL
51 PDGIVLGFDVGTRRIGVAVGSAW.GAGARAVAVIDVH.....GVAVDWNALDRVKRN..WLPVGL
52 -------------------SDIM.GWTAQGLDTLRINee...NNELGIDQLVDIIKK..HNVGTV
53 -PGRRLGLDVGTVRIGVAASDRD.AKLAMPVETVPREtgfkgPDLADIDRLVAIVEE..YNAVEV
54 -HKRIIGIDFGTKRIGVALSDPL.RMFAQPLGTFDME.....GL---VRVLSRVRDD..EGIELV
55 -DATVLGFDVGSRRIGVAVGTAL.GAGARAVAVINVH.....ANGPDWVALDRVHKE..WRPAGL
56 --MKVLAVDYGTKRVGLAIGDEE.LKIVSPKGTVSSK.....E---AVKKIKEIVEK..SRVGKV
57 --QYILGIDFGLKRIGTALVNTI.DRFPSPFRVFAVQn....NLQQAVNTLFKDLKQagYELVQI
58 -VMRKIALDLGTKSCGFAISDPIsNSFAIPLENFFFEen...NFKKVINKLKHYEKK..YEFDTI
59 ---YILAIDFGLKKIGTAIANTL.DKYPSAFHVFEVKn....NFKTAVNNLFLRIKNdgYELEKI
60 --KRYIALDIGDVRIGVARSDIM.GIIASPLETINRK.....KVK-SVKRIAEICKE..NDTNLV
61 -PGRFLGLDVGDKYVGLAISDPS.NMVASPLSVLLRKks...NIDLMATDFQNLVKA..FSVSGL
62 --MRKLALDLGTKSCGFAISDLL.GIIASGLDNFIYEen...DFTAVLAKIDEIMINyhHEIDTI
63 -----------------------.--TAQPVETIKIDse...AGELGFERLGVLIKE..YKPEKV
64 --MRALALDVGLKRIGVALC-ID.KKIALPLDAVLRK.....NRNQAANEIKNLLKI..HEISLL
65 ----ILACDVGLKRIGIAA--LL.NGVILPLEAILRH.....NRNQASRDLSDLLRK..KDIQVL
66 ----ILACDVGLKRIGIAA--LL.NGVILPLEAILRH.....NRNQASRDLSDLLRE..KNIQVL
67 --KVIVAVDYGERKCGVAFG---.-------EILPQK.....SLVIPTKNLKEFIRK..LKPDKI
68 -CGFSLGVDLGEARTGVAVGRGI.T-LPRPLTVLKLR.....GQKLEL-MLLDIAQQ..QEADEL
69 --PTVLALDVSKSRIGFAVS--A.GRLAFGRGSVDRK.....RLPLDLKAVRLKVEE..TGAERL
60 70 80 90 100 1
| | | | |
1 IVGLPLN.M.DGTE.QPL..TARARKFANRIHGRF.....GVEVKLHDERLSTVEAR.SGLF...
2 IVGLPLN.M.DGTE.QPL..TARARKFANRIHGRF.....GVEVKLHDERLSTVEAR.SGLF...
3 IVGLPLN.M.DGTE.QPL..TARARKFANRIHGRF.....GVEVKLHDERLSTVEAR.SGLF...
4 IVGLPLN.M.DGTE.QPL..TARARKFANRIHGRF.....GVTVTLHDERLSTVEAR.SGLF...
5 IVGLPLN.M.DGTE.QPL..TARARKFANRIHGRF.....GVEVKLHDERLSTVEAR.SGLF...
6 IVGLPLN.M.DGTE.QPL..TARARKFANRIHGRF.....GVTVTLHDERLSTVEAR.SGLF...
7 VVGLPKN.M.NGTI.GPR..GEACQQFAENLRELL.....QLDVVMWDERLSTMAAE.RLLI...
8 VVGLPKN.M.NGSI.GPQ..GEKVIKFGEKLREVL.....RIPVVFWDERLTTLQAE.RFLIe..
9 VIGLPKN.M.DGSI.GPS..GERSETFAAELRRYV.....PCEIVMWDERLTTTAAE.RMLI...
10 VCGLPKN.M.NGTI.GFQ..SEKVLGFCEVIKQNI.....NVPIKMWDERLTTVTAN.RAML...
11 VVGLPKN.M.NNTI.GPR..AESSKIYAEVLESRI.....GLPVVLWDERLTTSAAE.RTLI...
12 VLGFPKN.M.NGTV.GPR..GEASQTFAKVLETTY.....NVPVVLWDERLTTMAAE.KMLI...
13 VVGLPKN.M.NNTI.GPR..AESSKIYAEVLEARI.....GLPVVLWDERLTTSAAE.RTLI...
14 VVGLPLN.M.DGTE.QPL..TARARRFANRLHGRF.....GVQVALQDERLSTVEAR.ANLF...
15 VAGMPKN.M.NGTI.GPS..GEMVKNLCEQIEKSF.....DGKIEFWDERLTTVAAH.RAML...
16 VIGLPKN.M.NNTN.GPR..VDASITYGNHIEHLF.....GLPVHYQDERLTTVEAE.RMLIe..
17 VIGLPKN.M.NNTN.GPR..VDASITYGNHIEHLF.....GLPVHYQDERLTTVEAK.RMLIe..
18 VVGLPLN.M.DGTE.QDL..TVRARKFAHRLHGRF.....GVAVELQDERLTTTEAR.AEIF...
19 VVGLPKN.M.NNTS.GPR..VEASQAYGAKLEEFF.....GLPVDYQDERLTTVAAE.RMLIe..
20 VVGLPLN.I.NGTK.QKI..TNKSEKFANLLKYKF.....NIVVKMHDERLTTVEAK.SIIF...
21 VLGLPVN.M.DGSE.GPR..AQSTRAFARNLEKLT.....PLPITFWDERLSTVAAE.RAML...
22 VVGLPLN.M.DGTD.QEI..TVRARKFGNRLHGRF.....GKPVEFKDERLTTTDAR.ARLF...
23 VVGLPLN.M.DGTP.SDM..CLRAEKFARRLNGRY.....NLPFYTHDERLTTFEAKgERLV...
24 VIGLPKN.M.NNSI.GFR..GEASLTYKEKLLEAYp....SIEIVMWDERLSTMAAE.RSLL...
25 VVGLPLN.M.DGTE.QEI..TVRARKFGNRLHGRF.....WQAVEFKDERLTTTDAR.ARLF...
26 VVGLPLN.M.DGSP.SEM..SERAEKFGRRLNGRF.....NLPVFTHDERLTTYAAK.GERL...
27 AIGLPIN.M.DGSE.GPR..AQKSRAFVRNMAQLS.....DLPFVFWDERLSTVAAE.RTLI...
28 IVGLPLN.M.DGTE.QDL..TLRARKFANRLQGRF.....GVNVHLQDERLTTTQAR.SEIF...
29 VVGLPYS.M.DGSL.GFQ..ARQVQKFTSRLAKAL.....QLPVEYVDERLTSFQAE.QMLI...
30 VVGLPLN.I.DGTR.QDI..TKKAEKFAFLLKYKF.....NIFVYLHDERLSTKEAK.SLIF...
31 VIGLPMN.M.DGTA.GPR..VQATRAFVRTMQPLT.....DLPFVFWDERLSTVAVE.RALI...
32 VIGLPIN.M.DGSA.GPR..AQATRAFVRTMGEKT.....ALPFIYWDERLSTVAAE.RALL...
33 IIGLPVN.M.DGSE.GPR..CQATRAFVRNMGEKT.....DIPFVLWDERLSTVAAE.RVLI...
34 VIGLPID.M.SGGV.TQQ..TNIVMKFAEKLKQSL.....GLPIYLQDERLTTKSAN.NFLK...
35 IIGLPID.M.SGAV.TEQ..TNIVMKFAEELAKSI.....NLPIYLQDERLTTKAAN.NLLK...
36 VVGLPVH.T.DGTK.HEM..THLSRKFGRRLNGRF.....NLPVYWVDERLSSVYAE.SLLS...
37 VVGLPVH.T.DGTK.HEM..THLSRKFGRRLNGRF.....NLPVYWVDERLSSVYAE.SLLS...
38 IVGLPTD.L.HGKDlDTI..TPRAKKFAQRLHGRF.....GLPVELHDERLSTTEAR.AELF...
39 VVGLPWT.L.TDRT.GSS..AKDAIDTAEALARRVa....PVPVRLVDERLTTVSAQ.RLLR...
40 VVGLPRS.L.KGGE.GPA..AAKVRRFTQELAKGIa....PVPVRLVDERMTTVTAS.QGLR...
41 VVGLPYT.M.AGEI.GSQ..AKQVQKFARRVAEQL.....HLPLEYMDERLSSVEAE.NQLK...
42 VIGLPMN.M.DGTE.GVR..CQSNRALGRNLLRLKp....DLPITFWDERLSTAAVT.RVLId..
43 VFGNPLP.M.QKAYaSSV..QSEIQELAALIQEMT.....AIEVILWDERLSSAQAE.RMLKs..
44 VFGNPLP.M.QKAYaSSV..QSEIQELAALIQEMT.....AIEVILWDERLSSAQAE.RMLKs..
45 VLGNPLP.MqKGLY.SSL..QEEVSLLAEELKKLS.....TVEIILWDERLSSVQAE.RMLKq..
46 VLGNPLP.MqKGLY.SSL..QEEVSLLAEELKKLS.....TVEIILWDERLSSVQAE.RMLKq..
47 VFGNPLP.M.QKSYaSSV..QSEIQELAALVRDMT.....SLEVILWDERLSSAQAE.RMLKn..
48 VVGMPVH.P.EGED.QPM..IKLAKRFGNQLHGRY.....GLPVTWVDERYSSIAAQ.DA--...
49 IVGLPRT.L.ADRI.GRS..AQDAIELAEALARRVs....PTPVRLADERLTTVSAQ.RSLR...
50 VVGDPLT.L.DDKD.QPA..RKRAHAFARQLRERY.....ALPVVLIDERSSSVEAA.QRFArer
51 VVGDPLT.L.EGHD.QPI..RKQAQAFACQLRERY.....RLPVVLVDERSSSVEAA.SRFAgar
52 VIGLPKN.M.NNSI.GFR..GEASLTYKEKLLEAYp....SIEIVMWDERLSTMAAE.RSLL...
53 IVGLPTD.L.QGNG.SAS..VKHAKEIAFRVRRRLtnagkNIPVRLGDERLTTVVAT.QALR...
54 VVGYPMS.D.KGEE.NRM..TGVIDRFVAELRESFp....GTLIETFDEHRSSRTAM.KILA...
55 VVGDPLT.L.DDKD.QPA..RKRAHAFARELRERY.....ALPVVLIDERSSSVEAA.QRFArer
56 IVGLPLT.P.SGKE.GQR..AKLVKDFVEKLREELp....NTEIILWDERWTTAEAM.RRLE...
57 VIGFPHF.-.-HYQ.SSI..QVSIHKFVELIKTRF.....NVPVTLIDESGTTSEVK.ANLQ...
58 ILGYPLR.M.TGTR.SER..SIMVEEFEKLLRKTF.....SQRIVLVDERLSTVKAK.AMLK...
59 VIGFPKF.-.-HYY.SDI..QKAIKSFKQLLEKRF.....NLPIILVDESNTTSAVK.DKLI...
60 VVGIPKS.L.DGEE.KRQ..AEKVREYIEKLKKEIe....NLEIIEVDERFSTVIAD.NILK...
61 VVGYPFG.K.LNNV.EDVvtVNLFIEELRKTEKLK.....DVKYTYWDERLSSKTVE.LMLK...
62 VLGYPTNvY.DGSK.NER..TYLIESFYALLKQHFlnhe.KIKIVYEDERFSTKIAT.QRLK...
63 VLGLPKH.M.NGDE.GIR..AEASRDYGTKLANEF.....GLEVAYQDERLTTAQAE.KVLI...
64 IVGIPKG.-.GSSE.EEM..TRRIKHFVSLLE--F.....DKEICFVDESGTSKEAL.GYGV...
65 VVGKP--.-.NESY.ADT..HARIEHFIKLVD--F.....KGEIVFINEDNSSVEAY.ENLE...
66 VVGKP--.-.HESY.ADT..NARIEHFIKLVD--F.....KGEIVFINEDRSSIEAY.ENLE...
67 IFGLPLS.M.SGKY.TQQ..TFKTVAVAFKFS--K.....EYETYLCDERLTTKIGE.RI--...
68 IVGLPVS.A.DGSE.TPQ..SNKVRSVVGRLAVQAadr..GLRVYLQDEYGTTIDAL.EFMI...
69 VLGLPLR.T.DGKP.SPT..ADRVRAFGRVLMD-K.....GYTVEYQDERFTTQRAR.AL--...
10 120 130
| | |
1 EQGGYRAL...N..KGKVDSASAVIILESYFEQGY
2 EQGGYRAL...N..KGKIDSASAVIILESYFEQGY
3 EQGGYRAL...N..KGKVDSASAVIILESYFEQGY
4 ERGGYRAL...N..KGKVDSASAVIILESYFEQGY
5 EQGGYRAL...N..KGKVDSASAVIILESYFEQGY
6 ERGGYRAL...N..KGKVDSASAVIILESYFEQGY
7 SADVSRKK...R..KQVIDKMAAVVILQGFLD---
8 GVDMSRGK...R..KKVIDKLAATIILQSYLDS--
9 SADVSRKK...R..KSVIDKMAAVMILQGYLDR--
10 EADLSRKK...R..KKLVDKVAATYILQGYLN---
11 EADVSRKK...R..KEVIDKLAAVMILQSYLD---
12 AADVSRQK...R..KKVIDKMAAVMILQGYLD---
13 EADVSRKK...R..KEVIDKLAAVMILQSYLD---
14 DRGGYRAL...D..KGSVDAASAVIILESWFDE--
15 EADLSRAK...R..KKIVDKIAATYILQGYLD---
16 QADISRGK...R..KKVIDKLAAQLILQNYLNRN-
17 QADISRGK...R..KKVIDKLAAQLILQNYLNRN-
18 GRGGYKAL...N..KSKVDGISACLILESWFEN--
19 QADISRNK...R..KKVIDKLAAQLILQNYLDRK-
20 KKNGFKGL...K..EEKIHSCAAVIILESWFN---
21 EADLSRKR...R..AELVDHVAAGFILQGALD---
22 ERGGYKAL...D..KGSVDGVSAQLIVEAWMEEQY
23 RGGQKGSY...R..DNPVDAIAAALLLQGWLD---
24 EADVSRQK...R..KQVIDKMAAVFILQGYLDS--
25 ERGGYRAL...E..KGSVDGVSAQLILEAWMEEQY
26 AQGQRDGY...R..ERPVDALAAALLLEGWL----
27 EMDFSRAK...R..AGKIDSAAAAFILQGVLD---
28 ERGGFKAL...K..KGKIDGVSACLILESWFE---
29 AENVSPSR...N..KGLIDRKAAALILQQWLD---
30 KKNGFKVL...K..KEKIHSVAAVIILESWFNQ--
31 GMDVSRGK...R..ADRIDSAAAAFILQGALDR--
32 EMDVSRAK...R..AERIDSAAASFILQGALD---
33 EMDVSRKK...R..AERIDSAAASFILQGALD---
34 SFGIKRKD...R..NHNDDAVAASMILEIVL----
35 SFGVKRKD...R..NNNDDAVAASMILETVLD---
36 EAQVFGKK...R..KSVLDQVAAQAILHGFFEG--
37 EAQVFGKK...R..KSVLDQVAAQAILHGFFEG--
38 AMGGYKAL...S..KGNVDCQSAVIILESWFESQ-
39 AAGVRAKD...Q..RAVIDQAAAVVILQNWLDQ--
40 ASGVKSKK...G..RSVIDQAAAVIILQQALES--
41 ARKRFSSY...D..KGLIDQQAAEIILQQWLD---
42 EHDISRKR...R..DEVVDKMAAGWILQGALE---
43 DCGLNRKQ...R..KNPSDSLAATLILSSFLDS--
44 DCGLNRKQ...R..KNSSDSLAATLILSSFLDS--
45 DCGLSRKD...R..KGKTDSLAATLILTSFLDS--
46 DCGLSRKD...R..KGKTDSLAATLILTSFLDS--
47 DCGLSRKQ...R..KNSSDSLAATLILSSFLDS--
48 ------GA...T..DDVLDAEAARIILQQFFDES-
49 QAGVRASE...Q..RAVIDQAAAVAILQSWLDE--
50 ADGRKRRR...D..AEALDAMAAAVIVERWL----
51 AAGYKRRR...D..ADTLDAIAAAVILERWLA---
52 EADVSRQK...R..KQVIDKMAAVFILQGYLDS--
53 ASGVSEKA...G..RKVIDQAAAVEILQTWLD---
54 ASGSSRKK...RneKGRLDTAAACLILQGYLDS--
55 ADGRKRRR...D..ADTLDAMAAAVIVERWL----
56 --GLPPKK...K..KELKDVISAMIILEEYLN---
57 ELGLKNRT...F..KKAKDTLAATLILERFLNQ--
58 ETNISQSK...I..KEKKDSMAAALLLNYYLNN--
59 TMDLKHKD...F..KKAKDTLAAVLILERFFQN--
60 ELNKNGAIe..K..RKVVDKVAASIILQTYLD---
61 PLNLHPVQ...E..KTMLDKLAAVVILQEYLD---
62 NSCVKAAK...I..KKVKDKMSAVVILESYLSKN-
63 DGGVRRKE...R..KKSIDKLAAVLILQNYLD---
64 A---NTRK...K..DGKLDSLSAFIMIKDYF----
65 HLGKKNKRiatK..DGRLDSLSACRILERYCQ---
66 HLGKKNKRlaiK..DGRLDSLSACRILERYCQ---
67 --------...-..SKRDDAVSAALIFQSFFEN--
68 SRGVKRSA...R..DVKSDAYSAMMILERYFS---
69 --------...G..AADEDEAAAVQILELWL----