(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0399
              10        20        30        40        50        60     
              |         |         |         |         |         |     
   1 MSMKKTKNWLLVFLTVTFCFLMLGCQSKEDKKGGTKPSNEAALTKTENLDFRLSFNKIKVTTDQN
   2 ---KKHLLTLLLISFFTSFLVACSTTKDKEPQPSDSEIITPRLHQAAHQDKRANFEKIKLATVDS
   3 ---KKHLLTLLLISFFTSFLVACSTTKDKEPQPSDSENITPRLHQAAHQDKRANFEKIKLATVDS
   4 ---KKHLLTLLLISFFTSFLVACSTTKDKEPQPSDSEIITPRLHQAAHQDKRANFEKIKLATVDS
   5 ---KKHLLTLLLISFFTSFLVACSTTKDKKPQPSDSEIITPRLHQAAHQDKRANFEKIKLATVDS
   6 MIPLKKITVGTIICLSFLGLTACSSSNTQQTSTSKSNVSQHKNIKADHEELRLKFNKVKLGVKAN
   7 MIPLKKITVGTIVCLSFLGLTACSSSNTQQTSTSKSNVSQHKNIKADHEELRLKFNKVKLGVKAN
   8 ---KKHLLTLLLISFFTSFLVACSTTKDKEPQPSDSEIITPRLHQAAHQDKRANFEKIKLATVDS
   9 ---KKHLLTLLLISFFTSFLVACSTTKDKEPQPSDSEIITPRLHQAAHQDKRANFEKIKLATVDS
  10 -----------IVCLSFLGLTACSSSNTQQTXTSKSNVSQHKNIKADHEELRLKFNKVKLGVKAN
  11 MSMKKTKNWLLVFLTVTFCFLMLGCQSKEDKKGGTKPSNEAALTKTENLDFRLSFNKIKVTTDQN
  12 MLEKKEEPFYKKIWFWIIAIVIVCGVASYGDSKKINITDTKNSAKT---ETKKDDTRKVTYEKFC
  13 MLEKKEEPFYKKIWFWIIAIVIICGVASYGDSKKINITDTKNSDKT---EIKKDDTRKVTYEKFC
  14 MLEKKEEPFYKKIWFWIIAIVIVCGVASYGDSKKINITDTKNSAKT---DTKKDDTRKVTYEKFC
  15 MLEEKDEPFYKKIWFWIIAIVIVFGVASYGGSKKINITDTKNSPKT---EIKKDDTRKVTYEKFC
  16 MLEKKEEPFYKKIWFWIIAMVIVFGVANYGGSKKINIIDTKNSSET---GIKKDDTRKVTYEKFC
  17 MSKKAKKPFYKKFWIWIIAIIIIGGVIGSNGSKKTGNTSNNSSTKK---EAKKEDTRKITYEKFS
  18 MSKRVKKPFYKKFWFWIIAIIIIGGVIGSNGSKKTGNTSNNNSTKK---EAKKEDTRKITYEKFS
  19 MKKKYLVAATGLALLGLSMSACSSNSGSKSDSATTSQQKKKNETISQNKELREKFDQVKVGELSK
  20 MKKKYLVAATGLALLGLSMSACSSNSGSKSDSAKTSQKKKKNETISQNKELREKFDQIKVGELSN
  21 ------------------------------------------------------FNKVKLGVKAN
  22 MKKKYLVAATGLALLGLSLSACSSNSSKGNSEKTTEQSKKKNKVISQNKELREKFDQIKVGNFLS
  23 MIPLKKITVGTIVCLSFLGLTACSSSNTQQTSTSKSNVSQHKNIKADHEELRLKFNKVKLGVKAN
  24 MKSKMKIIFSITMFFALVSVGVIKLNKENAAAYAAKGKALSHHMKNSVKDKKYSMKK------FL
  25 ----------LLIAFAALALTACSSKKATTDSESDKKAASSISSSNVVKQTSNKINLTNYKKIKV
  26 ----------LLIAFAALALTACSSKKATTDSESDKKAASSISSSNVVKQNSNKINLTNYKKIKV
  27 --------------------------SSKNDTKKESSEKKSEDKSKDNSDLKATYDKINVGDIMN
  28 --------------------------SSKNDTKKESSEKKSEDKSKDNSDLKATYDKINVGDIMN
  29 -----FLKIFGIFSTLFIVFFTCKTF--------------ANDLSNSYYTFNNCSPKQITYDNFV
  30 -----VLKIFGIFTTLFIVFFTCRTF--------------ATDLSTSYCTFNNCSPKQITYDNFL
  31 -----------------------------------------------------------------
  32 -----------------------------------------------------------------
  33 -----------------------------------------------------------------
  34 -----------------------------------------------------------------
  35 -----------------------------------------------------------------
  36 -----------------------------------------------------------------
  37 -----------------------------------------------------------------
  38 -----------------------------------------------------------------
  39 MIPLKKITVGTIVCLSFLGLTACSSSNTQQTSTSKSNVSQHKNIKADHEELRLKFNKVKLGVKAN
  40 -----------------------------------------------------------------
  41 -----------------------------------------------------------------
  42 -----------------------------------------------------------------
  43 -----------------------------------------------------------------
  44 -----------------------------------------------------------------
  45 -----------------------------------------------------------------
  46 -----------------VALSLVACGDSTKEASNEKKEXPKQEAKKENKENKESKKKITAADV-E
  47 ---------------VALSLVACGD--STKEASNEKKEEPKQEAKKENKENKESKKKITAADV-E
  48 ---------------------------------DTKKYEEKNDRMPWYKDVNNDSAEV-DYDDYL
  49 ---------------------------------------------------KKDDKK--------


         70        80         90       100                 110        1
         |         |          |         |                   |         
   1 HFSGGTSIEQLKQWFG.DPNKSEQRNAGNITLDSYTW.....VKDG.....AVINAQ.LYKNSTV
   2 SFTGGTSLEELISLFG.EPNQHDPKTAGEVTIDAYTW.....QFDQ.....VTLTVN.LYQNSSI
   3 SFTGGTSLEELISLFG.EPNQHDPKTAGEVTIDAYTW.....QFDQ.....VTLTVN.LYQNSSI
   4 SFTGGTSLEELISLFG.EPSQHDPKTAGEVTIDAYTW.....QFDQ.....VTLTVN.LYQNSSI
   5 SFTGGTSLEELISLFG.EPSQHDPKTAGEVTIDAYTW.....QFDQ.....VTLTVN.LYQNSSI
   6 NFKGGTSLAELKQLFGgEPNEKFDTPAGNVTLKGYRW.....NVDD.....ISITIQ.LLNDSSI
   7 NFKGGTSLAELKQLFGgEPNEKFDTPAGNVTLKGYRW.....NVDD.....ISITIQ.LLNDSSI
   8 SFTGGTSLEELISLFG.EPSQHDPKTAGEVTINAYTW.....QFDQ.....VTLTVN.LYQNSSI
   9 SFTGGTSLEELISLFG.EPSQHDPKTAGEVTIDAYTW.....QFDQ.....VTLTVN.LYQNSSI
  10 NFKGGTSLAELKQLFGgEPNEKFDTPAGNVTLKGYRW.....NVDD.....ISITIQ.LLNDSSI
  11 HFSGGTSIEQLKQWFG.DPNKSEQRNAGNITLDSYTW.....VKDG.....AVINAQ.LYKNSTV
  12 KIKIGSTYEEVKSILG.EYKESKESEINGMKVVIYSW.....YNDDn....SNMEVI.VKNNKVI
  13 KIKIGSTYEEVKSILG.EYKESKESEINGMKVVIYSW.....YNDDn....SNMEVI.VKNNKVI
  14 KIKIGSTYEEVKSILG.EYKESKESEINDMKVVIYSW.....YNDDn....SNMEVI.VKNNKVI
  15 KIKIGSTYEEVKSILG.EYKESKESEIDGMKVVIYTW.....YNDDn....SNMEVI.VKNNKVI
  16 KIKIGSTYEEVKSILG.EYKESKESEINGMKVVIYTW.....YNDDn....SNMEVI.VKNNKVI
  17 KVKMGSTYEDVKNMLG.EAKESTSSEMGGIKTVIYTWd....NGDG.....SNMNVT.FQNNKAL
  18 KVKMGSTYEDVKNMLG.EGKESTSSEMGGIKTVIYTWd....NGDG.....SNMNVT.FQNNKAL
  19 HGEGGSTIQDVEKLLG.KANTTDTTTVESYKTKSYIW.....NKGA.....VTVTVQ.FEPDKVV
  20 HGEGGSTIQDVEKLLG.KANTTDTTTVDSYKTKSYIW.....NKGA.....VTVTVQ.FEPDKAV
  21 NFKGGTSLAELKQLFGgEPNEKFDTPAGNVTLKGYRW.....NVDD.....ISITIQ.LLNDSSI
  22 QGEGGSTIDEVKQLLG.SPTSTTTTSSNGVKLKQLTW.....TKGA.....VTVAIQtLDSNKVV
  23 NFKGGTSLAELKQLFGgEPNEKFDTPAGNVTLKGYRW.....NVDD.....ISITIQ.LLNDSSI
  24 RINMGMNYKSVENILG.VSEKTGQINDKRELMNHQWK.....NSDD.....SYIVVI.TKNNYVV
  25 VEKTGSTPSQVTDLLGrEADAKSKAKTSKLKATIYTWegvqnGSAG.....ANLAVE.FANGVAI
  26 VEKTGSTPSQVTDLLGrEADAKSKAKTSKLKATIYTWegvqnGSAG.....ANLAVE.FANGVAI
  27 SSEGGSTEDEVKAILG.EPASSSTTDIQGISTTTLSW.....TNVKggdllASITVS.FSDGKAA
  28 SSEGGSTEDEVKAILG.EPASSSTTDIQGISTTTLSW.....TNVKggdllASITVS.FSDGKAA
  29 KIEIGSNYNDVIKILG.KESYKDSTIINGLKNTTYIW.....KVNGte...KNIFIT.FRNGIVI
  30 KVEIGSNYNDVVKILG.KESYMDSAIINGLKNTTYIW.....KINGte...KNIFIT.FRNGIVI
  31 ----GSTYEEVKNMLG.EGKESTSSEMGGIKTVIYTWd....NGDG.....SNMNVT.FQNNKAL
  32 ----------------.--------------------.....----.....------.------V
  33 ----------------.--------------------.....----.....------.-------
  34 ----------------.--------------------.....----.....------.-------
  35 ----------------.--------------------.....----.....------.-------
  36 ----------------.--------------------.....----.....------.-------
  37 ----------------.--------------------.....----.....------.YLVKLAI
  38 ----------------.--------------------.....----.....------.-------
  39 NFKGGTSLAELT----.----------------IIWW.....RTNE.....X-----.-------
  40 ----------------.--------------------.....----.....------.-------
  41 ----------------.--------------------.....----.....------.-------
  42 ----------------.--------------------.....----.....------.-------
  43 ----------------.--------------------.....----.....------.-------
  44 ----------------.--------------------.....----.....------.-------
  45 ----------------.--------------------.....----.....------.-------
  46 AIKVGDSL--------.--------------------.....----.....------.-------
  47 AIKVGDSL--------.--------------------.....----.....------.-------
  48 KIKKGMSANQVKEAIG.KPRMIE--KYKSYSVYGYLG.....HGDK.....GSLRII.FDPNGTV
  49 ----------------.--------------------.....----.....------.-------


      20            130       140              150       160       170 
      |              |         |                |         |         | 
   1 ARSISNFSFS.....REAKIGKEDYDELKIG.....ESYKK..VVEKLGEPDVLSQSMSSDKEEM
   2 VKTISNFTFA.....RELGLSQKEYQQLQKG.....MSYED..VKKILTEPDNYSQASSSDHQTL
   3 VKTISNFTFA.....RELGLSQKEYQQLQKG.....MSYED..VKKILTEPDNYSQASSSDHQTL
   4 VKTISNFTFA.....RELGLSQKEYQQLQKG.....MSYED..VKKILTEPDNYSQASSSDHQTL
   5 VKTISNFTFA.....RELGLSQKEYQQLQKG.....MSYED..VKKILTEPDNYSQASSSDHQTL
   6 VRSISNFKFI.....RDANITTKDYNSLKNG.....MSYNK..VKELLGEPDDISQAVSSDKEEL
   7 VRSISNFKFI.....RDANITTKDYNSLKNG.....MSYNK..VKELLGEPDDISQAVSSDKEEL
   8 VKTISNFTFA.....RELGLSQKEYQQLQKG.....MSYED..AKKILTEPDNYSQASSSDHQTL
   9 VKTISNFTFA.....RELGLSQKEYQQLQKR.....MSYED..VKKILTEPDNYSQASSSDHQTL
  10 VRSISNFKFI.....RDANITTKDYNSLKNG.....MSYNK..VKELLGEPDDISQAVSSDKEEL
  11 ARSISNFSFS.....REAKIGKEDYDELKIG.....ESYKK..VVEKLGEPDVLSQSMSSDKEEM
  12 GKAQAGLSMG.....K-ADVNLIKYAKITTG.....MDYSK..VEEILGDGKLMSISKTNGSTKS
  13 GKAQAGLSMG.....K-ADVNLIKYAKITTG.....MDYSK..VEEILGDGKLMSISKTNGSTKS
  14 GKAQAGLSMG.....K-ADVNLIKYAKITTG.....MDYSK..VEEILGDGKLMSISKTNGSTKS
  15 GKAQAGLSMG.....K-ADVNLIKYTKIRTG.....MDYSK..VKEILGDGKLMSISKTNGSTKS
  16 GKAQADLSMG.....K-ADVNLIKYAKITTG.....MDYSK..VEEILGDGKLMSISKTNGSTKS
  17 AKAQAGLSRK.....R-ADVNMEKYNKIQTG.....MDYNK..VKEILGDGELMSISEVGGSNTS
  18 AKAQAGLSRE.....K-ADVNMEKYNTIQTG.....MDYNK..IKEILGAGELMSISEVGGSNTS
  19 TKDITGFKWGk....RDEKLDLAAFNSIQDG.....ASYDD..IVKKYGEPDSLNESLLLGTKTV
  20 TKDITGFKWGk....RDEKLDLAAFNSIQDG.....ATYDD..IVKKYGEPDSLNESLLLGTKTV
  21 VRSISNFKFI.....RDANITTKDYNSLKNG.....MSYNK..VKELLGEPDDISQAVSSDKEEL
  22 SKEITGFKWGk....RDEKLTLGEFNNIADG.....STYQS..LVDKYGEPDGLHEANVAGTKIT
  23 VRSISNFKFI.....RDANITTKDYNSLKNX.....XSYNK..VKELLGEPDDISQA--------
  24 SKIEIGLE-K.....INANISRVKYKKISRG.....MSYEK..VKNLVGKGELLSETNLMGSNSK
  25 AKQISGLVVN.....RSKQITPKDYKSLKKG.....MTQEE..VEAIVGKPNGYSESDYSNSEVV
  26 AKQISGLVVN.....RSKQITPKDYKSLKKG.....MTQEE..VEAIVGKPNGYSESDYSNSEVV
  27 SKSVSGLKVA.....KHDKVTADQVNNIATD.....GSYSEeqARKDLGDPTGITSTNINGEKND
  28 SKSVSGLKVA.....KHDKVTADQVNNIATD.....GSYSEeqARKDLGDPTGITSTNINGEKND
  29 TKEQYKLN-D.....NFSVLSSEKISSLKKG.....MSYDE..AKNILGEGILRSEEKGIKVFTW
  30 TKEQYKLN-D.....KFSILSNKKINSLKKG.....MSYNE..VKNILGEGILRSEEK----ETQ
  31 AKAQAGLSRE.....R-ADVNAEKYNRIQTG.....MDYNK..VKEILGDGE-------------
  32 SPEAPNTSGD.....NETVITKENYDKVKNG.....MSYEE..VVKIIGSEGEIVTETGEKGDDM
  33 ----------.....GCSKLTMENYSKIKTG.....IGYTE..VVKILGKPDNCSEA--------
  34 ------LPLA.....GCNKLTPENYDKLRMG.....MHYAE..VKSILGEPTRCSD---------
  35 ---------L.....GCSKLTQENYAKLKMG.....LGYGE..VVTILGKPDSCSEA--------
  36 ----------.....GCSKLTTENYEKIKMG.....MDYGE..VAGILGKPDSCSEA--------
  37 AIVLSAFLFG.....CFSKINQENYAKIENG.....MTMEQ..VKDILGEPTESQTAGIGSLSGT
  38 ----------.....--SKVTVENYDKIRVG.....MTYDE..VKQLLGAPNRCSDV--------
  39 ----------.....----------------.....-----..----------------------
  40 ----------.....---------YEDLKNG.....TSYDN..AVKILGVPDVYSIAVSSDATMT
  41 ----------.....---------YEDLKNG.....TSYDN..AVKTLGVPDVYSIAVSSDATMT
  42 ----------.....---------YEDLKNG.....TSYDN..AVQTLGVPDVYSIAVSSDATMT
  43 ----------a....CENKVTRENYDKLAIG.....MEYSK..VVELLGEPENCQSV--------
  44 ----------.....NKELVSQEKYKQLKLD.....MSYED..VKKIMGSPGKVKKFPGQPSTET
  45 ----------.....-----YMEEYDKIEVG.....ISANK..VKELIGEPSDIEKYKSYSLYRY
  46 -------TG-.....-------------AGG.....EKYED..VVAKFGEPDNKAESQAGDIKMI
  47 -------TG-.....-------------AGG.....EKYED..VVAKFGEPDNKAESQAGDIKMI
  48 TKIEENGLRHq14nqQSHPVSLDMFESIKIG.....MTADE..VTEIVGTSPR------------
  49 ----------.....---KITVADVDTIKTGdp8ggDKYED..LVAKYGEPDIKSDSTSKNVKTY


            180        190        200      
             |          |          |      
   1 QTVWSSGIKTKSSSA.TIELYF.ENGLLKNKTQKDLE
   2 QAIWVSGLKTDTSGA.NISLVF.ENNQLTEMSQVGLE
   3 QAIWVSGLKTDTSGA.NISLVF.ENNQLTEMSQVGLE
   4 QAIWVSGLKTDTSGA.NISLVF.ENNQLTEMSQVGLE
   5 QAIWVSGLKTDTSGA.NISLVF.ENNQLTEMSQVGLE
   6 QAAWISGIQSSDSDP.GINLTF.ENDKLTNKQQHGLK
   7 QAAWISGIQSSDSDP.GINLTF.ENDKLTNKQQHGLK
   8 QAIWVSGLKTDTSGA.NISLVF.ENNQLTEMSQVGLE
   9 QAIWVSGLKTDTSGA.NISLVF.ENNQLTEMSQVGLE
  10 QAAWISGIQSSDSDP.GINLTF.ENDKLTNKQQHGLK
  11 QTVWSSGIKTKSSSA.TIELYF.ENGLLKNKTQKDLE
  12 IYLWA-----NPNGT.NMNVTF.QNGKVTAKNRLGL-
  13 IYLWA-----NPNGT.NMNVTF.QNGKVTAKNRLGL-
  14 IYLWA-----NPNGT.NMNVTF.QNGKVTAKNRLGL-
  15 IYLWA-----NPNGT.NMNVTF.QNGKVTTKNRLGL-
  16 IYLWA-----NPNGT.NMNVTF.QNGKVTAKNRLGL-
  17 IYSWV-----NSNGT.NINVTF.QDGKSAAKAQFGLK
  18 IYSWV-----NSNGT.NMNVTF.QDGKSAAKAQFGLK
  19 TALWYTGIKGKGAGA.NASLTF.ENGALTSKTQTDLK
  20 TGLWYTGIKGKADGA.FASLTF.ENGALTSKTQTDLK
  21 QAAWISGIQSSDSDP.GINLTF.ENDKLTNKQQHGLK
  22 NAVWLTGIK-GDDGA.SATFSF.ENDKLSSKTQTKLK
  23 ---------------.------.--------------
  24 MYQWV-----NSDGS.SMNITF.TNGIVDYKTEKKLK
  25 LWLYTSGLK-SDDQA.NFYVTF.QKNKLVAKKAQN--
  26 LWLYTSGLK-SDDQA.NFYVTF.QKNKLVAKKAQN--
  27 TLIWMKNLD-GDLGA.TVTVSF.SNGSAISKSSSGLK
  28 TLIWMKNLD-GDLGA.TVTVSF.SNGNAISKSSSGLK
  29 F---------DNNSK.YLNCTF.KNDKLTI-------
  30 VYSWF-----NNDSS.YINCTF.KNDKL---------
  31 ---------------.------.--------------
  32 YGIAVLYENKGSSLS.NATFIF.LGDKLQSKSQYGLE
  33 --LFVRNCVWGNEQK.NITVSF.VGDKAILFSSKNIK
  34 -LLAVKACTWGDDAR.YINVNF.VADQAVLLNSSNLR
  35 --LFAKSCVWGDEQK.NITVNF.AGDRVVLFTSTNIR
  36 --LFAKSCIWGNEQK.NITVNF.VGDKTILFTSKNIR
  37 NAIW------KSDSV.TIKLTF.VNSKVQLK------
  38 --MTVKSCTWGDEKR.HVQVSF.VADQVVLFSSENLR
  39 ---------------.------.--------------
  40 QALWSSNLVTKKGKTgSLTLNF.KNGTLENKSQENL-
  41 QALWSSNLVTKKGKTgSLTLNF.KNGTLENKSQENL-
  42 QALWSSNLVTKKGKTgSLTLNF.KNGTLENKSQENL-
  43 --VSVKSCVWGKTPK.TISVQF.VGEKILFYSNTG--
  44 TYVWK----QKNSSY.RILVTL.KDGKV---------
  45 G--------GWDSKG.EMTVYFdKNGTVTKKEQNGLR
  46 IASWTKNVN-GDLGA.NFNVTF.T-------------
  47 IASWTKNVN-GDLGA.NFNVTF.T-------------
  48 ---------------.------.--------------
  49 TASWSKNAK-GGTGA.NFTVSF.I-------------