; SAM: prettyalign v3.2 (June, 2000) compiled 07/10/00_09:40:44 ; (c) 1992-1999 Regents of the University of California, Santa Cruz ; ; Sequence Alignment and Modeling Software System ; http://www.cse.ucsc.edu/research/compbio/sam.html ; ; ----------------- Citations (SAM, SAM-T99, HMMs) -------------------- ; R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: ; Extension and analysis of the basic method, CABIOS 12:95-107, 1996. ; K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting ; remote protein homologies, Bioinformatics 14(10):846-856, 1999. ; A. Krogh et al., Hidden Markov models in computational biology: ; Applications to protein modeling, JMB 235:1501-1531, Feb 1994. ; --------------------------------------------------------------------- ; Sequence numbers correspond to the following labels: ; 1 T0109 ; 2 gi|1176352|sp|P45340|ORN_HAEIN_1:181 ; 3 gi|1730261|sp|P39287|ORN_ECOLI_1:179 ; 4 gi|1361151|pir||S56390_24:202 ; 5 gi|7227908|sp|Q9Y3B8|ORN_HUMAN_5:184 ; 6 gi|7379329|emb|CAB83884.1|_4:178 ; 7 gi|7227120|gb|AAF42197.1|_4:178 ; 8 gi|7661646|ref|NP_056338.1|_37:216 ; 9 gi|7227907|sp|Q17819|ORN_CAEEL_8:186 ; 10 gi|6323088|ref|NP_013160.1|_50:238 ; 11 gi|7227904|sp|O06174|ORN_MYCTU_2:181 ; 12 gi|7227905|sp|O07708|ORN_MYCLE_2:181 ; 13 gi|7227909|sp|Q9ZVE0|ORN_ARATH_21:200 ; 14 gi|7301049|gb|AAF56185.1|_32:209 ; 15 gi|7227906|sp|O94626|ORN_SCHPO_1:177 ; 16 gi|6277197|dbj|BAA86266.1|_2:180 ; 17 gi|1019712|gb|AAA98633.1|_49:221 ; 18 gi|7468910|pir||G71502_6:174 ; 19 gi|8163328|gb|AAF73606.1|_6:174 ; 20 gi|7468222|pir||A72083_2:165 ; 21 gi|6014995|sp|O67074|DP3E_AQUAE_13:178 ; 22 gi|7468909|pir||B71536_3:159 ; 23 gi|8163241|gb|AAF73567.1|_2:159 ; 24 gi|7483286|pir||D69371_3:174 ; 25 gi|6225287|sp|Q9ZHF6|DPO3_THEMA_352:518 ; 26 gi|7468223|pir||H72051_5:173 ; 27 gi|6014996|sp|O83649|DP3E_TREPA_15:175 ; 28 gi|6822230|emb|CAB70936.1|_2:186 ; 29 gi|1706437|sp|P54394|DING_BACSU_3:162 ; 30 gi|1794167|dbj|BAA11217.1|_42:210 ; 31 gi|7477983|pir||H70794_10:171 ; 32 gi|6685391|sp|Q9ZCJ9|DP3E_RICPR_7:164 ; 33 gi|4336092|gb|AAD17623.1|_2:170 10 20 30 40 50 | | | | | 1 MSFDKQNLIWIDLEMTGLDPEK.ERIIEIATIVTDKNLNILA.......EGPVLAVHQSDELLNK 2 MSFDKQNLIWIDLEMTGLDPEK.ERIIEIATIVTDKNLNILA.......EGPVLAVHQSDELLNK 3 MSANENNLIWIDLEMTGLDPER.DRIIEIATLVTDANLNILA.......EGPTIAVHQSDEQLAL 4 MSANENNLIWIDLEMTGLDPER.DRIIEIATLVTDANLNILA.......EGPTIAVHQSDEQLAL 5 -ESMAQRMVWVDLEMTGLDIEK.DQIIEMACLITDSDLNILA.......EGPNLIIKQPDELLDS 6 ----KNNLCWLDMEMTGLNPET.DRIIEVAMIITDSDLNVLA.......QSEVYAIHQSDELLDN 7 ----KNNLCWLDMEMTGLNPET.DRIIEVAMIITDSDLNVLA.......QSEVYAVHQSDDVLNK 8 -ESMAQRMVWVDLEMTGLDIEK.DQIIEMACLITDSDLNILA.......EGPNLIIKQPDELLDS 9 CDKIEQRIIWIDCEMTGLDVEK.QTLCEIALIVTDSELNTIA.......TGPDIVIHQPKEVLDN 10 --KLFKPLVWIDCEMTGLDHVN.DRIIEICCIITDGHLAPVKaadgqgdSHYESVIHYGPEVMNK 11 ----QDELVWIDCEMTGLDLGS.DKLIEIAALVTDADLNILG.......DGVDVVMHADDAALSG 12 ----HDELVWIDCEMTGLDLGS.DQLIEIAALVTDADLNILG.......DGVDVVIHIDSTALSS 13 -GDYKQPLVWIDLEMTGLNVEV.DRILEIACIITNGDLTQSV.......EGPDLVVRQTKDCLDK 14 -CGLDTDIVWMDLEMTGLDIEK.DKILEVACIITDQDLNVKS.......EGPCFAINHPQEVYDS 15 MSNLKQPLVWIDCEMTGLEVGK.HVLMEVAAIITDGNLRPVE.......EKFDAVIKLDEKQLSE 16 ----NDRMVWIDCEMTGLSLAD.DALIEVAALVTDSELNVLG.......EGVDIVIRPPDAALET 17 -TKLFKPLVWIDCEMTGLDHVN.DRIIEICCIITDGHLAPVKaadgqgdSHYETVIHYGPEVMNK 18 ----DVEFVCLDCETTGLDVKK.DRVIEFAAIRF--TFDEII.......DSVEFLIHPERAV--- 19 ----DIEFVCLDCETTGLDVKK.DRVIEFAAIRF--TFDEVI.......DSVEFLIHPERAV--- 20 SSQTMDVLIFYDTETTGTQIER.DRIIEIAAYNS-----VTD.......ESFLTYVNPEIPI--- 21 -NLLDGTFVVIDLEATGFDVEK.SEVIDLAAVRVEGG--IIT.......EKFSTLVYPGYFI--- 22 ------ALIFYDTETTGTQIDK.DRIVELAAYNG-----TTS.......ESFQTLVNPEIPI--- 23 -----PDLIFYDTETTGTQIDK.DRIVEIAAYNG-----TTG.......ESFQTLVNPEIPI--- 24 GSLRKVQFLSIDLETTGLNQKK.DEIIAIGAVPIIGTRILAG.......ESYYRLLRPEKFK--- 25 -TFGDATFVVLDFETTGLDPQV.DEIIEIGAVKIQGG--QIV.......DEYHTLIKPSREI--- 26 ---KDTVFTCLDCEMTGLDVKK.DRIIEIAAVRF--TFDSVI.......SSIEFLINPERVV--- 27 -------FTAFDTETTGLKAEE.DRIIEIGAVTFDRK--GII.......ARFSTLIFPDRAI--- 28 TSWFEGPLAAFDTETTGVDTET.DRIVSAALVVQDAP-GLRP.......RVTRWLVNPGVPV--- 29 ----KQRFVVIDVETTGNSPKKgDKIIQIAAVVIENG--QIT.......ERFSKYINPNKSI--- 30 -DARELDTLVLDFETTGFNPEV.DRVISIGWVEIRNSNIRLN.......SARHVFINHAIDI--- 31 -SHQDRGWAVIDVETSGFRPGQ.ARIISLAVLGLDAA-GRLE.......QSVVSLLNPKVDP--- 32 --------IILDTETTGLDPQQgHRIVEIGAIEMVNKV-LTG.......KHFHFYINPERDM--- 33 ----TRQLVVVDCETTGL-HDG.AAILEVAAVNIDTG---AE.......LHFVPFVTREQLAQAQ 60 70 80 90 100 110 12 | | | | | | 1 MNDWCQKTHSENGLIERIK.ASKLTERA.AELQTLDFLKKWVP.KGASPICGNSIAQDKRFLVKY 2 MNDWCQKTHSENGLIERIK.ASKLTERA.AELQTLDFLKKWVP.KGASPICGNSIAQDKRFLVKY 3 MDDWNVRTHTASGLVERVK.ASTMGDRE.AELATLEFLKQWVP.AGKSPICGNSIGQDRRFLFKY 4 MDDWNVRTHTASGLVERVK.ASTMGDRE.AELATLEFLKQWVP.AGKSPICGNSIGQDRRFLFKY 5 MSDWCKEHHGKSGLTKAVK.ESTITLQQ.AEYEFLSFVRQQTP.PGLCPLAGNSVHEDKKFLDKY 6 MDEWNTATHGRTGLTQRVR.ESSHTEAE.VEQKLLDFMSEWVP.RRATPMCGNSIHQDRRFMVKY 7 MDEWNTATHGRTGLTQRVR.ESSHTEAE.VEQKLLDFMSEWVP.GRATPMCGNSIHQDRRFMVKY 8 MSDWCKEHHGRSGLTKAVK.ESTITLQQ.AEYEFLSFVRQQTP.PGLCPLAGNSVHEDKKFLDKY 9 MEEWPRNTFHENGLMEKII.ASKYSMAD.AENEVIDFLKLHAL.PGKSPIAGNSIYMDRLFIKKY 10 MNEWCIEHHGNSGLTAKVL.ASEKTLAQ.VEDELLEYIQRYIPdKNVGVLAGNSVHMDRLFMVRE 11 MIDVVAEMHSRSGLIDEVK.ASTVDLAT.AEAMVLDYINEHVKqPKTAPLAGNSIATDRAFIARD 12 MIDVVAEMHSRSGLINEVE.SSIVDLVT.AESIVLDYINNHVKqPKTAPLAGNSIATDRSFIARD 13 MDDWCQTHHGASGLTKKVL.LSAITERE.AEQKVIEFVKKHVG.SGNPLLAGNSVYVDFLFLKKY 14 MNEWCMKHHYNSGLIDRCK.SSDVNLEE.ASNLVLSYLEKNIP.KRACPLGGNSVYTDRLFIMKF 15 MNDWCIEQHGKSGLTERCR.QSNLTVKD.VENQLLAYIKKYIPkKREALIAGNSVHADVRFLSVE 16 MPEVVRQMHTASGLLDE-L.AGGTTLAD.AEEQVLAYVREHVKePGKAPLCGNSVGTDRGFLARD 17 MNEWCIEHHGNSESHPKVL.ASEKTLAQ.VEDELLEYIQRYIPdKNVGVLAGNSVHMDRLFMVRE 18 ----SAESQKIHKISDAML.RDKPKFGE.VFSRIKGFFKE---.--RDHIVGHHVGFDLQVLSQE 19 ----SAESQKIHKISDAML.KDKPKFSE.VFSTIKGFFKE---.--RDYIVGHHVGFDLQVLSQE 20 ----PDEASKIHGITTDAV.LSAPKFPE.AYEGFRKFCGE---.-DSILVAHNNDGFDFPLLGKE 21 ----PERIKKLTGITNAML.VGQPTIEE.VLPEFLEFVGD---.---NIVVGHFVEQDIKFINKY 22 ----PAEATKIHGITTAEV.ADAPRFPE.AYQKFIEFCGT---.-DNILVAHNNNAFDYPLLVRE 23 ----PAEATKIHGITTSEV.ANAPKFPE.AYQQFSDFCGT---.-DNILVAHNNNAFDYPLLLRE 24 -----HESMKFHGLDPARL.KTAHDFSE.IAEEVADLLRG---.---KVLVGYAIELDYGFLKRA 25 ----SRKSSEITGITQEML.ENKRSIEE.VLPEFLGFLED---.---SIIVAHNANFDYRFLRLW 26 ----SAESQRVHHISNAML.RDQPKIAE.VFPQIKAFFKE---.--GDYIVGHSVGFDLQVLAQE 27 ----PPDVSKINHITDDML.VNKPRFCE.IVSDFSRFIKG---.---TVLVAHNANFDVEFLNAE 28 ----PESATAVHGLTEEYV.QRHGRWPApVMYEMAEALTEQAR.AG-RPLVVMNAPFDLTLLDRE 29 ----PAFIEQLTGISNQMV.ENEQPFEA.VAEEVFQLLDG---.---AYFVAHNIHFDLGFVKYE 30 ----CHESVKVHHIRPETLhVSGISEQA.AFTQLLDVIAG---.---KILVAHGCIMEQRFLEQY 31 ------GPTHVHGLTAAML.DGQPQFAD.IAGEVVDVLRG---.---RTLVAHNVAFDYAFLAAE 32 ----PFEAYKIHGISGEFL.KDKPLFKT.IANDFLKFIAD---.---STLIIHNAPFDIKFLNHE 33 PMAMQMNRYYERGIWQRRL.SPDST-DA.AYWKLANMLAG---.---NTFGGSNPAFDSRLLAAA 0 130 140 150 160 170 | | | | | | 1 MPDLADY...FHYRHLDVSTLKELAARWKPEI........LEG.FKKENTHLALDDIRESIKELA 2 MPDLADY...FHYRHLDVSTLKELAARWKPEI........LEG.FKKENTHLALDDIRESIKELA 3 MPELEAY...FHYRYLDVSTLKELARRWKPEI........LDG.FTKQGTHQAMDDIRESVAELA 4 MPELEAY...FHYRYLDVSTLKELARRWKPEI........LDG.FTKQGTHQAMDDIRESVAELA 5 MPQFMKH...LHYRIIDVSTVKELCRRWYPEEy.......EFA.PKKAASHRALDDISESIKELQ 6 MPKLENY...FHYRNLDVSTLKELAKRWNPPV........AKS.VVKRGSHKALDDILESIEEMR 7 MPKLENY...FHYRNLDVSTLKELAKRWNPPV........AKS.VVKRGSHKALDDILESIEEMR 8 MPQFMKH...LHYRIIDVSTVKELCRRWYPEEy.......EFA.PKKAASHRALDDISESIKELQ 9 MPKLDKF...AHYRCIDVSTIKGLVQRWYPDY........-KH.PKKQCTHRAFDDIMESIAELK 10 FPKVIDH...LFYRIVDVSSIMEVARRHNPALq.......ARN.PKKEAAHTAYSDIKESIAQLQ 11 MPTLDSF...LHYRMIDVSSIKELCRRWYPRI........YFGqPPKGLTHRALADIHESIRELR 12 MPTLDSF...LHYRMIDVSSIKELCRRWYPRI........YFGqPAKGLTHRALADIHESIRELR 13 MPELAAL...FPHILVDVSSVKALCARWFPIEr.......RKA.PAKKNNHRAMDDIRESIKELK 14 MPLVDAY...LHYRIVDVSTIKELAKRWHPAIl.......DSA.PKKSFTHRSLDDIRESIKELA 15 MPKIIEH...LHYRIIDVSTIKELAKRWCPDI........-PA.YDKKGDHRALSDILESIGELQ 16 MRELEGY...LHYRIVDVSSVKELARRWYPRA........YFNsPAKNGNHRALADIRDSITELR 17 FPKVIDH...LFYRIVDVSSIMEVARRHNPALq.......ARN.PKKEAAHTAYSDIK------- 18 SERLGET...LLPKHHYVIDTLRLAKGYGDSPnns9alarHFN.VPHQGNHRAMKDVEMNVKVFK 19 SERLGET...LLPKQHYVIDTLRLAKEYGDSPnns9alarHFN.VPHQGNHRAMKDVEMNVKVFK 20 CRRH-SL...EPLTNRTIDSLK-WAQKYRPDLpkh10lrqVYG.FAENQAHRALDDVVILHKVFT 21 TKQY-RG...KKFRNPSLCTLK-LARKVFPGLkky10iaeNFG.FETNGVHRALKDATLTAEIFI 22 CRRH-GL...SEPQLRTIDSLK-WAKKYRTDLpqh10lrqVYG.FEENQAHRALDDVITLYRVF- 23 CRRH-GL...PEPQLRTIDSLK-WAKKYRTDLpqh10lrqVYG.FEENNAHRALDDVITLHRVF- 24 LKREGYK...VENKRIDVIDFEKAVCYILGERpvg12lakKYR.VEVSYRHNALADAFITAQIFQ 25 IKKVMG-...LDWERPYIDTLA-LAKSLLKLRsys9svveKLG.LGPFRHHRALDDARVTAQVFL 26 MERIGET...FLSKYTIIDTLR-LAKEYGDSPnns9slavHFN.VPYDGNHRAMKDVEININIFK 27 LSLC-KK...QPLSHKVVDTYA-MAQAVFPGLgrh12lalQFG.LTVHAAHRAEDDARVCMELFT 28 LRRHRAS...SLGRWLERTPLHVLDPHVLDKHldr16lcaHYG.VELAGAHDAAADAQAALEVVR 29 LHKA-GF...QLPDCEVLDTVE-LSRIVFPGFegy10lseELQ.LRHDQPHRADSDAEVTGLI-- 30 IKMKYQN...LKLPLIWLDTLK--IEQYRTQLrpt14irkELN.LPTYQAHNALNDAIATAEL-- 31 AEIAEAE...LPVDF--VMCTVELARRLQLGVdnl10laaHWG.VPQQRPHDAFDDVRVLTGIL- 32 LSLLKRTeikFLELTNTIDTLV-MARNMFPGArys11krfKVD.NSGRQLHGALKDAAL------ 33 MPDGAPE...WHHRLADLAAFT--AGKLNLDPvel11vceRLG.VTVSDRHSALADAHATATCFT 180 | 1 YYREHFMKLD 2 YYREHFMKL- 3 YYREHFI--- 4 YYREHFI--- 5 FYRNNIFK-- 6 HYREHFL--- 7 HYREHFL--- 8 FYRNNIFK-- 9 NYRESIFV-- 10 WYMDNYLKPP 11 FYRRTAFVPQ 12 FYRRTAFVPP 13 YYKKTIFK-- 14 YYKANL---- 15 HYRSY----- 16 YYREAVFVPQ 17 ---------- 18 HLTKRF---- 19 HLTKRF---- 20 ---------- 21 KI-------- 22 ---------- 23 ---------- 24 VQ-------- 25 RFVE------ 26 HLCKRF---- 27 T--------- 28 AVGRR----- 29 ---------- 30 ---------- 31 ---------- 32 ---------- 33 ILR-------