; SAM: /projects/compbio/bin/i686/prettyalign v3.3.2 (February, 2001) compiled 06/24/02_10:51:09 ; (c) 1992-2001 Regents of the University of California, Santa Cruz ; ; Sequence Alignment and Modeling Software System ; http://www.cse.ucsc.edu/research/compbio/sam.html ; ; ----------------- Citations (SAM, SAM-T99, HMMs) -------------------- ; R. Hughey, A. Krogh, Hidden Markov models for sequence analysis: ; Extension and analysis of the basic method, CABIOS 12:95-107, 1996. ; K. Karplus, C. Barrett, R. Hughey, Hidden Markov models for detecting ; remote protein homologies, Bioinformatics 14(10):846-856, 1998. ; A. Krogh et al., Hidden Markov models in computational biology: ; Applications to protein modeling, JMB 235:1501-1531, Feb 1994. ; --------------------------------------------------------------------- ; Sequences correspond to the following labels: ; 1 T0149 ; 2 1cp2A ; 3 1dar ; 4 1egaA ; 5 1f60A ; 6 1fp6A ; 7 1fts ; 8 1g3qA ; 9 1g5pA ; 10 1g7tA ; 11 1i2mA ; 12 1j8mF ; 13 1jpnA ; 14 1ng1 ; 15 1nipA ; 16 1nksA ; 17 2ng1 ; 18 2reb 10 20 30 40 | | | | 1 ........MNPIAVTLLTGFLGAGKTTLLRHILNEQ..HGYKIAVIE........NE.FGEVSVD 2 ........---MRQVAIYGKGGIGKSTTTQNLTSGLhaMGKTIMVVG........CD.P-KADST 3 mav9eydl-KRLRNIGIAAHIDAGKTTTTERILYYT..GRXXXXXXX........XX.XXXXXXX 4 xxxdk...-SYCGFIAIVGRPNVGKSTLLNKLLGQK..----I----........--.-SITSRK 5 xgkek...-SHINV-VVIGHVDSGKSTTTGHLIYKC..---------........--.---GGID 6 a.......---MRQCAIYGKGGIGKSTTTQNLVAALaeMGKKVMIVG........CD.P-KADST 7 rsl93kap----FVILMVGVNGVGKTTTIGKLARQFeqQGKSVMLAA........GD.T-FRAAA 8 mgr.....----IISIVSGKGGTGKTTVTANLSVALgdRGRKVLAVD........GD.LTMANLS 9 a.......---MRQCAIYGKGGIGKSTTTQNLVAALaeMGKKVMIVG........CD.P-KADST 10 mki.....--RSPIVSVLGHVDHGKTTLLDHIRGSAv.ASRXXXXIT........QH.IGATEIP 11 xxx11vqf-----KLVLVGDGGTGKTTFVKRHLXXXx.XKKYVATLG........VE.V------ 12 xxl98kip----YVIMLVGVQGTGKTTTAGKLAYFYkkKGFKVGLVG........AD.V-YRPAA 13 mfq95pvl-KDRNLWFLVGLQGSGKTTTAAKLALYYkgKGRRPLLVA........AD.T-QRPAA 14 mfq95pvl-KDRNLWFLVGLQGSGKTTTAAKLALYYkgKGRRPLLVA........AD.T-QRPAA 15 a.......---MRQCAIYGKGGIGKSTTTQNLVAALaeMGKKVMIVG........CD.P-KADST 16 ........---MKIGIVTGIPGVGKSTVLAKVKEILdnQGINNKIINygd10tal--kL-GYAKD 17 fqq94pvl-KDRNLWFLVGLQGSGKTTTAAKLALYYkgKGRRPLLVA........AD.T-QRPAA 18 xxd59pmg----RIVEIYGPESSGKTTLTLQVIAAAqrEGKTCAFID........AE.H---ALD 50 60 70 80 90 | | | | | 1 DQLIG....DRA........TQIKTLTNGCICCSRSNELEDALL....DLLDNLDKGNIQFDRLV 2 RLLLG....GLAqks27yggIRCVESGGPEPGVGCAGRGIITSI....NMLEQLGAYTDDLDYVF 3 XXXXX....XXX........XXXXAAVTTCFWK-----------....-----------DHRINI 4 AQTTR....HRI........VGIHTE------------------....----------GAYQAIY 5 KRTIE....KFE........KEAAELGKGSFKY------AWVLD....KLKAE-RERGITID--- 6 RLILH....SKAqnt29yggVKCVESGGPEPGVGCAGRGVITAI....NFLEEEGAYEDDLDFVF 7 VEQLQ....VWGqrnn....IPVIAQHTGA-------DSASVIF....DAIQA--AKARNIDVLI 8 L--VL....GVDdpd25tqfDNVYVL------PG-AVDWEHVLKadprKLPEVIKSLKDKFDFIL 9 RLILH....SKAqnt29yggVKCVESGGPEPGVGCAGRGVITAI....NFLEEEGAYEDDLDFVF 10 MDVIE....GICgd......------------------------....-FLKKFSIRETLPGLFF 11 -----....---........HPLVFHTNR---------------....----------GPIKFNV 12 LEQLQ....QLG........QQIGVPVYGEPGEKDVVGIAKRGV....EKFLS-----EKMEIII 13 REQLRllgeKVG........VPVLEVMDGESPE----SIRRRVE....EKARL-----EARDLIL 14 REQLRllgeKVG........VPVLEVMDGESPE----SIRRRVE....EKARL-----EARDLIL 15 RLILH....SKAqnt29yggVKCVESGGPEPGVGCAGRGVITAI....NFLEEEGAYEDDLDFVF 16 RDEMR....KLSvekqk...-------------KLQIDAAKGIA....EEARA-----GGEGYLF 17 REQLRllgeKVG........VPVLEVMDGESPE----SIRRRVE....EKARL-----EARDLIL 18 PIYAR....KLG........VDIDNL---LCSQPDTGEQALEIC....DALAR----SGAVDVIV 100 110 120 130 140 | | | | | 1 IEC.TGM........ADP.GPIIQTFFSHEVLCQRYLLDGVIALVDAVHADEQM........NQF 2 YDV.LGD........VVC.GGFAMPIRE-------GK-AQEIYIVASGEMMALY........AAN 3 IDT.PGH........VDF.T--------IEVERSMRVLDGAIVVFDSSQGVEPQs.......ETV 4 VDT.PGL........HMEeKRAINRLMNKAASSSIGDVELVIFVVEGTRWTPDD........EMV 5 ---.---........---.----------IALWKFETPKYQVTVIDAPGHRDFIknm32kdg--Q 6 YDV.LGD........VVC.GGFAMPIREN-------KAQE-IYIVCSGEMMAMY........AAN 7 ADT.AGRlqnk....SHL.MEELKKIVRVMKKLDVEAPHEVMLTIDASTGQNAV........SQA 8 IDCpAGL........Q--.---------LDAMSAMLSGEEALLVTNP--EISCL........TDT 9 YDV.LGD........VVC.GGFAMPIREN-------KAQ-EIYIVCSGEMMAMY........AAN 10 IDT.PGH........E--.-----AFTT-LRKRGGALADLAILIVDI---NEGFkpqt....QEA 11 WDT.AG-........---.---QEKFGG-LRDGYYIQAQCAIIMFDVTSRVTYK........NVP 12 VDT.AGR........HGY.GEEAALLEEMKNIYEAIKPDEVTLVIDASIGQKAY........DLA 13 VDT.AGR........LQIdEPLMGEL---ARLKEVLGPDEVLLVLDAMTGQEAL........SVA 14 VDT.AGR........LQIdEPLMGEL---ARLKEVLGPDEVLLVLDAMTGQEAL........SVA 15 YDV.LGD........VVC.GGFAMPIRENK--------AQEIYIVCSGEMMAMY........AAN 16 IDT.HAVirt9gylpGLP.SYVITEINP-----------SVIFLLEA-------........--- 17 VDT.AGR........LQIdEPLMGEL---ARLKEVLGPDEVLLVLDAMTGQEAL........SVA 18 VDS.VAA........LTP.KAEIEXXXXXX--------------XXGLAARMMS........QAM 150 160 170 | | | 1 TIAQSQVGYA.......D.......R....ILLTKTDVAGEAE.....KLHERLARI........ 2 NISKGIQKYAksggvrlG.......G....IICNSRKVANEYE.....LLDAFAKEL........ 3 WRQAEKYKVP.......R.......I....AFANKMDKTGADLwl...VIRTMQERL........ 4 LNKLREGKAP.......V.......I....LAVNKVDNVQEKAdll..PHLQFLASQ........ 5 TREHALLAFTlgv....-rql....I....VAVNKMDSVKWDEs....RFQEIVKETsnf10gyn 6 NISKGIVKYA.......NsgsvrlgG....LICNSRNTDREDE.....LIIALANKL........ 7 KLFHEAVGLT.......-.......G....ITLTKLDGTAKGG.....VIFSVADQF........ 8 MKVGIVLKKA.......G.......LailgFVLNRYGRSDRDIpp...EAAEDVMEV........ 9 NISKGIVKYAnsgsvrlG.......G....LICNSRNTDREDE.....LIIALANKL........ 10 LNILRMYRTP.......F.......V....VAANKIDRIHGWRvhegr---------........ 11 NWHRDLVRVC.......Enipi...V....LCGNKVDIKDRKV.....KAKSIVFHR........ 12 SKFNQASKIG.......-.......T....IIITKMDGTAKGG.....GALSAVAAT........ 13 RAFDEKVGVT.......-.......G....LVLTKLDGDARGGa....ALSARHVT-........ 14 RAFDEKVGVT.......-.......G....LVLTKLDGDARGGa....ALSARHVT-........ 15 NISKGIVKYA.......NsgsvrlgG....LICNSRNTDREDE.....LIIALANKL........ 16 ----------.......-.......-....-------------.....---------........ 17 RAFDEKVGVT.......-.......G....LVLTKLDGDARGGa....ALSARHVT-........ 18 RKLAGNLKQS.......Ntl.....L....IFINQXXXXXXXXx....---------........ 180 190 200 210 220 230 | | | | | | 1 NARAPVYTVTHGDIDLGLLFNTNGFMLE........ENVVSTKPRFHFIADKQNDISSIVVELDY 2 GSQLIHFVPRSPMVTKAEINKQTVIEYDptceq...----------------------------- 3 GARPVVMQLP------------------........-------------IGREDTFSGIID---- 4 MNFLDIVPISAE---------TGLNVD-ti......AAIVRKHLPEATHHFPEDYITDR------ 5 PKTVPFVPISGWNGDNMIEATTNAPWYKgwe29srp----------------------------- 6 GTQMIHFVPRDNVVQRAEIRRMTVIEYDpkakqad.----------------------------- 7 GIPIRYIGVGER----------------........----------------------------- 8 PLLAVIP---------------------........----------------------------- 9 GTQMIHFVPRDNVVQRAEIRRMTVI---eyd10qad----------------------------- 10 --------------PFMETFSKQDIQVQqkl48gip----------------------------- 11 KKNLQYYDISAKS----------N----........----------------------------- 12 GATIKFIGTG-EKIDELE----------........----------------------------- 13 GKP--IYFA-------------------........----------------------------- 14 GKP--IYFA-------------------........----------------------------- 15 GTQMIHFVPRDNVVQRAEIRRMTVIEYDpka18vvd---------------------NKLLVIPN 16 ----------------------------........----------------------------- 17 --GKPIYFAGVSEKPE------------........----------------------------- 18 ----------------------------........----------------------------- 240 250 260 270 280 290 | | | | | | 1 PVDISEVSRVMENLLLESADKLLRYKGMLWIDGEPNRLLFQGVQRLYSADWDRPWGD........ 2 ---AEEYRELARKVD------------------------------------------........ 3 ---------------------VLRMKAYTYGNDLG--------TDIREIPIPEEYLDqar14vaa 4 ---------------------------------------------------------........ 5 -------------------------------TDKPLRLPLQDVYKI-----------........ 6 -----EYRALARKVV------------------------------------------........ 7 ---------------------------------------------------------........ 8 ---------------------------------------------------------........ 9 -----EYRALARKVV------------------------------------------........ 10 -----ELLTMLMGLAQQYLRE------------------------------------........ 11 -YNFEKPFLWLARKLIGD---------------------------------------........ 12 VFNPRRFVA------------------------------------------------........ 13 ---------------------------------------------------------........ 14 ---------------------------------------------------------........ 15 PITMDELEELLMEF-------------------------------------------........ 16 ---------------------------------------------------------........ 17 ---------------------------------------------------------........ 18 ---------------------------------------------------------........ 300 310 | | 1 EKPHSTMVFIGIQLPEEEIRAAFAGLRK......... 2 ---ANELFVIPKPMTQERLEEILMQ---yg....... 3 ----------------------------df470kxx. 4 ----------------------------sq113xxx. 5 ----------------------------gg204xxx. 6 ---DNKLLVIPNPITMDELEELLMEF--gim18eev. 7 ----------------------------ied19far. 8 ----------------------------edp39kla. 9 ---DNKLLVIPNPITMDELEELLMEF--gim18xxx. 10 ----------------------------ql372xxx. 11 ----------------------------pnl45xxx. 12 ----------------------------rlhhhh... 13 ----------------------------gvs26mgd. 14 ----------------------------gvs24lgm. 15 ----------------------------gim18xxx. 16 ----------------------------dpk70smk. 17 ----------------------------gle17lgm. 18 ----------------------------xx149xxx.