(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0272 1wj9, Thermus thermophilus
                            10                  20                  30 
                            |                   |                   | 
   1 ...M...WLTKLVL...N.....PA..S..R.AAR..R..DL.A....N..P.YE...MHRTLSK
   2 ...M...YLSRITL...H.....TSelS..P.AQL..L..HL.Ver..G..E.YV...MHQWLWD
   3 ...M...YLSRITL...H.....TGqlS..P.AQL..L..HL.V....DrgE.YV...MHQWLWD
   4 ...M...YLSKIVL...R.....QS..S..Q.TAN..IlaKLsA....N..GvYT...SHQLLWK
   5 ...M...YLSRIQLrfnN.....LR..P..EmLAK..W..NS.A....R..P.YA...SHQWLWQ
   6 ...M...YLTRLTL...D.....PR..S..A.QAR..R..DL.A....D..A.YD...MHRTLVR
   7 ...Ms..HITRAELsr.D.....TG..A..R.KAL..T..AL.LrdqgG..T.DG...GHRLVWT
   8 ...Msv.WLTRIVP...D.....PR..S..R.DAR..R..DL.G....G..N.DSamgLHRRLMS
   9 ...M...YLSKVII...-.....--..A..R.AWS..R..DL.-....-..-.YQ...LHQGLWH
  10 ...M...YLSKVLI...N.....--..G..T.ACR..-..--.-....N..P.YE...IHRVLWK
  11 ntkI...HLVKLPV...HialsvPA..K..N.KPRkgW..EL.T....D..P.SF...RHRAVMA
  12 ...M...WMSKLVL...D.....PR..-..R.AVG..K..NL.-....-..-.YD...THRLLWN
  13 ...-...-------...-.....--..-..-.---..-..--.-....-..-.--...MHAAVMS
  14 m..M...YISMVAL...-.....--..-..-.---..-..SV.I....D..T.YE...QHQAIWE
  15 ...M...IATMLTL...S.....RK..D..V.KAL..R..-I.T....D..S.YS...LHRVIYS
  16 ...MtplYLISLPL...D.....MV..SfhR.WAG..Q..RG.I....G..T.DE...G-RALHH
  17 ...-...-------...-.....--..-..-.---..-..--.-....-..-.--...-------
  18 ...-...-------...-.....--..-..-.---..-..--.-....-..-.--...-------
  19 ...-...-------...-.....--..-..-.---..-..--.-....-..-.--...-------
  20 ...-...-------...-.....--..-..-.---..-..--.-....-..-.--...-------
  21 ...-...-------...-.....--..-..-.---..-..--.-....-..-.--...-------


                 40                  50             60           70    
                 |                   |              |            |    
   1 AVS.R...ALEEGRER.....L..L..WRLE.PARGLEPPVV.....LVQTLTE...PDWSVLD.
   2 LFP.-...--GSKERQ.....F..L..YRRE.ELQGAFRFFV.....L--SQEQ...PAASAIF.
   3 LFP.-...--GGKERQ.....F..L..YRRE.ELQGAFRFFV.....L--SQER...PAESETF.
   4 LFS.T...---DEKRQ.....F..L..FREEiGIMGL--PVF.....YVLSKTS...PQ---TE.
   5 LFP.-...--EQELRQ.....F..L..FREE.AHGG----FF.....ML-SAIP...P---LLQ.
   6 AFV.R...DERDAPGR.....F..L..WRLEpGADAWASPTL.....LVQSCES...GDWDVLQg
   7 LFA.D...DPKASRD-.....F..V..FR--.---EAEPGRY.....LIVSARP...PGDG---.
   8 LYPcD...AGPDPRAR.....FgvL..FRIE.D--TPAGAHI.....LLQSAHE...PDLTRLP.
   9 LFP.N...RPDAARD-.....F..L..FHVE.KRNTPEGCHV.....LLQSAQM...P----VS.
  10 LFP.E...DADAERD-.....F..L..FRVE.R-SGQQSVEV.....LLQSRRE...PTMAASR.
  11 LFP.DtdsPLPRKSVD.....I..L..FRFE.QLAG-QPPFF.....LIQSTVA...PKQ--VD.
  12 LFA.D...APDRTRD-.....F..L..FREQ.D----EPYTF.....LTVSRRQ...PEDT---.
  13 SFP.Tll.PSDTDGPR.....V..L..WRID.RTSRAEVFLY.....IV-SPPK...PDLTHLV.
  14 LFP.N...SSERKRDH.....-..L..FRVE.NINTSGQVMV.....LLQSASE...PQSNV--.
  15 LFE.Dv..RSEAEKRSsvpsgF..L..F-AD.KGGDAKGRKI.....LILSDRP...P----LQ.
  16 LLG.E...SFGKG---.....V..LqpFRLMvAPNGATGTLYay9taLVQTTRDfglPDALAVC.
  17 ---.-...--------.....-..-..----.----------.....----HLC...PELHEHD.
  18 ---.-...--------.....-..-..----.----------.....----HLC...PTLHEHD.
  19 ---.-...--------.....-..-..----.----------.....--MSENE...P-IDELN.
  20 ---.-...--------.....-..-..----.---------F.....SAIAHFE...PKLHTLD.
  21 ---.-...--------.....-..-..----.----------.....-------...-------.


                   80               90       100               110     
                   |                |         |                 |     
   1 .E..GY.....AQVF...P..PKPFHPA..LKPGQRLRFRLRANP...AK.RL.AA...TG.K..
   2 .D..--.....---V...Q..TRPFAPT..LSAGQTLRFNLRANP...TV.CK.--...NG.K..
   3 .T..--.....---I...E..CRSFVPE..LRTGQQLCFNLRANP...TI.CK.--...AG.K..
   4 .S..PL.....FEV-...E..TKAFYPQ..LKEGQRLAFKLRVNP...TI.CItDP...SG.Krq
   5 .H..SL.....F-LI...E..TKLFNPQ..LTNGLELDFQLRANP...V-.-I.TR...NG.K..
   6 lP..GY.....LQRPa..E..CKALDLEalIRPQWRYRFRLLANP...T-.-V.TR...AG.K..
   7 .Q..GL.....WR-L...E..TKPYAPA..FREGQRFGFTLRANPataVK.QA.GEt..RG.K..
   8 .D..GY.....GQAItr.P..LDPLLDA..LKPGLTIRYRCTASP...VR.KP.GA...NT.Ral
   9 .T..AV.....ATVI...K..TKQVEFQ..LQVGVPLYFRLRANP...IK.TI.LD...NQ.Kr1
  10 .E..VL.....L--M...G..SKPYLLS..LQQDQQLRFMLVANP...IK.TI.ND...ES.Ar1
  11 .N..LD.....SEVQ...H..RTVSLRR..FSPKSAVRFRISING...IR.RQ.TTeh.NGrK..
  12 .T..GW.....WS-I...Q..IKPYAPK..LQAGDAVAFSLRVNA...VV.KR.NE...NG.Kqr
  13 .EqaGWptqptWESY...D..YTPFLSR..LAKGDVWAFRLTANPvhsIR.RK.AG...EP.T..
  14 .-..-K.....ATVL...Q..SKLFDPQ..ISQGEYYKFKLLANP...TK.CL.SQ...GK.K..
  15 .P..AH.....GELV...-..SRPVPEE..FLQHRFYKFEVTLNP...TR.KE.NK...SG.K..
  16 .D..PA.....R--L...A..AKAMPED..WREGRRLAFDLRARP...VR.RLlKP...AG.Vf1
  17 .S..LS.....IQTIsglP..DRPGILR..LTANSRLRIRIPVEQ...IP.LV.YPl..AG.K..
  18 .S..LS.....IQTIsglP..DRPGILR..LTANSQLRIRIPVEQ...IP.LV.YPl..AG.K..
  19 .-..-V.....WD-I...D..VKQYDPI..LKSGQKLAFSLRANP...IVsKR.DE...ND.Kqh
  20 .T..LG.....IQTI...AgiPKDGVIN..LTQNSRLRIRIPVNQ...VR.LV.YPl..AG.K..
  21 .-..--.....--MR...N..LDPFLAR..LDKGSRVRYRIVASP...TK.RL.GRsenNT.Q..


                   120                   130           140         150 
                    |                     |             |           | 
   1 ...RVA.L....KTPA.....EKV...A...WLE.RRL.E...EGGFRLLEGERGPWVQI..LQD
   2 ...RHD.L....LMEAkr9gdSQD...I...WSY.QQQ.A...ALTWLARQGEQNGFTLR..--E
   3 ...RHD.L....LMEAkr9aeGSD...V...WLH.QQQ.A...ALDWLAAQGERSGFTLL..--D
   4 ...RHD.V....LMHAk12eqGKI...K...AMM.EN-.-...AARNWLLNHRRMQQWGI..QFD
   5 ...RSD.V....MMNAk20qqAAQ...A...WLE.QQG.Q...QHGFRLIAPEPDDFAMW..AGD
   6 ...RRG.L....LGEA.....EQL...A...WLQ.RQG.E...RHGFAV------KAVLV..SAS
   7 ...RVD.A....IMHA.....KTRsatP...LTV.EDR.E...RVALDWLLDRQQGFGVL..FER
   8 y..NLP.A....VVPL.....NGA...AadeWWT.RQA.D...AAGLKPLALHPHPL---..--D
   9 1rcRVP.L....IKEA.....EQI...A...WLQ.RKL.-...--GNAARVEDVHPISER..PQ-
  10 1kcRVP.L....IREE.....DLR...A...WLK.RKL.E...--GVAVIE-----AVEV..EKR
  11 ...RIT.TspvpFDSD.....EKA...P...SHI.TRM.T...PWVQKKLNGALRN-VEI..LNH
  12 ...RFD.I....VQDAcl7elNQN...A...QMP.TRA.EiaqEAGTRWLLARQQALGLS..IES
  13 ...KLTaH....LTQR.....YQK...K...WLL.QRQ.D...AAGFRVVEKPAEKRRLP..EGD
  14 ...VIE.I....KDEN.....EQI...Q...WLQ.RKL.R...GANVTV-----------..--T
  15 ...RVP.I....KTRE.....EVA...A...WFGgKSQ.T...SWGFSV-DPARLDVRML..---
  16 6lrRFP.E....GRPA.....EGV...A...GAP.ITR.E...GVYFQWLAERLSGAARV..--E
  17 ...SLT.I....GIHKi17vlKSR...I...TVI.KGFsE...PENFLEAAQKQLNAREI..SGR
  18 ...SLT.I....GIHKi17vlKSR...I...TVI.KGFsE...PETFLEAAQKQLNAREI..SGR
  19 ...RHD.V....VMDEk12diEPN...M...PDI.VQR.K...GSEWLLRKGDMNGFSIN..AEQ
  20 ...SLR.I....GKHTi17klRSR...I...VVI.RGYeE...PESFLVVAQRQLEQLGI..--Q
  21 ...RLG.L....KEPP.....KKP...ReytWAL.RGA.A...AEEWWHSRAAANGLELLstYAQ


                        160                  170        180           1
                         |                    |          |            
   1 TF.....LEV..R.....RKK..D.....GEEAGKL.L...Q.VQAVLFEGRLE..VV..DPERA
   2 AS.....VDA..Y.....RQQqiR.....RGKDRQM.I...Q.FSSVDYTGVLV..IN..EPALF
   3 TS.....VDA..Y.....RQQqlR.....RENSRQL.I...Q.FSSVDYTGMLT..VT..DPGLF
   4 NL.....LDIegY.....TQH..Rs....VKKQGQK.I...Q.FSSVDFQGLLT..IT..NGELY
   5 EYse9gcVQA..Y.....QQH..Rfv...RKDQETP.I...T.FSSVDFSGALC..IT..DAALF
   6 DL.....LDS..R.....RKG..-.....----GAP.I...V.LQRVCFEGLLQ..VV..EADAL
   7 AL.....CSAggY.....RQV..R.....VPRGGKA.I...T.FSVIDYEGVFT..VR..DPGLL
   8 AA.....QGV..-.....RAG..N.....GDK--QR.I...R.HNRIRFDGSAT..IT..DPDLL
   9 --.....---..Y.....FSG..D.....G-KSGK-.-...-.IQTVCFEGVLT..IN..DAPAL
  10 PA.....MNF..-.....RKA..R.....EKRVGK-.-...-.VQAVSFHGVLS..VT..DPVGL
  11 QR.....EVI..G.....TKH..R.....GGKAASMtI...Q.IDTV--DGFGI..VE..DPELL
  12 AA.....ILV..Egc9vkRAT..R.....DTRSG-V.V...S.LGIMDLQGTAE..VK..DPQLL
  13 EH.....ELI..V.....HNR..Rdw6skGARKGRP.V...S.LVTVTFDGRLE..VT..DPDAL
  14 AL.....ESV..L.....VKS..R.....-----KS.F...T.SRFVCFEGILQ..VD..KPDQI
  15 PV.....MQF..S.....KQG..D.....-----RT.V...T.HGAARVSGMLR..VE..NRDLF
  16 EV.....RLA..R.....FER..Ra....VLRGSKS.I...EgPDAV-FHGELV..IL..DGAKF
  17 VT.....LLK..D.....PEG..N.....PKRNTIK.Ik..R.FTIVGF-GLKVegLN..EKDSL
  18 VI.....LLQ..D.....PKG..N.....PKRNTIK.Ik..R.FTIVGF-GLKVegLN..EKDSL
  19 IR.....VDA..Y.....QNH..Klf...KPKGKHH.V...S.FSTVDIVGTLT..VT..DPDIF
  20 AI.....ASI..P.....TKA..N.....GKPIRRT.IkikR.FTVVGF-GLEV..INlsDEDSL
  21 TL.....DDV..R.....DPG..T.....ADRSRK-.I...R.HPAVRFDGEAV..IS..DVDAV


      90       200           210     
      |         |             |     
   1 LATLRRGVGPGKALGLGL.LS...VAP....
   2 LQRLAQGFGKSRAFGCGM.MM...IKPgdd.
   3 LQRLSQGYGKSRAFGCGL.ML...IKPgae.
   4 LEQYAKGFGRAKAMGCGL.ML...IRTv...
   5 KQALFSGLGKSKALGCGM.LM...VKRkr..
   6 RRALASGIGPAKAFGCGL.LS...VARc...
   7 GQALVRGIGKAKAYGCGL.ML...LRRlae.
   8 RQKITEGIGRGKAYGCGL.LS...IAPtre.
   9 IDLVQQGIGPAKSMGCGL.LS...LAPl...
  10 ISLINTGIGPAKAFGCGL.LS...LARt...
  11 NELILHGVGRAKAYGCGL.LS...VSEi...
  12 LQALFQGVGPAKGFGCGL.LL...IRRv...
  13 RRALISGIGRAKAYGCGL.MT...LAPvg..
  14 YSVLVMGIGRKKHAGAGL.LS...LAKas..
  15 IESFNKGIGRGRAFGFGL.LQ...IEPlkd.
  16 AGHLASGLGRHTAYGYGM.ML...LRPar..
  17 -ELQATGLGGKRRMGCGV.FY...GDA....
  18 -ELQVTGLGGKRRMGCGV.FY...GDS....
  19 RDALFKGIGPAKGFGCGM.LL...VRPlr..
  20 R-LQIHGVGGKQKMGCGLfMP...IKErq..
  21 RHAVLNGIGRGKSYGCGL.LSlalIEE....