(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.soe.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0367 FH7577A, Archaeoglobus fulgidus DSM 4304, 125 res
              10        20         30        40          50        60  
              |         |          |         |           |         |  
   1 MDELELRIRKAEKLVQDAKKEFEMGLY.ERCCSTAYYAMFHAAKA.MLLGY.GRDSKTHRGTIYL
   2 MDELELRIRKAEKLVQDAKKEFEMGLY.ERCCSTAYYAMFHAAKA.MLLGY.GRDSKTHRGTIYL
   3 -EEIEKHIKIAEEELSSAYLLLENGKL.RDSISRAYYSMFHAAKA.LLLLK.GINPRKHSGVIRM
   4 --ELEKLIEKAEKSLEASENLYNSEFY.DFAVSRIYYSMFYCVKA.LLLTK.EINPKKHSGVLKM
   5 --ELEKLIEKAEKSLEASENLYNSEFY.DFAVSRIYYSMFYCVKA.LLLTK.EINPKKHSGVLKM
   6 MKRVEDWIKQAERDLEEARYAKSGGYY.ELACFLSQQCAEKAVKG.LLQFQ.GIEKRGHSISHLL
   7 IDLCRWRLEKAERTFKEGEQLLDVGFY.NGAINRFYYAAFHAVRA.LLALK.KLDSAKHSGVISL
   8 --LALWRIEKADTTFEEGEISFENHFY.EGAINRFYYASFHAVRS.LLATK.GLDSPKHKGVISL
   9 -EEIQALIEIAEENLSAAKILFENKLY.RDAVARAYYAIFHSAKA.LLLTK.NLNPKKHAGVIKM
  10 -DEIRALLRKAEERLAASRELFERGHY.AFAISSAYYVMFYCARA.LLLSK.GITPKSHAGVHAQ
  11 --LARLSIRKARSFLESSRKNLEMGIY.DGALVMAYLAMFHAARA.LLFKD.GWREKSHACISAY
  12 --EYERCITMAERTLSSARLDASHGEY.NWACFKAHQAAEFALKA.LLYGV.GRPARGHSLTHLL
  13 -NEFDRWFKSAKLTLESAKHDVSGGFY.NWACFKSQQAAEFAVKA.YFYGI.GQPKTGHTISSLL
  14 --IIEYWRRRARECLDDAKLLLKNERL.HSAVNRIYYALFYQVSA.LLLAK.GLSFSKHSGVLAA
  15 -DPTEALLQKANQALDDASYLLEDDRV.EAAMNRAYYAAFHAARA.ALLTE.GEEPTSHAGILSR
  16 MNRARDWMRQAERDLEHAQKSLELGHW.EWACFAAHQAAEKALKA.VYFAL.GAEAFGHALTHLL
  17 -------IKRAEEWLEEAKRNREFGSY.RTSLMASYLAMFHAARA.VLFRD.GWREKSHYCVARY
  18 MSRYADWFAQALRDLEKAHLDLQHEYW.EWACFTAQQAAEKAVKA.LLMSR.GMDAWGHAITPML
  19 --RYYDWLRQAERNLKSAELNFENGIY.EETCYEAQQVAEKSVKS.LLSYF.HKEMRGHSITFLL
  20 --LILYRMERAKEAIAEAELLFSEGHI.RTSVNRLYYACFYAVSA.ILLAK.GYSSAKHSGIRSL
  21 IDLISTLLLEAEEKLEDAQKALEAGKW.AASIYYAYSAQIHAAKA.LLTSE.DVAVNSHAAILKD
  22 -NPISFLLQRGNDSLKSAEALIELDLT.LDGLSRAYYAALHYARA.LLLSI.DVVPKSHKGALTL
  23 IDLIATLLYESEEKVQNASSSYGQGKW.AASIYHSYAAMVNSAKA.LLTAE.KTKTNTHSSIIKD
  24 IDLIATLFLESEEKIDKAKESFVNSVY.SSSIYHAYSSLVNSAKA.MLLAE.NIKTNSQANIIKQ
  25 -DLILRTLTPGEDIPYNLLLLADETNY.EDAISRSYYALYYAAKA.LLSSK.GIITKTHKGLITQ
  26 -EEVEKHIKITEEELSSAYLLLENGKL.RDSISRAYYSMFHAAKA.LLLLK.GIDPRKHSGVIRM
  27 -SEYERWIKQAERNLRSALRDLEGGDY.EWASFKAQQAAELAVKA.LLRGM.GSAPIGHSITRLL
  28 -EEISLFIRRAQETYTVACELHQNHHY.NDAVSRAYYSMFYAAKA.VLLTK.DIITRTHRGTISQ
  29 -DLVLWRLEKAEKTFEDAELLLERDSF.ASAINRYYYAAFYAIRA.LLATR.ELDSPKHSGVISL
  30 -ERVPVSLSIAERFLHSAQKNLEIEEY.EMVQLAAYNSAFHSARA.LLFSK.GYTERSHSCLGIA
  31 -SEYERWIKEAKRTLESAYSDLKEGYY.EWASFKSQQAAELAVKA.VLRGL.GLAPVGHSITRLL
  32 --EQQDLMDRAEQSLQAARILADEALF.DVSVSRSYYAMFYCARA.SLLAL.NLSSKSHSGTISL
  33 -EEAEKWLRQALEDLATAKDTITTGHY.YASAFWAEQAAEKALKA.LLIAG.GKIERTHDLNELL
  34 ----ELWLRQAERDLIKAENDLKTGDW.DSAAFWSQQCAEKALKA.LLLNA.GKAYRGHELLELA
  35 -EEAQKWFRQALEDLATAKDTITTGHY.YASAFWAEQAAEKALKA.LLIEN.GKIERTHDLNQLL
  36 ---------------------------.-----------------.-----.-IIPKSHKGTLTL
  37 -----AWLEKASRYREYAKRNFESGAY.DLACFLAQQSAEFLLKA.LLIREtGARPLTHSLYEMA
  38 -------LDKAKHNLAFVNQNIKSGNFqDWSIVGLYYAVYHAALA.LVAKK.GFISRSHNATMIF
  39 ---------------------------.-----------FYCASA.MLLQL.GARPKSHAGTISE
  40 -DDVSRLLRRAEKFRKDAINAYNEGYY.DISCFYAEQAVQLRIKAyMLRNL.GFIPRIHGIRDLL


                      70         80             90        100       110
                      |          |              |          |         |
   1 .....IW.....ECREELGLSD.DDCSKLS.....RAFDLREESDYGI.YKEVSKDLAIKILKDA
   2 .....IW.....ECREELGLSD.DDCSKLS.....RAFDLREESDYGI.YKEVSKDLAIKILKDA
   3 .....FG.....LHFVDSGFIE.RPYAKYLt....YAFSLRSKADYDV.YYEPTHEEAENVVETA
   4 .....FA.....KEFIKTNELD.VELFEYIn....EAYNYRQTADYDA.TIEIKKEEAEYLLHKG
   5 .....FA.....KEFIKTNELD.VELFEYIn....EAYNYRQTADYDA.TIEIKKEEAEYLLHKG
   6 .....TNppa..DILQCATFLD.KQYTPSR.....YPDVYYEGAPYEY.YTERDADECINCAIRI
   7 .....FN.....REYVKTGVIS.KEASKTLs....TIFAMRSEADYDD.FKSFSLQEAADARKAV
   8 .....FN.....REFVKPGLMS.IQSSKTIr....RLFDMRAVADYRD.FATFTEEQVREARDDV
   9 .....FG.....LYFVNEGYIE.EIYGRII.....TKSYNLRWKADYT.TDKPTEEEAESIIYEA
  10 .....LG.....KEFVKTGEMPaRLYTGYS.....KALNMRHTADYDA.FVEYTERDAREVLRYA
  11 .....LR.....EFYVKPGLLD.VKWVRYLd....YVRNLRHQTQYDV.GFSPDPEEITDILPKI
  12 .....GEva9seEVAELCRLLD.KFYVPTR.....CVDAWSEGIPYEY.FSKSDAETAIKAAEGV
  13 nllnaPQ.....DLIDKAKYLD.KLYVPTR.....YPDVWEEETPAYY.YTKKEAEEAIKYAEEI
  14 .....FN.....REFVKTGKVN.KELGKFYn....RMFEHRKTGDYGE.LVEFEEENVKDWIRKA
  15 .....FS.....YHFVRTGRIS.EEVGKVLa....RAETDRNRADYDA.FSVFEIQAAEDLVGDV
  16 .....AGlh9tpELRRCALVLD.RLYIPTR.....YPDSWEAGAPLDY.YGEEEARDALLCAGEI
  17 .....LE.....EFYVKTGKLE.GYWVELLd....RMRELRHEDQYDV.SYTPEPDEVKDALKVA
  18 .....RAlk8plHLIEYAQLLD.VLYIPTR.....YPNGFSVGKPADY.FSATKAQEALDAATAI
  19 qf7siPD.....WILKCAQELD.KNYIPSR.....YPDVYDQGAPLDY.YSKDDATSCLECARKI
  20 .....FH.....QKIVKAGLVN.PSAGTLYn....RLFDARQKADYAD.LVKFEADIVAPWFDEV
  21 .....FD.....THFIDQPRGF.GNDSFRQ.....AVLRMNRHAPDAG.FAREYLEQARNFLSEA
  22 .....FS.....LHFIKSGIIP.KEIGVIFs....ILQKIREDSDYEI.GTYYDKNEALRLLDDT
  23 .....FD.....KIFVDSKRIE.LKSEFSE.....LVLQIKKNEPTSE.FAKSYLEDAKYFLESV
  24 .....FD.....EYYITSNKIS.LEGSFAD.....LIYQINKYAPKEE.FAKKYIKDAEAFLEKV
  25 .....IS.....DHYVNNGLVD.HQIWHTLa....YTESLRESADYST.GEQITEEISLDVIEE-
  26 .....FG.....LHFVNSGFIE.RVYAKYL.....T------------.----------------
  27 .....RN.....LVGEGIDVPK.KLFYIAM.....KLDRNYMASRYP-.----------------
  28 .....LN.....SNFVRVGEFE.EMVWKYL.....PLSETLREKP---.----------------
  29 .....FN.....REFIKTNLLS.KKASKTIt....KVFDLRSN-----.----------------
  30 .....LN.....HLYKEEFDLL.KLINIFD.....KMRVSSHNVQYGG.TFVTFEEAS-------
  31 .....RE.....LKGMGFNVP-.-------.....-------------.----------------
  32 .....FG.....QHLAAAGRLP.IEIHRQLi....DAE----------.----------------
  33 ei8igLS.....VEEIRSEVIK.LTLHYTIs....RYPDAANTIPYSL.YSKEDAEELVKKAEKV
  34 r10vdVS.....PIEEDLRELT.IHYVISR.....YPN-AANAVPYEL.YDEKKARELVERARRV
  35 y10lpVE.....EIRSEVNKLT.LHYTISR.....YPDAA-NTIPYSL.YTKEDAEELVKKAEKV
  36 .....FS.....LHFIKTGIIP.KEIGVIFs....ILQKIREDSDYEI.GTFYERDEAERLLEDT
  37 .....RR.....LATLKNFELG.DEVAKCAka6hhYVQSRDPDARVGE.YERWEAESCIECMEAL
  38 likn.--y10elQLIDDLAITK.KDATFYT.....DLKSERQKASYST.DAMFNESKVLELQKKS
  39 .....FG.....RRCVMAGIAS.IEERDELi....LAERRRNAADYGTpQAEIDAHIAERLLETA
  40 si...--.....----------.-------.....-------------.----------------


             120     
              |     
   1 EIFVQKAKNAVNKNR
   2 EIFVQKAKNAVNKN-
   3 ERFLERIKSVLE---
   4 HIFLNKTKKYL----
   5 HIFLNKTKKYL----
   6 LNWVKG---------
   7 RSLIDEVSAY-----
   8 RALIDEAISFLQK--
   9 EMFVDRIKKAL----
  10 EEFLAFTKSYLEGK-
  11 EEFIGVVEK------
  12 LEFVRGVWMSLSGG-
  13 IKYVEELWRMLSQK-
  14 EGFLDAIEKLIE---
  15 SQFIEAVGQVIKNH-
  16 LRFGQS---------
  17 EKFIEVIKSLIGEE-
  18 IQYCQ----------
  19 LAWVKEI--------
  20 KSLVHQIETLVV---
  21 SAFRQ----------
  22 RLFCTTVAEVL----
  23 DAY------------
  24 RLFRE----------
  25 ---------------
  26 ---------------
  27 ---------------
  28 ---------------
  29 ---------------
  30 ---------------
  31 ---------------
  32 ---------------
  33 IEWV-----------
  34 LEW------------
  35 IEWV-----------
  36 RLFCRTVVAVL----
  37 ---------------
  38 IDFVNKVEDII----
  39 GRWVERT--------
  40 ---------------