(c) 1992-2000 Regents of the University of California, Santa Cruz
Sequence Alignment and Modeling Software System
http://www.cse.ucsc.edu/research/compbio/sam.html

Citations (SAM, SAM-T99, HMMs)

Sequence numbers correspond to the following labels:

    • T0176 Hypothetical protein yggU, E. coli
    • gi|16130854|ref|NP_417428.1|_1:99 (NC_000913) orf, hypothetical protein [Escherichia coli K12]
    • gi|7466484|pir||H65080 hypothetical protein b2953 - Escherichia coli (strain K-12)
    • gi|882482|gb|AAA69120.1| (U28377) ORF_o100 [Escherichia coli]
    • gi|1789323|gb|AAC75990.1| (AE000378) orf, hypothetical protein [Escherichia coli K12]
    • gi|15803492|ref|NP_289525.1|_1:99 (NC_002655) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
    • gi|15833083|ref|NP_311856.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
    • gi|12517499|gb|AAG58084.1|AE005525_10 (AE005525) orf, hypothetical protein [Escherichia coli O157:H7 EDL933]
    • gi|13363301|dbj|BAB37252.1| (AP002563) hypothetical protein [Escherichia coli O157:H7]
    • gi|21960273|gb|AAM86880.1|AE013934_3_4:99 (AE013934) hypothetical protein [Yersinia pestis KIM]
    • gi|16121248|ref|NP_404561.1|_3:95 (NC_003143) conserved hypothetical protein [Yersinia pestis]
    • gi|15979014|emb|CAC89787.1| (AJ414145) conserved hypothetical protein [Yersinia pestis]
    • gi|16766403|ref|NP_462018.1|_2:93 (NC_003197) putative cytoplasmic protein [Salmonella typhimurium LT2]
    • gi|16421655|gb|AAL21977.1| (AE008842) putative cytoplasmic protein [Salmonella typhimurium LT2]
    • gi|16761877|ref|NP_457494.1|_2:93 (NC_003198) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
    • gi|16504179|emb|CAD02926.1| (AL627277) conserved hypothetical protein [Salmonella enterica subsp. enterica serovar Typhi]
    • gi|15603178|ref|NP_246251.1|_5:98 (NC_002663) unknown [Pasteurella multocida]
    • gi|12721676|gb|AAK03397.1| (AE006170) unknown [Pasteurella multocida]
    • gi|15640485|ref|NP_230112.1|_4:95 (NC_002505) conserved hypothetical protein [Vibrio cholerae]
    • gi|11354528|pir||B82321 conserved hypothetical protein VC0458 [imported] - Vibrio cholerae (group O1 strain N16961)
    • gi|9654883|gb|AAF93631.1| (AE004132) conserved hypothetical protein [Vibrio cholerae]
    • gi|7292328|gb|AAF47735.1|_40:134 (AE003477) CG14966 gene product [Drosophila melanogaster]
    • gi|17936535|ref|NP_533325.1|_78:180 (NC_003304) conserved hypothetical protein [Agrobacterium tumefaciens str. C58 (U. Washington)]
    • gi|17741163|gb|AAL43641.1| (AE009212) conserved hypothetical protein [Agrobacterium tumefaciens str. C58 (U. Washington)]
    • gi|15889916|ref|NP_355597.1|_7:109 (NC_003062) AGR_C_4821p [Agrobacterium tumefaciens] [Agrobacterium tumefaciens str. C58 (Cereon)]
    • gi|15157869|gb|AAK88382.1| (AE008179) AGR_C_4821p [Agrobacterium tumefaciens str. C58 (Cereon)]
    • gi|20381446|gb|AAH27500.1|_26:119 (BC027500) RIKEN cDNA 3110040N11 gene [Mus musculus]
    • gi|13385576|ref|NP_080353.1|_26:119 (NM_026077) RIKEN cDNA 3110040N11 [Mus musculus]
    • gi|12851848|dbj|BAB29184.1| (AK014163) Uncharacterized ACR, YggU family COG1872 containing protein~data source:Pfam, source key:PF02594, evidence:ISS~putative [Mus musculus]
    • gi|12857526|dbj|BAB31031.1| (AK018003) Uncharacterized ACR, YggU family COG1872 containing protein~data source:Pfam, source key:PF02594, evidence:ISS~putative [Mus musculus]
    • gi|12859380|dbj|BAB31634.1| (AK019261) Uncharacterized ACR, YggU family COG1872 containing protein~data source:Pfam, source key:PF02594, evidence:ISS~putative [Mus musculus]
    • gi|20552556|ref|XP_058687.5|_24:119 (XM_058687) similar to RIKEN cDNA 3110040N11 [Homo sapiens]
    • gi|21389393|ref|NP_653198.1| (NM_144597) hypothetical protein MGC29937 [Homo sapiens]
    • gi|18043732|gb|AAH19820.1|AAH19820 (BC019820) Unknown (protein for MGC:29937) [Homo sapiens]
    • gi|17509435|ref|NP_491998.1|_27:117 (NM_059597) W01A8.2.p [Caenorhabditis elegans]
    • gi|7508740|pir||T26031 hypothetical protein W01A8.2 - Caenorhabditis elegans
    • gi|3880399|emb|CAA95853.1| (Z71267) predicted using Genefinder~cDNA EST yk275h2.3 comes from this gene~cDNA EST yk309g11.3 comes from this gene~cDNA EST yk275h2.5 comes from this gene~cDNA EST yk309g11.5 comes from this gene [Caenorhabditis elegans]
    • gi|1388023|gb|AAB88056.1|_6:101 (L46591) putative [Bartonella bacilliformis]
    • gi|15678665|ref|NP_275780.1|_3:85 (NC_000916) conserved protein [Methanothermobacter thermautotrophicus] [Methanothermobacter thermautotrophicus str. Delta H]
    • gi|7429675|pir||G69184 conserved hypothetical protein MTH637 - Methanobacterium thermoautotrophicum (strain Delta H)
    • gi|2621719|gb|AAB85143.1| (AE000844) conserved protein [Methanothermobacter thermautotrophicus str. Delta H]
    • gi|20150521|pdb|1JRM|A_3:85 Chain A, Nmr Structure Of Mth0637
    • gi|15618408|ref|NP_224693.1|_2:78 (NC_000922) CT388 hypothetical protein [Chlamydophila pneumoniae CWL029]
    • gi|15836028|ref|NP_300552.1| (NC_002491) CT388 hypothetical protein [Chlamydophila pneumoniae J138]
    • gi|16752546|ref|NP_444808.1| (NC_002179) conserved hypothetical protein [Chlamydophila pneumoniae AR39]
    • gi|7468099|pir||H72072 conserved hypothetical protein CP0257 [imported] - Chlamydophila pneumoniae (strains CWL029 and AR39)
    • gi|4376783|gb|AAD18637.1| (AE001634) CT388 hypothetical protein [Chlamydophila pneumoniae CWL029]
    • gi|7189184|gb|AAF38120.1| (AE002186) conserved hypothetical protein [Chlamydophila pneumoniae AR39]
    • gi|8978867|dbj|BAA98703.1| (AP002546) CT388 hypothetical protein [Chlamydophila pneumoniae J138]
    • gi|14520717|ref|NP_126192.1|_3:80 (NC_000868) hypothetical protein [Pyrococcus abyssi]
    • gi|7518409|pir||H75167 hypothetical protein PAB7122 - Pyrococcus abyssi (strain Orsay)
    • gi|5457933|emb|CAB49423.1| (AJ248284) hypothetical protein [Pyrococcus abyssi]
    • gi|21289029|gb|EAA01322.1|_36:111 (AAAB01008987) agCP12513 [Anopheles gambiae str. PEST]
    • gi|18978137|ref|NP_579494.1|_3:80 (NC_003413) hypothetical protein [Pyrococcus furiosus DSM 3638]
    • gi|18893938|gb|AAL81889.1| (AE010274) hypothetical protein [Pyrococcus furiosus DSM 3638]
    • gi|21674646|ref|NP_662711.1|_2:83 (NC_002932) conserved hypothetical protein [Chlorobium tepidum TLS]
    • gi|21647849|gb|AAM73053.1| (AE012935) conserved hypothetical protein [Chlorobium tepidum TLS]
    • gi|9758285|dbj|BAB08809.1|_111:197 (AB007649) gene_id:MLE2.7~unknown protein [Arabidopsis thaliana]
    • gi|15605113|ref|NP_219898.1|_17:91 (NC_000117) hypothetical protein [Chlamydia trachomatis]
    • gi|7468717|pir||B71520 hypothetical protein CT388 - Chlamydia trachomatis (serotype D, strain UW3/Cx)
    • gi|3328814|gb|AAC67985.1| (AE001312) hypothetical protein [Chlamydia trachomatis]
    • gi|15835282|ref|NP_297041.1|_3:76 (NC_002620) conserved hypothetical protein [Chlamydia muridarum]
    • gi|11360652|pir||F81677 conserved hypothetical protein TC0667 [imported] - Chlamydia muridarum (strain Nigg)
    • gi|7190702|gb|AAF39489.1| (AE002334) conserved hypothetical protein [Chlamydia muridarum]
    • gi|15893224|ref|NP_360938.1|_8:91 (NC_003103) unknown [Rickettsia conorii]
    • gi|15620440|gb|AAL03839.1| (AE008676) unknown [Rickettsia conorii]
    • gi|14600512|ref|NP_147028.1|_11:95 (NC_000854) hypothetical protein [Aeropyrum pernix]
    • gi|7515610|pir||D72774 hypothetical protein APE0182 - Aeropyrum pernix (strain K1)
    • gi|5103573|dbj|BAA79094.1| (AP000058) 113aa long hypothetical protein [Aeropyrum pernix]
    • gi|15668799|ref|NP_247602.1|_4:84 (NC_000909) conserved hypothetical protein [Methanococcus jannaschii] [Methanocaldococcus jannaschii]
    • gi|2496078sp|Q58035|Y618_METJA Hypothetical protein MJ0618
    • gi|2128409|pir||B64377 conserved hypothetical protein MJ0618 - Methanococcus jannaschii
    • gi|1591329|gb|AAB98613.1| (U67510) conserved hypothetical protein [Methanococcus jannaschii] [Methanocaldococcus jannaschii]
    • gi|20092890|ref|NP_618965.1|_18:107 (NC_003552) conserved hypothetical protein [Methanosarcina acetivorans str. C2A] [Methanosarcina acetivorans C2A]
    • gi|19918198|gb|AAM07445.1| (AE011122) conserved hypothetical protein [Methanosarcina acetivorans str. C2A] [Methanosarcina acetivorans C2A]
    • gi|17230688|ref|NP_487236.1|_5:72 (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
    • gi|17132291|dbj|BAB74895.1| (AP003592) ORF_ID:asl3196~hypothetical protein [Nostoc sp. PCC 7120]
    • gi|13473522|ref|NP_105090.1|_4:92 (NC_002678) hypothetical protein [Mesorhizobium loti]
    • gi|14024272|dbj|BAB50876.1| (AP003003) hypothetical protein [Mesorhizobium loti]
    • gi|20093713|ref|NP_613560.1|_3:81 (NC_003551) Uncharacterized conserved protein [Methanopyrus kandleri AV19]
    • gi|19886604|gb|AAM01490.1| (AE010325) Uncharacterized conserved protein [Methanopyrus kandleri AV19]
    • gi|21226924|ref|NP_632846.1|_4:92 (NC_003901) hypothetical protein [Methanosarcina mazei Goe1]
    • gi|20905233|gb|AAM30518.1| (AE013307) hypothetical protein [Methanosarcina mazei Goe1]
    • gi|16127852|ref|NP_422416.1|_7:84 (NC_002696) conserved hypothetical protein [Caulobacter crescentus CB15]
    • gi|13425372|gb|AAK25584.1| (AE006020) conserved hypothetical protein [Caulobacter crescentus CB15]
    • gi|6503188|gb|AAF14630.1|AF200362_6_1:59 (AF200362) unknown [Haemophilus ducreyi]
    • gi|15841462|ref|NP_336499.1|_5:73 (NC_002755) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
    • gi|13881702|gb|AAK46313.1| (AE007056) conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
    • gi|11499654|ref|NP_070896.1|_3:73 (NC_000917) conserved hypothetical protein [Archaeoglobus fulgidus]
    • gi|7429676|pir||G69508 conserved hypothetical protein AF2072 - Archaeoglobus fulgidus
    • gi|2648454|gb|AAB89177.1| (AE000960) conserved hypothetical protein [Archaeoglobus fulgidus]
    • gi|18424708|ref|NP_568972.1|_129:190 (NM_125739) expressed protein [Arabidopsis thaliana]
    • gi|16226478|gb|AAL16178.1|AF428410_1 (AF428410) AT5g63440/MLE2_7 [Arabidopsis thaliana]
              10         20        30             40        50        6
              |          |         |              |         |         
   1 MDGVMSAVTVNDDGLV.LRLYIQPKASRDSIVGL.....HGDEVKVAITAPPVDGQANSHLVKFL
   2 MDGVMNAVTVNDDGLV.LRLYIQPKASRDSIVGL.....HGDEVKVAITAPPVDGQANSHLVKFL
   3 MDGVMSAVTVNDDGLV.LRLYIQPKASRDSIVGL.....HGDEVKVAITAPPVDGQANSHLVKFL
   4 -----NAVTVNDDGLV.LRLYIQPKASRDSIVGL.....HGDEVKVAITAPPVDGQANSHLVKFL
   5 ---AVSAVLSTENGLI.LKLYIQPKASRDQIVGL.....HGDELKVAITAPPVDGQANTHLVKFI
   6 ------AVLSTENGLI.LKLYIQPKASRDQIVGL.....HGDELKVAITAPPVDGQANTHLVKFI
   7 -----SAVTRCEDGLV.LRLYIQPKASRDSIVGL.....HGDEVKIAITAPPVDGQANSHLTKFL
   8 -----SAVTRCEDGLV.LRLYIQPKASRDSIVGL.....HGDEVKVAITAPPVDGQANSHLTKFL
   9 -----PAVEKQEEHLR.LRIFLQPKASKDQIVGL.....HDNELKITITAPPIDGQANAHLLKFL
  10 -------AWREGDDLL.LRLYIQPKASRDSIVGL.....HGEELKVAITAPPIDGKANAHLSKYL
  11 ----SPISVDKSGNIC.IQILAKPGAKQNGITGI.....GFEGVGVQIAAPPSEGEANAELVKFL
  12 DDCLSGLWRKHDDHVR.LSVRLTPNGGRDAIDGVeqdadGNAHLKARVSAVPEGGKANKALIVLL
  13 DDCLSGLWRKHDDHVR.LSVRLTPNGGRDAIDGVeqdadGNAHLKARVSAVPEGGKANKALIVLL
  14 ----GPVATDPKGFVT.IAIHAKPGSRQNAVTDL.....STEAVGVAIAAPPSQGEANAELCRYL
  15 ----GPVATDPKGFVT.IAIHAKPGSRQNAVTDL.....STEAVGVAIAAPPSEGEANAELCRYL
  16 --PLGPVAVDPKGCVT.IAIHAKPGSKQNAVTDL.....TAEAVNVAIAAPPSEGEANAELCRYL
  17 ----SAIFSDTEGRIG.LHIHAKPGAKKSCVVAI.....GDSEVDVAIGAAPREGAANEELISYL
  18 --------RVDVDSLI.LFVRLTPKASMNNIVGVesrddGKQYLIIRLCAVPEDGKANKALIKFL
  19 ---TMDCLREVGDDLL.VNIEVSPASGKFGIPSYne...WRKRIEVKIHSPPQKGKANREIIKEF
  20 ---TMDCLREVGDDLL.VNIEVSPASGKFGIPSYne...WRKRIEVKIHSPPQKGKANREIIKEF
  21 -----------DDSWI.LEVKVTPKAKENKIVGF.....DGQALKVRVTEPPEKGKANDAVISLL
  22 --------KEVREGVI.LRVIVKPNARENSIEGIde...WRGRIKVNIKAQPVKGKANRELIKFL
  23 ----------KTGNLI.VKILAKPGAKTSGITDV.....SEEGIGCQIAAPPIDGEANTELIKYL
  24 --------RETSEGVI.LSVIVAPNARETKIVGIdg...TRGRVKVNVAAPPVKGKANKELMKFF
  25 -----SPISQKGEAVC.LSVRVQPRSSKSGVAGM.....YGEQLKICLKSAPVDNAANKECCELL
  26 -APVPPCISQLDGGLVqVAIEVEDRAQRSAITRV.....NADDVRVTVAAPAARGEANNELLEFM
  27 ----------LEGFWV.LEVRVTTKARENRVVCL.....EDGILRVRVTEVPEKGKANDAVVALL
  28 -----------EGFWV.LEIRVTTKARENKVVSL.....EDGILRVRVTEAPERGKANDAVVALL
  29 ------IYNSFKHEAL.INVKVKPYAKQNLIGNFviin.NIPYIKLAIKATPEQGKANEGIIHYL
  30 --------NSSSHQAL.LSFKVKPNSKQNLISNFviin.NIPYLKLSIKAIPEQGKANEEIINYL
  31 -DRLKDAVEVLGNRVR.IRVYVKPEGRERRL-RL.....EEGELVFYTDEPPLEGRANASLINFL
  32 -----KIIKESREGVL.IDIDVQANAKKNEIVGIne...WRKRLSIKIKAPATEGKANKEIIKFF
  33 YMSFEEAIKTLDSGII.IEIEVTPGSRSLSVPSGyne..WRKRIAVKLTKNAQKGKANEQLIESL
  34 ----------------.--VKVKPNSKQQKIAEQ.....DDGSLTVHLKSPPVDGKANEELIKLL
  35 ------PLRIRENGID.LFVRLTPKSSLDRLEGVetsadGRSHLKARVRAVPENGAANQALERLV
  36 -----SPVKEHREGTL.IRVRVNPDADTTDLKGVde...WRGVLEVDVAAPPVKGKANRELLEFL
  37 -MSFEEAIKSLDSGII.VDIEVTPGSRSLSVPSGyne..WRKRIEVKLTRNAQKGKANEQLIESL
  38 --------------VT.LVVRLTPRGGRDAAEGWaldadGRLYLKVRVASPPVEGAANAALIAFL
  39 ----------------.-----------------.....----LKVAITAPPVDGAANAYLLKYL
  40 ----------------.VVVRVKPGSHKGPLVEVg....PNGELIIYVREPAIDGKANDAVTRLL
  41 ---------EAKDGVL.ISVHVSPGSKEVSFSYDe....WRRAVEVRIKSPAKEGKANRELLGIF
  42 -----AAVWQDGEDII.LKLYIQPKASRDKIVGL.....HGEELKIAITAPPVDGKANAHL----
  43 DAPVPPCISQLDGGLVqVAIEVEDRAQRSAITRV.....NADDVRVTVAAPAARGEANNELLEFM


      0        70        80        90       100
     |         |         |         |         |
   1 GKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEVAALIN
   2 GKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEIAALI-
   3 GKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEVAALI-
   4 GKQFRVAKSQVVIEKGELGRHKQIKIINPQQIPPEIAALI-
   5 AKQFRVAKSQVIIEKGELGRHKQIKVINPQQIPPEVTILL-
   6 AKQFRVAKSQVIIEKGELGRHKQIKVINPQQIPPEVTILL-
   7 GKQFRVAKSQIVIEKGELGRHKQVKIIHPQQIPPEIAA---
   8 GKQFRVAKSQIVIEKGELGRHKQVKIIHPQQIPPEIAA---
   9 SKTFKVPKSSIVLEKGELNRHKQILIPNPKVIPTEVNVLL-
  10 AKLCKVAKGSVVVEKGELGRHKQVRILQPSQIPAEIAALI-
  11 SKVLGLRKSDVSLDKGSRSRNKIIMITKGVSTVEAIEQLL-
  12 AKKLGLPKSSITFISGETARKKILRIDTDPEDFEKLFKK--
  13 AKKLGLPKSSITFISGETARKKILRIDTDPEDFEKLFKK--
  14 SKVLDLRKSDVVLDKGGKSREKVVKLLA-STTPEEVLEKL-
  15 SKVLDLRKSDVVLDKGGKSREKVVKLLA-STTPEEVLEKL-
  16 SKVLELRKSDVVLDKGGKSREKVVKLLA-STTPEEILEKL-
  17 MSALGLRKNELQFDKGAKSRSKVVLIDTKRLTIDEI-----
  18 AKQWKIPSSCISLENGAISRYKQLRFSGGVEKIEKILHSL-
  19 SETFG---RDVEIVSGQKSRQKTIRIQG-------------
  20 SETFG---RDVEIVSGQKSRQKTIRIQG-------------
  21 AKALSLPKRDVTLIAGETSRKKKFLLPNR------------
  22 SNLFG---AEVEILKGETSREKDVLVRG-------------
  23 SKLLDLRKSDISLDRGSKSRQKTIVLD--------------
  24 KKLFG---AEVVIVRGETSREKDLLIKG-------------
  25 AKALGVPRSSVSVMKGASSRSKVLKVEG-------------
  26 GRVLGLRLSQMTLQRGWNSKSKLLVVED-------------
  27 ANFLSIPKSDVTLIAGEASRRKKVLL---------------
  28 AKFLSIPKNDVTLIAGEASRRKKVLL---------------
  29 AKEWELSRSSIEIIKGHTHSLKTILIKNI------------
  30 AKEWKLSRSNIEIIKGHTHSLKTILIKNI------------
  31 ARGLKVSVKNIEIVHGARSRSKVVEIRD-------------
  32 KEIFK---KDVEIVSGKLNPQKTVLIGD-------------
  33 AELFGISSSEILINSGATSSKKSLLIKG-------------
  34 AEKFDVPKSHITIKSGLSSKQKLIEIE--------------
  35 AKTLGVPASSVSVVAGGTSRLKTVRIVGDPE----------
  36 GRKLN---TTCELVSGEKSREKLVLA---------------
  37 AELFGICSSDIFISSGATSSKKSLLIKG-------------
  38 AKTLKIPRSAVRLAAGETARLKRLELEG-------------
  39 SKLFKVPKSSIVLEKGELQRHKQLFVPAPKLLPKEIE----
  40 AAHLQLPKSRVKLVSGATSRFKRFR----------------
  41 RQIFG----EVELVSGEKSRSKVL-----------------
  42 -----------------------------------------
  43 GR---------------------------------------