ViewGene

Locus Name:
GeneID:
Location:

GeneID:1700
TrivialName: PA14_66230
AnnotatorUID:
Modification Date/Time:2006-04-06 13:01:23
LocusName:PA14_66230
PAO1 Ortholog Locus:PA5010
Sequence Length:1122
Protein Length:373
Start: 5901186
Stop: 5900065
Strand: -
Type:
ChromosomeID:2
Status: ACTIVE
Frame Discrepancy: FALSE
Missense Discrepancy:FALSE
Comments:
Homology: gb|AAG08395.1|AE004913_10 (AE004913) UDP-glucose:(heptosyl) LPS alpha 1,3-glucosyltransferase WaaG [Pseudomonas aeruginosa PAO1]
Identities = 371/373 (99%), Positives = 372/373 (99%)
Sequence:
ATGACCCTGGCGTTCATCCTCTACAAATACTTCCCCTTCGGCGGCTTGCAGCGTGACTTCATGCGCATCGCC
CTGGAATGCCAGCGGCGCGGGCACGACATCCGCGTCTATACGCTGATCTGGGAGGGCGACGTGCCGGACGGC
TTCGAAGTGCTGGTGGCCCCGGTGCGCTCGATCTTCAACCACCGGCGCAACGAGAAGTTCACCGCGTGGGTC
CGCGCCGACCTGGCCAGGCGCCCGGTGCAGCGGGTGATCGGCTTCAACAAGATGCCCGGACTGGATGTCTAC
TACGCCGCCGACGCCTGTTTCGAGGAAAAGGCCCAGACCTTGCGCAACCCGCTGTACCGCCAGTGGGGCCGC
TACCGCCACTTCGCCGGCTACGAACGGGCAGTGTTCGACCCGGCCTCGAAGACCGAGATCCTGATGATCTCC
GAGGTGCAGCAGCCGCTCTTCGTCAAGCACTACGGCACCCAGGCCGAGCGTTTCCATCTGCTGCCGCCGGGG
ATCAGCCAGGATCGCCGGGCGCCGGCCAACGCCGCGGACGTGCGTGCGGAATTCCGCCGCGAGTTCGGCCTG
GAGGAGGACGACCTGCTGCTGGTGCAGATTGGCTCCGGCTTCAAGACCAAGGGCCTGGATCGCAGCCTGAAG
GCGCTGGCCGCGCTGCCCAAGGCGTTGCGCAGGCGTACCCGGCTGATCGCCATCGGCCAGGACGATCCCAAG
CCGTTCCTGCTACAGATCGCCGCCCTCGGTCTCAACGACCAGGTACAGATCCTCAAGGGTCGCAGCGATATC
CCGCGCTTCCTGCTCGGCGCCGACCTGCTGATCCACCCGGCCTACAACGAGAACACCGGTACGGTGCTGCTG
GAGGCGCTGGTCTCCGGCCTGCCGGTGTTGGTGACCGATGTCTGCGGCTATGCCCACTACATCGCCGAGGCC
GACGCCGGGCGGGTGCTGCCGAGTCCGTTCGAGCAGGACAGTCTCAACCGCCTGCTCGCGGAAATGCTGGAG
GACGCTCCGGCGCGCGCCGCCTGGTCGCGCAATGGCCTGGCCTACGCCGATCACGCCGACCTCTACAGCATG
CCGCAGCGCGCCGCCGACCTGATCCTCGGGGAGGCCTCATGA
Translation:
MTLAFILYKYFPFGGLQRDFMRIALECQRRGHDIRVYTLIWEGDVPDGFEVLVAPVRSIFNHRRNEKFTAWV
RADLARRPVQRVIGFNKMPGLDVYYAADACFEEKAQTLRNPLYRQWGRYRHFAGYERAVFDPASKTEILMIS
EVQQPLFVKHYGTQAERFHLLPPGISQDRRAPANAADVRAEFRREFGLEEDDLLLVQIGSGFKTKGLDRSLK
ALAALPKALRRRTRLIAIGQDDPKPFLLQIAALGLNDQVQILKGRSDIPRFLLGADLLIHPAYNENTGTVLL
EALVSGLPVLVTDVCGYAHYIAEADAGRVLPSPFEQDSLNRLLAEMLEDAPARAAWSRNGLAYADHADLYSM
PQRAADLILGEAS*
AnnotationID:1709GeneID:1700
AnnotatorUID: diggins
Modification Date/Time: 2005-03-14 10:10:42
Gene Name:waaG
Confidence Code:2
GeneProduct:lipopolysaccharide core biosynthesis protein WaaG
Cell Localization:()
Synonyms:rfaG
Cell Localization Confidence Code:5
MolecularFunction:
Functional Category:(7) Cell wall / LPS / capsule
Alternate Gene Product Name:UDP-glucose:(heptosyl) LPS alpha 1,3-glucosyltransferase WaaG
Functional Category Confidence Code:5
COGs:pfam00534,COG0438
Secondary Functional Category(ies):
EC Number:
Status:ACTIVE
Pathway:
Homology:
>gb|AAN65974.1|AE016226_3 (AE016775) lipopolysaccharide core biosynthesis protein WaaG
           [Pseudomonas putida KT2440]
          Length = 374

 Score =  622 bits (1603), Expect = e-177
 Identities = 303/371 (81%), Positives = 332/371 (89%)

>gb|AAO58429.1| (AE016874) lipopolysaccharide core biosynthesis protein WaaG
           [Pseudomonas syringae pv. tomato str. DC3000]
          Length = 373

 Score =  604 bits (1557), Expect = e-172
 Identities = 295/371 (79%), Positives = 324/371 (87%)

>gb|AAN82891.1|AE016769_6 (AE016769) Lipopolysaccharide core biosynthesis protein rfaG
           [Escherichia coli CFT073]
          Length = 374

 Score =  397 bits (1020), Expect = e-110
 Identities = 191/370 (51%), Positives = 247/370 (66%)

Structural Features:
pfam00534, Glycos_transf_1, Glycosyl transferases group 1.
CD-Length = 172 residues, 100.0% aligned
Score =  113 bits (285), Expect = 3e-26
Identities = 50/173 (28%), Positives = 85/173 (49%), Gaps = 4/173 (2%)

COG0438, RfaG, Glycosyltransferase [Cell envelope biogenesis, outer membrane]
CD-Length = 381 residues,  93.7% aligned
Score = 83.6 bits (204), Expect = 4e-17
Genomic Context:
Comment:
ReferenceID:3403
Author/Investigator(s): GenNotator
Title:
PubMed:
MedLine:
Source:
Reference Type:BLASTP
Data:
URL:
ReferenceID:20550
Author/Investigator(s): GenNotator
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:RPSBLAST
Data:
URL:
ReferenceID:35050
Author/Investigator(s):
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:Misc (text/plain)
Data:1700_cdd.html
URL:
ReferenceID:35872
Author/Investigator(s): Parker CT, Pradel E, Schnaitman CA.
Title: Identification and sequences of the lipopolysaccharide core biosynthetic genes rfaQ, rfaP, and rfaG of Escherichia coli K-12.
PubMed: 1732225
MedLine:
Source:
J Bacteriol. 1992 Feb;174(3):930-4.
Reference Type:Journal
Data:
URL:http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=1732225
Homologs By Global Alignment
Gene ID:1700

Identity:

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
HomologID Accession Description Length PctIdentity PctSimilarity Gaps Score
2989 PA5010_tr translation of PA5010 373 99.46 99.73 0 1922.0
93104 gb|AAG08395.1|AE004913_10 (AE004913) UDP-glucose:(heptosyl) LPS alpha 1,3-glucosyltransferase WaaG [Pseudomonas aeruginosa PAO1] 373 99.46 99.73 0 1922.0
93105 gb|AAC33168.1| (U63816) glucosyltransferase I homolog [Pseudomonas aeruginosa] 373 99.46 99.73 0 1919.0
93106 gb|AAD33103.1| (AF090724) glucosyltransferase I [Pseudomonas aeruginosa] 373 99.19 99.46 0 1916.0
93107 gb|AAN65974.1|AE016226_3 (AE016775) lipopolysaccharide core biosynthesis protein WaaG [Pseudomonas putida KT2440] 374 81.01 89.03 1 1603.0
GC ORFID: 44702How Found: BLASTX
GC_TrimmedSeqID: 69Blast Result ID
Subject Sequence Name: t_PA5010Glimmer Score:
Start: 5164869Stop: 5163748
Length: 1122
Start Codon: ATGTruncated Start:
Stop Codon: TGATruncated Stop:
Homolog: t_PA5010 translation of PA5010Homolog Bit Score 744.0
Other Homologs: psyr_15may02_Scaffold2_revised_gene3380 (ORF: BLASTX 607.0), t_PA5010 (ORF: GLIMMER)
GC ORF Sequence
ATGACCCTGGCGTTCATCCTCTACAAATACTTCCCCTTCGGCGGCTTGCAGCGTGACTTCATGCGCATCGCC
CTGGAATGCCAGCGGCGCGGGCACGACATCCGCGTCTATACGCTGATCTGGGAGGGCGACGTGCCGGACGGC
TTCGAAGTGCTGGTGGCCCCGGTGCGCTCGATCTTCAACCACCGGCGCAACGAGAAGTTCACCGCGTGGGTC
CGCGCCGACCTGGCCAGGCGCCCGGTGCAGCGGGTGATCGGCTTCAACAAGATGCCCGGACTGGATGTCTAC
TACGCCGCCGACGCCTGTTTCGAGGAAAAGGCCCAGACCTTGCGCAACCCGCTGTACCGCCAGTGGGGCCGC
TACCGCCACTTCGCCGGCTACGAACGGGCAGTGTTCGACCCGGCCTCGAAGACCGAGATCCTGATGATCTCC
GAGGTGCAGCAGCCGCTCTTCGTCAAGCACTACGGCACCCAGGCCGAGCGTTTCCATCTGCTGCCGCCGGGG
ATCAGCCAGGATCGCCGGGCGCCGGCCAACGCCGCGGACGTGCGTGCGGAATTCCGCCGCGAGTTCGGCCTG
GAGGAGGACGACCTGCTGCTGGTGCAGATTGGCTCCGGCTTCAAGACCAAGGGCCTGGATCGCAGCCTGAAG
GCGCTGGCCGCGCTGCCCAAGGCGTTGCGCAGGCGTACCCGGCTGATCGCCATCGGCCAGGACGATCCCAAG
CCGTTCCTGCTACAGATCGCCGCCCTCGGTCTCAACGACCAGGTACAGATCCTCAAGGGTCGCAGCGATATC
CCGCGCTTCCTGCTCGGCGCCGACCTGCTGATCCACCCGGCCTACAACGAGAACACCGGTACGGTGCTGCTG
GAGGCGCTGGTCTCCGGCCTGCCGGTGTTGGTGACCGATGTCTGCGGCTATGCCCACTACATCGCCGAGGCC
GACGCCGGGCGGGTGCTGCCGAGTCCGTTCGAGCAGGACAGTCTCAACCGCCTGCTCGCGGAAATGCTGGAG
GACGCTCCGGCGCGCGCCGCCTGGTCGCGCAATGGCCTGGCCTACGCCGATCACGCCGACCTCTACAGCATG
CCGCAGCGCGCCGCCGACCTGATCCTCGGGGAGGCCTCATGA