ViewGene

Locus Name:
GeneID:
Location:

GeneID:820
TrivialName: PA14_10630
AnnotatorUID:
Modification Date/Time:2006-04-06 13:01:23
LocusName:PA14_10630
PAO1 Ortholog Locus:PA4123
Sequence Length:1461
Protein Length:486
Start: 916843
Stop: 915383
Strand: -
Type:
ChromosomeID:2
Status: ACTIVE
Frame Discrepancy: FALSE
Missense Discrepancy:FALSE
Comments:
Homology: gb|AAG07510.1|AE004828_11 (AE004828) 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase [Pseudomonas aeruginosa PAO1]
Identities = 485/486 (99%), Positives = 485/486 (99%)
Sequence:
ATGATCAAACACTGGATCAATGGCCGCGAGGTCGAAAGCAAGGACGTCTTCGAGAACTACAACCCGGCCACC
GGCGAGCTGATCGGCGAGGTCGCCAGCGGCGGCGCGGCGGAAATCGACGCGGCGGTGGCGGCGGCCCGGGAA
GCCTTCCCGAAATGGGCCAATACCCCGGCCAAGGAGCGCGCGCGCCTGATGCGCCGGCTCGGCGAGCTGATC
GACCGGAACGTGCCGCACCTGGCGGAACTGGAGACCCTCGACACCGGCCTGCCGATCCACCAGACGAAGAAT
GTGCTGATCCCGCGCGCCTCGCACAACTTCGAGTTCTTCGCCGAAGTCTGCACGCGGATGAACGGGCACAGC
TATCCGGTCGACGACCAGATGCTCAACTACACCCTGTACCAGCCGGTGGGCGTCTGTGGCCTGGTCTCGCCG
TGGAACGTACCGTTCATGACCGCCACCTGGAAGACCGCGCCGTGCCTGGCGCTGGGCAACACGGCGGTGCTG
AAGATGTCCGAGCTGTCGCCGCTGACCGCCAACGAGCTGGGCCGCCTGGTGCACGAGGCGGGCATTCCGCCG
GGGGTGTTCAACGTGGTCCAGGGCTACGGCGCCAGCGCCGGCGACGCGCTGGTTCGCCACCGCGACGTGCGC
GCGGTGTCCTTCACCGGCGGCACCGCCACCGGGCGACGAATCATGGAGGCGGCCGGCATCAAGAAATACTCG
ATGGAGCTGGGCGGCAAGTCGCCGGTGCTGGTCTTCGAGGACGCCGACCTCGAGCGGGCGCTGGACGCCGCG
CTGTTCACCATCTTCTCGCTGAACGGCGAGCGCTGCACCGCCGGCAGTCGCATCTTCGTCCAGGAAAGCGTC
TACCCGCAGTTCGTCGCCGAGTTCGCCGCGCGCGCCAGGCGCCTGATCGTCGGCGATCCGCAGGACCCGAAG
ACCCAGGTCGGCTCGATGATCACCCAGGCCCACTACGACAAGGTCACCGGCTACATCCGCATCGGCCTCGAG
GAAGGCGCCACCCTGGTGGCCGGCGGCCTGGAGCGTCCGACCGGCCTGCCGGCGCACCTGAGCAAGGGGCAG
TTCATCCAGCCCACGGTGTTCGCCGACGTGGACAACCGCATGCGCATCGCCCAGGAGGAGATCTTCGGCCCG
GTGGTCTGCCTAATCCCGTTCAAGGACGAAGCCGAGGCGCTGCGCCTGGCCAACGACGTGGAATACGGCCTG
GCCTCCTACATCTGGACCCAGGACATCGGCAAGGCCCATCGCCTGGCACGCGGCATCGAGGCCGGGATGGTC
TTCATCAACAGCCAGAACGTGCGCGACCTGCGCCAGCCGTTCGGCGGGGTGAAGGCCTCGGGCACCGGACGC
GAAGGCGGGGAATACAGCTTCGAGGTATTCGCCGAGATCAAGAACGTGTGTATATCCATGGGCAGCCATCAC
ATCCCCCGCTGGGGCGTGTAG
Translation:
MIKHWINGREVESKDVFENYNPATGELIGEVASGGAAEIDAAVAAAREAFPKWANTPAKERARLMRRLGELI
DRNVPHLAELETLDTGLPIHQTKNVLIPRASHNFEFFAEVCTRMNGHSYPVDDQMLNYTLYQPVGVCGLVSP
WNVPFMTATWKTAPCLALGNTAVLKMSELSPLTANELGRLVHEAGIPPGVFNVVQGYGASAGDALVRHRDVR
AVSFTGGTATGRRIMEAAGIKKYSMELGGKSPVLVFEDADLERALDAALFTIFSLNGERCTAGSRIFVQESV
YPQFVAEFAARARRLIVGDPQDPKTQVGSMITQAHYDKVTGYIRIGLEEGATLVAGGLERPTGLPAHLSKGQ
FIQPTVFADVDNRMRIAQEEIFGPVVCLIPFKDEAEALRLANDVEYGLASYIWTQDIGKAHRLARGIEAGMV
FINSQNVRDLRQPFGGVKASGTGREGGEYSFEVFAEIKNVCISMGSHHIPRWGV*
AnnotationID:829GeneID:820
AnnotatorUID: diggins
Modification Date/Time: 2005-03-30 13:01:05
Gene Name:hpcC
Confidence Code:2
GeneProduct:5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase
Cell Localization:()
Synonyms:hpaE
Cell Localization Confidence Code:5
MolecularFunction:
Functional Category:(5) Carbon compound catabolism
Alternate Gene Product Name:4-hydroxyphenylacetate catabolism protein
Functional Category Confidence Code:5
COGs:pfam00171,COG1012
Secondary Functional Category(ies):
EC Number:1.2.1.-
Status:ACTIVE
Pathway:
Homology:
>gb|AAO17179.1| (AF346500) HpaE [Photorhabdus luminescens]
          Length = 488

 Score =  811 bits (2094), Expect = 0.0
 Identities = 383/486 (78%), Positives = 435/486 (89%)

>gb|AAL20034.1| (AE008747) 4-hydroxyphenylacetate catabolism protein [Salmonella
           typhimurium LT2]
          Length = 488

 Score =  810 bits (2093), Expect = 0.0
 Identities = 388/485 (80%), Positives = 434/485 (89%)

>emb|CAA86041.1| (Z37980) 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase
           [Escherichia coli]
          Length = 488

 Score =  806 bits (2081), Expect = 0.0
 Identities = 386/485 (79%), Positives = 433/485 (89%)

Structural Features:
pfam00171, Aldedh, Aldehyde dehydrogenase family. 
CD-Length = 464 residues,  99.6% aligned
Score =  535 bits (1381), Expect = 4e-153
Identities = 224/470 (47%), Positives = 305/470 (64%), Gaps = 11/470 (2%)

COG1012, PutA, NAD-dependent aldehyde dehydrogenases 
[Energy production and conversion]
CD-Length = 472 residues, 100.0% aligned
Score =  532 bits (1372), Expect = 3e-152
Identities = 236/482 (48%), Positives = 300/482 (62%), Gaps = 14/482 (2%)
Genomic Context:
Comment:
ReferenceID:1643
Author/Investigator(s): GenNotator
Title:
PubMed:
MedLine:
Source:
Reference Type:BLASTP
Data:
URL:
ReferenceID:18785
Author/Investigator(s): GenNotator
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:RPSBLAST
Data:
URL:
ReferenceID:36963
Author/Investigator(s): Prieto MA, Diaz E, Garcia JL.
Title: Molecular characterization of the 4-hydroxyphenylacetate catabolic pathway of Escherichia coli W: engineering a mobile aromatic degradative cluster.
PubMed: 8550403
MedLine:
Source:
J Bacteriol. 1996 Jan;178(1):111-20.
Reference Type:Journal
Data:
URL:http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=8550403
ReferenceID:36967
Author/Investigator(s): Roper DI, Stringfellow JM, Cooper RA.
Title: Sequence of the hpcC and hpcG genes of the meta-fission homoprotocatechuic acid pathway of Escherichia coli C: nearly 40% amino-acid identity with the analogous enzymes of the catechol pathway.
PubMed: 7737515
MedLine:
Source:
Gene. 1995 Apr 14;156(1):47-51.
Reference Type:Journal
Data:
URL:http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=7737515
ReferenceID:36977
Author/Investigator(s): Jenkins JR, Cooper RA.
Title: Molecular cloning, expression, and analysis of the genes of the homoprotocatechuate catabolic pathway of Escherichia coli C.
PubMed: 3053656
MedLine:
Source:
J Bacteriol. 1988 Nov;170(11):5317-24.
Reference Type:Journal
Data:
URL:http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=3053656
ReferenceID:38084
Author/Investigator(s):
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:Misc (text/plain)
Data:820_cdd.html
URL:
Homologs By Global Alignment
Gene ID:820

Identity:

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
HomologID Accession Description Length PctIdentity PctSimilarity Gaps Score
1472 PA4123_tr translation of PA4123 486 99.79 99.79 0 2523.0
53559 gb|AAG07510.1|AE004828_11 (AE004828) 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase [Pseudomonas aeruginosa PAO1] 486 99.79 99.79 0 2523.0
53560 gb|AAK03614.1| (AE006189) HpaE [Pasteurella multocida] 486 84.77 92.79 0 2181.0
53562 gb|AAL20034.1| (AE008747) 4-hydroxyphenylacetate catabolism protein [Salmonella typhimurium LT2] 488 79.50 88.93 2 2092.0
53563 emb|CAA86041.1| (Z37980) 5-carboxy-2-hydroxymuconate semialdehyde dehydrogenase [Escherichia coli] 488 79.09 88.72 2 2080.0
GC ORFID: 43822How Found: BLASTX
GC_TrimmedSeqID: 69Blast Result ID
Subject Sequence Name: t_PA4123Glimmer Score:
Start: 4037094Stop: 4038554
Length: 1461
Start Codon: ATGTruncated Start:
Stop Codon: TAGTruncated Stop:
Homolog: t_PA4123 translation of PA4123Homolog Bit Score 976.0
Other Homologs: t_PA5373 (ORF: BLASTX 369.0), psyr_15may02_Scaffold1_revised_gene155 (ORF: BLASTX 365.0), t_PA4022 (ORF: BLASTX 347.0), t_PA1984 (ORF: BLASTX 347.0), t_PA4073 (ORF: BLASTX 346.0), t_PA3504 (ORF: BLASTX 345.0), psyr_15may02_Scaffold2_revised_gene3624 (ORF: BLASTX 341.0), t_PA0219 (ORF: BLASTX 340.0), psyr_15may02_Scaffold2_revised_gene3988 (ORF: BLASTX 310.0), psyr_15may02_Scaffold2_revised_gene3643 (ORF: BLASTX 291.0), t_PA4123 (ORF: GLIMMER)
GC ORF Sequence
ATGATCAAACACTGGATCAATGGCCGCGAGGTCGAAAGCAAGGACGTCTTCGAGAACTACAACCCGGCCACC
GGCGAGCTGATCGGCGAGGTCGCCAGCGGCGGCGCGGCGGAAATCGACGCGGCGGTGGCGGCGGCCCGGGAA
GCCTTCCCGAAATGGGCCAATACCCCGGCCAAGGAGCGCGCGCGCCTGATGCGCCGGCTCGGCGAGCTGATC
GACCGGAACGTGCCGCACCTGGCGGAACTGGAGACCCTCGACACCGGCCTGCCGATCCACCAGACGAAGAAT
GTGCTGATCCCGCGCGCCTCGCACAACTTCGAGTTCTTCGCCGAAGTCTGCACGCGGATGAACGGGCACAGC
TATCCGGTCGACGACCAGATGCTCAACTACACCCTGTACCAGCCGGTGGGCGTCTGTGGCCTGGTCTCGCCG
TGGAACGTACCGTTCATGACCGCCACCTGGAAGACCGCGCCGTGCCTGGCGCTGGGCAACACGGCGGTGCTG
AAGATGTCCGAGCTGTCGCCGCTGACCGCCAACGAGCTGGGCCGCCTGGTGCACGAGGCGGGCATTCCGCCG
GGGGTGTTCAACGTGGTCCAGGGCTACGGCGCCAGCGCCGGCGACGCGCTGGTTCGCCACCGCGACGTGCGC
GCGGTGTCCTTCACCGGCGGCACCGCCACCGGGCGACGAATCATGGAGGCGGCCGGCATCAAGAAATACTCG
ATGGAGCTGGGCGGCAAGTCGCCGGTGCTGGTCTTCGAGGACGCCGACCTCGAGCGGGCGCTGGACGCCGCG
CTGTTCACCATCTTCTCGCTGAACGGCGAGCGCTGCACCGCCGGCAGTCGCATCTTCGTCCAGGAAAGCGTC
TACCCGCAGTTCGTCGCCGAGTTCGCCGCGCGCGCCAGGCGCCTGATCGTCGGCGATCCGCAGGACCCGAAG
ACCCAGGTCGGCTCGATGATCACCCAGGCCCACTACGACAAGGTCACCGGCTACATCCGCATCGGCCTCGAG
GAAGGCGCCACCCTGGTGGCCGGCGGCCTGGAGCGTCCGACCGGCCTGCCGGCGCACCTGAGCAAGGGGCAG
TTCATCCAGCCCACGGTGTTCGCCGACGTGGACAACCGCATGCGCATCGCCCAGGAGGAGATCTTCGGCCCG
GTGGTCTGCCTAATCCCGTTCAAGGACGAAGCCGAGGCGCTGCGCCTGGCCAACGACGTGGAATACGGCCTG
GCCTCCTACATCTGGACCCAGGACATCGGCAAGGCCCATCGCCTGGCACGCGGCATCGAGGCCGGGATGGTC
TTCATCAACAGCCAGAACGTGCGCGACCTGCGCCAGCCGTTCGGCGGGGTGAAGGCCTCGGGCACCGGACGC
GAAGGCGGGGAATACAGCTTCGAGGTATTCGCCGAGATCAAGAACGTGTGTATATCCATGGGCAGCCATCAC
ATCCCCCGCTGGGGCGTGTAG