ViewGene

Locus Name:
GeneID:
Location:

GeneID:199
TrivialName: PA14_41220
AnnotatorUID:
Modification Date/Time:2006-04-06 13:01:23
LocusName:PA14_41220
PAO1 Ortholog Locus:PA1803
Sequence Length:2397
Protein Length:798
Start: 3678450
Stop: 3676054
Strand: -
Type:
ChromosomeID:2
Status: ACTIVE
Frame Discrepancy: FALSE
Missense Discrepancy:FALSE
Comments:
Homology: gb|AAG05192.1|AE004606_6 (AE004606) Lon protease [Pseudomonas aeruginosa PAO1]
Identities = 797/798 (99%), Positives = 798/798 (100%)
Sequence:
ATGAAAACACTCGTCGAATTGCCCTTGCTGCCGCTACGTGACGTGGTGGTGTACCCGCACATGGTCATCCCG
TTGTTCGTCGGTCGGGAAAAGTCCATCGAGGCCCTGGAGGCAGCGATGACCGGCGACAAGCAAATCCTCCTG
CTGGCGCAGAAAAACCCCGCGGACGACGATCCGGGCGAAGATGGCCTGTACCGCATGGGTACGGTCGCCACC
GTGCTGCAGTTGCTCAAGCTGCCGGATGGCACCGTCAAGGTGCTGGTCGAAGGCGAGCAGCGCGGCCAGGTC
GAGCGCTTCATCGAGGAAGAAGGGCACATTCGTGCCGCGGTCCAGGCCATCGATGACGCCAATGTCGGCGAG
CGCGAGGCCGAGGTCTTCACTCGCAGCCTGCTGAGCCAGTTCGAGCAATACGTCCAGTTGGGCAAGAAAGTC
CCCGCCGAGGTGCTTTCTTCGCTGAACAGCATCGACGAGCCGAGCAGGCTGGTCGATACCATGGCCGCGCAC
ATGGCGCTGAAGATCGAGCAGAAGCAGGACATCCTGGAGATCACCGACCTGTCGTCGCGGGTCGAGCATGTC
CTGGCGCTGCTGGATGCCGAGATCGACCTGCTGCAGGTGGAAAAGCGTATCCGCGGCCGGGTCAAGAAGCAG
ATGGAGCGCAGCCAGCGCGAGTACTACCTGAATGAGCAGATGAAGGCCATTCAGAAGGAACTCGGCGATATC
GACGAAGGGCACAACGAAGTCGAGGAGCTGAAGAAGCGCATCGACGCCGCCGGCCTGACCAAGGAGGCGCAC
ACCAAGGCCACTGCCGAGCTGAACAAGCTCAAGCAGATGTCGCCGATGTCGGCGGAAGCCACCGTGGTGCGT
TCCTACATAGACTGGCTGCTGAACGTGCCGTGGAAGGCCGAGAGCAAGGTGCGCCATGATCTCGCCAAGGCG
GAAGACATCCTCGATGCCGACCATTACGGCCTGGAAGAGGTCAAGGAGCGCATTCTCGAGTATCTCGCCGTG
CAAAAGCGGGTGAAGAAGCTCAAGGGCCCGGTCCTTTGCCTGGTGGGGCCGCCCGGCGTGGGCAAGACCTCC
CTGGCCGAATCCATCGCTCGCGCCACCAATCGCAAGTTCGTGCGCATGGCGCTCGGCGGCGTGCGTGACGAG
GCCGAGATCCGCGGTCACCGTCGTACCTATATCGGCTCCATGCCGGGCCGCCTGATCCAGAAGATGACCAAG
GTCGGCGTGCGCAACCCGCTGTTCCTCCTCGACGAAATCGACAAGATGGGCAGCGACATGCGCGGGGATCCC
GCCTCGGCGCTGCTCGAGGTGCTCGACCCGGAGCAGAACCACAACTTCAACGATCACTACCTGGAGGTCGAC
TACGACCTGTCCGACGTGATGTTCCTCTGCACCGCAAACTCGATGAACATCCCAGCGCCCCTGCTGGATCGG
ATGGAAGTCATCCGCCTGCCGGGCTACACCGAGGACGAGAAGGTCAACATCGCCTCCAAGTACCTGATTCCG
AAGCAGGTCCAGGCCAACGGCCTGAAGAAGGGCGAGCTGACCTTCGAGGAAGGCGCCCTGCGCGACATCATT
CGCTACTACACCCGCGAAGCCGGGGTGCGCAGCCTCGAGCGGCAGATCGCCAAGGTCTGCCGCAAGGCGGTG
AAGGAGCATGCCAAGCTCAAGCGTATCCAGGCGGTGGTGTCCAGCGAGACGCTGGAGAACTACCTCGGCGTG
CGCAAGTTCCGTTATGGCCTCGCCGAACAGCAGGACCAGATCGGCCAGGTGACCGGGCTGGCCTGGACCCAG
GTCGGCGGCGAGCTGCTCACCATCGAGGCGGCCGTGGTACCGGGCAAGGGCCAGTTGACCAAGACCGGATCG
CTGGGCGACGTGATGGCGGAATCGATCACCGCGGCGTTGACCGTGGTGCGCAGCCGCGCCCAGAGCCTAGGA
ATCGCGGCGGACTTCCACGAGAAGCGTGACATCCATATCCACGTTCCGGAAGGCGCTACGCCGAAGGATGGC
CCGAGCGCAGGCATCGGCATGTGCACGGCGCTGGTTTCGGCGATCACGCAGATCCCGGTGCGCGCCGACGTG
GCAATGACCGGGGAGATCACCCTGCGTGGGCAGGTGCTGGCGATCGGCGGGCTGAAAGAAAAACTTTTGGCG
GCGCACCGCGGCGGGATCAAGACGGTGATCATTCCCGAGGAAAATGTTCGCGACCTGAAGGAAATTCCGGAC
AATATTAAGAGTGATCTGGTTATTAAACCGGTTAAATGGATTGACGAAGTCCTGCAAATTGCGCTGCAATAC
GCCCCGGAGCCCTTGCCCGATGCGGCTCCGGAGATGGTTGCAAAGGATGAAAAACGCGAGCCTGATTCCAAG
GAGCGAATTAGCACGCATTAG
Translation:
MKTLVELPLLPLRDVVVYPHMVIPLFVGREKSIEALEAAMTGDKQILLLAQKNPADDDPGEDGLYRMGTVAT
VLQLLKLPDGTVKVLVEGEQRGQVERFIEEEGHIRAAVQAIDDANVGEREAEVFTRSLLSQFEQYVQLGKKV
PAEVLSSLNSIDEPSRLVDTMAAHMALKIEQKQDILEITDLSSRVEHVLALLDAEIDLLQVEKRIRGRVKKQ
MERSQREYYLNEQMKAIQKELGDIDEGHNEVEELKKRIDAAGLTKEAHTKATAELNKLKQMSPMSAEATVVR
SYIDWLLNVPWKAESKVRHDLAKAEDILDADHYGLEEVKERILEYLAVQKRVKKLKGPVLCLVGPPGVGKTS
LAESIARATNRKFVRMALGGVRDEAEIRGHRRTYIGSMPGRLIQKMTKVGVRNPLFLLDEIDKMGSDMRGDP
ASALLEVLDPEQNHNFNDHYLEVDYDLSDVMFLCTANSMNIPAPLLDRMEVIRLPGYTEDEKVNIASKYLIP
KQVQANGLKKGELTFEEGALRDIIRYYTREAGVRSLERQIAKVCRKAVKEHAKLKRIQAVVSSETLENYLGV
RKFRYGLAEQQDQIGQVTGLAWTQVGGELLTIEAAVVPGKGQLTKTGSLGDVMAESITAALTVVRSRAQSLG
IAADFHEKRDIHIHVPEGATPKDGPSAGIGMCTALVSAITQIPVRADVAMTGEITLRGQVLAIGGLKEKLLA
AHRGGIKTVIIPEENVRDLKEIPDNIKSDLVIKPVKWIDEVLQIALQYAPEPLPDAAPEMVAKDEKREPDSK
ERISTH*
AnnotationID:209GeneID:199
AnnotatorUID: diggins
Modification Date/Time: 2005-06-02 06:06:58
Gene Name:lon
Confidence Code:2
GeneProduct:Lon protease
Cell Localization:(1) Cytoplasmic
Synonyms:
Cell Localization Confidence Code:2
MolecularFunction:
Functional Category:(26) Translation, post-translational modification, degradation
Alternate Gene Product Name:endopeptidase La
Functional Category Confidence Code:5
COGs:COG0466,pfam05362
Secondary Functional Category(ies):chaperones & heat shock proteins
EC Number:3.4.21.53
Status:ACTIVE
Pathway:
Homology:
>gb|AAF65564.1|AF250140_1 (AF250140) protease Lon [Pseudomonas fluorescens]
          Length = 798

 Score = 1410 bits (3651), Expect = 0.0
 Identities = 721/798 (90%), Positives = 752/798 (94%)

>gb|AAN67915.1|AE016424_1 (AE016782) ATP-dependent protease La [Pseudomonas putida KT2440]
          Length = 798

 Score = 1394 bits (3607), Expect = 0.0
 Identities = 711/798 (89%), Positives = 750/798 (93%)

>gb|AAO70000.1| (AE016842) Lon protease [Salmonella enterica subsp. enterica
           serovar Typhi Ty2]
          Length = 784

 Score = 1090 bits (2819), Expect = 0.0
 Identities = 546/769 (71%), Positives = 654/769 (85%), Gaps = 5/769 (0%)
Structural Features:
COG0466, Lon, ATP-dependent Lon protease, bacterial type 
[Posttranslational modification, protein turnover, chaperones]
CD-Length = 782 residues,  99.4% aligned
Score = 1209 bits (3129), Expect = 0.0
Identities = 532/777 (68%), Positives = 637/777 (81%), Gaps = 4/777 (0%)

pfam05362, Lon_C, Lon protease (S16) C-terminal proteolytic domain.
CD-Length = 205 residues,  99.0% aligned
Score =  336 bits (862), Expect = 9e-93
Identities = 144/203 (70%), Positives = 171/203 (84%)
Genomic Context:
Comment:
other possible gene names: capR, deg, lopA, muc
ReferenceID:401
Author/Investigator(s): GenNotator
Title:
PubMed:
MedLine:
Source:
Reference Type:BLASTP
Data:
URL:
ReferenceID:17543
Author/Investigator(s): GenNotator
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:RPSBLAST
Data:
URL:
ReferenceID:37130
Author/Investigator(s):
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:Misc (text/plain)
Data:199_cdd.html
URL:
ReferenceID:37801
Author/Investigator(s): Chin DT, Goff SA, Webster T, Smith T, Goldberg AL.
Title: Sequence of the lon gene in Escherichia coli. A heat-shock gene which encodes the ATP-dependent protease La.
PubMed: 3042779
MedLine:
Source:
J Biol Chem. 1988 Aug 25;263(24):11718-28.
Reference Type:Journal
Data:
URL:http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=3042779
ReferenceID:37802
Author/Investigator(s): Riethdorf S, Volker U, Gerth U, Winkler A, Engelmann S, Hecker M.
Title: Cloning, nucleotide sequence, and expression of the Bacillus subtilis lon gene.
PubMed: 7961402
MedLine:
Source:
J Bacteriol. 1994 Nov;176(21):6518-27.
Reference Type:Journal
Data:
URL:http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=7961402
Homologs By Global Alignment
Gene ID:199

Identity:

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
HomologID Accession Description Length PctIdentity PctSimilarity Gaps Score
340 PA1803_tr translation of PA1803 798 99.87 99.99 0 4005.0
21533 gb|AAG05192.1|AE004606_6 (AE004606) Lon protease [Pseudomonas aeruginosa PAO1] 798 99.87 99.99 0 4005.0
21534 gb|AAF65564.1|AF250140_1 (AF250140) protease Lon [Pseudomonas fluorescens] 798 90.35 94.23 0 3651.0
21535 gb|AAN67915.1|AE016424_1 (AE016782) ATP-dependent protease La [Pseudomonas putida KT2440] 798 89.09 93.98 0 3607.0
21537 gb|AAO57193.1| (AE016869) ATP-dependent protease La [Pseudomonas syringae pv. tomato str. DC3000] 798 88.72 93.48 0 3592.0
GC ORFID: 43201How Found: BLASTX
GC_TrimmedSeqID: 69Blast Result ID
Subject Sequence Name: t_PA1803Glimmer Score:
Start: 1275459Stop: 1277855
Length: 2397
Start Codon: ATGTruncated Start:
Stop Codon: TAGTruncated Stop:
Homolog: t_PA1803 translation of PA1803Homolog Bit Score 1547.0
Other Homologs: psyr_15may02_Scaffold1_revised_gene778 (ORF: BLASTX 1386.0), t_PA1803 (ORF: GLIMMER)
GC ORF Sequence
ATGAAAACACTCGTCGAATTGCCCTTGCTGCCGCTACGTGACGTGGTGGTGTACCCGCACATGGTCATCCCG
TTGTTCGTCGGTCGGGAAAAGTCCATCGAGGCCCTGGAGGCAGCGATGACCGGCGACAAGCAAATCCTCCTG
CTGGCGCAGAAAAACCCCGCGGACGACGATCCGGGCGAAGATGGCCTGTACCGCATGGGTACGGTCGCCACC
GTGCTGCAGTTGCTCAAGCTGCCGGATGGCACCGTCAAGGTGCTGGTCGAAGGCGAGCAGCGCGGCCAGGTC
GAGCGCTTCATCGAGGAAGAAGGGCACATTCGTGCCGCGGTCCAGGCCATCGATGACGCCAATGTCGGCGAG
CGCGAGGCCGAGGTCTTCACTCGCAGCCTGCTGAGCCAGTTCGAGCAATACGTCCAGTTGGGCAAGAAAGTC
CCCGCCGAGGTGCTTTCTTCGCTGAACAGCATCGACGAGCCGAGCAGGCTGGTCGATACCATGGCCGCGCAC
ATGGCGCTGAAGATCGAGCAGAAGCAGGACATCCTGGAGATCACCGACCTGTCGTCGCGGGTCGAGCATGTC
CTGGCGCTGCTGGATGCCGAGATCGACCTGCTGCAGGTGGAAAAGCGTATCCGCGGCCGGGTCAAGAAGCAG
ATGGAGCGCAGCCAGCGCGAGTACTACCTGAATGAGCAGATGAAGGCCATTCAGAAGGAACTCGGCGATATC
GACGAAGGGCACAACGAAGTCGAGGAGCTGAAGAAGCGCATCGACGCCGCCGGCCTGACCAAGGAGGCGCAC
ACCAAGGCCACTGCCGAGCTGAACAAGCTCAAGCAGATGTCGCCGATGTCGGCGGAAGCCACCGTGGTGCGT
TCCTACATAGACTGGCTGCTGAACGTGCCGTGGAAGGCCGAGAGCAAGGTGCGCCATGATCTCGCCAAGGCG
GAAGACATCCTCGATGCCGACCATTACGGCCTGGAAGAGGTCAAGGAGCGCATTCTCGAGTATCTCGCCGTG
CAAAAGCGGGTGAAGAAGCTCAAGGGCCCGGTCCTTTGCCTGGTGGGGCCGCCCGGCGTGGGCAAGACCTCC
CTGGCCGAATCCATCGCTCGCGCCACCAATCGCAAGTTCGTGCGCATGGCGCTCGGCGGCGTGCGTGACGAG
GCCGAGATCCGCGGTCACCGTCGTACCTATATCGGCTCCATGCCGGGCCGCCTGATCCAGAAGATGACCAAG
GTCGGCGTGCGCAACCCGCTGTTCCTCCTCGACGAAATCGACAAGATGGGCAGCGACATGCGCGGGGATCCC
GCCTCGGCGCTGCTCGAGGTGCTCGACCCGGAGCAGAACCACAACTTCAACGATCACTACCTGGAGGTCGAC
TACGACCTGTCCGACGTGATGTTCCTCTGCACCGCAAACTCGATGAACATCCCAGCGCCCCTGCTGGATCGG
ATGGAAGTCATCCGCCTGCCGGGCTACACCGAGGACGAGAAGGTCAACATCGCCTCCAAGTACCTGATTCCG
AAGCAGGTCCAGGCCAACGGCCTGAAGAAGGGCGAGCTGACCTTCGAGGAAGGCGCCCTGCGCGACATCATT
CGCTACTACACCCGCGAAGCCGGGGTGCGCAGCCTCGAGCGGCAGATCGCCAAGGTCTGCCGCAAGGCGGTG
AAGGAGCATGCCAAGCTCAAGCGTATCCAGGCGGTGGTGTCCAGCGAGACGCTGGAGAACTACCTCGGCGTG
CGCAAGTTCCGTTATGGCCTCGCCGAACAGCAGGACCAGATCGGCCAGGTGACCGGGCTGGCCTGGACCCAG
GTCGGCGGCGAGCTGCTCACCATCGAGGCGGCCGTGGTACCGGGCAAGGGCCAGTTGACCAAGACCGGATCG
CTGGGCGACGTGATGGCGGAATCGATCACCGCGGCGTTGACCGTGGTGCGCAGCCGCGCCCAGAGCCTAGGA
ATCGCGGCGGACTTCCACGAGAAGCGTGACATCCATATCCACGTTCCGGAAGGCGCTACGCCGAAGGATGGC
CCGAGCGCAGGCATCGGCATGTGCACGGCGCTGGTTTCGGCGATCACGCAGATCCCGGTGCGCGCCGACGTG
GCAATGACCGGGGAGATCACCCTGCGTGGGCAGGTGCTGGCGATCGGCGGGCTGAAAGAAAAACTTTTGGCG
GCGCACCGCGGCGGGATCAAGACGGTGATCATTCCCGAGGAAAATGTTCGCGACCTGAAGGAAATTCCGGAC
AATATTAAGAGTGATCTGGTTATTAAACCGGTTAAATGGATTGACGAAGTCCTGCAAATTGCGCTGCAATAC
GCCCCGGAGCCCTTGCCCGATGCGGCTCCGGAGATGGTTGCAAAGGATGAAAAACGCGAGCCTGATTCCAAG
GAGCGAATTAGCACGCATTAG