ViewGene

Locus Name:
GeneID:
Location:

GeneID:913
TrivialName: PA14_62770
AnnotatorUID:
Modification Date/Time:2006-04-06 13:01:23
LocusName:PA14_62770
PAO1 Ortholog Locus:PA4745
Sequence Length:1482
Protein Length:493
Start: 5603387
Stop: 5601906
Strand: -
Type:
ChromosomeID:2
Status: ACTIVE
Frame Discrepancy: FALSE
Missense Discrepancy:FALSE
Comments:
Homology: gb|AAG08131.1|AE004888_6 (AE004888) N utilization substance protein A [Pseudomonas aeruginosa PAO1]
Identities = 493/493 (100%), Positives = 493/493 (100%)
Sequence:
ATGAGCAAAGAAGTACTGCTGGTTGTTGAGTCGGTATCCAACGAAAAGGGCGTACCGGCCGGCGTGATTTTC
GAGGCGCTGGAGCTGGCTCTGGCGACCGCCACCAAGAAACGTTTCGAGGACGAGGTCGACCTGCGCGTCGAG
ATCAATCGTCATAACGGCAGCTACGAGACTTTCCGTCGTTGGCACGTCGTCGCCGACGAGGACTACCAGGAC
CCCGCCACCGAGATCACCGTCGAGGACGTCCAGGAGCAGAAGCCGGGCGCGAAGGTCGGCGAGGTCATCGAA
GAAAAGATCGAATCCATCGAGTTCGGCCGCATCGCTGCGCAGACTGCCAAGCAGGTCATCGTGCAGAAGGTC
CGCGAGGCCGAGCGCGCCCAGGTGGTCGATGCCTACCGCGAGAAGGTCGGCGAGATCATTTCCGGTACCGTG
AAGAAGGTCACCCGCGACAACGTCATCGTCGATCTCGGCAACAACGCCGAGGCGCTGCTGGCCCGCGACCAG
ATCATTCCGCGCGAGACCTTCCGCGTTGGCACCCGCGTGCGTGCCCTGCTGAAGGAGATCCGTACCGAGAAC
CGCGGTCCTCAGCTGGTCCTGTCGCGTACCGCGCCGGAAATGCTGATCGAGCTGTTCCGCATCGAAGTACCG
GAAATCGCCGAGCAGTTGATCGACGTGATGGCCGCCGCCCGTGACCCGGGCTCGCGCGCCAAGATCGCCGTT
CGTTCCAAGGACAAGCGCATCGACCCGCAGGGCGCCTGCATCGGCATGCGCGGTTCGCGCGTCCAGGCGGTA
TCCGGCGAGATCGGTGGAGAGCGGGTGGACATCGTCCTGTGGGACGACAACCCGGCGCAATTCGTGATCAAC
GCCATGGCTCCGGCCGAGGTGGCGGCGATCATCGTCGACGAGGATACCCATACCATGGATATCGCCGTCGCC
GAGGACAATCTGGCGCAGGCTATCGGCCGCAGTGGCCAGAACGTCCGTCTGGCCAGCCAGTTGACCGGCTGG
ACCCTGAATGTGATGACCGAGGCGGATATCCAGGCCAAGCAACAGGCCGAGACCGGGGATATCCTGCAGCGC
TTCGTCGACGAGCTGGATGTCGACGAGGAACTGGCCCAGGTACTGGTGGAAGAGGGCTTCACGACTCTTGAG
GAAATCGCCTACGTACCGATGGAAGAGATGCTCAGCATCGATGGCTTCGACGAAGACATCGTCAACGAGCTG
CGCTCCCGTGCCAAGGATCGCCTGCTGACCAAGGCCATTGCGACCGAAGAGAAGCTCGCCGACGCACAACCG
GCAGAAGACCTGCTCAGCCTCGATGGCATGAGCAAGGAGCTGGCCCTGGACCTGGCGCTGCGTGGCGTAACC
ACCCGTGAAGATCTGGCCGAGCAATCGATCGACGATCTGCTCGACATCGACGGCATGGACGAAGAGCGTGCC
GGCAAGTTGATCATGGCCGCCCGGGCCCATTGGTTCGAGTAA
Translation:
MSKEVLLVVESVSNEKGVPAGVIFEALELALATATKKRFEDEVDLRVEINRHNGSYETFRRWHVVADEDYQD
PATEITVEDVQEQKPGAKVGEVIEEKIESIEFGRIAAQTAKQVIVQKVREAERAQVVDAYREKVGEIISGTV
KKVTRDNVIVDLGNNAEALLARDQIIPRETFRVGTRVRALLKEIRTENRGPQLVLSRTAPEMLIELFRIEVP
EIAEQLIDVMAAARDPGSRAKIAVRSKDKRIDPQGACIGMRGSRVQAVSGEIGGERVDIVLWDDNPAQFVIN
AMAPAEVAAIIVDEDTHTMDIAVAEDNLAQAIGRSGQNVRLASQLTGWTLNVMTEADIQAKQQAETGDILQR
FVDELDVDEELAQVLVEEGFTTLEEIAYVPMEEMLSIDGFDEDIVNELRSRAKDRLLTKAIATEEKLADAQP
AEDLLSLDGMSKELALDLALRGVTTREDLAEQSIDDLLDIDGMDEERAGKLIMAARAHWFE*
AnnotationID:922GeneID:913
AnnotatorUID: diggins
Modification Date/Time: 2005-03-09 08:08:30
Gene Name:nusA
Confidence Code:2
GeneProduct:N utilization substance protein A
Cell Localization:()
Synonyms:
Cell Localization Confidence Code:5
MolecularFunction:
Functional Category:(24) Transcription, RNA processing and degradation
Alternate Gene Product Name:transcription pausing; L factor
Functional Category Confidence Code:5
COGs:COG0195
Secondary Functional Category(ies):
EC Number:
Status:ACTIVE
Pathway:
Homology:
>gb|AAN70285.1|AE016669_3 (AE016791) N utilization substance protein A [Pseudomonas putida
           KT2440]
          Length = 493

 Score =  843 bits (2178), Expect = 0.0
 Identities = 435/493 (88%), Positives = 466/493 (94%)

>gb|AAO57939.1| (AE016872) N utilization substance protein A [Pseudomonas syringae
           pv. tomato str. DC3000]
          Length = 493

 Score =  840 bits (2170), Expect = 0.0
 Identities = 432/493 (87%), Positives = 466/493 (94%)

>gb|AAG58305.1|AE005545_7 (AE005545) transcription pausing; L factor [Escherichia coli
           O157:H7 EDL933]
          Length = 495

 Score =  617 bits (1592), Expect = e-176
 Identities = 317/493 (64%), Positives = 391/493 (79%), Gaps = 3/493 (0%)
Structural Features:
COG0195, NusA, Transcription elongation factor [Transcription]
CD-Length = 190 residues, 100.0% aligned
Score =  180 bits (457), Expect = 4e-46
Identities = 97/196 (49%), Positives = 138/196 (70%), Gaps = 7/196 (3%)
Genomic Context:
Comment:
ReferenceID:1829
Author/Investigator(s): GenNotator
Title:
PubMed:
MedLine:
Source:
Reference Type:BLASTP
Data:
URL:
ReferenceID:18971
Author/Investigator(s): GenNotator
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:RPSBLAST
Data:
URL:
ReferenceID:37668
Author/Investigator(s):
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:Misc (text/plain)
Data:913_cdd.html
URL:
ReferenceID:37726
Author/Investigator(s): Saito M, Tsugawa A, Egawa K, Nakamura Y.
Title: Revised sequence of the nusA gene of Escherichia coli and identification of nusA11 (ts) and nusA1 mutations which cause changes in a hydrophobic amino acid cluster.
PubMed: 3027511
MedLine:
Source:
Mol Gen Genet. 1986 Nov;205(2):380-2.
Reference Type:Journal
Data:
URL:http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=3027511
ReferenceID:37727
Author/Investigator(s): Imamoto F, Nakamura Y.
Title: Escherichia coli proteins involved in regulation of transcription termination: function, structure, and expression of the nusA and nusB genes.
PubMed: 3019094
MedLine:
Source:
Adv Biophys. 1986;21:175-92.
Reference Type:Journal
Data:
URL:http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract&list_uids=3019094
ReferenceID:37728
Author/Investigator(s): Ishii S, Ihara M, Maekawa T, Nakamura Y, Uchida H, Imamoto F.
Title: The nucleotide sequence of the cloned nusA gene and its flanking region of Escherichia coli.
PubMed: 6326058
MedLine:
Source:
Nucleic Acids Res. 1984 Apr 11;12(7):3333-42.
Reference Type:Journal
Data:
URL:http://www.pubmedcentral.nih.gov/articlerender.fcgi?tool=pubmed&pubmedid=6326058
Homologs By Global Alignment
Gene ID:913

Identity:

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
HomologID Accession Description Length PctIdentity PctSimilarity Gaps Score
1623 PA4745_tr translation of PA4745 493 99.99 99.99 0 2431.0
58530 gb|AAG08131.1|AE004888_6 (AE004888) N utilization substance protein A [Pseudomonas aeruginosa PAO1] 493 99.99 99.99 0 2431.0
58531 gb|AAN70285.1|AE016669_3 (AE016791) N utilization substance protein A [Pseudomonas putida KT2440] 493 88.23 94.52 0 2178.0
58532 gb|AAO57939.1| (AE016872) N utilization substance protein A [Pseudomonas syringae pv. tomato str. DC3000] 493 87.62 94.52 0 2170.0
58540 dbj|BAB37473.1| (AP002564) transcription termination-antitermination factor NusA [Escherichia coli O157:H7] 497 63.78 78.67 6 1594.0
GC ORFID: 43915How Found: BLASTX
GC_TrimmedSeqID: 69Blast Result ID
Subject Sequence Name: t_PA4745Glimmer Score:
Start: 4867067Stop: 4865586
Length: 1482
Start Codon: ATGTruncated Start:
Stop Codon: TAATruncated Stop:
Homolog: t_PA4745 translation of PA4745Homolog Bit Score 941.0
Other Homologs: psyr_15may02_Scaffold4_revised_gene5492 (ORF: BLASTX 840.0), t_PA4745 (ORF: GLIMMER)
GC ORF Sequence
ATGAGCAAAGAAGTACTGCTGGTTGTTGAGTCGGTATCCAACGAAAAGGGCGTACCGGCCGGCGTGATTTTC
GAGGCGCTGGAGCTGGCTCTGGCGACCGCCACCAAGAAACGTTTCGAGGACGAGGTCGACCTGCGCGTCGAG
ATCAATCGTCATAACGGCAGCTACGAGACTTTCCGTCGTTGGCACGTCGTCGCCGACGAGGACTACCAGGAC
CCCGCCACCGAGATCACCGTCGAGGACGTCCAGGAGCAGAAGCCGGGCGCGAAGGTCGGCGAGGTCATCGAA
GAAAAGATCGAATCCATCGAGTTCGGCCGCATCGCTGCGCAGACTGCCAAGCAGGTCATCGTGCAGAAGGTC
CGCGAGGCCGAGCGCGCCCAGGTGGTCGATGCCTACCGCGAGAAGGTCGGCGAGATCATTTCCGGTACCGTG
AAGAAGGTCACCCGCGACAACGTCATCGTCGATCTCGGCAACAACGCCGAGGCGCTGCTGGCCCGCGACCAG
ATCATTCCGCGCGAGACCTTCCGCGTTGGCACCCGCGTGCGTGCCCTGCTGAAGGAGATCCGTACCGAGAAC
CGCGGTCCTCAGCTGGTCCTGTCGCGTACCGCGCCGGAAATGCTGATCGAGCTGTTCCGCATCGAAGTACCG
GAAATCGCCGAGCAGTTGATCGACGTGATGGCCGCCGCCCGTGACCCGGGCTCGCGCGCCAAGATCGCCGTT
CGTTCCAAGGACAAGCGCATCGACCCGCAGGGCGCCTGCATCGGCATGCGCGGTTCGCGCGTCCAGGCGGTA
TCCGGCGAGATCGGTGGAGAGCGGGTGGACATCGTCCTGTGGGACGACAACCCGGCGCAATTCGTGATCAAC
GCCATGGCTCCGGCCGAGGTGGCGGCGATCATCGTCGACGAGGATACCCATACCATGGATATCGCCGTCGCC
GAGGACAATCTGGCGCAGGCTATCGGCCGCAGTGGCCAGAACGTCCGTCTGGCCAGCCAGTTGACCGGCTGG
ACCCTGAATGTGATGACCGAGGCGGATATCCAGGCCAAGCAACAGGCCGAGACCGGGGATATCCTGCAGCGC
TTCGTCGACGAGCTGGATGTCGACGAGGAACTGGCCCAGGTACTGGTGGAAGAGGGCTTCACGACTCTTGAG
GAAATCGCCTACGTACCGATGGAAGAGATGCTCAGCATCGATGGCTTCGACGAAGACATCGTCAACGAGCTG
CGCTCCCGTGCCAAGGATCGCCTGCTGACCAAGGCCATTGCGACCGAAGAGAAGCTCGCCGACGCACAACCG
GCAGAAGACCTGCTCAGCCTCGATGGCATGAGCAAGGAGCTGGCCCTGGACCTGGCGCTGCGTGGCGTAACC
ACCCGTGAAGATCTGGCCGAGCAATCGATCGACGATCTGCTCGACATCGACGGCATGGACGAAGAGCGTGCC
GGCAAGTTGATCATGGCCGCCCGGGCCCATTGGTTCGAGTAA