ViewGene

Locus Name:
GeneID:
Location:

GeneID:5684
TrivialName: PA14_46520
AnnotatorUID:
Modification Date/Time:2006-04-06 13:01:23
LocusName:PA14_46520
PAO1 Ortholog Locus:
Sequence Length:1140
Protein Length:379
Start: 4141240
Stop: 4140101
Strand: -
Type:
ChromosomeID:2
Status: ACTIVE
Frame Discrepancy: FALSE
Missense Discrepancy:FALSE
Comments:
Homology: gb|AAO35096.1| (AE015937) putative S-layer protein [Clostridium tetani E88]
Identities = 55/144 (38%), Positives = 68/144 (47%), Gaps = 21/144 (14%)
Sequence:
ATGAAACGGCACACAACCTTGGCCCGCTATTTCGGGTGTGCCCTGCTGACGCTCAGTGTCCAAGGATGCGTG
ACCTGGGTTGATGTGGAGAAATTCAAGCAACCTGCGCCTGACCCCAGGACTCCCAGCGAAGTGCCATCATCG
AGCCAGAAGGGGTTTCGGTACTCGATGCCGGAGCCCTACCTGTTGGTCAAGCCCAAGGCCGATGGCACTGCA
ACCTACGAGTGGGTCTTCCTGCCTGACCGGAACAACGAGTACGTGGTAGCACCCAAGTCCCTCTTTGCCACC
TACAAGATGACGGTCGCGACCGAGAACGGCTTTCTAACGTCCGCCAGCTTTGATGGCACGGCGAACGAGGTG
GCGAGCAAGCTGGCTAGCGTCGTTGGCGACGTGAATGCAGCAAACAAGACATCGGAGTCGGCTGCCCAGAAG
GCCATTGAGCAAGCAGCCCAGACCAAGGCAGCAGCGGATGAGACGGCTTTCAACACCAAGCTGGCGGCTGCG
CAAAAGGCAGTGGCCGATGCGGAGGCTACACAAACAAGTGCCGAGGCCGAACTTAAGTTTTACGAGTCGGAC
GCTGGCAAGGGCGCAAAGGACGAGGTGAAGCTGGCGGCCCAGCTCGCGAAGCAGAAAGCGGACGCAACGCTC
GCTTTGATGAACAAACGCCTCCAGGACTTGCTTGTCAGCAGCAGTGGCGCAAAGGACGCCGGCGCTGCCGAG
CCAGGCATTAAGCACGCCATGGGGCCGGTACTGTTCAAGCTCGTGCAGACGGCGAACTCGGTCTCCCTAGTG
CAGGTCGACATCCAGCGCACCTTCGAGACTTCCGGCACGCCGGCCAAGGAGACTTCACCACCCGCTGGAGCA
GCGCCTACGCTCACCGCGAAGAATGTGAGCAATACCGCAGGAACAACGGTTGTCGACTTCTCGGCATCGGCC
GCAATCGAGATCGTCAACGACGCTTTGCTAACTTTAAACAAGGGGAATGTCGCCTACGACAAGTCCAAGGTT
GTCTACAAGAAGGGAGCCGGGGACAAACAGTACCAGGCCACCTTCAAGCCCACGCTGCCTGCGGGCAAATAC
ACGCTGCTGGTCGTGTACGACAAGGACAAGTCGGCGCAGCTGGAGTTCACCGTCAAGTGA
Translation:
MKRHTTLARYFGCALLTLSVQGCVTWVDVEKFKQPAPDPRTPSEVPSSSQKGFRYSMPEPYLLVKPKADGTA
TYEWVFLPDRNNEYVVAPKSLFATYKMTVATENGFLTSASFDGTANEVASKLASVVGDVNAANKTSESAAQK
AIEQAAQTKAAADETAFNTKLAAAQKAVADAEATQTSAEAELKFYESDAGKGAKDEVKLAAQLAKQKADATL
ALMNKRLQDLLVSSSGAKDAGAAEPGIKHAMGPVLFKLVQTANSVSLVQVDIQRTFETSGTPAKETSPPAGA
APTLTAKNVSNTAGTTVVDFSASAAIEIVNDALLTLNKGNVAYDKSKVVYKKGAGDKQYQATFKPTLPAGKY
TLLVVYDKDKSAQLEFTVK*
AnnotationID:5686GeneID:5684
AnnotatorUID: danlee
Modification Date/Time: 2005-09-28 13:01:55
Gene Name:
Confidence Code:4
GeneProduct:hypothetical protein
Cell Localization:()
Synonyms:
Cell Localization Confidence Code:5
MolecularFunction:
Functional Category:(14) Hypothetical, unclassified, unknown
Alternate Gene Product Name:
Functional Category Confidence Code:5
COGs:
Secondary Functional Category(ies):
EC Number:
Status:ACTIVE
Pathway:
Homology:
Structural Features:
Amino acids 121-374 align with amino acids 119-376 of: 
>emb|CAD05209.1| (AL627268) tolA protein [Salmonella enterica subsp. enterica serovar Typhi]
Length = 376 Score = 52.8 bits (125), Expect = 4e-06
 Identities = 86/285 (30%), Positives = 117/285.
Genomic Context:
Comment:
ReferenceID:11319
Author/Investigator(s): GenNotator
Title:
PubMed:
MedLine:
Source:
Reference Type:BLASTP
Data:
URL:
ReferenceID:28500
Author/Investigator(s): GenNotator
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:RPSBLAST
Data:
URL:
GC ORFID: 48686How Found: Glimmer
GC_TrimmedSeqID: 69Blast Result ID
Subject Sequence Name: Glimmer Score:
Start: 812669Stop: 813808
Length: 1140
Start Codon: ATGTruncated Start:
Stop Codon: TGATruncated Stop:
Homolog: Homolog Bit Score
Other Homologs:
GC ORF Sequence
ATGAAACGGCACACAACCTTGGCCCGCTATTTCGGGTGTGCCCTGCTGACGCTCAGTGTCCAAGGATGCGTG
ACCTGGGTTGATGTGGAGAAATTCAAGCAACCTGCGCCTGACCCCAGGACTCCCAGCGAAGTGCCATCATCG
AGCCAGAAGGGGTTTCGGTACTCGATGCCGGAGCCCTACCTGTTGGTCAAGCCCAAGGCCGATGGCACTGCA
ACCTACGAGTGGGTCTTCCTGCCTGACCGGAACAACGAGTACGTGGTAGCACCCAAGTCCCTCTTTGCCACC
TACAAGATGACGGTCGCGACCGAGAACGGCTTTCTAACGTCCGCCAGCTTTGATGGCACGGCGAACGAGGTG
GCGAGCAAGCTGGCTAGCGTCGTTGGCGACGTGAATGCAGCAAACAAGACATCGGAGTCGGCTGCCCAGAAG
GCCATTGAGCAAGCAGCCCAGACCAAGGCAGCAGCGGATGAGACGGCTTTCAACACCAAGCTGGCGGCTGCG
CAAAAGGCAGTGGCCGATGCGGAGGCTACACAAACAAGTGCCGAGGCCGAACTTAAGTTTTACGAGTCGGAC
GCTGGCAAGGGCGCAAAGGACGAGGTGAAGCTGGCGGCCCAGCTCGCGAAGCAGAAAGCGGACGCAACGCTC
GCTTTGATGAACAAACGCCTCCAGGACTTGCTTGTCAGCAGCAGTGGCGCAAAGGACGCCGGCGCTGCCGAG
CCAGGCATTAAGCACGCCATGGGGCCGGTACTGTTCAAGCTCGTGCAGACGGCGAACTCGGTCTCCCTAGTG
CAGGTCGACATCCAGCGCACCTTCGAGACTTCCGGCACGCCGGCCAAGGAGACTTCACCACCCGCTGGAGCA
GCGCCTACGCTCACCGCGAAGAATGTGAGCAATACCGCAGGAACAACGGTTGTCGACTTCTCGGCATCGGCC
GCAATCGAGATCGTCAACGACGCTTTGCTAACTTTAAACAAGGGGAATGTCGCCTACGACAAGTCCAAGGTT
GTCTACAAGAAGGGAGCCGGGGACAAACAGTACCAGGCCACCTTCAAGCCCACGCTGCCTGCGGGCAAATAC
ACGCTGCTGGTCGTGTACGACAAGGACAAGTCGGCGCAGCTGGAGTTCACCGTCAAGTGA