ViewGene

Locus Name:
GeneID:
Location:

GeneID:6084
TrivialName: PA14_31070
AnnotatorUID:
Modification Date/Time:2006-04-06 13:01:23
LocusName:PA14_31070
PAO1 Ortholog Locus:
Sequence Length:2361
Protein Length:786
Start: 2703530
Stop: 2701170
Strand: -
Type:
ChromosomeID:2
Status: ACTIVE
Frame Discrepancy: FALSE
Missense Discrepancy:FALSE
Comments:
Homology: emb|CAD16308.1| (AL646070) HYPOTHETICAL/UNKNOWN PROTEIN [Ralstonia solanacearum]
Identities = 604/682 (88%), Positives = 643/682 (94%), Gaps = 4/682 (0%)
Sequence:
ATGTCGTCGCGCGTGCAGTGCGAGCGCCGATCCTACGTATCGGGTGCCGCACTTCGCGCTGCCTTCCCATGT
GCAGGCGTCTTGCCTCGAACGCTGCCGGCCTATGGCCGTCGCCGCGTTCGCCGGCGCGCTGGCTGCTCGCGC
AGCAGCGCCGCCGGGCCGCCGCTGCCCGGAGCGCCAGCGAGGGGCAAAGGCGGAAGGCAAGACAAAAGGACG
CGGCACCGGGCCGCGTCGAAAGCCAGTCTGCACGTGGGGTTGGCGCGGCACGGCGCGGCTTTGCCGCCGTGC
CGCGTGGGGCGCGAGGCCCGCGCCAATGCAGGCATGTCCGCGTGCTTCGCACGACAGGACACGCCAGAGCTT
GCAGGGAGCGCAGCCATGAGCGACCGCCGCGACGACGATTTCCGCGTGCGCCCCAGCGCCCCGAAGAACCGG
GGCAAGGGCCAGGGACAGAGCTTCGTTTCCAAGGTGCTCAAGCAGACGGGCAAGGCCAGCGGCGGCAAGTCC
ACGGTGCGCCGTTCTGGCGCAGCGCGTGGCACCGGCCAGCGTCCCGGCTCGCGCCTGGGCCGCGGCCATACG
GCGGCGCGCTTCGCGGGCGCGAAGCTGACACCCATGTCGCGGCGCGTGACCATCAAGACGCTGCTGGTCAAC
CAGCGCCAGGCCAGCCCGCAATCGCTCGCCAAGCACCTTCGCTATATCGAACGTGATGGCGTGGGGCGCGAC
GGCGAGCCGGGCCAAGCCTACGGGCCGCAGACCGACGCCGCCGACCTCGACGCCTTCAAGGAACGCTGCGCC
GACGACCGGCACCATTTCCGCTTCATCCTCTCGCCCGAGGATGGCGCGGAACTGGAAGACCTGCGCACCTAT
ACGCGGCACCTCATGGGCCGCATGGAAGCCGACCTGGGCACAGGCCTCGATTGGGTGGCCGTGAACCACTGG
AACACCGACAACCCGCACATGCACATCGTCGTGCGCGGGCGCGACGACACCGGCAAAGACCTCATCATCGCG
GGCGACTACATCGCCGATGGTTTCCGCCACCGCGCCGCCGAGCTGGCGACCGAATGGCTGGGGCCGCGCACC
GAACTGGAGATCCAGCAGACTTTGCAGCGCGAGGTGGAACAGGAACGGTGGACGAGCCTGGATCGCACCTTG
AAGCGCGAGGCCGGCGACCATGGCCTGGTGCATGTCGAACGGCTCAACGAACCCCGCTTGCAGCGCCAACGC
CTGCTGCTGATCGGCCGCCTGCAACGCTTGCAGCGCCTGGGCCTGGCCGACGAGACGCAGCCCGGCACCTGG
GCCGTCCATGCCGATGCGGAAAAGACCTTGCGCGCCCTGGGCGAGCGCGGCGACATCATCCGCACGATGCAG
CGGGCCATGCGCGGCGAGCCGCGCGAGCTGGCGGTGTTCGAGCCTGGAGACGATGGCCGAACCATCCTCGGG
CGCGTGGCCGCGAAGGGACTGGCCGACGAGCTGCGCGACCGGGGCTATCTGGTCATCGACGGCGTGGACGGC
AAGGCCCACTACGTCGCGCTCAACGCCCGCGACGAGCTGGCGAACTATCCGACCGGGGCCGTGGTGGAGGTG
AAGGGATCGGCCGACGTGCGCGCGGCCGACAGGAACATCGCCGCGCTGGCGAGCGATGGCCTGTACCGCACC
GACCATCACCTCGCCATCGCGCAGGGCCAGGTCGTCCCCGGACGCGACCCGCAGGAGGTTGTGGCGGCCCAT
ATCCGCAGGCTGGAAGCCCTACGCCGGGCGGGCATCGTGGAGCGCGTGGCCGAGGGGCTATGGAAGGTGCCG
GGCGATCTGCCCGAGCAGGGCCGCCGCTACGACGCGCAGCGCATGGGCGGTGTGGCCGTGGAGCTGAAATCT
CACCTGCCCATCGAGCGGCAGGCCCGCGTAATCGGGGCCACCTGGCTAGATCAGCAGTTGATCGGTGGCGGC
TCGGGCCTGGGCGACCTGGGTTTCGGTAGCGAGGCCAGGCAGGCGATGCAGCAGCGCGCCGACTTCCTGGCC
GAACAGGGGCTGGCCGAGCGGCGCGGGCAGCGTGTGATCCTGGCGCGCAACCTGCTGGGCACGCTGCGCAAC
CGGGAACTGGCACAGGCCGCCAAAGACATTGCCGCCGATACCGGCCTGGAGCATCGCCCGGTGGCCGACGGG
CAGCGCGTGTCCGGCATCTACCGGCGCTCCGTCATGCTCGCCAGCGGGCGCTACGCGATGCTCGATGACGGC
ATGGGCTTCTCGCTGGTGCCGTGGAAGCCGGTGATCGAGCAGCGGCTGGGGCAGCAGCTTGCGGCAACCGTG
CGCGGTGGCGGGGTGTCCTGGGAGATTGGGCGGCAACGCGGGCCTACTGTCGCTTGA
Translation:
MSSRVQCERRSYVSGAALRAAFPCAGVLPRTLPAYGRRRVRRRAGCSRSSAAGPPLPGAPARGKGGRQDKRT
RHRAASKASLHVGLARHGAALPPCRVGREARANAGMSACFARQDTPELAGSAAMSDRRDDDFRVRPSAPKNR
GKGQGQSFVSKVLKQTGKASGGKSTVRRSGAARGTGQRPGSRLGRGHTAARFAGAKLTPMSRRVTIKTLLVN
QRQASPQSLAKHLRYIERDGVGRDGEPGQAYGPQTDAADLDAFKERCADDRHHFRFILSPEDGAELEDLRTY
TRHLMGRMEADLGTGLDWVAVNHWNTDNPHMHIVVRGRDDTGKDLIIAGDYIADGFRHRAAELATEWLGPRT
ELEIQQTLQREVEQERWTSLDRTLKREAGDHGLVHVERLNEPRLQRQRLLLIGRLQRLQRLGLADETQPGTW
AVHADAEKTLRALGERGDIIRTMQRAMRGEPRELAVFEPGDDGRTILGRVAAKGLADELRDRGYLVIDGVDG
KAHYVALNARDELANYPTGAVVEVKGSADVRAADRNIAALASDGLYRTDHHLAIAQGQVVPGRDPQEVVAAH
IRRLEALRRAGIVERVAEGLWKVPGDLPEQGRRYDAQRMGGVAVELKSHLPIERQARVIGATWLDQQLIGGG
SGLGDLGFGSEARQAMQQRADFLAEQGLAERRGQRVILARNLLGTLRNRELAQAAKDIAADTGLEHRPVADG
QRVSGIYRRSVMLASGRYAMLDDGMGFSLVPWKPVIEQRLGQQLAATVRGGGVSWEIGRQRGPTVA*
AnnotationID:6085GeneID:6084
AnnotatorUID: danlee
Modification Date/Time: 2005-09-30 16:04:06
Gene Name:
Confidence Code:4
GeneProduct:conserved hypothetical protein
Cell Localization:()
Synonyms:
Cell Localization Confidence Code:5
MolecularFunction:
Functional Category:(14) Hypothetical, unclassified, unknown
Alternate Gene Product Name:
Functional Category Confidence Code:5
COGs:COG3843
Secondary Functional Category(ies):
EC Number:
Status:ACTIVE
Pathway:
Homology:
gi|68526162|gb|EAN49136.1|  conserved hypothetical/unknown protein [Ralstonia metallidurans CH34]
     Length=700
     Score = 1283 bits (3319),  Expect = 0.0
     Identities = 641/702 (91%), Positives = 667/702 (95%), Gaps = 7/702 (0%)

gi|17547320|ref|NP_520722.1|  HYPOTHETICAL/UNKNOWN PROTEIN [Ralstonia solanacearum GMI1000]
     Length=683
     Score = 1221 bits (3159),  Expect = 0.0
     Identities = 604/682 (88%), Positives = 643/682 (94%), Gaps = 4/682 (0%)

gi|67088835|gb|EAM08301.1|  conserved hypothetical/unknown protein [Azotobacter vinelandii AvOP]
     Length=661
     Score = 1192 bits (3083),  Expect = 0.0
     Identities = 588/660 (89%), Positives = 623/660 (94%), Gaps = 0/660 (0%)

gi|72607449|gb|EAO43409.1|  conserved hypothetical/unknown protein [Burkholderia ambifaria  AMMD]
     Length=667
     Score = 1165 bits (3015),  Expect = 0.0
     Identities = 582/667 (87%), Positives = 617/667 (92%), Gaps = 7/667 (1%)

Structural Features:
COG3843, VirD2, Type IV secretory pathway, VirD2 components (relaxase) [Intracellular 
trafficking and secretion].
             CD-Length = 326 residues,  98.2% aligned
             Score =  253 bits (647), Expect = 6e-68
Genomic Context:
Comment:
putative Type IV secretion pathway component
ReferenceID:12048
Author/Investigator(s): GenNotator
Title:
PubMed:
MedLine:
Source:
Reference Type:BLASTP
Data:
URL:
ReferenceID:29288
Author/Investigator(s): GenNotator
Title: CDD Search
PubMed:
MedLine:
Source:
Reference Type:RPSBLAST
Data:
URL:
Homologs By Global Alignment
Gene ID:6084

Identity:

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
HomologID Accession Description Length PctIdentity PctSimilarity Gaps Score
198359 emb|CAD16308.1| (AL646070) HYPOTHETICAL/UNKNOWN PROTEIN [Ralstonia solanacearum] 790 76.45 81.39 111 3162.0
198360 emb|CAD61135.1| (AJ536756) hypothetical protein [Ralstonia oxalatica] 818 50.24 51.58 333 2016.0
198362 emb|CAD31511.1| (AL672112) HYPOTHETICAL PROTEIN [Mesorhizobium loti] 816 34.92 49.14 190 1258.0
198364 emb|CAE27677.1| (BX572600) conserved hypothetical protein [Rhodopseudomonas palustris CGA009] 812 34.60 47.04 254 1254.0
198361 dbj|BAB52523.1| (AP003008) unknown protein [Mesorhizobium loti] 812 34.60 49.13 182 1253.0
GC ORFID: 49086How Found: Glimmer
GC_TrimmedSeqID: 69Blast Result ID
Subject Sequence Name: Glimmer Score:
Start: 2250396Stop: 2252756
Length: 2361
Start Codon: ATGTruncated Start:
Stop Codon: TGATruncated Stop:
Homolog: Homolog Bit Score
Other Homologs:
GC ORF Sequence
ATGTCGTCGCGCGTGCAGTGCGAGCGCCGATCCTACGTATCGGGTGCCGCACTTCGCGCTGCCTTCCCATGT
GCAGGCGTCTTGCCTCGAACGCTGCCGGCCTATGGCCGTCGCCGCGTTCGCCGGCGCGCTGGCTGCTCGCGC
AGCAGCGCCGCCGGGCCGCCGCTGCCCGGAGCGCCAGCGAGGGGCAAAGGCGGAAGGCAAGACAAAAGGACG
CGGCACCGGGCCGCGTCGAAAGCCAGTCTGCACGTGGGGTTGGCGCGGCACGGCGCGGCTTTGCCGCCGTGC
CGCGTGGGGCGCGAGGCCCGCGCCAATGCAGGCATGTCCGCGTGCTTCGCACGACAGGACACGCCAGAGCTT
GCAGGGAGCGCAGCCATGAGCGACCGCCGCGACGACGATTTCCGCGTGCGCCCCAGCGCCCCGAAGAACCGG
GGCAAGGGCCAGGGACAGAGCTTCGTTTCCAAGGTGCTCAAGCAGACGGGCAAGGCCAGCGGCGGCAAGTCC
ACGGTGCGCCGTTCTGGCGCAGCGCGTGGCACCGGCCAGCGTCCCGGCTCGCGCCTGGGCCGCGGCCATACG
GCGGCGCGCTTCGCGGGCGCGAAGCTGACACCCATGTCGCGGCGCGTGACCATCAAGACGCTGCTGGTCAAC
CAGCGCCAGGCCAGCCCGCAATCGCTCGCCAAGCACCTTCGCTATATCGAACGTGATGGCGTGGGGCGCGAC
GGCGAGCCGGGCCAAGCCTACGGGCCGCAGACCGACGCCGCCGACCTCGACGCCTTCAAGGAACGCTGCGCC
GACGACCGGCACCATTTCCGCTTCATCCTCTCGCCCGAGGATGGCGCGGAACTGGAAGACCTGCGCACCTAT
ACGCGGCACCTCATGGGCCGCATGGAAGCCGACCTGGGCACAGGCCTCGATTGGGTGGCCGTGAACCACTGG
AACACCGACAACCCGCACATGCACATCGTCGTGCGCGGGCGCGACGACACCGGCAAAGACCTCATCATCGCG
GGCGACTACATCGCCGATGGTTTCCGCCACCGCGCCGCCGAGCTGGCGACCGAATGGCTGGGGCCGCGCACC
GAACTGGAGATCCAGCAGACTTTGCAGCGCGAGGTGGAACAGGAACGGTGGACGAGCCTGGATCGCACCTTG
AAGCGCGAGGCCGGCGACCATGGCCTGGTGCATGTCGAACGGCTCAACGAACCCCGCTTGCAGCGCCAACGC
CTGCTGCTGATCGGCCGCCTGCAACGCTTGCAGCGCCTGGGCCTGGCCGACGAGACGCAGCCCGGCACCTGG
GCCGTCCATGCCGATGCGGAAAAGACCTTGCGCGCCCTGGGCGAGCGCGGCGACATCATCCGCACGATGCAG
CGGGCCATGCGCGGCGAGCCGCGCGAGCTGGCGGTGTTCGAGCCTGGAGACGATGGCCGAACCATCCTCGGG
CGCGTGGCCGCGAAGGGACTGGCCGACGAGCTGCGCGACCGGGGCTATCTGGTCATCGACGGCGTGGACGGC
AAGGCCCACTACGTCGCGCTCAACGCCCGCGACGAGCTGGCGAACTATCCGACCGGGGCCGTGGTGGAGGTG
AAGGGATCGGCCGACGTGCGCGCGGCCGACAGGAACATCGCCGCGCTGGCGAGCGATGGCCTGTACCGCACC
GACCATCACCTCGCCATCGCGCAGGGCCAGGTCGTCCCCGGACGCGACCCGCAGGAGGTTGTGGCGGCCCAT
ATCCGCAGGCTGGAAGCCCTACGCCGGGCGGGCATCGTGGAGCGCGTGGCCGAGGGGCTATGGAAGGTGCCG
GGCGATCTGCCCGAGCAGGGCCGCCGCTACGACGCGCAGCGCATGGGCGGTGTGGCCGTGGAGCTGAAATCT
CACCTGCCCATCGAGCGGCAGGCCCGCGTAATCGGGGCCACCTGGCTAGATCAGCAGTTGATCGGTGGCGGC
TCGGGCCTGGGCGACCTGGGTTTCGGTAGCGAGGCCAGGCAGGCGATGCAGCAGCGCGCCGACTTCCTGGCC
GAACAGGGGCTGGCCGAGCGGCGCGGGCAGCGTGTGATCCTGGCGCGCAACCTGCTGGGCACGCTGCGCAAC
CGGGAACTGGCACAGGCCGCCAAAGACATTGCCGCCGATACCGGCCTGGAGCATCGCCCGGTGGCCGACGGG
CAGCGCGTGTCCGGCATCTACCGGCGCTCCGTCATGCTCGCCAGCGGGCGCTACGCGATGCTCGATGACGGC
ATGGGCTTCTCGCTGGTGCCGTGGAAGCCGGTGATCGAGCAGCGGCTGGGGCAGCAGCTTGCGGCAACCGTG
CGCGGTGGCGGGGTGTCCTGGGAGATTGGGCGGCAACGCGGGCCTACTGTCGCTTGA