Modification Date/Time: 2006-04-06 13:01:23
Missense Discrepancy: FALSE
Comments:Homology: emb|CAC17500.1| (AL939132) putative non-ribosomal peptide synthase [Streptomyces coelicolor A3(2)]
Identities = 401/951 (42%), Positives = 535/951 (56%), Gaps = 45/951 (4%)
Sequence:
ATGCTCCAGGCGCGAAAGGGAGAGATTCTGGCCCACCTGATCGCCACGGATACCAGCATCCAAGCTGACCCC
GCCAATCGGTTTGAGCCGTTTCCACTCACGGACCTGCAACTGGCGTACATCGTCGGGCGCCGCGACAACTAC
GAGCTCGGCGGCGTGGGCTGCCACAACTACCTCGAACTGCAGATGCCTGCCTTGGATCCGCAACGCCTGGAG
CGTGCATGGCATGCCCTGATCATGCGGCATGACATGTTGCGAGCGGAAATCGGCACCGACGGCCAACAACGC
GTCCTCAGAAGTGTCGTGCTGCCGCCGCTGCGCTGCGATGATCTGCGAGGGGCAAGCGCCGAGGAGTTTGAG
CGTGCCACGCTCGCCAGCCGAGACGAGATGGCATTTCGTCGATACGACTCCGAACGTTGGCCGCTCTATGAG
ATTCGCCTTACGCTGCACGACGACAGTTCGGTTCTGCACTACTCAACGGACCTGCTGATCGCCGACTTCGCC
AGCATCCAGTTGCTGCTGGCTGAGTTGGGTCAGCTCTACCATCGTCCCGAGACTGCTCCGGCACCACTAACG
CTGACTTTCCGCGACGTGGTGATGTCCGAGCGCCGGCGTCGTCAGCACCCGGATACCGAAGCTCGACAGCAA
AAGGACCGGGATTACTGGATGGCGCGGCTGCCGGACTTGCCCGGGGCCCCCGAGCTACCACTCCTGCCTCAT
GCTGCACGTCCCCTCCAGAACACCCCTAGCTTCGAGCGTCATGGCTTCGATCTTCCCGCGGGCTCCTGGCAA
CGGTTCAGCGAGATCGCAACGGCCCAGCAACTGACGCCGACCGCGGCCGTGCTAGCCGTCTTTACTGAGGTC
CTCCGGCTCTGGTCTCGCCAGCCCGACTTCTGCATCAACCTGACACTGTTCAATCGCCCGCCGGTCCATGAG
CAGATCCAGCATATCGTCGGCGACTTCATCGCCGTCAACGTCCTCGAGGTTCGCCTCGATGGTGGCACGACC
TTCCCGCAGCGTGCCCGGGCGTTGCAAACGCGCCTTTGGCAGGACATGGAGCATTCCGGCTTCACGGGCATC
GAGGTGCTTCGACAACTGTCGCGTCTGCATGGCAGCAATCAGCTCATCCCCGTGGTGTTCACCAGCACCGTG
GGCATCGCCGGTCAAGCACTGCCCCGGAATGACTTCATGCACGATGCCCAACTGCTCTACGGCATCACCCAG
ACGCCCCAGGTCTGGCTCGACTGCCAGGTAACCGAACGCAACGGTACATTGCATGTCGACTGGGACGTGCGC
AACGGCATCTTTCCCCCGGGCCTCATCGAACAGGCTTTCGCTGCCTTCACCCAGGCCATTACCTCGCTTTCT
CAAGGCCCGGACGCCTGGACACTCGGGCACCTGGTCACGCTACCCGAATCCACGCGACTGCAGCGCGAACGC
ATCAACACAGAGCGCAATGCCCCCTTGCCCTTGGGCTACCTGCATGGTGGCTTCTGCCGCCGCGCACTGGAT
TTTCCCGAGCGCCCTGCTTTGCTGTGCGGCGCAAGCGAATGGAACTACGGTCAGCTTGCTGCATGGGCCGTG
GCCATCGCCCGTGACCTGCGTGCCGCCGGCTGCGGGCCGGGCCAACCCGTCGCCTTGTTTCTCGACAAAGGG
CCCGCGCAGATCGCTGCCGTCCTGGGCGTCCTGCTGGCGGAGGGTGCCTACGTCCCCATAGACGTAGGCCAA
CCCGTCGAGCGAAGAGACACCATGCTGGCGGACGTCGGTGCGAAACTGCTGCTGACCGATTGCGAACATATC
GACGCTCAGTGGCCCGACGGCATCCAGCCGATGCTCGTTGGCGCCATGGACGCCTTGCCGCCCGAGGAACTG
GCCTCGGCCCTGCAAGAGGCCATCGCCGCGGCAGCGCACCGAGACACCGCAACGCAGTTGGCCTATGTGCTC
TATACCTCGGGCACCACGGGGCGGCCGAAGGGCGTCATGCTTACTCATCGGGGTGTCCTGAACACCATCCAA
GGTTTCAATCGCCAGTTCGGTCTTGACGAGAACGACAGATTCTTCGGTCTGGTGAACTACACCTTCGACCTC
TCGGTGCTGGACATTTTCTGCGCCTTCACCACCGGAGCGGCGTTGGTCCTGCCGCAGGGGCAGTGGCGCAAT
GACCCCGAGCAATGGGTATCGGCCATCGAGTTGCATCGGGCTACGGTCTGGAACTCCGTACCCGCGCACATG
CAAATGCTCTTAACCCACCTACCCCAGGGACGGATGCTCTCCAGCCTGCGGATAGGTTTTCTTTCTGGCGAC
TGGATCCCGGTGGCACTGCCGGACCAGGTACGTCAGCGCCTGCCCGGCATGGCACCCAAAAGCCTCGGTGGC
CCGACCGAGATTTCGGTGACCTGCATCTATCACGACATTGGTGATGTGCCCCAGGACGCAGTGTCGATCCCC
TACGGCTCGCCATTGAGCAACCACCGGCTTTATGTCCTGAATCACCAGCTTGAGCACTGCCCCAACTGGACC
CCGGGGGAAATGTATGTGGGCGGCCCCGGGGTCGCTCTCGGATTCGTCAATGATCCCGAGCGCACCCGCGAG
CGCTTCATCATTCACCCGCAGACCGGCGAGCGCCTGTACCGAACCGGCGATGTCTGCCGCTTCCGCGACGAC
GGCATCATCGAGATCCTGGGGCGCGAAGACAATCAGGTGAAGATCCGCGGCCATCGCATCGAATTGGGTGAT
GTGGAAGCGGCCTTTGCCTCACTGCCCGGTGTCGGCCGCGCTGTCGCGCTGGTACGCCGGGAGCCGCTGGAC
CTGGTGGCCGCAGTGCAGGTGTGCGAACCCTGCGATGATCCAGCCACGCTCATCGAGCAATGGCGTAAGGAC
CTGGCCACGCGTCTGCCTCGCTACATGTTGCCGTCGGCCATCGAGGTGTTGCCTCAGATTCCCTTGAGCCGC
AACGGCAAGGTCGACCGCAAGGCGCTCGCCGAACGCTTCCAGGGCGCGTTGGCGGGCGGCCGCGACCGGCAA
CCGCTCCGAGAGGACCCGCTGGAGCAGAAGCTGGCCGCGATCTGGCGCGAACTGACGCTGGCAGAAGACATC
GCGCGCGACGACGACTTCTTCATGATTGGCGGCACCAGCCTGACGGCGGTAGGACTGCTCAATCGGCTTTCC
AGCGAAGGACTGCGTGTCAATATCGACCTTATCTTCAACCACTCCGTGTTCCATGACATGGTTGAGGCACTG
AAGCGTGCCGAAGACGAAGAGGAGAATTTCCGCCAGGGCATCGACCTCGAAGCGCTGCTCACCCGAGCGATG
CGTAACCTGCACACGGCCGCGCCCGAGCCGGCAGCGGAGCAAGTCCGCAACATCTTCCTGACCGGCGCTACT
GGTTACCTGGGCATCTACGTGCTACGTGCCCTCGTGCAATCCACAAATTGTCGGATTCACTGCCTGCTGCGA
TGCCGCGACGAAGACAGCGGCTACCAGCGTCTTCGTCAGATGAGCGAGGAAAAGGGGCTGGGCTTCGATTTG
GACCGTGACCGCGTGCGCATCATCCCCGGCGATCTGTCTGCCGAACGTTTCGGACTGGACGAGGCCGCCTAT
GCTGCGCTGGCACAGGACATGGACAAGGTTCTGCATATCGCTGCGCTGATCAGCCTCATCGCGCCCCTTTCC
GGTCTGTACCCGATCAACGTTCAGGGCTCGGCCAACGTCATCGAACTGGCAACCACCGGCAAGCGAAAACCC
ATCCACTACATGTCCACCATCGGTGTCCACTACCGGCTTCCGTACGGTGAGGACGAGCCCCCTATTCCCGAG
GCTACAGGGCCGGATGCCCCCTGGCACAAGCCCGAGCTCACCTACGAACACACCAAGTACATGGCCGAACAG
CTATTCCATCGGGCGCGGGAACTGGGCGTGAAGGTGAACATCTTCCGCTCCGGGGCCATCACCTGGGACAGC
GAACAGCCCCAGCCTTTTATCAACGACGATGCCTTCGTCAAGTTCTTCCGGACCTGCCTGAGCGTACAGGGC
TATCCGGACTCCTCGATCCTGATAAGCATCACTCCGGTCAATGTCGTCGCCCGGTACATCGGGATGATCGCT
CAGCGTGAAATCCGCGACCAGGGGCAGAACTTCCATCTGGTGTCCCAGCACAGCCTCCCTGGCGGCAGGATC
TACGCCTGGTTCAACGAGCTGGGCTGCCGTTTTTCGCCCCTGGACTTCGAGACTTGGGACGAGCGGCTTTCG
GACAGTTTCGGGCGAGGCTTCATCAACCGCTACTTCAAGCACGGCATCGGCCAAGGTGGCCATCATCAGTAC
CGTATCGACAACCTGGTTGCCGTGCTGGAAAAACACGGCATGCAGCCGAACCAGGTCGACCGTGCCTACTTC
AAACCTTTGCTATCGCAGCTTGCAGGCGTGGGCCAGGACGACGAAGGAGACATGTCATGCACGCCCCGTTGA
Translation:
MLQARKGEILAHLIATDTSIQADPANRFEPFPLTDLQLAYIVGRRDNYELGGVGCHNYLELQMPALDPQRLE
RAWHALIMRHDMLRAEIGTDGQQRVLRSVVLPPLRCDDLRGASAEEFERATLASRDEMAFRRYDSERWPLYE
IRLTLHDDSSVLHYSTDLLIADFASIQLLLAELGQLYHRPETAPAPLTLTFRDVVMSERRRRQHPDTEARQQ
KDRDYWMARLPDLPGAPELPLLPHAARPLQNTPSFERHGFDLPAGSWQRFSEIATAQQLTPTAAVLAVFTEV
LRLWSRQPDFCINLTLFNRPPVHEQIQHIVGDFIAVNVLEVRLDGGTTFPQRARALQTRLWQDMEHSGFTGI
EVLRQLSRLHGSNQLIPVVFTSTVGIAGQALPRNDFMHDAQLLYGITQTPQVWLDCQVTERNGTLHVDWDVR
NGIFPPGLIEQAFAAFTQAITSLSQGPDAWTLGHLVTLPESTRLQRERINTERNAPLPLGYLHGGFCRRALD
FPERPALLCGASEWNYGQLAAWAVAIARDLRAAGCGPGQPVALFLDKGPAQIAAVLGVLLAEGAYVPIDVGQ
PVERRDTMLADVGAKLLLTDCEHIDAQWPDGIQPMLVGAMDALPPEELASALQEAIAAAAHRDTATQLAYVL
YTSGTTGRPKGVMLTHRGVLNTIQGFNRQFGLDENDRFFGLVNYTFDLSVLDIFCAFTTGAALVLPQGQWRN
DPEQWVSAIELHRATVWNSVPAHMQMLLTHLPQGRMLSSLRIGFLSGDWIPVALPDQVRQRLPGMAPKSLGG
PTEISVTCIYHDIGDVPQDAVSIPYGSPLSNHRLYVLNHQLEHCPNWTPGEMYVGGPGVALGFVNDPERTRE
RFIIHPQTGERLYRTGDVCRFRDDGIIEILGREDNQVKIRGHRIELGDVEAAFASLPGVGRAVALVRREPLD
LVAAVQVCEPCDDPATLIEQWRKDLATRLPRYMLPSAIEVLPQIPLSRNGKVDRKALAERFQGALAGGRDRQ
PLREDPLEQKLAAIWRELTLAEDIARDDDFFMIGGTSLTAVGLLNRLSSEGLRVNIDLIFNHSVFHDMVEAL
KRAEDEEENFRQGIDLEALLTRAMRNLHTAAPEPAAEQVRNIFLTGATGYLGIYVLRALVQSTNCRIHCLLR
CRDEDSGYQRLRQMSEEKGLGFDLDRDRVRIIPGDLSAERFGLDEAAYAALAQDMDKVLHIAALISLIAPLS
GLYPINVQGSANVIELATTGKRKPIHYMSTIGVHYRLPYGEDEPPIPEATGPDAPWHKPELTYEHTKYMAEQ
LFHRARELGVKVNIFRSGAITWDSEQPQPFINDDAFVKFFRTCLSVQGYPDSSILISITPVNVVARYIGMIA
QREIRDQGQNFHLVSQHSLPGGRIYAWFNELGCRFSPLDFETWDERLSDSFGRGFINRYFKHGIGQGGHHQY
RIDNLVAVLEKHGMQPNQVDRAYFKPLLSQLAGVGQDDEGDMSCTPR*
AnnotationID:5519 GeneID:5517
Modification Date/Time:
2005-10-03 15:03:11
GeneProduct: putative non-ribosomal peptide synthetase
Cell Localization Confidence Code: 5
Functional Category: (20) Putative enzymes
Alternate Gene Product Name:
Functional Category Confidence Code: 5
Secondary Functional Category(ies):
Homology: gi|68345153| gb|AAY92759.1| pyochelin synthetase F [Pseudomonas fluorescens Pf-5]
Length=1807
Score = 750 bits (1936), Expect = 0.0
Identities = 425/937 (45%), Positives = 578/937 (61%), Gaps = 30/937
gi|21225943| ref|NP_631722.1| putative non-ribosomal peptide synthase [Streptomyces coelicolor A3(2)]
Length=1842
Score = 708 bits (1827), Expect = 0.0
Identities = 401/951 (42%), Positives = 535/951 (56%), Gaps = 45/951 (4%)
gi|63255743| gb|AAY36839.1| Amino acid adenylation [Pseudomonas syringae pv. syringae B728a]
Length=3021
Score = 698 bits (1802), Expect = 0.0
Identities = 427/1049 (40%), Positives = 597/1049 (56%), Gaps = 35/1049 (3%)
Structural Features: COG1020 , EntF, Non-ribosomal peptide synthetase modules and related proteins [Secondary
metabolites biosynthesis, transport, and catabolism].
CD-Length = 642 residues, 100.0% aligned
Score = 395 bits (1016), Expect = 2e-110
pfam00501 , AMP-binding, AMP-binding enzyme.
CD-Length = 412 residues, 99.8% aligned
Score = 302 bits (773), Expect = 3e-82
COG0318 , CaiC, Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II [Lipid metabolism / Secondary
metabolites biosynthesis, transport, and catabolism].
CD-Length = 534 residues, 96.1% aligned
Score = 222 bits (566), Expect = 3e-58
Comment: This gene and upstream gene (GeneID 5516) are putative non-ribosomal peptide synthetases with homologies
to conserved domains COG1020 , pfam00501 , and COG0318 .