BGC0002044: empedopeptin biosynthetic gene cluster from Massilia sp. YMA4
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 2,138,714 - 2,204,004 nt. (total: 65,291 nt).
This entry is originally from NCBI GenBank CP030092.1, but has been modified (see Modifications tab for details).

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
TTA codons
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
NRPS/PKS domains
KnownClusterBlast
Modifications
General information about the BGC
MIBiG accession BGC0002044
Short description empedopeptin biosynthetic gene cluster from Massilia sp. YMA4
Status Quality: questionable
The quality level of this entry.

Status: active
The status of this entry.

Completeness: unknown
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRPS (Type I)
Loci
CP030092.1
2138714 - 2204004
via Knock-out studies
Compounds
  • empedopeptin
Species Massilia sp. YMA4 [taxonomy]
References
Chemical products information
empedopeptin Evidence:
Copy SMILES
Chemical database entries
NPAtlas
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • DPH57_09050
  • AXA91284.1
2138714 - 2139727 (-) hypothetical protein
    copy AA seq
    copy Nt seq
    • DPH57_09055
    • AXA91285.1
    2139766 - 2140737 (-) MBL fold metallo-hydrolase
      copy AA seq
      copy Nt seq
      • DPH57_09060
      • AXA91286.1
      2140841 - 2143039 (+) hybrid sensor histidine kinase/response regulator
        copy AA seq
        copy Nt seq
        • DPH57_09065
        • AXA91287.1
        2143084 - 2143401 (+) hypothetical protein
          copy AA seq
          copy Nt seq
          • DPH57_09070
          • AXA91288.1
          2143454 - 2143702 (+) DUF3297 family protein
            copy AA seq
            copy Nt seq
            • DPH57_09075
            • AXA91289.1
            2143858 - 2144421 (-) hypothetical protein
              copy AA seq
              copy Nt seq
              • DPH57_09080
              • AXA91290.1
              2144544 - 2145080 (-) hypothetical protein
                copy AA seq
                copy Nt seq
                • DPH57_09085
                • AXA94653.1
                2145114 - 2146127 (-) alpha/beta hydrolase
                  copy AA seq
                  copy Nt seq
                  • DPH57_09090
                  • AXA91291.1
                  2146305 - 2146682 (-) cupin
                    copy AA seq
                    copy Nt seq
                    • DPH57_09095
                    • AXA91292.1
                    2146964 - 2147236 (+) hypothetical protein
                      copy AA seq
                      copy Nt seq
                      • DPH57_09100
                      • AXA91293.1
                      2147325 - 2148485 (+) hypothetical protein
                        copy AA seq
                        copy Nt seq
                        • DPH57_09105
                        • AXA91294.1
                        2148482 - 2150320 (-) AI-2E family transporter
                          copy AA seq
                          copy Nt seq
                          • DPH57_09110
                          • AXA91295.1
                          2151013 - 2151735 (+) CDP-diacylglycerol--glycerol-3-phosphate 3-phosphatidyltransferase
                            copy AA seq
                            copy Nt seq
                            • DPH57_09115
                            • AXA91296.1
                            • modA
                            2151668 - 2152399 (-) molybdate ABC transporter substrate-binding protein
                              copy AA seq
                              copy Nt seq
                              • DPH57_09120
                              • AXA91297.1
                              2152503 - 2153897 (-) hypothetical protein
                                copy AA seq
                                copy Nt seq
                                • DPH57_09125
                                • AXA91298.1
                                2153907 - 2155088 (-) hypothetical protein
                                  copy AA seq
                                  copy Nt seq
                                  • DPH57_09130
                                  • AXA91299.1
                                  2155136 - 2156053 (-) hypothetical protein
                                    copy AA seq
                                    copy Nt seq
                                    • DPH57_09135
                                    • AXA91300.1
                                    2156095 - 2157075 (-) TauD/TfdA family dioxygenase
                                      copy AA seq
                                      copy Nt seq
                                      • DPH57_09140
                                      • AXA91301.1
                                      2157075 - 2165330 (-) non-ribosomal peptide synthetase
                                        copy AA seq
                                        copy Nt seq
                                        • DPH57_09145
                                        • AXA94654.1
                                        2165327 - 2168695 (-) hypothetical protein
                                          copy AA seq
                                          copy Nt seq
                                          • DPH57_09150
                                          • AXA91302.1
                                          2168806 - 2184774 (-) non-ribosomal peptide synthetase
                                            copy AA seq
                                            copy Nt seq
                                            • DPH57_09155
                                            • AXA91303.1
                                            2184851 - 2186272 (-) hypothetical protein
                                              copy AA seq
                                              copy Nt seq
                                              • DPH57_09160
                                              • AXA91304.1
                                              • zwf
                                              2187342 - 2188808 (-) glucose-6-phosphate dehydrogenase
                                                copy AA seq
                                                copy Nt seq
                                                • DPH57_09165
                                                • AXA91305.1
                                                2189016 - 2189306 (-) hypothetical protein
                                                  copy AA seq
                                                  copy Nt seq
                                                  • DPH57_09170
                                                  • AXA91306.1
                                                  2190085 - 2193171 (+) ABC transporter
                                                    copy AA seq
                                                    copy Nt seq
                                                    • DPH57_09175
                                                    • AXA91307.1
                                                    2193548 - 2195215 (+) hypothetical protein
                                                      copy AA seq
                                                      copy Nt seq
                                                      • DPH57_09180
                                                      • AXA91308.1
                                                      2195212 - 2195562 (+) hypothetical protein
                                                        copy AA seq
                                                        copy Nt seq
                                                        • DPH57_09185
                                                        • AXA91309.1
                                                        2197908 - 2198933 (+) hypothetical protein
                                                          copy AA seq
                                                          copy Nt seq
                                                          • DPH57_09190
                                                          • AXA91310.1
                                                          2198970 - 2199422 (-) hypothetical protein
                                                            copy AA seq
                                                            copy Nt seq
                                                            • DPH57_09195
                                                            • AXA91311.1
                                                            2200204 - 2202525 (+) peptidase domain-containing ABC transporter
                                                              copy AA seq
                                                              copy Nt seq
                                                              • DPH57_09200
                                                              • AXA91312.1
                                                              2202581 - 2202790 (-) hypothetical protein
                                                                copy AA seq
                                                                copy Nt seq
                                                                • DPH57_09205
                                                                • AXA91313.1
                                                                2203666 - 2204004 (+) hypothetical protein
                                                                  copy AA seq
                                                                  copy Nt seq
                                                                  Biosynthesis information

                                                                  Biosynthetic modules

                                                                  Name
                                                                  Unk01
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09140
                                                                  Substrates
                                                                  proline (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk02
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09140
                                                                  Substrates
                                                                  aspartic acid (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk03
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09150
                                                                  Substrates
                                                                  proline (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk04
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09150
                                                                  Substrates
                                                                  serine (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk05
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09150
                                                                  Substrates
                                                                  proline (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk06
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09150
                                                                  Substrates
                                                                  arginine (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk07
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09150
                                                                  Substrates
                                                                  aspartic acid (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Name
                                                                  Unk08
                                                                  Type
                                                                  nrps-type1
                                                                  Genes
                                                                  DPH57_09145
                                                                  Substrates
                                                                  serine (evidence: Structure-based inference [2])
                                                                  Integrated Monomers
                                                                  Domains
                                                                  adenylation
                                                                  Annotation changelog

                                                                  Entry version: next

                                                                  Date
                                                                  Changes
                                                                  Submitters
                                                                  Reviewers
                                                                  MIBiG v4 annotathon
                                                                  • (ID: 4CEPXXFDQQ46VQJXBT4B4AUO)
                                                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                                                  Entry version: 1

                                                                  Date
                                                                  Changes
                                                                  Submitters
                                                                  Reviewers
                                                                  Entry added
                                                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                                                  Added NRP substrate specificities
                                                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                                                  Detailed domain annotation
                                                                  Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
                                                                  A domain glossary is available here, and an explanation of the visualisation is available here.
                                                                  Selected features only
                                                                  Show module domains
                                                                  Similar known gene clusters from MIBiG 4.0
                                                                  Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                                                                  Click on reference genes to show details of similarities to genes within the current region.
                                                                  Click on an accession to open that entry in the MiBIG database.
                                                                  Modifications to original record
                                                                  • DPH57_00005 crossed the origin and was split into two features