BGC0001448: malacidin A biosynthetic gene cluster from uncultured bacterium
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 1 - 72,298 nt. (total: 72,298 nt).
This entry is originally from NCBI GenBank KY654519.1.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
TTA codons
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
KnownClusterBlast
NRPS/PKS domains
General information about the BGC
MIBiG accession BGC0001448
Short description malacidin A biosynthetic gene cluster from uncultured bacterium
Status Quality: questionable
The quality level of this entry.

Status: active
The status of this entry.

Completeness: complete
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRPS (Type I)
Loci
KY654519.1
via Heterologous expression
Compounds
  • malacidin A
  • malacidin B
Species uncultured bacterium [taxonomy]
References
Chemical products information
malacidin A Evidence: NMR, Mass spectrometry
Copy SMILES
C56H88N12O20
Chemical database entries
PubCHEM
ChEBI
malacidin B Evidence: NMR, Mass spectrometry
Copy SMILES
C57H90N12O20
Chemical database entries
PubCHEM
ChEBI
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • ARU08060.1
1 - 459 (-) hypothetical protein
    copy AA seq
    copy Nt seq
    • ARU08061.1
    459 - 1232 (-) hypothetical protein
      copy AA seq
      copy Nt seq
      • ARU08062.1
      1271 - 2197 (-) hypothetical protein
        copy AA seq
        copy Nt seq
        • ARU08063.1
        • mlcA
        2471 - 5602 (+) mlcA
        • Scaffold biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08064.1
        • mlcB
        5593 - 6489 (+) mlcB
        • Regulation
        copy AA seq
        copy Nt seq
        • ARU08065.1
        • mlcC
        6507 - 7445 (+) mlcC
        • Regulation
        copy AA seq
        copy Nt seq
        • ARU08066.1
        • mlcD
        7504 - 8319 (-) mlcD
        • Transport
        copy AA seq
        copy Nt seq
        • ARU08067.1
        • mlcE
        8316 - 9533 (-) mlcE
        • Precursor biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08068.1
        • mlcF
        9539 - 10012 (-) mlcF
        • Precursor biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08069.1
        • mlcG
        10224 - 11987 (+) mlcG
        • Precursor biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08070.1
        • mlcH
        11987 - 13420 (+) mlcH
        • Precursor biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08071.1
        • mlcI
        13417 - 15012 (+) mlcI
        • Precursor biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08072.1
        • mlcJ
        15023 - 15292 (+) mlcJ
        • Precursor biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08073.1
        • mlcK
        15319 - 24513 (+) mlcK
        • Scaffold biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08074.1
        • mlcL
        24510 - 41396 (+) mlcL
        • Scaffold biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08075.1
        • mlcM
        41401 - 48243 (+) mlcM
        • Scaffold biosynthesis
        copy AA seq
        copy Nt seq
        • ARU08076.1
        48218 - 49435 (-) hypothetical protein
          copy AA seq
          copy Nt seq
          • ARU08077.1
          49495 - 49689 (-) hypothetical protein
            copy AA seq
            copy Nt seq
            • ARU08078.1
            49752 - 50345 (-) hypothetical protein
              copy AA seq
              copy Nt seq
              • ARU08079.1
              50606 - 51100 (+) hypothetical protein
                copy AA seq
                copy Nt seq
                • ARU08080.1
                51182 - 52045 (+) hypothetical protein
                  copy AA seq
                  copy Nt seq
                  • ARU08081.1
                  52042 - 52446 (-) hypothetical protein
                    copy AA seq
                    copy Nt seq
                    • ARU08082.1
                    • mlcN
                    52469 - 53380 (-) mlcN
                    • Regulation
                    copy AA seq
                    copy Nt seq
                    • ARU08083.1
                    • mlcO
                    53377 - 54312 (-) mlcO
                    • Transport
                    copy AA seq
                    copy Nt seq
                    • ARU08084.1
                    • mlcP
                    54377 - 55243 (-) mlcP
                    • Precursor biosynthesis
                    copy AA seq
                    copy Nt seq
                    • ARU08085.1
                    55242 - 55457 (+) hypothetical protein
                      copy AA seq
                      copy Nt seq
                      • ARU08086.1
                      • mlcQ
                      55515 - 56600 (-) mlcQ
                      • Precursor biosynthesis
                      copy AA seq
                      copy Nt seq
                      • ARU08087.1
                      • mlcR
                      56597 - 57397 (-) mlcR
                      • Precursor biosynthesis
                      copy AA seq
                      copy Nt seq
                      • ARU08088.1
                      • mlcS
                      57408 - 59663 (-) mlcS
                      • Precursor biosynthesis
                      copy AA seq
                      copy Nt seq
                      • ARU08089.1
                      • mlcT
                      59660 - 61381 (-) mlcT
                      • Precursor biosynthesis
                      copy AA seq
                      copy Nt seq
                      • ARU08090.1
                      • mlcU
                      62005 - 62271 (+) mlcU
                      • Regulation
                      copy AA seq
                      copy Nt seq
                      • ARU08091.1
                      • mlcV
                      62354 - 62572 (+) mlcV
                      • Scaffold biosynthesis
                      copy AA seq
                      copy Nt seq
                      • ARU08092.1
                      62712 - 63257 (+) hypothetical protein
                        copy AA seq
                        copy Nt seq
                        • ARU08093.1
                        • mlcW
                        63298 - 63984 (+) mlcW
                        • Regulation
                        copy AA seq
                        copy Nt seq
                        • ARU08094.1
                        64023 - 64817 (+) hypothetical protein
                          copy AA seq
                          copy Nt seq
                          • ARU08095.1
                          • mlcX
                          64814 - 65797 (+) mlcX
                          • Precursor biosynthesis
                          copy AA seq
                          copy Nt seq
                          • ARU08096.1
                          • mlcY
                          66442 - 69489 (+) mlcY
                          • Regulation
                          copy AA seq
                          copy Nt seq
                          • ARU08097.1
                          69828 - 70919 (-) hypothetical protein
                            copy AA seq
                            copy Nt seq
                            • ARU08098.1
                            70916 - 72298 (-) hypothetical protein
                              copy AA seq
                              copy Nt seq
                              Biosynthesis information

                              Biosynthetic modules

                              Name
                              2
                              Type
                              nrps-type1
                              Genes
                              mlcA
                              Substrates
                              2,3-diamino-3-methylpropanoic acid (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              1
                              Type
                              nrps-type1
                              Genes
                              mlcK
                              Substrates
                              3-methylaspartic acid (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (Starter), adenylation
                              Name
                              4
                              Type
                              nrps-type1
                              Genes
                              mlcK
                              Substrates
                              valine (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (LCL), adenylation, epimerase
                              Name
                              5
                              Type
                              nrps-type1
                              Genes
                              mlcL
                              Substrates
                              lysine (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation, adenylation
                              Name
                              6
                              Type
                              nrps-type1
                              Genes
                              mlcL
                              Substrates
                              3-hydroxyaspartic acid (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (LCL), adenylation
                              Name
                              7
                              Type
                              nrps-type1
                              Genes
                              mlcL
                              Substrates
                              aspartic acid (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (LCL), adenylation
                              Name
                              8
                              Type
                              nrps-type1
                              Genes
                              mlcL
                              Substrates
                              glycine (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (LCL), adenylation
                              Name
                              9
                              Type
                              nrps-type1
                              Genes
                              mlcL
                              Substrates
                              3-methylaspartic acid (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (LCL), adenylation, epimerase
                              Name
                              10
                              Type
                              nrps-type1
                              Genes
                              mlcM
                              Substrates
                              valine (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (DCL), adenylation
                              Name
                              11
                              Type
                              nrps-type1
                              Genes
                              mlcM
                              Substrates
                              4R-methylproline (evidence: Structure-based inference)
                              Integrated Monomers
                              Domains
                              condensation (LCL), adenylation
                              Annotation changelog

                              Entry version: 4

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Update chemical activity to schema version 2.11
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                              Entry version: 3

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Corrected NRP module activity
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Removed gene names of 'No gene ID'
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Fixed duplicate gene names
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Updated NRP substrate specificities
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                              Entry version: 2

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Migrated from v1.4
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Updated compound(s) information (NPAtlas curation)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                              Entry version: 1

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Submitted
                              • (ID: IE42ACYYGHZNO4BAWZY6ERNR)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Similar known clusters from MIBiG 4.0
                              Shows clusters from the MIBiG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                              Detailed help and explanations are available here.

                              Click on reference genes to show details of similarities to genes within the current region.
                              Double click on a reference drawing to reverse the display of the genes.

                              Click on an accession to open that entry in the MIBiG database.


                              Location:

                              Identifier
                              Identity
                              Coverage
                              Detailed domain annotation
                              Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
                              A domain glossary is available here, and an explanation of the visualisation is available here.
                              Selected features only
                              Show module domains