BGC0001033: paenilamicin biosynthetic gene cluster from Paenibacillus larvae subsp. larvae DSM 25430
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 1,729,923 - 1,788,675 nt. (total: 58,753 nt).
This entry is originally from NCBI GenBank CP003355.1.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0001033
Short description paenilamicin biosynthetic gene cluster from Paenibacillus larvae subsp. larvae DSM 25430
Status Quality: questionable
The quality level of this entry.

Status: active
The status of this entry.

Completeness: unknown
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRPS (Type I)
  • PKS (Unknown)
Loci
CP003355.1
1729923 - 1788675
via
Compounds
  • paenilamicin
  • paenilamicin A2
  • paenilamicin B1
  • paenilamicin B2
Species Paenibacillus larvae subsp. larvae DSM 25430 [taxonomy]
References
Chemical products information
paenilamicin Evidence:
Copy SMILES
C43H88N14O13
paenilamicin A2 Evidence:
Copy SMILES
C42H86N14O13
paenilamicin B1 Evidence:
Copy SMILES
C43H88N16O13
paenilamicin B2 Evidence:
Copy SMILES
C42H86N16O13
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • ERIC2_c18040
  • AHD05614.1
1729923 - 1737689 (+) putative non-ribosomal peptide ligase/ polyketide synthase hybrid
    copy AA seq
    copy Nt seq
    • ERIC2_c18050
    • AHD05615.1
    1737728 - 1746766 (+) putative non-ribosomal peptide ligase/ polyketide synthase hybrid
      copy AA seq
      copy Nt seq
      • ERIC2_c18060
      • AHD05616.1
      1746763 - 1752078 (+) putative non-ribosomal peptide ligase domain protein
        copy AA seq
        copy Nt seq
        • ERIC2_c18070
        • AHD05617.1
        1752098 - 1756096 (+) putative non-ribosomal peptide ligase domain protein
          copy AA seq
          copy Nt seq
          • ERIC2_c18080
          • AHD05618.1
          1756143 - 1759442 (+) putative non-ribosomal peptide ligase domain protein
            copy AA seq
            copy Nt seq
            • ERIC2_c18090
            • AHD05619.1
            1759625 - 1764208 (+) putative polyketide synthase subunit
              copy AA seq
              copy Nt seq
              • ERIC2_c18100
              • AHD05620.1
              1764209 - 1769791 (+) putative polyketide synthase subunit
                copy AA seq
                copy Nt seq
                • ERIC2_c18110
                • AHD05621.1
                1769806 - 1778028 (+) non-ribosomal peptide ligase domain protein
                  copy AA seq
                  copy Nt seq
                  • ERIC2_c18120
                  • AHD05622.1
                  1778104 - 1778985 (+) hypothetical protein
                    copy AA seq
                    copy Nt seq
                    • ERIC2_c18130
                    • AHD05623.1
                    1778996 - 1782103 (+) cyclic peptide transporter
                      copy AA seq
                      copy Nt seq
                      • ERIC2_c18140
                      • AHD05624.1
                      1782135 - 1782992 (+) 3-hydroxybutyryl-CoA dehydrogenase
                        copy AA seq
                        copy Nt seq
                        • ERIC2_c18150
                        • AHD05625.1
                        1783021 - 1783290 (+) phosphopantetheine-binding protein
                          copy AA seq
                          copy Nt seq
                          • ERIC2_c18160
                          • AHD05626.1
                          • nusG1
                          1783305 - 1783835 (-) transcription antitermination protein NusG
                            copy AA seq
                            copy Nt seq
                            • ERIC2_c18170
                            • AHD05627.1
                            1784182 - 1788675 (+) putative non-ribosomal peptide ligase domain protein
                              copy AA seq
                              copy Nt seq
                              Biosynthesis information

                              Biosynthetic modules

                              Name
                              Unk01
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18040
                              Substrates
                              lysine, arginine (evidence: Structure-based inference, Mass spectrometry)
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk02
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18050
                              Substrates
                              D-alanine (evidence: Structure-based inference [1], [2])
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk03
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18060
                              Substrates
                              2,3-diaminopropionic acid (evidence: Structure-based inference [1], [2])
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk04
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18070
                              Substrates
                              lysine, ornithine (evidence: Structure-based inference, Mass spectrometry)
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk05
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18080
                              Substrates
                              serine (evidence: Structure-based inference [1], [2])
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk06
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18110
                              Substrates
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk07
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18110
                              Substrates
                              2,3-diaminopropionic acid (evidence: Structure-based inference [1], [2])
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk08
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18110
                              Substrates
                              glycine (evidence: Structure-based inference [1], [2])
                              Integrated Monomers
                              Domains
                              adenylation
                              Name
                              Unk09
                              Type
                              nrps-type1
                              Genes
                              ERIC2_c18170
                              Substrates
                              asparagine (evidence: Sequence-based prediction, Homology)
                              Integrated Monomers
                              Domains
                              adenylation
                              Annotation changelog

                              Entry version: 4

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Update chemical activity to schema version 2.11
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                              Entry version: 3

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Add new compound structure (SMILES) for paenilamicin
                              • (ID: 5UL74VURKJ25VSPPPO3H2NYB)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Add new compound: paenilamicin A2
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Add new compound: paenilamicin B1
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Add new compound: paenilamicin B2
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Fixed incorrect publication
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Updated bioactivity data
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Added NRP substrate specificities
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                              Entry version: 2

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Migrated from v1.4
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                              Entry version: 1

                              Date
                              Changes
                              Submitters
                              Reviewers
                              Submitted
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                              Detailed domain annotation
                              Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
                              A domain glossary is available here, and an explanation of the visualisation is available here.
                              Selected features only
                              Show module domains
                              Similar known gene clusters from MIBiG 4.0
                              Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                              Click on reference genes to show details of similarities to genes within the current region.
                              Click on an accession to open that entry in the MiBIG database.