BGC0000433: surfactin biosynthetic gene cluster from Bacillus velezensis FZB42
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 1 - 41,884 nt. (total: 41,884 nt).
This entry is originally from NCBI GenBank AJ575642.1.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0000433
Short description surfactin biosynthetic gene cluster from Bacillus velezensis FZB42
Status Quality: questionable
The quality level of this entry.

Status: active
The status of this entry.

Completeness: complete
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRPS (Type I)
Loci
AJ575642.1
via Knock-out studies
Compounds
  • surfactin
Species Bacillus velezensis FZB42 [taxonomy]
References
Chemical products information
surfactin Evidence:
Copy SMILES
C52H91N7O13
Chemical database entries
NPAtlas
ChemSpider
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • CAE02619.1
  • yciC
64 - 1341 (+) YciC protein
    copy AA seq
    copy Nt seq
    • CAE02620.1
    • yx01
    1406 - 2542 (+) Yx01 protein
      copy AA seq
      copy Nt seq
      • CAE02621.1
      • yckC
      2557 - 2991 (+) YckC protein
        copy AA seq
        copy Nt seq
        • CAE02622.1
        • yckD
        3064 - 3387 (+) YckD protein
          copy AA seq
          copy Nt seq
          • CAE02623.1
          • yckE
          3491 - 4927 (+) YckE protein
            copy AA seq
            copy Nt seq
            • CAE02624.1
            • nin
            4968 - 5366 (-) Nin
              copy AA seq
              copy Nt seq
              • CAE02625.1
              • nuc
              5387 - 5833 (-) NucA
                copy AA seq
                copy Nt seq
                • CAE02626.1
                • hxlB
                6183 - 6740 (-) HxlB protein
                  copy AA seq
                  copy Nt seq
                  • CAE02627.1
                  • hxlA
                  6737 - 7372 (-) HxlA protein
                    copy AA seq
                    copy Nt seq
                    • CAE02628.1
                    • hxlR
                    7604 - 7966 (+) transcriptional regulator
                      copy AA seq
                      copy Nt seq
                      • CAE02629.1
                      • xy02
                      8150 - 8494 (-) Xy02 protein
                        copy AA seq
                        copy Nt seq
                        • CAE02630.1
                        • srfAA
                        8558 - 19312 (+) surfactin synthetase A
                        • Scaffold biosynthesis
                        • Knock-out
                        copy AA seq
                        copy Nt seq
                        • CAE02631.1
                        • srfAB
                        19334 - 30094 (+) surfactin synthetase B
                        • Scaffold biosynthesis
                        copy AA seq
                        copy Nt seq
                        • CAE02632.1
                        • comS
                        22467 - 22607 (+) competence protein S
                          copy AA seq
                          copy Nt seq
                          • CAE02633.1
                          • srfAC
                          30129 - 33965 (+) surfactin synthetase C
                          • Scaffold biosynthesis
                          copy AA seq
                          copy Nt seq
                          • CAE02634.1
                          • srfAD
                          33985 - 34716 (+) surfactin synthetase D
                            copy AA seq
                            copy Nt seq
                            • CAE02635.1
                            • aat
                            34820 - 36148 (+) amino transferase
                              copy AA seq
                              copy Nt seq
                              • CAE02636.1
                              • ycxc
                              36719 - 37855 (-) transporter
                              • Transport
                              copy AA seq
                              copy Nt seq
                              • CAE02637.1
                              • ycxD
                              37800 - 39110 (+) transcriptional regulator containing an aminotransferase domain
                              • Regulation
                              copy AA seq
                              copy Nt seq
                              • CAE02638.1
                              • sfp
                              39105 - 39779 (-) phosphopantetheinyl transferase involved in nonribosomal synthesis
                              • Activation / processing
                              • Knock-out
                              copy AA seq
                              copy Nt seq
                              • CAE02639.1
                              • yczE
                              39878 - 40525 (-) integral membrane protein involved in nonribosomal synthesis
                                copy AA seq
                                copy Nt seq
                                • CAE02640.1
                                • yckI
                                40607 - 41350 (-) YckI protein
                                  copy AA seq
                                  copy Nt seq
                                  • CAE02641.1
                                  • yckJ
                                  41363 - 41869 (-) YckJ protein
                                    copy AA seq
                                    copy Nt seq
                                    Biosynthesis information

                                    Biosynthetic modules

                                    Name
                                    1
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAA
                                    Substrates
                                    glutamic acid (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (LCL), adenylation
                                    Name
                                    2
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAA
                                    Substrates
                                    leucine (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (LCL), adenylation
                                    Name
                                    3
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAA
                                    Substrates
                                    leucine (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (DCL), adenylation, epimerase
                                    Name
                                    4
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAB
                                    Substrates
                                    valine (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (LCL), adenylation
                                    Name
                                    5
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAB
                                    Substrates
                                    aspartic acid (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (LCL), adenylation
                                    Name
                                    6
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAB
                                    Substrates
                                    leucine (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (DCL), adenylation, epimerase
                                    Name
                                    7
                                    Type
                                    nrps-type1
                                    Genes
                                    srfAC
                                    Substrates
                                    leucine (evidence: Structure-based inference)
                                    Integrated Monomers
                                    Domains
                                    condensation (LCL), adenylation
                                    Annotation changelog

                                    Entry version: 4

                                    Date
                                    Changes
                                    Submitters
                                    Reviewers
                                    Update chemical activity to schema version 2.11
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                    Entry version: 3

                                    Date
                                    Changes
                                    Submitters
                                    Reviewers
                                    Corrected NRP module activity
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    Remove leading/trailing whitespace in gene identifiers
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    Corrected gene identifiers
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    Updated bioactivity data
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    Updated NRP substrate specificities
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                    Entry version: 2

                                    Date
                                    Changes
                                    Submitters
                                    Reviewers
                                    Migrated from v1.4
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    Updated compound(s) information (NPAtlas curation)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                    Entry version: 1

                                    Date
                                    Changes
                                    Submitters
                                    Reviewers
                                    Submitted
                                    • (ID: FICEQDRJCNKWHHGNCRVDDP5D)
                                    • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                    Detailed domain annotation
                                    Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
                                    A domain glossary is available here, and an explanation of the visualisation is available here.
                                    Selected features only
                                    Show module domains
                                    Similar known gene clusters from MIBiG 4.0
                                    Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                                    Click on reference genes to show details of similarities to genes within the current region.
                                    Click on an accession to open that entry in the MiBIG database.