BGC0000433: surfactin biosynthetic gene cluster from Bacillus velezensis FZB42
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 1 - 41,884 nt. (total: 41,884 nt).
This entry is originally from NCBI GenBank AJ575642.1.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
NRP
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0000433
Short description surfactin biosynthetic gene cluster from Bacillus velezensis FZB42
Status Minimal annotation: no
A minimal annotation only contains information on the BGC loci and one or more linked chemical product(s)

Completeness: complete
Whether the loci encodes everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRP (Other lipopeptide)
Loci NCBI GenBank: AJ575642.1
Compounds
  • surfactin
Species Bacillus velezensis FZB42 [taxonomy]
References
Chemical products information
surfactin
Copy SMILES
C52H91N7O13
Chemical database entries
NPAtlas
ChemSpider
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • CAE02619.1
  • yciC
64 - 1341 (+) YciC protein
copy AA seq
copy Nt seq
  • CAE02620.1
  • yx01
1406 - 2542 (+) Yx01 protein
copy AA seq
copy Nt seq
  • CAE02621.1
  • yckC
2557 - 2991 (+) YckC protein
copy AA seq
copy Nt seq
  • CAE02622.1
  • yckD
3064 - 3387 (+) YckD protein
copy AA seq
copy Nt seq
  • CAE02623.1
  • yckE
3491 - 4927 (+) YckE protein
copy AA seq
copy Nt seq
  • CAE02624.1
  • nin
4968 - 5366 (-) Nin
copy AA seq
copy Nt seq
  • CAE02625.1
  • nuc
5387 - 5833 (-) NucA
copy AA seq
copy Nt seq
  • CAE02626.1
  • hxlB
6183 - 6740 (-) HxlB protein
copy AA seq
copy Nt seq
  • CAE02627.1
  • hxlA
6737 - 7372 (-) HxlA protein
copy AA seq
copy Nt seq
  • CAE02628.1
  • hxlR
7604 - 7966 (+) transcriptional regulator
copy AA seq
copy Nt seq
  • CAE02629.1
  • xy02
8150 - 8494 (-) Xy02 protein
copy AA seq
copy Nt seq
  • CAE02630.1
  • srfAA
8558 - 19312 (+) surfactin synthetase A
  • Scaffold biosynthesis
Knock-out
copy AA seq
copy Nt seq
  • CAE02631.1
  • srfAB
19334 - 30094 (+) surfactin synthetase B
  • Scaffold biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAE02632.1
  • comS
22467 - 22607 (+) competence protein S
  • Unknown
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAE02633.1
  • srfAC
30129 - 33965 (+) surfactin synthetase C
  • Scaffold biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAE02634.1
  • srfAD
33985 - 34716 (+) surfactin synthetase D
  • Other enzymatic
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAE02635.1
  • aat
34820 - 36148 (+) amino transferase
copy AA seq
copy Nt seq
  • CAE02636.1
  • ycxc
36719 - 37855 (-) transporter
  • Transport
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAE02637.1
  • ycxD
37800 - 39110 (+) transcriptional regulator containing an aminotransferase domain
  • Regulation
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAE02638.1
  • sfp
39105 - 39779 (-) phosphopantetheinyl transferase involved in nonribosomal synthesis
  • Activation / processing
Knock-out
copy AA seq
copy Nt seq
  • CAE02639.1
  • yczE
39878 - 40525 (-) integral membrane protein involved in nonribosomal synthesis
  • Unknown
Knock-out
copy AA seq
copy Nt seq
  • CAE02640.1
  • yckI
40607 - 41350 (-) YckI protein
copy AA seq
copy Nt seq
  • CAE02641.1
  • yckJ
41363 - 41869 (-) YckJ protein
copy AA seq
copy Nt seq
NRP-specific information
Subclass Other lipopeptide
Cyclic? yes
NRP-synthases
Gene Modules
srfAA
Module 1
Specificity: glutamic acid
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Module 2
Specificity: leucine
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Module 3
Specificity: leucine
Evidence for specificity:
  • Structure-based inference
Condensation domain type: DCL
srfAB
Module 4
Specificity: valine
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Module 5
Specificity: aspartic acid
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Module 6
Specificity: leucine
Evidence for specificity:
  • Structure-based inference
Condensation domain type: DCL
srfAC
Module 7
Specificity: leucine
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Modification domains: Unknown
Annotation changelog
MIBiG version Submitter Notes
1.0
  • Hidden contributor (ID: FICEQDRJCNKWHHGNCRVDDP5D, no GDPR consent given).
  • Submitted
2.0
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Migrated from v1.4
  • Updated compound(s) information (NPAtlas curation)
3.0
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Corrected NRP module activity
  • Remove leading/trailing whitespace in gene identifiers
  • Corrected gene identifiers
  • Updated bioactivity data
  • Updated NRP substrate specificities
3.1
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Update chemical activity to schema version 2.11
Detailed domain annotation
Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
A glossary is available here.
Selected features only
Show module domains
Similar known gene clusters
Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
Click on reference genes to show details of similarities to genes within the current region.
Click on an accession to open that entry in the MiBIG database.