BGC0000315: CDA1b biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 3,519,449 - 3,602,320 nt. (total: 82,872 nt).
This entry is originally from NCBI GenBank AL645882.2.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
TTA codons
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
NRP
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0000315
Short description CDA1b biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Status Minimal annotation: no
A minimal annotation only contains information on the BGC loci and one or more linked chemical product(s)

Completeness: complete
Whether the loci encodes everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRP (Ca+-dependent lipopeptide)
Loci NCBI GenBank: AL645882.2
Compounds
  • CDA1b
  • CDA2a
  • CDA2b
  • CDA3a
  • CDA3b
  • CDA4a
  • CDA4b
Species Streptomyces coelicolor A3(2) [taxonomy]
References
Chemical products information
CDA1b
Copy SMILES
C66H79N14O29P1
CDA2a
Copy SMILES
C66H77N14O29P1
CDA2b
Copy SMILES
C67H81N14O29P1
CDA3a
Copy SMILES
C66H76N14O26
CDA3b
Copy SMILES
C67H80N14O25
CDA4a
Copy SMILES
C67H78N14O26
CDA4b
Copy SMILES
C67H78N14O26
Chemical database entries
NPAtlas
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • CAB38581.1
  • SCO3210
3519449 - 3520903 (-) putative 2-dehydro-3-deoxyheptonate aldolase
copy AA seq
copy Nt seq
  • CAB38582.1
  • SCO3211
3520900 - 3521676 (-) putative indoleglycerol phosphate synthase (trpC2)
  • Precursor biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38583.1
  • SCO3212
3521673 - 3522680 (-) probable anthranilate phosphoribotransferase (trpD2)
  • Precursor biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38584.1
  • SCO3213
3522697 - 3523299 (-) probable anthranilate synthase component II (trpG)
  • Precursor biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38585.1
  • SCO3214
3523296 - 3524831 (-) probable anthranilate synthase component I (trpE2)
  • Precursor biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38586.1
  • SCO3215
3524828 - 3525844 (-) 2-oxogluterate-3-methyltransferase (glmT)
  • Precursor biosynthesis
Activity assay
Knock-out
copy AA seq
copy Nt seq
  • CAB38587.1
  • SCO3216
3526137 - 3528527 (+) putative integral membrane ATPase
copy AA seq
copy Nt seq
  • CAB38588.1
  • SCO3217
3529272 - 3531188 (+) putative transcriptional regulator
  • Regulation
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38589.1
  • SCO3218
3531250 - 3531465 (-) putative small conserved hypothetical protein
copy AA seq
copy Nt seq
  • CAB38590.1
  • SCO3219
3531588 - 3532763 (-) putative lipase (putative secreted protein)
copy AA seq
copy Nt seq
  • CAB38591.1
  • SCO3220
3532971 - 3533399 (-) putative secreted protein
copy AA seq
copy Nt seq
  • CAB38592.1
  • SCO3221
3533492 - 3534346 (-) putative oxidoreductase
copy AA seq
copy Nt seq
  • CAB38593.1
  • SCO3222
3534444 - 3534899 (-) putative secreted protein
copy AA seq
copy Nt seq
  • CAB38594.1
  • SCO3223
3535054 - 3535848 (-) putative ABC transporter integral membrane protein
  • Transport
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38595.1
  • SCO3224
3535855 - 3536808 (-) putative ABC transporter ATP-binding protein
  • Transport
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38596.1
  • SCO3225
3536945 - 3538660 (+) two component sensor kinase (absA1)
  • Regulation
Other in vivo study
copy AA seq
copy Nt seq
  • CAB38597.1
  • SCO3226
3538679 - 3539347 (+) two component system response regulator (absA2)
  • Regulation
Other in vivo study
copy AA seq
copy Nt seq
  • CAD55497.1
  • SCO3227
3539337 - 3540671 (-) 4-hydroxyphenylglycine aminotransferase (hpgT)
  • Precursor biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38520.1
  • SCO3228
3540668 - 3541801 (-) 4-hydroxymandelate oxidase (Hmo)
  • Precursor biosynthesis
Knock-out
copy AA seq
copy Nt seq
  • CAB38519.1
  • SCO3229
3541951 - 3543066 (-) 4-hydroxymandelic acid synthase (HmaS)
  • Precursor biosynthesis
Knock-out
copy AA seq
copy Nt seq
  • CAB38518.1
  • SCO3230
3543335 - 3565726 (+) CDA peptide synthetase I (CdaPs1)
  • Scaffold biosynthesis
Other in vivo study
copy AA seq
copy Nt seq
  • CAB38517.1
  • SCO3231
3565723 - 3576735 (+) CDA peptide synthetase II (CdaPs2)
  • Scaffold biosynthesis
Other in vivo study
copy AA seq
copy Nt seq
  • CAD55498.1
  • SCO3232
3576735 - 3583988 (+) CDA peptide synthetase III (CdaPs3)
  • Scaffold biosynthesis
Other in vivo study
copy AA seq
copy Nt seq
  • CAB38877.1
  • SCO3233
3583992 - 3584810 (+) putative type II thioesterase
  • Unknown
Knock-out
copy AA seq
copy Nt seq
  • CAB38878.1
  • SCO3234
3584822 - 3585724 (+) 3-hydroxyasparagine phosphotransferase (HasP)
  • Tailoring (Phosphorylation)
Knock-out
copy AA seq
copy Nt seq
  • CAB38879.1
  • SCO3235
3585800 - 3587647 (-) putative ABC transporter
  • Transport
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38880.1
  • SCO3236
3587687 - 3588688 (-) Asparagine oxygenase (AsnO)
  • Precursor biosynthesis
Activity assay
Knock-out
copy AA seq
copy Nt seq
  • CAB38881.1
  • SCO3237
3588746 - 3590134 (-) conserved hypothetical protein
copy AA seq
copy Nt seq
  • CAB38882.1
  • SCO3238
3590146 - 3591306 (-) hypothetical protein
copy AA seq
copy Nt seq
  • CAB38883.1
  • SCO3239
3591313 - 3592182 (-) conserved hypothetical protein
copy AA seq
copy Nt seq
  • CAB38884.1
  • SCO3240
3592179 - 3592889 (-) conserved hypothetical protein
copy AA seq
copy Nt seq
  • CAB38885.1
  • SCO3241
3592886 - 3593758 (-) putaive isomerase
copy AA seq
copy Nt seq
  • CAB38886.1
  • SCO3242
3593755 - 3594630 (-) puitative transferase
copy AA seq
copy Nt seq
  • CAB38887.1
  • SCO3243
3594627 - 3595793 (-) putative myo-inositol phosphate synthase
copy AA seq
copy Nt seq
  • CAB38888.1
  • SCO3244
3595843 - 3596640 (-) putative secreted protein
copy AA seq
copy Nt seq
  • CAB38889.1
  • SCO3245
3596708 - 3597970 (-) Hexenoyl-S-ACP Monooxygenase (hcmO)
  • Precursor biosynthesis
Activity assay
copy AA seq
copy Nt seq
  • CAB38890.1
  • SCO3246
3598017 - 3599009 (-) 3-Oxoacyl-[acyl carrier protein] synthase III (fabH4)
  • Precursor biosynthesis
Sequence-based prediction
copy AA seq
copy Nt seq
  • CAB38891.1
  • SCO3247
3599006 - 3600808 (-) Hexanoyl-S-ACP oxidase/epoxidase (hxcO)
  • Precursor biosynthesis
Activity assay
Knock-out
copy AA seq
copy Nt seq
  • CAB38892.1
  • SCO3248
3600858 - 3602078 (-) 3-Oxoacyl-[acyl carrier protein] synthase II (fabF3)
  • Precursor biosynthesis
Other in vivo study
copy AA seq
copy Nt seq
  • CAB38893.1
  • SCO3249
3602075 - 3602320 (-) acyl carrier protein
  • Precursor biosynthesis
Activity assay
copy AA seq
copy Nt seq
NRP-specific information
Subclass Ca+-dependent lipopeptide
Cyclic? yes
Release type
  • Macrolactonization
Lipid moiety 2,3-epoxyhexanoyl
NRP-synthases
Gene Modules
SCO3230
Module 1
Specificity: serine
Evidence for specificity:
  • Sequence-based prediction
Condensation domain type: Starter
Module 2
Specificity: threonine
Evidence for specificity:
  • Sequence-based prediction
Condensation domain type: LCL
Module 3
Specificity: tryptophan
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Module 4
Specificity: aspartic acid
Evidence for specificity:
  • Sequence-based prediction
Condensation domain type: DCL
Module 5
Specificity: aspartic acid
Evidence for specificity:
  • Sequence-based prediction
Condensation domain type: LCL
Module 6
Specificity: 4-hydroxyphenylglycine
Evidence for specificity:
  • Sequence-based prediction
Condensation domain type: LCL
SCO3231
Module 7
Specificity: aspartic acid
Evidence for specificity:
Condensation domain type: DCL
Module 8
Specificity: glycine
Evidence for specificity:
  • Sequence-based prediction
Condensation domain type: LCL
Module 9
Specificity: asparagine / 3S-hydroxyasparagine
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
SCO3232
Module 10
Specificity: glutamic acid / 3R-methylglutamic acid
Evidence for specificity:
Condensation domain type: DCL
Module 11
Specificity: tryptophan
Evidence for specificity:
  • Structure-based inference
Condensation domain type: LCL
Annotation changelog
MIBiG version Submitter Notes
1.0
  • Hidden contributor (ID: JEQAH6TWUDBPITIQUEGTRXT6, no GDPR consent given).
  • Submitted
2.0
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Migrated from v1.4
  • Updated compound(s) information (NPAtlas curation)
3.0
  • Hidden contributor (ID: 3UOU7PODQJXIM6BEPUHHRRA5, no GDPR consent given).
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Removed duplicate citation
  • Fix duplication in gene evidence (issue #1)
  • Corrected NRP module activity
  • Corrected NRP gene names
  • Updated NRP substrate specificities
3.1
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Update chemical activity to schema version 2.11
Detailed domain annotation
Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
A glossary is available here.
Selected features only
Show module domains
Similar known gene clusters
Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
Click on reference genes to show details of similarities to genes within the current region.
Click on an accession to open that entry in the MiBIG database.