BGC0000315: CDA1b biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 3,519,449 - 3,602,320 nt. (total: 82,872 nt).
This entry is originally from NCBI GenBank AL645882.2.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
TTA codons
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0000315
Short description CDA1b biosynthetic gene cluster from Streptomyces coelicolor A3(2)
Status Quality: questionable
The quality level of this entry.

Status: active
The status of this entry.

Completeness: complete
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRPS (Type I)
Loci
AL645882.2
3519449 - 3602320
via Knock-out studies
Compounds
  • CDA1b
  • CDA2a
  • CDA2b
  • CDA3a
  • CDA3b
  • CDA4a
  • CDA4b
Species Streptomyces coelicolor A3(2) [taxonomy]
References
Chemical products information
CDA1b Evidence:
Copy SMILES
C66H79N14O29P1
CDA2a Evidence:
Copy SMILES
C66H77N14O29P1
CDA2b Evidence:
Copy SMILES
C67H81N14O29P1
CDA3a Evidence:
Copy SMILES
C66H76N14O26
CDA3b Evidence:
Copy SMILES
C67H80N14O25
CDA4a Evidence:
Copy SMILES
C67H78N14O26
CDA4b Evidence:
Copy SMILES
C67H78N14O26
Chemical database entries
NPAtlas
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • CAB38581.1
  • SCO3210
3519449 - 3520903 (-) putative 2-dehydro-3-deoxyheptonate aldolase
    copy AA seq
    copy Nt seq
    • CAB38582.1
    • SCO3211
    3520900 - 3521676 (-) putative indoleglycerol phosphate synthase (trpC2)
    • Precursor biosynthesis
    copy AA seq
    copy Nt seq
    • CAB38583.1
    • SCO3212
    3521673 - 3522680 (-) probable anthranilate phosphoribotransferase (trpD2)
    • Precursor biosynthesis
    copy AA seq
    copy Nt seq
    • CAB38584.1
    • SCO3213
    3522697 - 3523299 (-) probable anthranilate synthase component II (trpG)
    • Precursor biosynthesis
    copy AA seq
    copy Nt seq
    • CAB38585.1
    • SCO3214
    3523296 - 3524831 (-) probable anthranilate synthase component I (trpE2)
    • Precursor biosynthesis
    copy AA seq
    copy Nt seq
    • CAB38586.1
    • SCO3215
    3524828 - 3525844 (-) 2-oxogluterate-3-methyltransferase (glmT)
    • Precursor biosynthesis
    • Activity assay Knock-out
    copy AA seq
    copy Nt seq
    • CAB38587.1
    • SCO3216
    3526137 - 3528527 (+) putative integral membrane ATPase
      copy AA seq
      copy Nt seq
      • CAB38588.1
      • SCO3217
      3529272 - 3531188 (+) putative transcriptional regulator
      • Regulation
      copy AA seq
      copy Nt seq
      • CAB38589.1
      • SCO3218
      3531250 - 3531465 (-) putative small conserved hypothetical protein
        copy AA seq
        copy Nt seq
        • CAB38590.1
        • SCO3219
        3531588 - 3532763 (-) putative lipase (putative secreted protein)
          copy AA seq
          copy Nt seq
          • CAB38591.1
          • SCO3220
          3532971 - 3533399 (-) putative secreted protein
            copy AA seq
            copy Nt seq
            • CAB38592.1
            • SCO3221
            3533492 - 3534346 (-) putative oxidoreductase
              copy AA seq
              copy Nt seq
              • CAB38593.1
              • SCO3222
              3534444 - 3534899 (-) putative secreted protein
                copy AA seq
                copy Nt seq
                • CAB38594.1
                • SCO3223
                3535054 - 3535848 (-) putative ABC transporter integral membrane protein
                • Transport
                copy AA seq
                copy Nt seq
                • CAB38595.1
                • SCO3224
                3535855 - 3536808 (-) putative ABC transporter ATP-binding protein
                • Transport
                copy AA seq
                copy Nt seq
                • CAB38596.1
                • SCO3225
                3536945 - 3538660 (+) two component sensor kinase (absA1)
                • Regulation
                • Other in vivo study
                copy AA seq
                copy Nt seq
                • CAB38597.1
                • SCO3226
                3538679 - 3539347 (+) two component system response regulator (absA2)
                • Regulation
                • Other in vivo study
                copy AA seq
                copy Nt seq
                • CAD55497.1
                • SCO3227
                3539337 - 3540671 (-) 4-hydroxyphenylglycine aminotransferase (hpgT)
                • Precursor biosynthesis
                copy AA seq
                copy Nt seq
                • CAB38520.1
                • SCO3228
                3540668 - 3541801 (-) 4-hydroxymandelate oxidase (Hmo)
                • Precursor biosynthesis
                • Knock-out
                copy AA seq
                copy Nt seq
                • CAB38519.1
                • SCO3229
                3541951 - 3543066 (-) 4-hydroxymandelic acid synthase (HmaS)
                • Precursor biosynthesis
                • Knock-out
                copy AA seq
                copy Nt seq
                • CAB38518.1
                • SCO3230
                3543335 - 3565726 (+) CDA peptide synthetase I (CdaPs1)
                • Scaffold biosynthesis
                • Other in vivo study
                copy AA seq
                copy Nt seq
                • CAB38517.1
                • SCO3231
                3565723 - 3576735 (+) CDA peptide synthetase II (CdaPs2)
                • Scaffold biosynthesis
                • Other in vivo study
                copy AA seq
                copy Nt seq
                • CAD55498.1
                • SCO3232
                3576735 - 3583988 (+) CDA peptide synthetase III (CdaPs3)
                • Scaffold biosynthesis
                • Other in vivo study
                copy AA seq
                copy Nt seq
                • CAB38877.1
                • SCO3233
                3583992 - 3584810 (+) putative type II thioesterase
                  copy AA seq
                  copy Nt seq
                  • CAB38878.1
                  • SCO3234
                  3584822 - 3585724 (+) 3-hydroxyasparagine phosphotransferase (HasP)
                  • Tailoring
                  • Knock-out
                  copy AA seq
                  copy Nt seq
                  • CAB38879.1
                  • SCO3235
                  3585800 - 3587647 (-) putative ABC transporter
                  • Transport
                  copy AA seq
                  copy Nt seq
                  • CAB38880.1
                  • SCO3236
                  3587687 - 3588688 (-) Asparagine oxygenase (AsnO)
                  • Precursor biosynthesis
                  • Activity assay Knock-out
                  copy AA seq
                  copy Nt seq
                  • CAB38881.1
                  • SCO3237
                  3588746 - 3590134 (-) conserved hypothetical protein
                    copy AA seq
                    copy Nt seq
                    • CAB38882.1
                    • SCO3238
                    3590146 - 3591306 (-) hypothetical protein
                      copy AA seq
                      copy Nt seq
                      • CAB38883.1
                      • SCO3239
                      3591313 - 3592182 (-) conserved hypothetical protein
                        copy AA seq
                        copy Nt seq
                        • CAB38884.1
                        • SCO3240
                        3592179 - 3592889 (-) conserved hypothetical protein
                          copy AA seq
                          copy Nt seq
                          • CAB38885.1
                          • SCO3241
                          3592886 - 3593758 (-) putaive isomerase
                            copy AA seq
                            copy Nt seq
                            • CAB38886.1
                            • SCO3242
                            3593755 - 3594630 (-) puitative transferase
                              copy AA seq
                              copy Nt seq
                              • CAB38887.1
                              • SCO3243
                              3594627 - 3595793 (-) putative myo-inositol phosphate synthase
                                copy AA seq
                                copy Nt seq
                                • CAB38888.1
                                • SCO3244
                                3595843 - 3596640 (-) putative secreted protein
                                  copy AA seq
                                  copy Nt seq
                                  • CAB38889.1
                                  • SCO3245
                                  3596708 - 3597970 (-) Hexenoyl-S-ACP Monooxygenase (hcmO)
                                  • Precursor biosynthesis
                                  • Activity assay
                                  copy AA seq
                                  copy Nt seq
                                  • CAB38890.1
                                  • SCO3246
                                  3598017 - 3599009 (-) 3-Oxoacyl-[acyl carrier protein] synthase III (fabH4)
                                  • Precursor biosynthesis
                                  copy AA seq
                                  copy Nt seq
                                  • CAB38891.1
                                  • SCO3247
                                  3599006 - 3600808 (-) Hexanoyl-S-ACP oxidase/epoxidase (hxcO)
                                  • Precursor biosynthesis
                                  • Activity assay Knock-out
                                  copy AA seq
                                  copy Nt seq
                                  • CAB38892.1
                                  • SCO3248
                                  3600858 - 3602078 (-) 3-Oxoacyl-[acyl carrier protein] synthase II (fabF3)
                                  • Precursor biosynthesis
                                  • Other in vivo study
                                  copy AA seq
                                  copy Nt seq
                                  • CAB38893.1
                                  • SCO3249
                                  3602075 - 3602320 (-) acyl carrier protein
                                  • Precursor biosynthesis
                                  • Activity assay
                                  copy AA seq
                                  copy Nt seq
                                  Biosynthesis information

                                  Biosynthetic modules

                                  Name
                                  1
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3230
                                  Substrates
                                  serine (evidence: Sequence-based prediction)
                                  Integrated Monomers
                                  Domains
                                  condensation (Starter), adenylation
                                  Name
                                  2
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3230
                                  Substrates
                                  threonine (evidence: Sequence-based prediction)
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation
                                  Name
                                  3
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3230
                                  Substrates
                                  tryptophan (evidence: Structure-based inference [1], [3])
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation, epimerase
                                  Name
                                  4
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3230
                                  Substrates
                                  aspartic acid (evidence: Sequence-based prediction)
                                  Integrated Monomers
                                  Domains
                                  condensation (DCL), adenylation
                                  Name
                                  5
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3230
                                  Substrates
                                  aspartic acid (evidence: Sequence-based prediction)
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation
                                  Name
                                  6
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3230
                                  Substrates
                                  4-hydroxyphenylglycine (evidence: Sequence-based prediction)
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation, epimerase
                                  Name
                                  7
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3231
                                  Substrates
                                  aspartic acid (evidence: )
                                  Integrated Monomers
                                  Domains
                                  condensation (DCL), adenylation
                                  Name
                                  8
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3231
                                  Substrates
                                  glycine (evidence: Sequence-based prediction)
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation
                                  Name
                                  9
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3231
                                  Substrates
                                  asparagine, 3S-hydroxyasparagine (evidence: Structure-based inference)
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation, epimerase
                                  Name
                                  10
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3232
                                  Substrates
                                  glutamic acid, 3R-methylglutamic acid (evidence: )
                                  Integrated Monomers
                                  Domains
                                  condensation (DCL), adenylation
                                  Name
                                  11
                                  Type
                                  nrps-type1
                                  Genes
                                  SCO3232
                                  Substrates
                                  tryptophan (evidence: Structure-based inference [1], [2], [3])
                                  Integrated Monomers
                                  Domains
                                  condensation (LCL), adenylation
                                  Annotation changelog

                                  Entry version: 4

                                  Date
                                  Changes
                                  Submitters
                                  Reviewers
                                  Update chemical activity to schema version 2.11
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                  Entry version: 3

                                  Date
                                  Changes
                                  Submitters
                                  Reviewers
                                  Removed duplicate citation
                                  • (ID: 3UOU7PODQJXIM6BEPUHHRRA5)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  Fix duplication in gene evidence (issue #1)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  Corrected NRP module activity
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  Corrected NRP gene names
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  Updated NRP substrate specificities
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                  Entry version: 2

                                  Date
                                  Changes
                                  Submitters
                                  Reviewers
                                  Migrated from v1.4
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  Updated compound(s) information (NPAtlas curation)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                                  Entry version: 1

                                  Date
                                  Changes
                                  Submitters
                                  Reviewers
                                  Submitted
                                  • (ID: JEQAH6TWUDBPITIQUEGTRXT6)
                                  • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                  Detailed domain annotation
                                  Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
                                  A domain glossary is available here, and an explanation of the visualisation is available here.
                                  Selected features only
                                  Show module domains
                                  Similar known gene clusters from MIBiG 4.0
                                  Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                                  Click on reference genes to show details of similarities to genes within the current region.
                                  Click on an accession to open that entry in the MiBIG database.