BGC0001984: sarpeptin A biosynthetic gene cluster from Streptomyces sp.
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 1 - 96,578 nt. (total: 96,578 nt).
This entry is originally from NCBI GenBank MN068049.1.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
TTA codons
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0001984
Short description sarpeptin A biosynthetic gene cluster from Streptomyces sp.
Status Quality: questionable
The quality level of this entry.

Status: active
The status of this entry.

Completeness: unknown
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRPS (Type I)
Loci
MN068049.1
via
Compounds
  • sarpeptin A
  • sarpeptin B
Species Streptomyces sp. [taxonomy]
References
Chemical products information
sarpeptin A Evidence:
(no structure information available)
sarpeptin B Evidence:
(no structure information available)
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • QED55413.1
  • speR
9395 - 11188 (+) SARP transcriptional regulator
    copy AA seq
    copy Nt seq
    • QED55414.1
    • speH
    12052 - 12978 (-) alpha-ketoglutarate dependent oxygenase
      copy AA seq
      copy Nt seq
      • QED55415.1
      • speJ
      13326 - 13856 (+) flavin-dependent monooxygenase
        copy AA seq
        copy Nt seq
        • QED55416.1
        • speE
        16153 - 16434 (-) acyl carrier protein
          copy AA seq
          copy Nt seq
          • QED55417.1
          • speF
          16493 - 18175 (-) acyl-CoA dehydrogenase
            copy AA seq
            copy Nt seq
            • QED55418.1
            • speG
            18172 - 20004 (-) acyl-CoA dehydrogenase
              copy AA seq
              copy Nt seq
              • QED55419.1
              • speD
              20001 - 21782 (-) fatty-acyl AMP ligase
                copy AA seq
                copy Nt seq
                • QED55420.1
                • speI
                27998 - 28942 (-) alpha-ketoglutarate dependent oxygenase
                  copy AA seq
                  copy Nt seq
                  • QED55421.1
                  • speC
                  31375 - 42978 (-) nonribosomal peptide synthetase
                    copy AA seq
                    copy Nt seq
                    • QED55422.1
                    • speB
                    42975 - 55616 (-) nonribosomal peptide synthetase
                      copy AA seq
                      copy Nt seq
                      • QED55423.1
                      • speA
                      55618 - 76578 (-) nonribosomal peptide synthetase
                        copy AA seq
                        copy Nt seq
                        • QED55424.1
                        • speK
                        82006 - 83565 (-) alpha/beta hydrolase
                          copy AA seq
                          copy Nt seq
                          Biosynthesis information

                          Biosynthetic modules

                          Name
                          Unk01
                          Type
                          nrps-type1
                          Genes
                          speC
                          Substrates
                          glycine (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk02
                          Type
                          nrps-type1
                          Genes
                          speC
                          Substrates
                          alanine (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk03
                          Type
                          nrps-type1
                          Genes
                          speC
                          Substrates
                          glycine (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk04
                          Type
                          nrps-type1
                          Genes
                          speB
                          Substrates
                          aspartic acid (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk05
                          Type
                          nrps-type1
                          Genes
                          speB
                          Substrates
                          glycine (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk06
                          Type
                          nrps-type1
                          Genes
                          speB
                          Substrates
                          proline (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk07
                          Type
                          nrps-type1
                          Genes
                          speB
                          Substrates
                          glycine (evidence: Sequence-based prediction [2])
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk08
                          Type
                          nrps-type1
                          Genes
                          speA
                          Substrates
                          glycine (evidence: Structure-based inference, Sequence-based prediction)
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk09
                          Type
                          nrps-type1
                          Genes
                          speA
                          Substrates
                          aspartic acid (evidence: Structure-based inference, Sequence-based prediction)
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk10
                          Type
                          nrps-type1
                          Genes
                          speA
                          Substrates
                          tyrosine (evidence: Structure-based inference, Sequence-based prediction)
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk11
                          Type
                          nrps-type1
                          Genes
                          speA
                          Substrates
                          threonine (evidence: Structure-based inference, Sequence-based prediction)
                          Integrated Monomers
                          Domains
                          adenylation
                          Name
                          Unk12
                          Type
                          nrps-type1
                          Genes
                          speA
                          Substrates
                          leucine (evidence: Structure-based inference, Sequence-based prediction)
                          Integrated Monomers
                          Domains
                          adenylation
                          Annotation changelog

                          Entry version: 2

                          Date
                          Changes
                          Submitters
                          Reviewers
                          Fixed DOIs
                          • (ID: CUSGWVK3I4TVHJH7NHPFAP2L)
                          • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                          Added NRP substrate specificities
                          • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                          • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)

                          Entry version: 1

                          Date
                          Changes
                          Submitters
                          Reviewers
                          Submitted
                          • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                          • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                          Detailed domain annotation
                          Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
                          A domain glossary is available here, and an explanation of the visualisation is available here.
                          Selected features only
                          Show module domains
                          Similar known gene clusters from MIBiG 4.0
                          Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                          Click on reference genes to show details of similarities to genes within the current region.
                          Click on an accession to open that entry in the MiBIG database.