BGC0002906: arabidiol biosynthetic gene cluster from Arabidopsis thaliana
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 8,730,499 - 8,813,593 nt. (total: 83,095 nt).
This entry is originally from NCBI GenBank NC_003076, but has been modified (see Modifications tab for details).

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
Biosynthesis
History
KnownClusterBlast
Modifications
General information about the BGC
MIBiG accession BGC0002906
Short description arabidiol biosynthetic gene cluster from Arabidopsis thaliana
Status Quality: high
The quality level of this entry.

Status: pending
The status of this entry.

Completeness: complete
Whether the entry covers everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • terpene (Triterpene)
Loci
NC_003076
8730499 - 8813593
via Heterologous expression [2]
Compounds
  • arabidiol
  • baruol
Species Arabidopsis thaliana [taxonomy]
References
Chemical products information
arabidiol [synonyms: 5E)-2-hydroxy-6, 3aR, 9bR)-3-[(2R, 9a-tetramethyldodecahydro-1H-cyclopenta[a]naphthalen-7-ol, 10-dimethylundeca-5, 14-diol, (3R, (13R)-malabarica-17, 5aR, 21-diene-3beta, 9aR, 7S, 6, 9-dien-2-yl]-3a] Evidence: Mass spectrometry [2]
Copy SMILES
C30H52O2
Chemical database entries
PubCHEM
ChEBI
baruol [synonyms: 6a, 4aS, 21-dien-3-ol, 3, 8, 10b, 6aS, 4b, 4a, 9, (2S, 4, 11-hexadecahydrochrysen-2-ol, 10bS)-1, 2, 10a, D:B-friedobaccharan-5, 1, 10a-hexamethyl-8-(4-methylpent-3-en-1-yl)-1, 8R, 10, 10aR, 6, 7, 4bR, 5] Evidence: Mass spectrometry [1]
Copy SMILES
C30H50O1
Chemical database entries
PubCHEM
ChEBI
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • AT5G25220_rename1
  • NP_001031938.1
  • KNAT3
8736208 - 8738087 (+) homeobox protein knotted-1-like 3
    copy AA seq
    copy Nt seq
    • AT5G25220
    • NP_197904.1
    • KNAT3
    8736208 - 8738115 (+) homeobox protein knotted-1-like 3
      copy AA seq
      copy Nt seq
      • AT5G25230
      • NP_001318643.1
      8739709 - 8743594 (+) Ribosomal protein S5/Elongation factor G/III/V family protein
        copy AA seq
        copy Nt seq
        • AT5G25240
        • NP_197906.1
        8746779 - 8747174 (-) stress induced protein
          copy AA seq
          copy Nt seq
          • AT5G25250
          • NP_197907.1
          • FLOT1
          8749774 - 8751430 (+) SPFH/Band 7/PHB domain-containing membrane-associated protein family
            copy AA seq
            copy Nt seq
            • AT5G25260
            • NP_197908.1
            8752751 - 8754282 (+) SPFH/Band 7/PHB domain-containing membrane-associated protein family
              copy AA seq
              copy Nt seq
              • AT5G25265
              • NP_680219.1
              8754794 - 8756855 (-) Hyp O-arabinosyltransferase-like protein
                copy AA seq
                copy Nt seq
                • AT5G25270
                • NP_001318644.1
                8757783 - 8762002 (-) Ubiquitin-like superfamily protein
                  copy AA seq
                  copy Nt seq
                  • AT5G25280
                  • NP_001332588.1
                  8773882 - 8774544 (+) serine-rich protein-like protein
                    copy AA seq
                    copy Nt seq
                    • AT5G25290
                    • NP_197911.1
                    8778592 - 8779785 (+) F-box protein (DUF295)
                      copy AA seq
                      copy Nt seq
                      • AT5G25300
                      • NP_197912.4
                      8780258 - 8783093 (+) F-box protein
                        copy AA seq
                        copy Nt seq
                        • AT5G25310
                        • NP_197913.4
                        8784820 - 8787235 (+) Exostosin family protein
                          copy AA seq
                          copy Nt seq
                          • AT5G25320_rename1
                          • NP_001330098.1
                          8787403 - 8789209 (-) ACT-like superfamily protein
                            copy AA seq
                            copy Nt seq
                            • AT5G25320
                            • NP_197914.1
                            8787403 - 8789530 (-) ACT-like superfamily protein
                              copy AA seq
                              copy Nt seq
                              • AT5G25320_rename2
                              • NP_001330097.1
                              8787865 - 8789530 (-) ACT-like superfamily protein
                                copy AA seq
                                copy Nt seq
                                • AT5G25330
                                • NP_197915.1
                                8791564 - 8792664 (+) Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
                                  copy AA seq
                                  copy Nt seq
                                  • AT5G25340
                                  • NP_197916.1
                                  8793238 - 8794211 (+) Ubiquitin-like superfamily protein
                                    copy AA seq
                                    copy Nt seq
                                    • AT5G25350
                                    • NP_197917.1
                                    • EBF2
                                    8794842 - 8796882 (-) EIN3-binding F box protein 2
                                      copy AA seq
                                      copy Nt seq
                                      • AT5G25360_rename1
                                      • NP_001330286.1
                                      8799934 - 8802161 (-) uncharacterized protein
                                        copy AA seq
                                        copy Nt seq
                                        • AT5G25360
                                        • NP_197918.1
                                        8799934 - 8802333 (-) uncharacterized protein
                                          copy AA seq
                                          copy Nt seq
                                          • AT5G25370_rename1
                                          • NP_001331648.1
                                          • PLDALPHA3
                                          8804240 - 8807237 (-) phospholipase D alpha 3
                                            copy AA seq
                                            copy Nt seq
                                            • AT5G25370
                                            • NP_001318645.1
                                            • PLDALPHA3
                                            8804240 - 8807547 (-) phospholipase D alpha 3
                                              copy AA seq
                                              copy Nt seq
                                              Biosynthesis information

                                              No biosynthesis information available for this record.

                                              Annotation changelog

                                              Entry version: 1

                                              Date
                                              Changes
                                              Submitters
                                              Reviewers
                                              MIBiG v4 annotathon
                                              • (ID: AP4OPPCC6CSE5F433WMY7TMD)
                                              • (ID: Q4G2APLPZNOU4XV72H6GYP54)
                                              • (ID: AAAAAAAAAAAAAAAAAAAAAAAA)
                                              Similar known clusters from MIBiG 4.0
                                              Shows clusters from the MIBiG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
                                              Detailed help and explanations are available here.

                                              Click on reference genes to show details of similarities to genes within the current region.
                                              Double click on a reference drawing to reverse the display of the genes.

                                              Click on an accession to open that entry in the MIBiG database.

                                              No significant matches found.

                                              Modifications to original record
                                              • renamed CDS with name AT5G25220 at join{[8736207:8736888](+), [8737163:8737284](+), [8737372:8737557](+), [8737634:8737766](+), [8737844:8737981](+), [8738083:8738087](+)} to AT5G25220_rename1 to avoid duplicates
                                              • removed an exact duplicate of CDS feature AT5G25230
                                              • removed an exact duplicate of CDS feature AT5G25280
                                              • removed an exact duplicate of CDS feature AT5G25280
                                              • renamed CDS with name AT5G25320 at join{[8789088:8789209](-), [8788931:8788999](-), [8788136:8788844](-), [8787949:8788054](-), [8787402:8787786](-)} to AT5G25320_rename1 to avoid duplicates
                                              • renamed CDS with name AT5G25320 at join{[8789459:8789530](-), [8789301:8789353](-), [8789088:8789203](-), [8788931:8788999](-), [8788136:8788844](-), [8787949:8788054](-), [8787864:8787879](-)} to AT5G25320_rename2 to avoid duplicates
                                              • removed an exact duplicate of CDS feature AT5G25360
                                              • renamed CDS with name AT5G25360 at join{[8801964:8802161](-), [8801504:8801749](-), [8800764:8800852](-), [8800068:8800135](-), [8799933:8799987](-)} to AT5G25360_rename1 to avoid duplicates
                                              • removed an exact duplicate of CDS feature AT5G25370
                                              • removed an exact duplicate of CDS feature AT5G25370
                                              • renamed CDS with name AT5G25370 at join{[8806153:8807237](-), [8805048:8805756](-), [8804239:8804685](-)} to AT5G25370_rename1 to avoid duplicates