BGC0000362: glycopeptidolipid biosynthetic gene cluster from Mycobacterium avium
Shows the layout of the region, marking coding sequences and areas of interest. Clicking a gene will select it and show any relevant details. Clicking an area feature (e.g. a candidate cluster) will select all coding sequences within that area. Double clicking an area feature will zoom to that area. Multiple genes and area features can be selected by clicking them while holding the Ctrl key.
Location: 1 - 71,286 nt. (total: 71,286 nt).
This entry is originally from NCBI GenBank AF143772.2.

Legend:

core biosynthetic genes
additional biosynthetic genes
transport-related genes
regulatory genes
other genes
resistance
TTA codons
reset zoomreset view
zoomzoom to selection
Gene details
Shows details of the most recently selected gene, including names, products, location, and other annotations.
Select a gene to view the details available for it
General
Compounds
Genes
NRP
History
NRPS/PKS domains
KnownClusterBlast
General information about the BGC
MIBiG accession BGC0000362
Short description glycopeptidolipid biosynthetic gene cluster from Mycobacterium avium
Status Minimal annotation: no
A minimal annotation only contains information on the BGC loci and one or more linked chemical product(s)

Completeness: Unknown
Whether the loci encodes everything needed for the pathway producing the compound(s)
Biosynthetic class(es)
  • NRP
Loci NCBI GenBank: AF143772.2
Compounds
  • glycopeptidolipid
Species Mycobacterium avium [taxonomy]
References
Chemical products information
glycopeptidolipid
Copy SMILES
C81H142N4O27
Chemical database entries
PubCHEM
List of genes involved in compound(s) production
Identifiers Position Product Functions Evidence Extra
  • AAD44199.1
47 - 1132 (+) unknown
copy AA seq
copy Nt seq
  • AAD44200.1
1524 - 2441 (-) IS1601-D
copy AA seq
copy Nt seq
  • AAD44201.1
2441 - 2785 (-) IS1601-C
copy AA seq
copy Nt seq
  • AAD44202.1
2837 - 4039 (-) IS1601-B
copy AA seq
copy Nt seq
  • AAD44203.1
4298 - 5545 (-) IS1601-A
copy AA seq
copy Nt seq
  • AAD44204.1
5791 - 6876 (+) A-protein-like protein
copy AA seq
copy Nt seq
  • AAD44205.1
  • gepiA
6996 - 8057 (+) GepiA
copy AA seq
copy Nt seq
  • AAD44206.1
  • mtfA
8321 - 9448 (+) MtfA
copy AA seq
copy Nt seq
  • AAD44207.1
  • mtfB
9715 - 10536 (+) MtfB
copy AA seq
copy Nt seq
  • AAD44208.1
  • gtfA
10658 - 11935 (+) GtfA
copy AA seq
copy Nt seq
  • AAD44209.1
  • rtfA
12339 - 13625 (+) RtfA
copy AA seq
copy Nt seq
  • AAD44210.1
  • mtfC
13727 - 14527 (+) MtfC
copy AA seq
copy Nt seq
  • AAD44211.1
  • mtfD
14627 - 15445 (+) mtfD
copy AA seq
copy Nt seq
  • AAD44212.1
16004 - 17236 (-) transposase
copy AA seq
copy Nt seq
  • AAD44213.2
  • gtfB
17728 - 18984 (-) GtfB
copy AA seq
copy Nt seq
  • AAD44214.2
19126 - 19779 (+) unknown
copy AA seq
copy Nt seq
  • AAD44215.1
20437 - 20781 (+) transposase
copy AA seq
copy Nt seq
  • AAD44216.1
20781 - 21698 (+) transposase
copy AA seq
copy Nt seq
  • AAD44217.2
  • gdhgA
22689 - 23729 (-) GdhgA
copy AA seq
copy Nt seq
  • AAD44218.1
  • gtfC
24002 - 24811 (+) GtfC
copy AA seq
copy Nt seq
  • AAD44219.1
  • mdhtA
25020 - 26051 (+) mdhtA
copy AA seq
copy Nt seq
  • AAD44220.1
  • merA
25991 - 27010 (+) MerA
copy AA seq
copy Nt seq
  • AAD44221.1
27286 - 28008 (+) unknown
copy AA seq
copy Nt seq
  • AAD44222.1
  • gtfD
28153 - 28953 (+) GtfD
copy AA seq
copy Nt seq
  • AAD44223.1
29601 - 30518 (-) IS1601-D
copy AA seq
copy Nt seq
  • AAD44224.1
30518 - 30862 (-) IS1601-C
copy AA seq
copy Nt seq
  • AAD44225.1
30914 - 32116 (-) IS1601-B
copy AA seq
copy Nt seq
  • AAD44226.1
32375 - 33622 (-) IS1601-A
copy AA seq
copy Nt seq
  • AAD44227.1
  • drrC
33901 - 34692 (-) DrrC
copy AA seq
copy Nt seq
  • AAD44228.1
  • drrB
34692 - 35432 (-) DrrB
copy AA seq
copy Nt seq
  • AAD44229.1
  • drrA
35429 - 36376 (-) DrrA
copy AA seq
copy Nt seq
  • AAD44230.1
  • tmtpC
36589 - 39513 (-) TmtpC
copy AA seq
copy Nt seq
  • AAD44231.1
  • tmtpB
39572 - 42463 (-) TmtpB
copy AA seq
copy Nt seq
  • AAD44232.1
  • tmtpA
42460 - 43125 (-) TmtpA
copy AA seq
copy Nt seq
  • AAD44233.1
  • pstA
44047 - 54294 (+) PstA
copy AA seq
copy Nt seq
  • AAD44234.1
  • pstB
54291 - 61949 (+) PstB
copy AA seq
copy Nt seq
  • AAD44235.1
65266 - 66183 (-) IS1601-D
copy AA seq
copy Nt seq
  • AAD44236.1
66183 - 66527 (-) IS1601-C
copy AA seq
copy Nt seq
  • AAF63833.1
  • pstD
67088 - >71286 (+) PstD
copy AA seq
copy Nt seq
NRP-specific information
Subclass N/A
Cyclic? no
NRP-synthases
Gene Modules
pstA
Module ?
Specificity: phenylalanine
Evidence for specificity:
  • Structure-based inference
Module ?
Specificity: threonine
Evidence for specificity:
  • Structure-based inference
pstB
Module ?
Specificity: alanine
Evidence for specificity:
  • Structure-based inference
Module ?
Specificity: alaninol
Evidence for specificity:
  • Structure-based inference
Annotation changelog
MIBiG version Submitter Notes
1.0
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Submitted
2.0
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Migrated from v1.4
3.0
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Updated bioactivity data
  • Added NRP substrate specificities
3.1
  • Hidden contributor (ID: AAAAAAAAAAAAAAAAAAAAAAAA, no GDPR consent given).
  • Update chemical activity to schema version 2.11
Detailed domain annotation
Shows NRPS- and PKS-related domains for each feature that contains them. Click on each domain for more information about the domain's location, consensus monomer prediction, and other details.
A glossary is available here.
Selected features only
Show module domains
Similar known gene clusters
Shows clusters from the MiBIG database that are similar to the current region. Genes marked with the same colour are interrelated. White genes have no relationship.
Click on reference genes to show details of similarities to genes within the current region.
Click on an accession to open that entry in the MiBIG database.