The two main cannabinoids CBDA and THCA characterizing hemp- and drug-type varieties are produced in a biosynthetic reaction catalyzed by the enzymes CBDA and THCA synthase, which compete for the same substrate cannabigerolic acid (CBGA) [reviewed in (
8)]. The two synthases are encoded by the genes
CBDAS and
THCAS, which belong to the berberine bridge enzyme (BBE)–like multigene family, from which they possibly arose by duplication and neofunctionalization [reviewed in (
41)]. When involved in secondary metabolism, the homologs of these genes likely play a major role in chemical plant defense (
8). Confirming earlier genetic studies, recent genome assemblies showed that
CBDAS and
THCAS (and their multiple pseudogenic copies) lie scattered within closely linked loci, in a retrotransposon-rich, highly repetitive region of the genome with suppressed recombination, and with a history of extensive rearrangement and tandem duplication/pseudogenization events (
4,
16–
19). Using strict filtering criteria, we mapped the reads of the 104 analyzed genomes to a reference CBDA/THCA hybrid cultivar genome [Jamaican Lion DASH (
42)], in which full-length coding sequences for
THCAS,
CBDAS, and more than 30 pseudogene copies of these genes are assembled. The results (
Fig. 3A) show that all marijuana cultivars from the Drug-type genetic group D always map a complete coding sequence for
THCAS and two
CBDAS pseudogenes (with 93 to 94% similarity to the full
CBDAS; pseudogenes 1 and 2 in
Fig. 3A; see Materials and Methods), with the exception of only five samples that also map a full
CBDAS gene. Conversely, within the Hemp-type genetic group B constituted of plants selected for fiber production, all accessions only map a complete sequence for
CBDAS, with the exception of nine samples (mostly landraces;
Fig. 3B) that either map both genes and the
CBDAS pseudogenes or map
THCAS and the
CBDAS pseudogenes. The main pattern inferred from our comparative analysis confirms previous structural data based on full genome sequencing of single cultivars (
18,
19). It is also consistent with published chemotype inheritance models validated among a wide variety of
Cannabis accessions (
16,
17,
20,
43,
44), thus providing complementary evidence for the latter at the genomic sequence level and global validation across a comprehensive panel of
Cannabis domestication types distributed worldwide. Although our results would require confirmation with associated phenotypic or expression data, they nevertheless provide support for a genetic model of inheritance based on
CBDAS genotyping (
20), in which plants that are homozygous for functional or nonfunctional alleles of
CBDAS have the CBD-type or THC-type chemotype, respectively, whereas plants that are heterozygous have the intermediate-type chemotype (consistent with codominant Mendelian inheritance due to the documented physical linkage of the two synthase genes). The occurrence of five samples mapping full
THCAS and two
CBDAS pseudogenes (i.e., with a presumed THC chemotype) nested within the Hemp-type genetic group and, more generally, the scattered phylogenetic clustering of synthase gene combination (i.e., of more than one presumed chemotype class) across the Hemp-type and Drug-type genetic groups provide a compelling argument for the independence of cannabinoid synthase inheritance from a multitude of other positively selected traits differentiating fiber-type from drug-type
Cannabis [see also the high-CBDA cultivar CBDRx, which has full
CBDAS and lacks full
THCAS (i.e., CBD chemotype) but clusters genetically among marijuana cultivars;
figure 1 in (
18)]. As such, the results call into question, from both a biological and functional point of view, the current binary categorization of
Cannabis plants as “hemp” or “marijuana” derived from the assignment to a single phenotype [see also (
20)].
In contrast with these results, samples belonging to the Basal cannabis group (and to a lesser extent to the Drug-type feral group) show a more variable pattern, with the presence of one or another synthase gene, or co-occurrence. Overall, our results point to a loss of complete coding
THCAS or
CBDAS sequence during intensive and recent selection for increased fiber production or psychoactive properties, respectively (
Fig. 3B). They suggest the ancestral possession of both genes in a functional state, a polymorphic condition before or during the early stages of domestication with loss of function of one of the two synthase genes, and the extensive loss of full
THCAS in hemp-type and
CBDAS in drug-type cultivars due to strong selection for beneficial crop phenotypes (
Fig. 3, A and B).
The pseudogenization of
CBDAS and exclusive presence of full
THCAS in marijuana cultivars are consistent with artificial selection of high THCA synthesis through the suppression of competition between the two synthase enzymes for their common substrate CBGA [
Fig. 3B; (
45,
46)], possibly also because CBDA synthase has been shown to be a superior competitor for CBGA when both synthases are present (
17). The predominant occurrence of
CBDAS and loss of function of
THCAS in hemp types, by contrast, is more puzzling. Our analysis of transcriptomics data (
47) from a cultivar having both synthase genes and the two
CBDAS pseudogenes reveals that the expression level of
CBDAS is always significantly higher than that of
THCAS, although both are expressed in all tissues and vegetative stages (
Fig. 3C). A functional
CBDAS does not seem a prerequisite for good quality fiber production in hemp [e.g., hemp cultivar Santhica 27, lacking both synthase genes (FSA in
Fig. 3A) and known to mostly produce CBGA (
48)], but it is plausible that CBDA-synthase activity (and/or the corresponding loss of that of THCA synthase) may have allowed increased bast fiber production via a physiological trade-off. Although such a trade-off might appear unlikely, it would resonate with the known role played not only in plant defense but also in the processes of cell wall biosynthesis and/or immunity by the primordial BBE-like enzymes from which cannabinoids evolved (
49,
50). Of course, the loss of full
THCAS sequence observed in modern hemp types may also simply reflect selective breeding of varieties with very low levels of THCA licensed for cultivation.
Add Comment