Analysis of the NMI01 marker for a population database of cannabis seeds

We have analyzed the distribution of genotypes at a single hexanucleotide short tandem repeat (STR) locus in a Cannabis sativa seed database along with seed-packaging information. This STR locus is defined by the polymerase chain reaction amplification primers CS1F and CS1R and is referred to as NMI01 (for National Marijuana Initiative) in our study. The population database consists of seed seizures of two categories: seed samples from labeled and unlabeled packages regarding seed bank source. Of a population database of 93 processed seeds including 12 labeled Cannabis varieties, the observed genotypes generated from single seeds exhibited between one and three peaks (potentially six alleles if in homozygous state). The total number of observed genotypes was 54 making this marker highly specific and highly individualizing even among seeds of common lineage. Cluster analysis associated many but not all of the handwritten labeled seed varieties tested to date as well as the National Park seizure to our known reference database containing Mr. Nice Seedbank and Sensi Seeds commercially packaged reference samples.

Phylos Galaxy

Search the largest evolutionary map of cannabis genetic insights for thousands of varieties from over 80 countries.

We believe in Open Data.

We believe the foundational knowledge of the cannabis genome can be advanced and better understood by sharing this incredible data we have collected for the Galaxy.

As of January 10, 2022, our latest (and last) Galaxy public dataset has been published in the European Bioinformatics Institute (EMBL-EBI) archives. The European Variation Archive (EVA) is an open-access database of all types of genetic variation data from all species. Download the Phylos-collected public genotype data.