CB1 CB1
Open Standard -- Version 1.0

CB1 Token Schema

The first structured, research-grounded data standard for cannabis product intelligence.

6,200+
Compounds
20
Effect Dimensions
20
Customer Needs
30+
Receptor Targets
10
Conformance Req
Section 1

Introduction

Cannabis is the only regulated consumer product category without a structured compound intelligence standard. Certificates of Analysis -- the documents that define what is in a cannabis product -- exist as unstructured PDFs with no interoperability, no scoring methodology, and no machine-readable format.

The CB1 Token Schema defines a structured, versioned, research-grounded data object called a Product Intelligence Token. This token transforms raw COA data into actionable intelligence: effect predictions, consumer need matching, compound interaction analysis, and multi-audience content -- all anchored to public blockchain infrastructure for provenance verification.

1.1 Scope

This specification defines the structure, data types, required fields, optional fields, compound taxonomy reference, effect scoring methodology, customer need mapping, and provenance chain requirements for a CB1 Product Intelligence Token.

This specification does not define how tokens are generated, how they are stored, or how they are displayed. It defines what a token contains and how that content is structured.

1.2 Terminology

TermDefinition
TokenA Product Intelligence Token: the structured data object defined by this schema.
COACertificate of Analysis. The source laboratory document from which a token is derived.
CompoundA cannabinoid or terpene identified in the COA and tracked in the CB1 Compound Taxonomy.
Effect DimensionOne of 20 scored dimensions representing a predicted physiological or psychological effect.
Customer NeedOne of 20 consumer use-case categories derived from weighted effect dimension scores.
ProvenanceCryptographic proof that a token's immutable data has not been modified since creation.
AnchorThe act of recording a cryptographic hash of token data on a blockchain.

1.3 Design Principles

#PrincipleDescription
1Lab-verified data onlyTokens are derived from licensed laboratory COAs, not user-submitted data.
2Deterministic scoringGiven identical compound inputs, the scoring engine must produce identical outputs. No LLM in the scoring loop.
3Immutable provenanceThe COA hash and compound profile hash are anchored to a public ledger at creation and cannot be retroactively modified.
4Mutable intelligenceEffect scores and narratives may be regenerated as the taxonomy evolves. Provenance covers source data, not derived intelligence.
5Multi-audienceA single tokenization event produces intelligence for laboratories, brands, retailers, and consumers simultaneously.
Section 2

Token Structure

A conforming Product Intelligence Token MUST contain the following top-level objects.

2.1 Root Object

FieldTypeReqDescription
token_idUUID v4REQUIREDGlobally unique identifier.
schema_versionstringREQUIREDSchema version. Value: "1.0".
created_atISO 8601REQUIREDTimestamp of token creation.
sourceobjectREQUIREDCOA source metadata.
cannabinoidsarrayREQUIREDCannabinoid compound objects.
terpenesarrayREQUIREDTerpene compound objects.
contaminantsarrayOPTIONALContaminant test results.
metadataobjectREQUIREDComputed summary values.
intelligenceobjectREQUIREDEffect scoring and need mapping.
narrativesobjectOPTIONALMulti-audience generated content.
blockchain_provenanceobjectREQUIREDImmutable provenance anchors.

2.2 Source Object

FieldTypeReqDescription
batch_idstringREQUIREDLaboratory batch or sample identifier.
strain_namestringOPTIONALProduct or strain name from COA.
sample_typeenumREQUIREDFLOWER, CONCENTRATE, VAPE, EDIBLE, TINCTURE, BEVERAGE, CAPSULE, TOPICAL, PRE_ROLL, INFUSED_PRE_ROLL, LIVE_RESIN, LIVE_ROSIN, DISTILLATE, RSO, HASH.
lab_namestringREQUIREDName of the testing laboratory.
lab_licensestringOPTIONALLaboratory license number.
statestringREQUIREDUS state code (2-letter).
test_dateISO 8601REQUIREDDate the COA was issued.
Section 3

Intelligence Output

The intelligence object contains the output of the deterministic scoring engine. All scores are floating-point values between 0.0 and 1.0.

3.1 Effect Profile (20 Dimensions)

A conforming token MUST include an effect_profile object with exactly 20 keys. The scoring methodology is deterministic: given identical compound inputs, the scoring engine MUST produce identical outputs.

KeyLabelGroup
relaxationRelaxationExperience
euphoriaEuphoriaExperience
energyEnergyExperience
focusFocusExperience
creativityCreativityExperience
sedationSedationExperience
sensory_enhancementSensory EnhancementExperience
introspectionIntrospectionExperience
social_easeSocial EaseExperience
arousalArousalExperience
stress_reliefStress ReliefWellness
mood_elevationMood ElevationWellness
muscle_relaxationMuscle RelaxationWellness
anti_inflammationAnti-InflammationWellness
anti_nauseaAnti-NauseaWellness
pain_reliefPain ReliefTherapeutic
neuroprotectionNeuroprotectionTherapeutic
anti_spasmAnti-SpasmTherapeutic
appetite_stimulationAppetite StimulationTherapeutic
appetite_suppressionAppetite SuppressionTherapeutic

3.2 Customer Need Map (20 Categories)

A conforming token MUST include a customer_need_map object with exactly 20 keys. Each need score is derived from a weighted combination of effect dimension scores.

KeyLabelPrimary Effect Drivers
sleepSleepsedation, relaxation
anxietyAnxiety Reliefstress_relief, relaxation
chronic_painPain Managementpain_relief, anti_inflammation
recreationRecreationeuphoria, social_ease, creativity
focus_workFocus / Productivityfocus, energy
socialSocialsocial_ease, euphoria
nausea_giNausea / GIanti_nausea, appetite_stimulation
migraine_headacheMigrainepain_relief, anti_inflammation
inflammation_arthritisInflammationanti_inflammation, pain_relief
muscle_recoveryMuscle Recoverymuscle_relaxation, anti_inflammation
mood_depressionMood / Depressionmood_elevation, euphoria
stress_ptsdStress / PTSDstress_relief, relaxation
appetite_managementAppetite Controlappetite_suppression, energy
seizure_epilepsySeizure Supportanti_spasm, neuroprotection
neurological_supportNeurologicalneuroprotection, anti_inflammation
skin_wellnessSkin Wellnessanti_inflammation
cancer_supportCancer Supportanti_nausea, pain_relief
metabolic_healthMetabolic Healthappetite_suppression, anti_inflammation
intimacyIntimacyarousal, sensory_enhancement
meditationMeditationintrospection, relaxation
Section 4

Compound Taxonomy Reference

CB1 Compound Taxonomy v2.1 is the scientific foundation of the scoring engine. All compound names in a conforming token MUST use canonical names defined in this taxonomy.

Research Database Extension
Beyond the 41 curated compounds below, the CB1 intelligence layer draws on 6,200+ research compounds from the Cannabis Compound Database (University of Alberta) with 256 compound-protein receptor binding links. Compounds not in the curated taxonomy are dynamically scored via the receptor-effect mapping matrix with a 0.85x inference discount.

4.1 Cannabinoids (11)

Canonical NameClassificationPrimary Receptor Affinities
thcMajorCB1, CB2
cbdMajor5-HT1A, TRPV1, GPR55, PPAR-y
thcaMajorTRPA1, TRPM8
cbdaMajor5-HT1A, COX-2
cbgMinorCB1, CB2, 5-HT1A
cbgaMinorPPAR-y
cbnMinorCB1, CB2, TRPV2
cbcMinorTRPV1, TRPA1
thcvMinorCB1, CB2
delta-8-thcMinorCB1, CB2
cbdvMinorTRPV1

4.2 Terpenes (30)

Canonical NamePrimary ReceptorAroma Profile
myrceneTRPV1, GABA-Aearthy, musky, herbal
limonene5-HT1A, A2Acitrus, lemon, orange
linaloolGABA-Afloral, lavender
alpha-pineneAChE inhibitorpine, fresh, forest
beta-pineneAChE inhibitorpine, woody, herbal
beta-caryophylleneCB2peppery, spicy, woody
humuleneTRPV1earthy, woody, hoppy
terpinolene5-HT1Afloral, piney, herbal
ocimene--sweet, herbal, woody
bisabololTRPM8floral, chamomile
nerolidol--woody, floral, citrus
guaiol--piney, woody, rose
camphene--camphor, herbal, woody
geraniol--rose, floral, citrus
valencene--citrus, sweet, woody

Additional terpenes (15): fenchol, borneol, terpineol, cis-nerolidol, trans-nerolidol, alpha-terpinene, gamma-terpinene, para-cymene, eucalyptol, isopulegol, sabinene, alpha-phellandrene, delta-3-carene, alpha-cedrene, farnesene.

4.3 Receptor Systems (13 Mapped)

The schema maps 13 primary receptor systems to body map positions. The research database tracks 124 unique protein targets across 256 compound-protein binding links from peer-reviewed studies.

ReceptorSystem
CB1Central Nervous System
CB2Immune and Peripheral
5-HT1ASerotonin
TRPV1Pain / Heat
GABA-ACalming
GPR55Novel Cannabinoid
PPAR-yMetabolic
AChEMemory / Cholinergic
Adenosine A2ADopamine Modulation
TRPV2Immune Sensing
TRPA1Nociceptive
TRPM8Thermosensory
COX-2Inflammatory Pathway
Section 5

Narrative Content

A conforming token MAY include a narratives object containing generated text content derived from the token's compound data and intelligence output. Narrative content is audience-specific and SHOULD be regenerated when the scoring methodology or research corpus is updated.

AudienceFieldPurpose
Consumerquick_learn60-second consumer summary of what to expect.
Consumerdeep_diveResearch-grounded explanation of compound profile and effects.
Retailmerch_copyMenu description for dispensary product listings.
Retailbudtender_tipsTalking points for budtender-customer conversations.
Wholesalesales_pitchWholesale pitch positioning product for dispensary buyers.
Section 6

Blockchain Provenance

A conforming token MUST include a blockchain_provenance object documenting the cryptographic anchoring of immutable token data to one or more public distributed ledgers.

6.1 What Is Anchored (Immutable)

SHA-256 hash of the original COA PDF document. SHA-256 hash of the canonical compound profile JSON (sorted by compound name, values truncated to 2 decimal places). Token ID, creating organization ID, and creation timestamp.

6.2 What Is NOT Anchored (Mutable)

Effect scores, customer need scores, and interaction flags. Narrative content. Brand assignments, token status, and consumer-facing metadata. These are intelligence outputs that improve over time -- anchoring them would freeze the product.

6.3 Chain Requirements

Hedera HCS
Scientific Provenance

COA hash, compound profile hash, token ID, org ID, timestamp. Append-only topic. Governed by 39 Fortune 500 council members.

REQUIRED
Ethereum L1
Legal Anchor

Weekly Merkle root of all tokens created in the period. Court-admissible provenance.

OPTIONAL
XRPL
Financial Compliance

XLS-20 NFToken for ownership chain, clawback, and global freeze capabilities.

OPTIONAL

6.4 Verification

Any party with access to a Hedera mirror node or explorer can independently verify that a token's COA and compound profile have not been modified since creation, without relying on CB1's API or infrastructure.

Section 7

Token Lifecycle

A Product Intelligence Token progresses through a governed lifecycle that establishes chain-of-custody from laboratory creation through consumer access.

1
Laboratory
Upload COA

Token minted. Compounds extracted and normalized. Effect scoring runs. Hedera HCS anchor fires. Lab org stamped.

minted
2
Laboratory
Assign to Brand

Brand org ID linked. Brand receives token in their portal inbox.

pending
3
Brand
Accept

Brand confirms the token represents their product.

accepted
4
Brand
Publish

Token visible to retailers and consumers. QR code active.

published
5
Retailer
Display

Token data flows to POS systems, menus, and budtender tools.

6
Consumer
Scan / View

Full intelligence page accessible via QR code or direct link.

Section 8

Conformance Requirements

A system that produces Product Intelligence Tokens conforming to this specification MUST satisfy the following requirements:

  1. Tokens MUST contain all fields marked REQUIRED in this specification.
  2. Compound names MUST use canonical names from CB1 Compound Taxonomy v2.1.
  3. Effect scoring MUST be deterministic: identical compound inputs MUST produce identical effect_profile outputs.
  4. effect_profile MUST contain exactly 20 keys as defined in Section 3.1.
  5. customer_need_map MUST contain exactly 20 keys as defined in Section 3.2.
  6. All effect and need scores MUST be floating-point values between 0.0 and 1.0.
  7. Tokens MUST be anchored to at least one public distributed ledger with a SHA-256 hash of the COA PDF and compound profile JSON.
  8. Token source data MUST originate from a Certificate of Analysis issued by a licensed testing laboratory.
  9. Narrative content MUST NOT make medical claims. All content is educational and informational.
  10. Systems implementing this schema MUST declare the schema_version in every token produced.