INDEX
Explanations
mentions of cannabis or marijuana-related terms
references to marijuana or cannabis culture
New Auto-Interp
Negative Logits
livest
-0.75
simultane
-0.72
IGH
-0.70
obser
-0.69
Hitman
-0.66
silence
-0.64
distingu
-0.63
-+
-0.61
Prosecut
-0.61
mosqu
-0.61
POSITIVE LOGITS
pour
1.36
tery
1.29
luck
1.21
assium
1.12
atoes
1.04
ting
1.04
bell
1.03
tering
0.98
entially
0.95
atos
0.93
Activations Density 0.019%