INDEX
Explanations
references to marijuana and its various forms
New Auto-Interp
Negative Logits
kova
-0.18
aft
-0.17
ritch
-0.16
ÑĢеж
-0.15
frey
-0.15
loads
-0.14
ruit
-0.14
olon
-0.14
Freder
-0.14
resi
-0.14
POSITIVE LOGITS
juana
0.22
etta
0.22
insky
0.20
mari
0.19
iage
0.18
Mari
0.18
iaux
0.17
ette
0.17
enburg
0.17
itime
0.17
Activations Density 0.008%