INDEX
Explanations
mentions of cigarettes
references to cigarettes and their various contexts
New Auto-Interp
Negative Logits
hip
-1.13
hips
-1.13
paces
-1.03
ourcing
-1.00
ourced
-0.92
ettings
-0.88
cale
-0.83
olving
-0.80
peak
-0.80
terday
-0.79
POSITIVE LOGITS
holder
0.95
holders
0.88
brush
0.83
Worker
0.74
holder
0.73
bott
0.73
pole
0.69
coaster
0.69
belt
0.69
peel
0.65
Activations Density 0.023%