INDEX
Explanations
references to smoke
references to smoke in various contexts
New Auto-Interp
Negative Logits
Volt
-0.72
":["
-0.69
HCR
-0.66
*/(
-0.65
mathematic
-0.64
eanor
-0.62
onomous
-0.60
emort
-0.59
irlf
-0.58
Vanguard
-0.58
POSITIVE LOGITS
smoke
1.10
Smoke
1.05
crow
0.84
smokes
0.82
lake
0.82
haze
0.81
smoked
0.81
pipe
0.80
creen
0.78
smoking
0.78
Activations Density 0.011%