INDEX
Explanations
mentions of the word "Sm" followed by either a numerical value of 9 or 10
mentions of the "Smoker" brand or related terminology
New Auto-Interp
Negative Logits
ARCH
-0.72
orial
-0.70
aga
-0.68
bishop
-0.66
orus
-0.65
confir
-0.64
yrinth
-0.63
oire
-0.63
Cerberus
-0.63
agos
-0.63
POSITIVE LOGITS
Sm
3.68
Sm
2.54
sm
1.89
sm
1.60
Smoke
1.52
SM
1.44
Smash
1.38
Smile
1.37
Smoking
1.35
Sn
1.24
Activations Density 0.020%