INDEX
Explanations
phrases related to someone being in a specific position or role
phrases that denote a position or role in various contexts
New Auto-Interp
Negative Logits
zhou
-0.66
acca
-0.64
adays
-0.64
ife
-0.64
sparing
-0.62
mates
-0.62
anie
-0.61
isan
-0.61
bots
-0.61
uria
-0.61
POSITIVE LOGITS
midst
0.91
same
0.90
forefront
0.88
wrong
0.88
equation
0.86
proverbial
0.84
jaws
0.84
doorway
0.83
deepest
0.82
category
0.79
Activations Density 0.223%