INDEX
Explanations
specific instances of the word "the"
occurrences of the word "the."
New Auto-Interp
Negative Logits
bg
-0.82
Ò
-0.79
thereby
-0.76
thood
-0.74
tec
-0.73
ceive
-0.71
arate
-0.71
pection
-0.70
cpu
-0.69
visory
-0.69
POSITIVE LOGITS
oret
1.23
biggest
1.11
simplest
1.10
resa
1.08
slightest
1.08
greatest
1.06
sheer
1.06
easiest
1.05
smartest
1.04
notion
1.04
Activations Density 0.621%