INDEX
Explanations
instances of the word "one" in various forms and contexts
New Auto-Interp
Negative Logits
relude
-0.17
greater
-0.17
greater
-0.16
llx
-0.16
_greater
-0.16
ouce
-0.15
Greater
-0.15
osci
-0.15
011
-0.15
alyze
-0.15
POSITIVE LOGITS
one
0.23
of
0.23
among
0.20
из
0.19
salah
0.18
of
0.17
one
0.16
amongst
0.16
half
0.16
Of
0.16
Activations Density 0.050%