INDEX
Explanations
occurrences of the word "as."
New Auto-Interp
Negative Logits
bef
-0.16
Might
-0.15
thinkable
-0.15
.assertThat
-0.14
nown
-0.14
ever
-0.14
ër
-0.14
atIndex
-0.14
öt
-0.13
avier
-0.13
POSITIVE LOGITS
far
0.28
much
0.23
far
0.20
long
0.20
corn
0.19
ides
0.19
long
0.19
soon
0.19
oppose
0.18
much
0.17
Activations Density 0.085%