INDEX
Explanations
the word "as" in various contexts
New Auto-Interp
Negative Logits
iyim
-0.14
orts
-0.14
igrations
-0.14
allerdings
-0.14
ondon
-0.14
ippers
-0.14
ably
-0.14
sys
-0.13
omens
-0.13
aç
-0.13
POSITIVE LOGITS
there
0.28
opposed
0.23
there
0.23
none
0.21
regards
0.19
There
0.19
it
0.18
oppose
0.17
There
0.17
unlike
0.17
Activations Density 0.100%