INDEX
Explanations
the word "except" and its variations, indicating exclusions or exceptions
New Auto-Interp
Negative Logits
coni
-0.21
kir
-0.15
486
-0.15
kola
-0.14
ause
-0.14
(
-0.14
izza
-0.14
Lair
-0.14
FIX
-0.14
ftar
-0.14
POSITIVE LOGITS
ing
0.37
ting
0.19
ING
0.19
s
0.17
ed
0.17
ingly
0.16
reme
0.15
antly
0.15
edException
0.15
eur
0.15
Activations Density 0.011%