INDEX
Explanations
occurrences of the word "even."
New Auto-Interp
Negative Logits
chwitz
-0.18
wort
-0.16
chin
-0.15
èm
-0.15
xies
-0.15
conte
-0.14
ocaly
-0.14
undra
-0.14
quate
-0.14
dit
-0.14
POSITIVE LOGITS
wel
0.24
-handed
0.19
ness
0.19
slightest
0.16
though
0.16
flo
0.16
quiry
0.16
zo
0.16
398
0.15
Though
0.15
Activations Density 0.066%