INDEX
Explanations
occurrences of the word "everything" and variations, indicating a focus on comprehensive or overarching statements
New Auto-Interp
Negative Logits
atak
-0.15
side
-0.15
aktu
-0.14
omik
-0.14
ijd
-0.14
immer
-0.14
ibr
-0.14
ps
-0.13
ulumi
-0.13
unate
-0.13
POSITIVE LOGITS
else
0.52
else
0.34
Else
0.29
ELSE
0.29
Else
0.28
_else
0.28
else
0.27
except
0.25
ELSE
0.23
/e
0.23
Activations Density 0.035%