INDEX
Explanations
emphatic affirmations or the word "absolutely" in various contexts
New Auto-Interp
Negative Logits
ison
-0.17
mie
-0.17
ways
-0.16
pty
-0.15
retty
-0.15
ponder
-0.15
wald
-0.15
sdale
-0.15
la
-0.15
lems
-0.14
POSITIVE LOGITS
positively
0.23
-ÑĤаки
0.18
olutely
0.18
certain
0.17
-zero
0.17
certainty
0.17
correct
0.16
posit
0.16
utely
0.16
monarchy
0.16
Activations Density 0.020%