INDEX
Explanations
statements emphasizing an unexpected or extreme outcome or circumstance
instances of the word "even."
New Auto-Interp
Negative Logits
rend
-0.74
idon
-0.68
alogy
-0.64
_-
-0.63
plex
-0.63
eme
-0.62
ugal
-0.60
=-=-=-=-=-=-=-=-
-0.60
ymph
-0.60
rog
-0.60
POSITIVE LOGITS
remotely
1.13
bother
0.78
mention
0.76
bothered
0.75
bothering
0.74
slightest
0.71
mentioning
0.70
outright
0.69
THING
0.66
handed
0.66
Activations Density 0.033%