INDEX
Explanations
occurrences of the word "grim" or its variations in the context of negative situations
New Auto-Interp
Negative Logits
aurus
-0.08
_CLIP
-0.07
elian
-0.07
oun
-0.07
asurer
-0.07
ียà¸ļ
-0.07
esis
-0.06
ylko
-0.06
roll
-0.06
use
-0.06
POSITIVE LOGITS
linger
0.08
ness
0.08
elda
0.08
dest
0.07
aces
0.07
eton
0.07
acing
0.07
ities
0.07
lich
0.06
shaw
0.06
Activations Density 0.005%