INDEX
Explanations
phrases related to emotional struggle and pain
New Auto-Interp
Negative Logits
enthal
-0.72
accordingly
-0.70
Austral
-0.68
prohibiting
-0.67
policy
-0.66
Fair
-0.65
caution
-0.63
recommends
-0.62
farious
-0.62
priority
-0.59
POSITIVE LOGITS
oneself
0.95
surrounded
0.95
numb
0.85
immersed
0.84
strangers
0.80
somebody
0.79
someone
0.79
someone
0.76
suddenly
0.74
loved
0.72
Activations Density 0.416%