INDEX
Explanations
references to fear and anxiety in various contexts
New Auto-Interp
Negative Logits
ennen
-0.16
ätz
-0.16
croll
-0.14
frustr
-0.14
frustrating
-0.14
ovat
-0.14
zcze
-0.13
Cros
-0.13
егоÑĢ
-0.13
drift
-0.13
POSITIVE LOGITS
fear
0.65
Fear
0.60
Fear
0.59
fears
0.53
afraid
0.53
fearful
0.52
terrified
0.52
scared
0.51
nervous
0.48
panic
0.47
Activations Density 0.910%