INDEX
Explanations
themes related to fear and its psychological impacts
New Auto-Interp
Negative Logits
ätz
-0.18
Fault
-0.17
Forgery
-0.16
tring
-0.15
irritating
-0.14
icide
-0.14
unut
-0.14
oday
-0.14
Fault
-0.14
ãĥŃãĥ³
-0.14
POSITIVE LOGITS
fear
0.74
Fear
0.65
Fear
0.62
fears
0.58
afraid
0.57
fearful
0.51
fearing
0.49
scared
0.48
feared
0.48
terrified
0.47
Activations Density 0.328%