INDEX
Explanations
expressions of fear and anxiety
New Auto-Interp
Negative Logits
WriteTagHelper
-0.79
OGND
-0.77
EconPapers
-0.75
nahilalakip
-0.74
:✨
-0.72
McKe
-0.71
AddTagHelper
-0.69
ویکیآمباردا
-0.68
colspan
-0.64
protoimpl
-0.64
POSITIVE LOGITS
Fear
1.65
fear
1.62
FEAR
1.59
fears
1.58
Fear
1.50
fear
1.50
Fears
1.43
feared
1.32
fearful
1.23
afraid
1.23
Activations Density 0.066%