INDEX
Explanations
negative emotions or actions such as neglect, contempt, disdain, and scorn
negative sentiments related to neglect and contempt
New Auto-Interp
Negative Logits
misunder
-0.63
helicop
-0.61
Mehran
-0.59
livest
-0.58
TIT
-0.58
>>>>>>>>
-0.58
anecd
-0.57
skating
-0.57
toget
-0.57
Kurd
-0.56
POSITIVE LOGITS
ful
2.65
fully
2.33
FUL
1.80
fulness
1.78
full
1.55
ible
1.46
eful
1.32
iful
1.31
uously
1.30
ingly
1.28
Activations Density 0.103%