INDEX
Explanations
expressions of anger and related emotions
New Auto-Interp
Negative Logits
виправивши
-0.73
PyTuple
-0.66
hindurch
-0.66
dalamnya
-0.65
rimid
-0.65
EconPapers
-0.65
Landmark
-0.62
schmidt
-0.61
skapet
-0.61
quartered
-0.60
POSITIVE LOGITS
anger
2.67
angry
2.39
Anger
2.37
Angry
2.06
angry
2.05
Anger
2.04
Angry
2.03
rage
1.88
angered
1.76
angrily
1.72
Activations Density 0.105%