INDEX
Explanations
expressions of strong emotions, particularly anger and frustration
New Auto-Interp
Negative Logits
DAOImpl
-0.38
interess
-0.35
createState
-0.34
")]
-0.32
忑
-0.31
ník
-0.31
("")]
-0.31
fragility
-0.31
disease
-0.30
сыз
-0.30
POSITIVE LOGITS
angry
1.04
anger
1.02
Angry
0.96
angrily
0.95
colère
0.94
Anger
0.93
Anger
0.93
angry
0.88
Angry
0.88
wrath
0.88
Activations Density 0.554%