INDEX
Explanations
adjectives expressing negative emotions such as disappointment, frustration, and exasperation
words related to emotional reactions or states of distress
New Auto-Interp
Negative Logits
glas
-0.81
faire
-0.78
reorgan
-0.75
appropriated
-0.74
vernment
-0.73
©¶æ
-0.72
ascript
-0.72
livest
-0.72
guided
-0.71
lighter
-0.69
POSITIVE LOGITS
ingly
0.99
Anger
0.78
Beir
0.78
ly
0.75
Dak
0.74
atically
0.72
LY
0.72
vu
0.71
ciating
0.71
Clown
0.71
Activations Density 0.098%