INDEX
Explanations
expressions of dissatisfaction or negative sentiments about situations
New Auto-Interp
Negative Logits
ÃŃn
-0.15
ereum
-0.14
rett
-0.14
Sharper
-0.14
bitrary
-0.14
inality
-0.14
uliar
-0.13
ansa
-0.13
Elias
-0.13
_visibility
-0.13
POSITIVE LOGITS
Optim
0.19
optimism
0.18
optim
0.18
optimistic
0.18
optim
0.18
happy
0.17
AGO
0.17
ä¹IJ
0.17
happy
0.16
hopeful
0.16
Activations Density 0.002%