INDEX
Explanations
negative emotions or sentiments relating to confidence and self-worth
New Auto-Interp
Negative Logits
expandindo
-0.50
dafx
-0.40
تقاوى
-0.37
گاب
-0.36
ilton
-0.36
виправивши
-0.35
magist
-0.35
®
-0.33
anthin
-0.32
-------------</
-0.32
POSITIVE LOGITS
whatsoever
1.89
affatto
1.47
vůbec
1.30
כלל
1.17
alls
1.05
вовсе
1.03
เลย
1.00
absoluto
1.00
soever
0.97
вообще
0.97
Activations Density 0.327%