INDEX
Explanations
negative sentiment or criticism in the text
New Auto-Interp
Negative Logits
uku
-0.15
ülü
-0.14
rue
-0.14
959
-0.13
_fds
-0.13
isons
-0.13
onds
-0.13
672
-0.13
ukkan
-0.13
simp
-0.13
POSITIVE LOGITS
opup
0.15
leh
0.15
disposing
0.15
èģĺ
0.14
ingu
0.14
ongo
0.14
cad
0.14
tainment
0.14
걸
0.14
(JS
0.13
Activations Density 0.015%