INDEX
Explanations
negative sentiments or phrases that indicate a lack of positivity
New Auto-Interp
Negative Logits
anan
-0.17
ãģıãĤī
-0.17
-looking
-0.16
mith
-0.16
-ÑĤо
-0.15
ije
-0.14
addon
-0.14
ا
-0.14
heed
-0.14
rap
-0.14
POSITIVE LOGITS
/+
0.25
/-
0.25
webkit
0.21
jekt
0.17
ÂĢÂ
0.17
+-+-
0.15
egin
0.14
vs
0.14
_-
0.14
vironment
0.14
Activations Density 0.132%