INDEX
Explanations
negative evaluations of situations or places
New Auto-Interp
Negative Logits
_nl
-0.16
ugi
-0.15
idak
-0.15
POLITICO
-0.14
Leather
-0.14
دارÛĮ
-0.14
Bid
-0.14
igram
-0.14
Clay
-0.14
roti
-0.14
POSITIVE LOGITS
ken
0.16
imes
0.15
mit
0.15
kel
0.14
ensitive
0.14
еÑĩ
0.14
109
0.14
cho
0.13
èģ
0.13
has
0.13
Activations Density 0.160%