INDEX
Explanations
significant instances of negation or emphasis in relation to opinion and argumentation
New Auto-Interp
Negative Logits
OWER
-0.15
lest
-0.14
ije
-0.14
Does
-0.13
ODO
-0.13
enci
-0.13
REFERENCES
-0.13
ÙĨÙĩ
-0.13
ISCO
-0.13
Ala
-0.13
POSITIVE LOGITS
æĺ¯æĪij
0.42
is
0.40
æĺ¯
0.36
adalah
0.35
æĺ¯
0.34
lÃł
0.33
merupakan
0.31
ÑıвлÑıеÑĤÑģÑı
0.31
είναι
0.28
ãģ®ãģĮ
0.27
Activations Density 0.331%