INDEX
Explanations
phrases indicating a reduction or decrease in quantity or quality
New Auto-Interp
Negative Logits
umu
-0.15
uild
-0.14
kostenlose
-0.14
ÙĨدر
-0.14
iaÅĤ
-0.14
REAK
-0.13
å¡
-0.13
ÄIJT
-0.13
outh
-0.13
.setView
-0.13
POSITIVE LOGITS
than
0.23
-than
0.22
ening
0.19
_than
0.18
Than
0.18
než
0.17
eren
0.17
/no
0.16
THAN
0.16
ere
0.16
Activations Density 0.028%