INDEX
Negative Logits
редел
-0.08
defin
-0.08
concern
-0.08
ensure
-0.08
garant
-0.07
aseg
-0.07
ulate
-0.07
aplic
-0.07
sicher
-0.07
clos
-0.07
POSITIVE LOGITS
opting
0.10
opted
0.09
deliberately
0.09
optar
0.09
عدم
0.08
択
0.08
lựa
0.08
abst
0.08
voluntarily
0.08
anonymity
0.08
Activations Density 0.081%