INDEX
Explanations
concepts related to neutrality and impartiality in various contexts
New Auto-Interp
Negative Logits
olla
-0.16
asan
-0.15
akit
-0.14
die
-0.14
Solid
-0.14
alı
-0.14
'{$-0.13
IMS
-0.13
beden
-0.13
už
-0.13
POSITIVE LOGITS
isplay
0.16
hend
0.16
egl
0.15
source
0.15
Poz
0.14
asel
0.14
Bindable
0.14
referee
0.14
<source
0.14
neutrality
0.14
Activations Density 0.031%