INDEX
Explanations
terms and concepts related to equity and equality
New Auto-Interp
Negative Logits
enburg
-0.16
scratch
-0.16
billig
-0.15
usan
-0.15
tings
-0.15
emez
-0.15
ARGIN
-0.14
ãĥ©ãĥĥãĤ¯
-0.14
casts
-0.14
emies
-0.14
POSITIVE LOGITS
atorial
0.24
Equ
0.21
ipping
0.20
ipped
0.20
ilibrium
0.20
ipment
0.19
equ
0.19
inox
0.17
ator
0.17
anim
0.16
Activations Density 0.025%