INDEX
Explanations
terms related to social issues and social constructs
New Auto-Interp
Negative Logits
баÑĩ
-0.16
elan
-0.15
uito
-0.15
.sponge
-0.15
andal
-0.15
icari
-0.15
_principal
-0.14
iosa
-0.14
ptune
-0.14
_INITIALIZ
-0.14
POSITIVE LOGITS
ware
0.16
ordes
0.15
justice
0.15
emez
0.14
iedade
0.14
orro
0.14
sciences
0.14
oley
0.14
/community
0.14
proof
0.14
Activations Density 0.037%