INDEX
Explanations
discussions about gender and representation in the arts
New Auto-Interp
Negative Logits
ron
-0.08
sky
-0.07
lius
-0.07
OTOR
-0.06
omor
-0.06
erin
-0.06
(ConfigurationManager
-0.06
él
-0.06
ÙİÙĤ
-0.06
TEX
-0.06
POSITIVE LOGITS
unlike
0.12
unless
0.08
ibri
0.07
like
0.07
Unlike
0.07
åıĬåħ¶
0.07
ikat
0.07
eca
0.07
Unlike
0.07
δοÏĤ
0.07
Activations Density 0.037%