INDEX
Explanations
keywords and concepts related to academic critique and analysis of power structures
New Auto-Interp
Negative Logits
agy
-0.16
zas
-0.15
ocale
-0.14
rych
-0.14
urm
-0.14
ะ
-0.14
Discrim
-0.14
eki
-0.13
onest
-0.13
Hamm
-0.13
POSITIVE LOGITS
spaces
0.18
spaces
0.17
space
0.16
labour
0.15
scripts
0.15
åĢį
0.15
-space
0.15
Practices
0.15
(er
0.15
legit
0.14
Activations Density 0.238%