INDEX
Explanations
discussions about socioeconomic status and class differences
New Auto-Interp
Negative Logits
onya
-0.17
inis
-0.15
ynam
-0.15
äm
-0.14
lbrace
-0.14
ifter
-0.14
è°ĭ
-0.14
UTO
-0.14
NavController
-0.14
ingleton
-0.14
POSITIVE LOGITS
.hm
0.15
_ENCODING
0.15
hood
0.15
452
0.14
blame
0.13
brow
0.13
425
0.13
umbo
0.13
hood
0.13
Äħ
0.13
Activations Density 0.028%