INDEX
Explanations
titles of books and themes related to social issues and identities
New Auto-Interp
Negative Logits
ymous
-0.16
ilyn
-0.15
moon
-0.15
rtle
-0.15
è£½ä½ľ
-0.15
ẩy
-0.14
Bien
-0.14
oleÄį
-0.13
azen
-0.13
dom
-0.13
POSITIVE LOGITS
;:
0.15
!:
0.15
ourg
0.15
:CGRect
0.14
jÃŃ
0.14
ando
0.14
?:
0.14
ÌĨ
0.14
asting
0.14
ãĥ«
0.14
Activations Density 0.180%