INDEX
Explanations
phrases related to social justice and rights advocacy
New Auto-Interp
Negative Logits
illis
-0.16
eros
-0.14
elor
-0.14
outes
-0.14
reb
-0.14
aira
-0.14
اÛĮØ´
-0.14
linger
-0.14
ij
-0.14
Blo
-0.14
POSITIVE LOGITS
individual
0.17
odic
0.15
individ
0.14
Caucus
0.14
åĿĤ
0.14
_phys
0.14
ắm
0.14
ëŀĻ
0.14
ãĤ¡
0.14
ograd
0.14
Activations Density 0.628%