INDEX
Explanations
words related to activism and active participation in social or political movements
New Auto-Interp
Negative Logits
eda
-0.20
'..',
-0.17
ers
-0.17
wig
-0.16
enance
-0.15
ापà¤ķ
-0.15
pector
-0.15
ç¼
-0.15
oriously
-0.14
ald
-0.14
POSITIVE LOGITS
/de
0.19
/react
0.19
/pass
0.19
uator
0.18
XObject
0.17
ewear
0.16
-duty
0.16
irth
0.15
748
0.15
verb
0.15
Activations Density 0.020%